Cisco Systems, Inc.
170 West Tasman Drive
San Jose, CA 95134-1706
USA
http://www.cisco.com
Tel: 408 526-4000
800 553-NETS (6387)
Fax: 408 526-4100
Text Part Number: OL-1237-01
THE SPECIFICATIONS AND INFORMATION REGARDING THE PRODUCTS IN THIS MANUAL ARE SUBJECT TO CHANGE WITHOUT NOTICE. ALL
STATEMENTS, INFORMATION, AND RECOMMENDATIONS IN THIS MANUAL ARE BELIEVED TO BE ACCURATE BUT ARE PRESENTED WITHOUT
WARRANTY OF ANY KIND, EXPRESSED OR IMPLIED. USERS MUST TAKE FULL RESPONSIBILITY FOR THEIR APPLICATION OF ANY PRODUCTS.
THE SOFTWARE LICENSE AND LIMITED WARRANTY FOR THE ACCOMPANYING PRODUCT ARE SET FORTH IN THE INFORMATION PACKET THAT
SHIPPED WITH THE PRODUCT AND ARE INCORPORATED HEREIN BY THIS REFERENCE. IF YOU ARE UNABLE TO LOCATE THE SOFTWARE LICENSE
OR LIMITED WARRANTY, CONTACT YOUR CISCO REPRESENTATIVE FOR A COPY.
The following inform ation is for FCC compliance of Class A devices: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant
to part 15 of the FCC rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial
environment. This equipment generates, uses, and can radiate radio-frequency energy and, if not installed and used in accordance with the instruction manual, may cause
harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case users will be required
to correct the interference at their own expense.
The following information is for FCC compliance of Class B devices: The equipment described in this manual generates and may radiate radio-frequency energy. If it is not
installed in accordance with Cisco’s installation instructions, it may cause interference with radio and television reception. This equipment has been tested and found to
comply with the limits for a Class B digital device in accordance with the specifications in part 15 of the FCC rules. These specifications are designed to provide reasonable
protection against such interference in a residential installation. However, there is no guarantee that interference will not occur in a particular installation.
Modifying the equipment without Cisco’s written authorization may result in the equipment no longer complying with FCC requirements for Class A or Class B digital
devices. In that event, your right to use the equipment may be limited by FCC regulations, and you may be required to correct any interference to radio or television
communications at your own expense.
You can determine whether your equipment is causing interference by turning it off. If the interference stops, it was probably caused by the Cisco equipment or one of its
peripheral devices. If the equipment causes interference to radio or television reception, try to correct the interference by using one or more of the following measures:
• Turn the television or radio antenna until the interference stops.
• Move the equipment to one side or the other of the television or radio.
• Move the equipment farther away from the television or radio.
• Plug the equipment into an outlet that is on a different circuit from the television or radio. (That is, make certain the equipment and the television or radio are on circuits
controlled by different circuit breakers or fuses.)
Modifications to this product not authorized by Cisco Systems, Inc. could void the FCC approval and negate your authority to operate the product.
NOTWITHSTANDING ANY OTHER WARRANTY HEREIN, ALL DOCUMENT FILES AND SOFTWARE OF THESE SUPPLIERS ARE PROVIDED “AS IS” WITH
ALL FAULTS. CISCO AND THE ABOVE-NAMED SUPPLIERS DISCLAIM ALL WARRANTIES, EXPRESSED OR IMPLIED, INCLUDING, WITHOUT
LIMITATION, THOSE OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT OR ARISING FROM A COURSE OF
DEALING, USAGE, OR TRADE PRACTICE.
IN NO EVENT SHALL CISCO OR ITS SUPPLIERS BE LIABLE FOR ANY INDIRECT, SPECIAL, CONSEQUENTIAL, OR INCIDENTAL DAMAGES, INCLUDING,
WITHOUT LIMITATION, LOST PROFITS OR LOSS OR DAMAGE TO DATA ARISING OUT OF THE USE OR INABILITY TO USE THIS MANUAL, EVEN IF CISCO
OR ITS SUPPLIERS HAVE BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
CCVP, the Cisco Logo, and the Cisco Square Bridge logo are trademarks of Cisco Systems, Inc.; Changing the Way We Work, Live, Play, and Learn is a service mark of Cisco Systems,
Inc.; and Access Registrar, Aironet, BPX, Catalyst, CCDA, CCDP, CCIE, CCIP, CCNA, CCNP, CCSP, Cisco, the Cisco Certified Internetwork Expert logo, Cisco IOS, Cisco
Cisco Systems, Cisco Systems Capital, the Cisco Systems logo, Cisco Unity, Enterprise/Solver, EtherChannel, EtherFast, EtherSwitch, Fast Step, Follow Me Browsing,
FormShare, GigaDrive, HomeLink, Internet Quotient, IOS, iPhone, IP/TV, iQ Expertise, the iQ logo, iQ Net Readiness Scorecard, iQuick Study, LightStream, Linksys,
MeetingPlace, MGX, Networking Academy, Network Registrar, Pac ke t, PIX, ProConnect, RateMUX, ScriptShare, SlideCast, SMARTnet, StackWise, The Fastest Way to Increase
Your Internet Quotient, and TransPath are registered trademarks of Cisco Systems, Inc. and/or its affiliates in the United States and certain other countries.
All other trademarks mentioned in this document or Website are the property of their respective owners. The use of the word partner does not imply a partnership relationship
between Cisco and any other company. (0704R)
Troubleshooting System Crashes3-8
High CPU Utilization Problems3-9
ARP Traffic3-9
CPUHOG Errors3-11
Debug and System Messages3-11
Exec and Virtual Exec Processes3-11
Interrupts are Consuming a Large Amount of Resources3-12
Invalid Scheduler Allocate Configuration3-12
IP Input Processing3-12
One or More Processes is Consuming an Excessive Amount of Resources3-12
Problems with Access Lists3-13
SNMP Traffic3-13
Bus Errors3-13
Memory Problems3-15
Alignment Errors3-15
Low Memory Errors3-16
Memory Parity Errors3-16
Particle Pool Fallbacks3-17
Spurious Interrupts3-18
Spurious Memory Accesses3-19
CHAPTER
iv
4Troubleshooting Line Cards4-1
General Information for Troubleshooting Line Card Crashes4-2
Cache Parity Errors4-4
Bus Errors4-5
Software-Forced Crashes4-6
Troubleshooting the Timing, Communication, and Control Plus Card4-8
Troubleshooting the OC-12 Packet-Over-SONET Line Card4-12
Troubleshooting the OC-12 Dynamic Packet Transport Spatial Reuse Protocol WAN Card4-14
Troubleshooting the Cisco uBR10012 OC-48 DPT/POS Line Card4-16
Troubleshooting the Gigabit Ethernet Line Card4-18
This guide documents processes and procedures for user level hardware troubleshooting on the
Cisco uBR10012 universal broadband router. For complete configuration instructions, please refer to the
Cisco uBR10012 Universal Broadband Router Software Configuration Guide and the documents listed
in the “Related Documentation” section on page viii.
• Purpose, page vii
• Audience, page vii
• Document Organization, page viii
• Related Documentation, page viii
• Obtaining Documentation, page ix
Purpose
Audience
• Documentation Feedback, page ix
• Obtaining Technical Assistance, page x
• Obtaining Additional Publications and Information, page xi
The Cisco uBR10012 router provides data and Voice over IP (VoIP) services to cable modems (CMs)
and customer premises equipment (CPE) devices over a cable TV (CATV) network, supplying
high-speed Internet and voice connectivity over the coaxial cable that provides TV and other signals.
Many of the Cisco uBR10012 modules are available in redundant configurations, so that the failure of
one module does not affect systems operations. This guide provides troubleshooting steps for a failed
component that you can take before system failure occurs and before intervention from higher level
support agencies becomes necessary.
To benefit from this guide, you must be experienced using Cisco IOS and have some responsibility for
installing, configuring, or operating the Cisco uBR10012 router. Knowledge of basic cable data network
operations and of the Data-Over-Cable Service Interface Specifications (DOCSIS), which define the
transmission of data and other services over a coaxial cable TV network.
Chapter 1, “Basic Troubleshooting Tasks and
Startup Issues”
Chapter 2, “PEM Faults and Fan Assembly
Failures”
Chapter 3, “Troubleshooting PRE-1 Modules” How to troubleshoot Performance Routing Engine (PRE-1) modules. It
Chapter 4, “Troubleshooting Line Cards”Troubleshooting faults for all following Cisco uBR10012 line cards.
Chapter 5, “Replacing or Recovering
Passwords”
Appendix A, “Unsupported Commands”A list of the commands that are not supported in Cisco IOS Release
Appendix B, “Recommended Tools and Test
Equipment”
Basic procedures that users should perform before undertaking a detailed
troubleshooting analysis of the Cisco uBR10012 router or logging a case
with the Cisco Technical Assistance Center (TAC).
Methods for troubleshooting faults involving the Cisco uBR10012 Power
Entry Modules (PEMs) and blower modules.
provides information on troubleshooting PRE-1 fault states, the
management Ethernet port, and the serial port.
How to recover a lost enable or console login password, and how to replace
a lost enable secret password on the Cisco uBR10012 router.
12.2(15)BC1 for the Cisco uBR10012 router.
A list of basic tools and test equipment necessary to perform maintenance
and troubleshooting tasks on the Cisco uBR10012 router.
Related Documentation
When troubleshooting the Cisco uBR10012 router, you should use the Cisco uBR10012 Universal
Broadband Router Troubleshooting Guide with the following documents:
• Cisco uBR10012 Universal Broadband Router Release Notes—Provides the most up-to-date
information about software version requirements for using the router. It also provides information
about bugs and workarounds. See the following URL:
• Cisco uBR10012 Universal Broadband Router Hardware Installation Guide—Contains information
about the hardware of the Cisco uBR10012 router, how to install the router, connect its cables, and
start the system up for the first time. See the following URL:
For more information about the IOS software that runs on the Cisco uBR10012 router, see the Cisco IOS
command reference books and configuration guides:
• Cisco Broadband Cable Command Reference Guide—Describes the cable specific commands used
on the Cisco uBR10012 router. See the following URL:
Cisco documention and additional literature are available on Cisco.com. Cisco also provides several
ways to obtain technical assistance and other technical resources. These sections explain how to obtain
technical information from Cisco Systems.
Obtaining Documentation
Cisco.com
You can access the most current Cisco documentation on the World Wide Web at this URL:
http://www.cisco.com/univercd/home/home.htm
You can access the Cisco website at this URL:
http://www.cisco.com
International Cisco websites can be accessed from this URL:
You can submit comments by using the response card (if present) behind the front cover of your
document or by writing to the following address:
Cisco Systems
Attn: Customer Document Ordering
170 West Tasman Drive
San Jose, CA 95134-9883
We appreciate your comments.
Obtaining Technical Assistance
For all customers, partners, resellers, and distributors who hold valid Cisco service contracts, the Cisco
Technical Assistance Center (TAC) provides 24-hour-a-day, award-winning technical support services,
online and over the phone. Cisco.com features the Cisco TAC website as an online starting point for
technical assistance. If you do not hold a valid Cisco service contract, please contact your reseller.
Cisco TAC Website
Preface
The Cisco TAC website provides online documents and tools for troubleshooting and resolving technical
issues with Cisco products and technologies. The Cisco TAC website is available 24 hours a day, 365
days a year. The Cisco TAC website is located at this URL:
http://www.cisco.com/tac
Accessing all the tools on the Cisco TAC website requires a Cisco.com user ID and password. If you
have a valid service contract but do not have a login ID or password, register at this URL:
http://tools.cisco.com/RPF/register/register.do
Opening a TAC Case
Using the online TAC Case Open Tool is the fastest way to open P3 and P4 cases. (P3 and P4 cases are
those in which your network is minimally impaired or for which you require product information.) After
you describe your situation, the TAC Case Open Tool automatically recommends resources for an
immediate solution. If your issue is not resolved using the recommended resources, your case will be
assigned to a Cisco TAC engineer. The online TAC Case Open Tool is located at this URL:
http://www.cisco.com/tac/caseopen
For P1 or P2 cases (P1 and P2 cases are those in which your production network is down or severely
degraded) or if you do not have Internet access, contact Cisco TAC by telephone. Cisco TAC engineers
are assigned immediately to P1 and P2 cases to help keep your business operations running smoothly.
To open a case by telephone, use one of the following numbers:
To ensure that all cases are reported in a standard format, Cisco has established case priority definitions.
Priority 1 (P1)—Your network is “down” or there is a critical impact to your business operations. You
and Cisco will commit all necessary resources around the clock to resolve the situation.
Priority 2 (P2)—Operation of an existing network is severely degraded, or significant aspects of your
business operation are negatively affected by inadequate performance of Cisco products. You and Cisco
will commit full-time resources during normal business hours to resolve the situation.
Priority 3 (P3)—Operational performance of your network is impaired, but most business operations
remain functional. You and Cisco will commit resources during normal business hours to restore service
to satisfactory levels.
Priority 4 (P4)—You require information or assistance with Cisco product capabilities, installation, or
configuration. There is little or no effect on your business operations.
Obtaining Additional Publications and Information
Information about Cisco products, technologies, and network solutions is available from various online
and printed sources.
• Cisco Marketplace provides a variety of Cisco books, reference guides, and logo merchandise. Go
to this URL to visit the company store:
http://www.cisco.com/go/marketplace/
• The Cisco Product Catalog describes the networking products offered by Cisco Systems, as well as
ordering and customer support services. Access the Cisco Product Catalog at this URL:
http://cisco.com/univercd/cc/td/doc/pcat/
• Cisco Press publishes a wide range of general networking, training and certification titles. Both new
and experienced users will benefit from these publications. For current Cisco Press titles and other
information, go to Cisco Press online at this URL:
http://www.ciscopress.com
• Packet magazine is the Cisco quarterly publication that provides the latest networking trends,
technology breakthroughs, and Cisco products and solutions to help industry professionals get the
most from their networking investment. Included are networking deployment and troubleshooting
tips, configuration examples, customer case studies, tutorials and training, certification information,
and links to numerous in-depth online resources. You can access Packet magazine at this URL:
http://www.cisco.com/packet
• iQ Magazine is the Cisco bimonthly publication that delivers the latest information about Internet
business strategies for executives. You can access iQ Magazine at this URL:
http://www.cisco.com/go/iqmagazine
OL-1237-01
• Internet Protocol Journal is a quarterly journal published by Cisco Systems for engineering
professionals involved in designing, developing, and operating public and private internets and
intranets. You can access the Internet Protocol Journal at this URL:
This section describes the basic procedures that users should perform before undertaking a detailed
troubleshooting analysis of the Cisco uBR10012 router or logging a case with the Cisco Technical
Assistance Center (TAC).
These basic troubleshooting checks are organized as follows:
• Basic Troubleshooting Checklist, page 1-1
• Confirming the Hardware Installation, page 1-2
• Displaying the Cisco IOS Software Version, page 1-3
• Displaying System Environment Information, page 1-4
• Hardware Troubleshooting Flowchart, page 1-4
• Cisco uBR10012 System Startup Sequence, page 1-5
Basic Troubleshooting Checklist
If you encounter a problem after you install the Cisco uBR10012 router, go through the following
troubleshooting checklist to check for the most common error conditions before you contact the Cisco
Technical Assistance Center (TAC) or before you perform a detailed troubleshooting analysis:
1. Is the power on?
2. Is each Power Entry Module (PEM) securely inserted into the router? Is each PEM connected to a
power source that is supplying voltage in the proper AC or DC range? Are all power leads and cables
firmly connected at both ends?
3. Is the fan assembly module installed in the chassis and operating? Can you hear the fans operating,
and when you put your hand in front of the fan blowers, can you feel the air flow? Are all empty
slots covered with blank front panels, to ensure the correct air flow through the chassis for cooling?
4. Is each PRE-1 module firmly seated and securely inserted in the chassis?
5. Is at least one Timing, Communication and Control Plus (TCC+) card installed in the router?
6. Are the other line cards firmly seated and securely screwed to the chassis?
7. Are all data cables firmly connected at both ends?
8. Are the ports properly configured? Refer to the Cisco uBR10012 Universal Broadband Router
Software Configuration Guide for configuration examples.
After going through this checklist, go through the remaining sections in this chapter to verify the
installation and to perform basic troubleshooting.
Start troubleshooting the installation by issuing the show hardware command. The show hardware
command displays all hardware components that are recognized by the system. These components can
include the following:
• Performance Routing Engine (PRE-1) modules (minimum of one, maximum of two)
• FastEthernet Interface (onboard the active PRE-1 module)
• Cable Interface line cards (minimum of one, maximum of eight):
–
Cisco uBR10-MC5X20S-D
–
Cisco uBR-LCP2-MC16C
–
Cisco uBR-LCP2-MC16E
–
Cisco uBR-LCP2-MC16S
–
Cisco uBR-LCP2-MC28C
• WAN interface uplink line cards (minimum of one, maximum of four):
–
Cisco uBR10-1GE Gigabit Ethernet (GigE)
–
Cisco uBR10-1OC12/P-SMI Packet Over SONET (POS)
Chapter 1 Basic Troubleshooting Tasks and Startup Issues
–
Cisco uBR10-SRP-OC12SML Dynamic Packet Transport (DPT) Spatial Reuse Protocol (SRP)
–
Cisco uBR10-OC-48 DPT/POS
• Timing, Communication and Control Plus (TCC+) card (minimum of one, maximum of two)
If an installed item does not appear in the command output, make sure the item is properly installed. For
example, make sure the line cards are fully inserted into the slot and the captive screws are tightened. If
the problem persists, consult the Cisco uBR10012 release notes to confirm that this is not an existing
problem. Finally, you should consider replacing the component.
The following example shows typical output from the show hardware command:
UBR10K-ROUTER1#show hardware
Cisco Internetwork Operating System Software
IOS (tm) 10000 Software (UBR10K-P6-M), Released Version 12.2(8)BC2
Copyright (c) 1986-2002 by cisco Systems, Inc.
Compiled Mon 12-Aug-02 17:53 slacmar
Image text-base: 0x60008940, data-base: 0x61730000
ROM: System Bootstrap, Version 12.0(9r)SL2, RELEASE SOFTWARE (fc1)
BOOTLDR: 10000 Software (C10K-EBOOT-M), Version 12.0(17)ST, RELEASE SOFTWARE)
UBR10K-ROUTER1 uptime is 3 weeks, 21 hours, 43 minutes
System returned to ROM by power-on
System restarted at 13:00:51 PDT Mon Dec 13 2003
System image file is “disk0:/ubr10k-k9p6-mz”
cisco uBR10000 (PRE1-RP) processor with 425983K/98304K bytes of memory.
Processor board ID DEFGHIJKLMN
R7000 CPU at 262Mhz, Implementation 39, Rev 2.1, 256KB L2, 2048KB L3 Cache
Backplane version 1.0, 8 slot
1-2
Last reset from power-on
PXF processor tmc0 is running.
PXF processor tmc1 is running.
2 TCCplus card(s)
1 FastEthernet/IEEE 802.3 interface(s)
125440K bytes of ATA PCMCIA card at slot 1 (Sector size 512 bytes).
32768K bytes of Flash internal SIMM (Sector size 256KB).
Configuration register is 0x2102
UBR10K-ROUTER1#
Displaying the Cisco IOS Software Version
Use the show version command to confirm that the router is running the proper version of Cisco IOS
software and has a sufficient amount of system memory. The command also reports the system uptime
and the method by which the system was powered up.
In the following sample of output from the show version command, some of the information that may
be useful for troubleshooting appears in bold type:
UBR10K-ROUTER1# show version
Displaying the Cisco IOS Software Version
Cisco Internetwork Operating System Software
IOS (tm) 10000 Software (UBR10K-P6-M), Released Version 12.2(8)BC2
Copyright (c) 1986-2002 by cisco Systems, Inc.
Compiled Thu 19-Apr-01 13:47 by skabar
Image text-base: 0x60008960, data-base: 0x612B0000
ROM: System Bootstrap, Version 12.0(9r)SL1, RELEASE SOFTWARE (fc1)
BOOTFLASH: 10000 Software (C10K-EBOOT-M), Released Version 12.2(1)
UBR10K-ROUTER1 uptime is 3 weeks, 21 hours, 43 minutes
System returned to ROM by power-on
System restarted at 13:00:51 PDT Mon Dec 13 2003
cisco uBR10000 (PRE-1-RP) processor with 393215K/131072K bytes of memory.
Processor board ID DEFGHIJKLMN
R7000 CPU at 262Mhz, Implementation 39, Rev 2.1, 256KB L2, 2048KB L3 Cache
Backplane version 1.0, 8 slot
Chapter 1 Basic Troubleshooting Tasks and Startup Issues
Displaying System Environment Information
Displaying System Environment Information
Use the show environment command to display the basic system environment status, to verify the
following:
• Make sure the system operating temperature is equal to or less than 41° F at the inlet and 104° F
degrees at the core (5° C and 40° C).
• That the fan assembly module is installed in the chassis and operating properly.
• Report the operational status of the PEMs and blower
If the operating temperature is not between 41° F and 104° F, refer to the “Fan Assembly Module Faults”
section on page 2-7.
The following example is sample output from the show environment command for a system with two
DC PEMs installed:
UBR10K-ROUTER1# show environment
Temperature normal:chassis inlet measured at 29C/84F
Temperature normal:chassis core measured at 39C/98F
Fan: OK
Power Entry Module 0 type DC status: OK
Power Entry Module 0 Power: 555w
Power Entry Module 0 Voltage: 62v
Power Entry Module 1 type DC status: OK
Power Entry Module 1 Power: 558w
Power Entry Module 1 Voltage: 62v
UBR10K-ROUTER1#
Hardware Troubleshooting Flowchart
Use Figure 1-1 to determine which component of your Cisco uBR10012 router is malfunctioning.
Figure 1-1 describes a series of hardware dependent startup events that must take place for a
Cisco uBR10012 router to allow the passage of IP traffic. At each main point of the flowchart, there are
pointers to the chapters in this guide that describe how to troubleshoot individual pieces of hardware.
NoteThis flowchart does not address software configuration problems.
The following sections provide methods for troubleshooting faults involving the Cisco uBR10012 DC
Power Entry Modules (PEMs), the optional 2400W AC-input power shelf, and fan assembly module.
This chapter contains the following major sections:
• AC PEM Faults, page 2-1
• DC PEM Faults, page 2-3
• 2400W AC-Input Power Shelf, page 2-5
• Other Electrical Problems, page 2-6
• Fan Assembly Module Faults, page 2-7
AC PEM Faults
CHAPTER
2
On the Cisco uBR10012 router, two AC PEMs are installed in a redundant configuration, which allows
one AC PEM to fail without affecting system operations. A single PEM can power the router for
sufficient time to request and install a new PEM to replace the one that failed.
TipTo quickly check the functional status of your PEMs, use the show environment command.
AC PEM faults can occur for the following reasons:
• PEM failure
• Invalid AC-input power being supplied by the power source
• Backplane interface failures or damage
Figure 2-1 illustrates the AC PEM and its indicators. Table 2-1 describes the indicators.
Table 2-2AC PEM Fault Symptoms and Corrective Action (continued)
DC PEM Faults
PEM experiences
problems in one
slot but operates
normally in a
different slot
Fault LED is lit
yellow
1. Ensure that the input power to both slots is correct.
2. Verify that no connections have been made to the DC-power connectors
underneath each PEM.
3. If the problem persists, contact Cisco TAC.
1. Verify that no connections have been made to the DC-power connectors
underneath each PEM.
2. Verify that the PEM is fully inserted into the power bay and that its captive
screws have been tightened.
3. Check to see if the power switch is set to the standby position. If so, set the
switch to the ON position.
4. If the problem persists, flip the power switch on the PEM to the standby
position, wait several seconds, and then back to the ON position.
5. Replace PEM with a known good replacement.
6. Contact Cisco TAC.
TipSecurely tighten the captive screws on your PEMs to prevent heightened levels of electromagnetic
interference.
DC PEM Faults
On the Cisco uBR10012 router, two DC PEMs are in a redundant configuration, which allows one DC
PEM to fail without affecting system operations. A single PEM can usually power the router for
sufficient time to request and install a new PEM to replace the one that failed.
TipTo quickly check the functional status of your PEMs, use the show environment command.
DC PEM faults can occur for the following reasons:
• PEM failure
• Reversed power cables
• Backplane interface failures or damage
Two models of the DC PEM exist.
• Figure 2-2 shows the front panel of the original DC PEM (UBR10-PWR-DC) that was initially
produced for the Cisco uBR10012 router.
• Figure 2-3 shows the front panel of the DC PEM that is currently being produced for the
Cisco uBR10012 router. The new model of the DC PEM (UBR10-PWR-DC-M) is identical in form
and function to the first version, except that it includes a connector on the front panel for connecting
to the alarm status connectors on the optional 2400-watt AC-input power shelf.
Table 2-3 describes the indicators on the front panel of both models of DC PEM.
Power (green)PEM is powered on and is operational.
Fault (yellow)PEM is not operating correctly or the circuit breaker is in the OFF position.
Miswire (yellow)Input DC power cables are wired incorrectly and should be reversed.
Table 2-4 lists the DC PEM fault symptoms and corrective actions.
Table 2-4DC PEM Fault Symptoms and Corrective Action
Fault SymptomCorrective Action
Green LED on PEM
fails to light
2400W AC-Input Power Shelf
1. Make sure the circuit breaker on the PEM is turned on.
2. Make sure the PEM is properly seated and screwed in place.
3. Make sure power leads are properly connected to power connectors on the
backplane. If connections are loose or their polarity is reversed, the chassis
does not receive power.
4. Check the external power source.
5. Move the PEM to the other PEM slot. If the PEM still fails, replace it.
PEM experiences
problems in one
slot but operates
1. Ensure that the input power to both slots is correct.
2. If the problem persists, contact Cisco TAC.
normally in a
different slot
Fault LED is lit
yellow
Miswire LED is lit
yellow
1. Check to see if the circuit breaker (on/off switch) has tripped. If it has,
return the switch to the ON position.
2. Replace PEM with a known good replacement.
3. Contact Cisco TAC.
If the MISWIRE LED is on, the power cables are reversed. Power off the PEM
and the external power source and reconnect the wires correctly. See the
Cisco uBR10012 Universal Broadband Router Hardware Installation Guide.
TipSecurely tighten the captive screws on your PEMs to prevent heightened levels of electromagnetic
interference.
2400W AC-Input Power Shelf
OL-1237-01
The 2400W AC-input power shelf converts AC-output power from an external AC power source into DC
power that is suitable for powering the Cisco uBR10012 router. The power shelf supplies –54 VDC
output power to the two DC PEMs in the Cisco uBR10012 chassis.
The power shelf includes three 1200-watt (W) AC-input power modules that plug into a common power
backplane in the 2400W AC-input power shelf. Two 1200W AC-input power modules are capable of
powering a fully configured Cisco uBR10012 router. The third power module provides full redundancy.
During normal operation, the three AC-input power modules provide automatic load-sharing with each
power module supporting 33 percent of the power load. When you remove one of the AC-input power
modules, the remaining power modules immediately ramp up to full power and maintain uninterrupted
system power for a limited time. This allows you to replace the affected module without impacting
system operations.
Faults on the 2400W AC-input power shelf can occur for the following reasons:
• The AC-input power to one or more power modules has failed.
• The AC power plug to one or more power modules has been removed or unplugged.
• One or more power modules has failed and must be replaced.
Figure 2-4 illustrates the AC PEM and its indicators. Table 2-5 describes the indicators.
Figure 2-4AC-Input Power Shelf Front Panel
D
C
O
K
A
C
O
K
F
A
U
L
T
D
C
O
K
A
C
O
K
F
A
U
L
T
Table 2-5AC-Input Power Shelf Module LEDs
LEDColorDescription
AC OKGreenThe AC-input power to the power module is present and is within the proper
range.
DC OKGreen The power module is producing DC output power in the proper range.
FAULTRed This particular power module has failed and must be replaced. The 2400W
AC-input power shelf can continue operating with only two out of the three power
modules installed, but the failed module should still be replaced as soon as
possible.
Other Electrical Problems
D
C
O
K
A
C
O
K
F
A
U
L
T
36137
2-6
If the electrical problem cannot be traced to a PEM, check the unit for:
• Improper power cable connections to the Cisco uBR10012 router
• Improper installation of other field-replaceable units (FRUs)
• Improperly grounded equipment, particularly equipment racks and power grounds
• Fluctuating voltage, which can result from excessive power drains caused by other equipment (such
as air conditioning units)
• Cable corrosion or defective power panels, circuit breakers or fuses, or cable connections
• Undersized power cables or excessive power cable lengths
• Excessive power demand on backup power systems or batteries when alternate power sources
are used
Fan Assembly Module Faults
The fan assembly module is critical to the operation of the Cisco uBR10012 router because it allows the
router to maintain proper operating temperatures. Severe overheating can result in system failure, so a
fan assembly module must always be present in the chassis while the router is operating.
Figure 2-5 shows the fan assembly module front panel and its LED indicators.
Fan Assembly Module Faults
Figure 2-5Fan Assembly Module
FANS OK
SINGLE FAN FAILURE
MULTIPLE FAN FAILURE
Fan
assembly
CISCO
10000
CISCO
10000
AUX
AU
X
AC
TIVIT
E
Y
AC
THE
TIVITY
RN
LINK
E
ETH
T
ERN
LIN
ET
K
56479
The Cisco uBR10012 fan assembly module contains four fans in a redundant configuration. One fan can
fail without affecting system operations. If more than one fan fails, however, the fan assembly module
must be replaced immediately to avoid overheating the system.
The fan assembly module draws air in from the bottom front of the Cisco uBR10012 router, through the
air filter at the bottom of the front bezel. The air is drawn up through the line cards, and then exits
through the vents at the top rear of the router.
OL-1237-01
Figure 2-6 shows the air circulation pattern of the Cisco uBR10012 router when two DC PEMs are
installed. The air flow when two AC PEMs are installed is similar. The front bezel is not shown for
clarity.
The LEDs on the front panel indicate the current status of the fans. Tab le 2-6 lists the fan assembly
module fault indications and recommended actions.
Table 2-6Fan Assembly Module Fault Indications and Recommended Action
SymptomSteps to Take
Fans OK LED is not lit
1. Make sure the fan assembly module is fully inserted into the chassis.
2. Place your hand in front of the fan assembly module outlet to determine if the fans are
operating. If the fans are running, remove the fan assembly module and inspect the wiring
to the LEDs and fans to ensure that the wires are not nicked or cut.
3. Make sure that two AC PEM or two DC PEM modules are installed in the chassis.
Although only one PEM is required to power the chassis, two PEMs should be installed
for proper airflow. (If one PEM fails, leave the failed module in the chassis until the
replacement module can be installed.)
4. If you use DC PEMs, make sure the wiring is not reversed.
5. Replace the fan assembly module.
SINGLE FAN FAILUR E
LED is lit
MULTI-FAN FAILURE LED
is lit
One fan in the fan assembly module has failed. The fan assembly can cool the chassis
sufficiently with three working fans, but replace the failed fan as soon as possible.
More than one fan has failed, and the fan assembly cannot sufficiently cool the chassis.
Replace the failed fans immediately. If necessary, power down the chassis until replacements
are available.
Fans run but the system
overheats
1. Make sure that all intake and exhaust vents on the front and rear of the chassis are free
of blockages.
Fan Assembly Module Faults
2. Make sure that the ambient temperature and other environmental factors in the system
area are within the ranges specified in the “Displaying System Environment
Information” section on page 1-4.
3. Make sure all line cards and blank faceplates are in place. Make sure two PEM modules
are installed in the chassis. The cooling system cannot operate effectively unless the
chassis is fully enclosed.
4. Check the air filter, and, if necessary, clean or replace it.
5. Reduce the ambient temperature of the area surrounding the Cisco uBR10012 chassis.
This can be done using air conditioning, using fans to circulate the air in the room, and
closing the blinds on any windows that are facing the sun.
This chapter describes how to troubleshoot Performance Routing Engine (PRE-1) modules. It provides
information on troubleshooting PRE-1 fault states, the management Ethernet port, and the serial port.
• Information Required for Troubleshooting PRE-1 Modules, page 3-1
• PRE Module Not Supported, page 3-2
• PRE-1 Module Status Screen, page 3-2
• Booting Up with Redundant PRE-1 Modules, page 3-3
• PRE-1 Module Faults, page 3-4
• Ethernet Connection Problems, page 3-6
• Console Port Serial Connection Problems, page 3-7
• Troubleshooting Common System Problems, page 3-8
Information Required for Troubleshooting PRE-1 Modules
The PRE-1 module is the primary processor for the Cisco uBR10012 router, and any problems with the
PRE-1 module affect all operations. If you suspect a problem with the PRE-1 module, please collect the
following information before proceeding further, to aid in troubleshooting the problem:
Step 1Capture all console logs and system messages.
Step 2Capture the output of the show tech-support command. Registered users on Cisco.com can decode the
output of this command by using the Output Interpreter tool, which is at the following URL:
Step 3Capture the complete bootup sequence, especially if the router is reporting errors at bootup.
Step 4If the router is unresponsive, or if it refuses to boot to the Cisco IOS prompt, reboot the router to the
ROMMON prompt and capture a stack trace, using the stack ROMMON command. For more
information on this procedure, see the Obtaining a Stack Trace from ROM Monitor section in the Troubleshooting Router Hangs document, at the following URL:
The Cisco uBR10012 router supports only the PRE-1 module in Cisco IOS Release 12.2(8)BC1, and
later releases. If you attempt to boot the Cisco uBR10012 router with a PRE module with one of these
software releases, the router prints the following error message and falls through to the ROM monitor:
%%Error: PRE not supported with this image
rommon>
To correct this error, replace the PRE modules in the router with PRE-1 modules. To continue using the
original PRE modules, you must be reload the router with Cisco IOS Release 12.2(4)BC1 or an earlier
12.2 BC release.
NoteFor information on the replacement of PRE modules with PRE-1 modules, see the Field Notice,
Cisco uBR10000 Proactive Upgrade of PRE to PRE1, at the following URL:
http://www.cisco.com/en/US/products/hw/cable/ps2209/products_field_notice09186a00800946c5.sht
ml
Chapter 3 Troubleshooting PRE-1 Modules
PRE-1 Module Status Screen
The PRE-1 module contains a small LED screen that displays the current state of the boot process on the
active and standby PRE-1 modules. Ta b le 3-1 lists each message and its meaning.
Table 3-1LED Messages on the PRE-1 Modules
Message Description
BLDRSTRT The PRE-1 module is starting the boot loader software.
BLDREXCThe boot loader software has begun to execute.
BLDRMEM The boot loader software is initializing the memory on the PRE-1 module.
BLDRFILEThe boot loader software is initializing the router’s file systems.
BLDRDRVRThe boot loader software is initializing the driver subsystems.
BLDRLIBThe boot loader software is initializing the subsystem libraries.
BLDRPROTThe boot loader software is initializing the protocol subsystems.
BLDRMGMTThe boot loader software is initializing the management subsystems.
BLDRINTFThe boot loader software is initializing the router’s interfaces.
BLDRSTBYThe boot loader software is running and the PRE-1 module is running as the
standby PRE-1 module.
LOADIOSThe boot loader software has finished initializing and has begun to load the
Cisco IOS software.
IOS STRTThe PRE-1 module is starting the Cisco IOS software.
IOS EXCThe Cisco IOS software has begun to execute.
IOS MEM The Cisco IOS software is initializing the memory on the PRE-1 module.
IOS FILEThe Cisco IOS software is initializing the router’s file systems.
Table 3-1LED Messages on the PRE-1 Modules (continued)
Message Description
IOS DRVRThe Cisco IOS software is initializing the driver subsystems.
IOS LIBThe Cisco IOS software is initializing the subsystem libraries.
IOS PROTThe Cisco IOS software is initializing the protocol subsystems.
IOS MGMTThe Cisco IOS software is initializing the management subsystems.
IOS INTFThe Cisco IOS software is initializing the router’s interfaces.
IOS CONFThe Cisco IOS software has begun to load the startup configuration file.
IOS RUNThe Cisco IOS software is running and the PRE-1 module is running as the
IOS STBYThe Cisco IOS software is running and the PRE-1 module is running as the
Booting Up with Redundant PRE-1 Modules
active PRE-1 module. This could indicate that the PRE-1 module originally
booted up as the active module, or that a switchover put this module into the
active state.
NoteThis message indicates that the Cisco IOS router is running a Cisco IOS
software image. This is typically the full Cisco IOS image that was
found on a Flash disk or TFTP server. However, if an error occurs during
bootup, this could be the boot Cisco IOS image that is permanently
written in the router’s bootflash and is used when the router cannot boot
the full Cisco IOS image.
standby PRE-1 module. This could indicate that the PRE-1 module originally
booted up as the standby module, or that the PRE-1 module was originally the
active PRE-1module, but that a switchover put it into the standby state.
Booting Up with Redundant PRE-1 Modules
When two PRE-1 modules are installed in the Cisco uBR10012 router, the active PRE-1 module is
whichever module that first loads the Cisco IOS software and asserts control over the shared bus between
the two modules. The other PRE-1 module automatically boots the Cisco IOS software and enters the
standby mode.
Typically, the PRE-1 module in slot A (the left-most PRE-1 module slot as you face the chassis) boots
the Cisco IOS software more quickly than the PRE-1 module in slot B (the PRE-1 slot on the right). This
is because the PRE-1 module in slot B adds a slight delay in its bootup sequence, so as to allow the
module in slot A to boot first.
However, the selection of the active PRE-1 module does not affect the operations of the Cisco uBR10012
router. The router can operate normally with either the slot A or the slot B PRE-1 module acting as the
active PRE-1 module.
If you notice that the slot B PRE-1 module is always becoming the active PRE-1 module, and you would
like the slot A PRE-1 module to become the active PRE-1 module, check for the following:
• Check to see if the slot A PRE-1 module is booting Cisco IOS software from a Flash Disk in slot0
or slot1, which indicates it is using an old-style 16 or 20 MB PCMCIA card. These Flash Disk
memory cards operate more slowly than the new ATA-style 48 MB, 64 MB, or 128 MB Flash Disk
cards. If possible, boot the PRE-1 module using an ATA-style card in disk0 or disk1.
• If using an ATA-style Flash Disk is not possible, consider booting the Cisco IOS software image
• Verify that both PRE-1 modules are booting the same version of Cisco IOS software. Slight
variations in the loading of different images could allow the slot B PRE-1 module to boot first.
PRE-1 Module Faults
The PRE-1 module provides the IP routing and forwarding functionality in the Cisco uBR10012 router.
Thus, in a non-redundant PRE-1 configuration, a PRE-1 failure is a system failure. A redundant PRE-1
configuration is recommended because it allows the redundant PRE-1 module to automatically assume
full functionality upon failure of the primary PRE-1 module.
If the PRE-1 module fails, the yellow PRE-1 STATUS LED lights. If this occurs, try the following steps:
• Reboot the Cisco uBR10012 router
• Move the PRE-1 module to the other PRE-1 module slot
• Replace the PRE-1 module with a spare module
In addition, you should capture any error messages that appear on the console, as well as the state of the
PRE-1 LEDs and alphanumeric display. Then contact the Cisco Technical Assistance Center (TAC).
Figure 3-1 describes the LED indicators on the PRE-1 faceplate. Use these descriptions to verify the
Table 3-2PRE-1 Module Fault Indications and Recommended Action
Chapter 3 Troubleshooting PRE-1 Modules
The PRE-1 initializes, but you cannot
establish a console connection
1. Ensure that the terminal settings are properly set.
2. If you still cannot connect, check the console cable. Is it firmly connected? Is it
the correct type of cable with proper connectors?
3. If the cable checks out and you cannot establish a console or Telnet session,
reinsert the PRE-1 module. If the problem persists, replace the PRE-1 module.
4. Enter show log to review console messages recorded in the system log.
Card cannot be fully inserted into its
Make sure that you are using the correct slot (A or B) for the PRE-1 module.
slot
An alarm LED is lit
1. Enter the show facility-alarm status command and examine the output to
determine which system component raised the alarm.
2. Troubleshoot using a procedure appropriate to the module or FRU responsible for
the alarm.
Ethernet Connection Problems
If the management Fast Ethernet interface (F0/0/0) on the PRE-1 fails to work properly, and the
corresponding Link LED is not lit (steady green):
• Visually check that an Ethernet cable is connected to the correct Ethernet port on the
Cisco uBR10012 router.
• Verify that you are using the correct type of cable for a 100BaseT Ethernet.
• Check to see if the cable is bad or broken.
• Make sure the primary PRE-1 module booted up properly by checking the Status LED on its
faceplate. This LED on the primary PRE-1 module should be steady green. If a redundant PRE-1
module is installed, its STATUS LED should be flashing green. If this is not the case with either
PRE-1 module, remove and reinsert the module and boot it up again.
NoteThe show interface command also shows that there is an Ethernet interface (E0/0/0) on the PRE-1
module, but this is an internal interface that the router uses to communicate between PRE-1 modules and
line cards. This Ethernet interface is not configurable and can be used only by the router’s internal
subsystems.
If the Link LED is lit (steady green), but the Ethernet port is not working properly, make sure that the
port in question is configured properly and is not administratively shut down. If you have a working
console connection, perform the following steps:
Step 1At the switch prompt, enter show interface fastethernet0/0/0. If the port is administratively down, enter
these commands to enable it:
c10000# configure terminal
Enter configuration commands, one per line. End with CNTL/Z.
c10000(config)#interface fastethernet0/0/0
c10000(config-if)# no shut
c10000(config-if)# exit
c10000(config)# exit
c10000#
Step 2Check that the Ethernet port in question is assigned a valid IP address.
For more information about configuring Ethernet ports, refer to the Cisco uBR10012 Universal
Broadband Router Software Configuration Guide.
If the cable, connections, power, and configuration all check out, and you still cannot connect to the
Ethernet port on the module, replace the module in question. If the problem persists, contact the Cisco
TAC for further assistance. Refer to the “Obtaining Technical Assistance” section on page x for
instructions on contacting the Cisco TAC.
Console Port Serial Connection Problems
If the console screen connected to a Cisco uBR10012 console port appears frozen or fails to work
properly, check the following steps:
Step 1Refer to the “Cisco uBR10012 System Startup Sequence” section on page 1-5. If the display stops
responding during this process, there is no console output.
Console Port Serial Connection Problems
Step 2Check the console cable and make sure it is properly connected to the console port on the active PRE-1
module at one end and to your terminal equipment or terminal server at the other end.
NoteYou cannot connect to the console port on the standby PRE-1 module. You must connect to
the console port on the currently active PRE-1 module. If a switchover occurs, you must
switch the serial cable to the new active PRE-1 module to maintain the console connection.
Step 3Verify that you are using the right type of cable and adapter. For information about pin-out connections
and installation instructions, refer to the Cisco uBR10012 Universal Broadband Router Hardware
Installation Guide.
Step 4Make sure the cable is not defective or broken. Replace the cable with another high quality cable if
possible, and check to see if the console port starts working.
Step 5Check that the terminal equipment is configured with the correct settings for the console port. The
default console port settings are:
• 9600 baud
• 8 data bits
• 1 stop bit
• No parity
• No flow control
Step 6Check the LEDs on the PRE-1 faceplate to make sure it has powered up properly. If necessary, remove
and reinsert both PRE-1 modules to power them up again. Also, make sure the terminal equipment is
working properly.
Step 7The console can appear frozen if the PRE-1 processor is busy performing other tasks, such as parsing a
large configuration file or passing a large burst of traffic. These periods are usually only temporary, and
normal reaction resumes after a few moments.
Step 8The console can be frozen if the PRE-1 process is generating a large volume of debug messages. If this
is the case, hit the return key a couple of times and type no debug all to attempt to turn off the debug
messages. This will not work if the router is in global configuration mode, but try typing do no debug all to execute this EXEC mode command in global configuration mode.
If the cable, connections, power, and terminal settings all check out and you still cannot connect to the
console port on the module, replace the module in question. If the problem persists, contact the Cisco
TAC for further assistance.
Troubleshooting Common System Problems
This section describes how to troubleshoot the following common system problems on the
Cisco uBR10012 router:
• Troubleshooting System Crashes, page 3-8
• High CPU Utilization Problems, page 3-9
• Bus Errors, page 3-13
Chapter 3 Troubleshooting PRE-1 Modules
• Memory Problems, page 3-15
Troubleshooting System Crashes
System crashes occur when the router experiences an unexpected situation from which it cannot recover.
In response, the router stops all processes and reloads. Crashes can result from either hardware or
software problems.
When the router crashes, it is extremely important to gather as much information as possible about the
crash before doing a manual reload or power-cycling the router. All information about the crash, except
that which has been stored in the crashinfo file, is lost after a manual reload or power-cycle.
In particular, use the following commands to gather more information about the crash:
• All console, system, and message logs.
• Crashinfo file, if one was generated at the time of the crash.
• All output from the following commands:
–
show version
–
show context
–
show stacks
–
show tech-support
3-8
NoteRegistered Cisco.com users can decode the output of these show commands by using the Output
The PRE-1 module can experience high CPU utilization, where the CPU processor approaches 100%
usage for extended periods of time, for several reasons. See the following sections for possible causes
and solutions.
• ARP Traffic, page 3-9
• CPUHOG Errors, page 3-11
Troubleshooting Common System Problems
ARP Traffic
• Debug and System Messages, page 3-11
• Exec and Virtual Exec Processes, page 3-11
• Interrupts are Consuming a Large Amount of Resources, page 3-12
High volumes of Address Resolution Protocol (ARP) requests and responses can occupy a significant
portion of the CPU time, because the router cannot use fast-switching to process ARP packets, but must
instead forward them to the route processor (RP). Because of this, processing a large volume of ARP
traffic can also prevent the router from handling normal traffic.
Theft-of-service and denial-of-service (DNS) attacks also often generate a large number of ARP packets
on the network. Many viruses also use ARP requests to discover computers that might be vulnerable to
attack, and if these computers become infected, they are used to propagate the virus, generating even
more ARP traffic on the network.
OL-1237-01
ARP requests are broadcast packets, so they are broadcast to all devices on that particular network
segment. In some cases, a router can also forward ARP broadcasts to an ARP proxy for further
processing. Some low-end routers commonly used by subscribers for home networks can also incorrectly
respond to all ARP requests, which generates even more traffic.
In addition, the Cisco CMTS router automatically monitors ARP traffic and enters the IP addresses
found in ARP requests into its own ARP table, in the expectation that a device will eventually be found
with that IP address. Unacknowledged IP addresses remain in the router’s ARP table for 60 seconds,
which means that a large volume of ARP traffic can fill the router’s ARP table.
If ARP traffic is excessive, you can try the following ways to limit this traffic:
Step 1Disable the forwarding of ARP requests on a cable interface by using the no cable arp command in
interface configuration mode.
Step 2Disable the use of proxy-ARP on a cable interface by using the no cable proxy-arp command in
interface configuration mode.
NoteUsing the no cable arp and no cable proxy-arp commands shifts all responsibility for the
management of the IP addresses used by CMs and CPE devices to the DHCP server and
provisioning system.
Chapter 3 Troubleshooting PRE-1 Modules
Another approach would be to identify the cable modems and customer premises equipment (CPE) that
are generating the ARP traffic. A simple way of doing this is by using an access list to log requests for
an unassigned IP address in the subnet being used on a cable interface.
Step 1Reserve at least one IP address on each cable interface’s subnet and ensure that it is not being assigned
to any cable modems or CPE devices. For example, if a cable interface is using the subnet
192.168.100.0/24, you could choose to reserve IP address 192.168.100.253 for this purpose. Ensure that
the IP addresses you have chosen are not assigned to devices by your provisioning system.
Step 2If you currently have an access list applied to the cable interface, add a line that logs requests for this
particular IP address. If you are not currently using an access list on the cable interface, create one for
this purpose. In both cases, the relevant line would be:
Router(config)# access-list
number
permit ip any host 192.168.100.253 log
where number is the number for the access-list. Change the IP address to whatever address you have
selected to be reserved for this cable interface.
NoteIf you are creating a new access list, ensure that the last line of the list is access-list number
permit ip any any. Otherwise, all other traffic will be blocked on the interface.
Step 3Apply the access list to the cable interface using the ip access-group command:
Router(config-if)# ip access-group
Step 4After applying the access list, regularly examine the message log to find the devices that are attempting
number
in
to access the reserved IP address. If a cable modem or CPE device is repeatedly sending ARP requests
or replies for this IP address, it could be part of a virus or theft-of-service attack, or it could indicate a
cable modem with defective software.
3-10
Step 5After identifying these devices, you can further investigate the matter, and if necessary, block these
The router displays a %SYS-3-CPUHOG error message when a process is using an excessive amount of
processor cycles. For example, using the logging buffered command to allocate a significant amount of
memory (for example, 200 MB) for log buffers could generate a %SYS-3-CPUHOG message, because
allocating such an amount of memory requires a large amount of processor time.
For more information on what could cause this problem and how to resolve it, see the document What Causes %SYS-3-CPUHOG Messages, at the following URL:
http://www.cisco.com/en/US/products/sw/iosswrel/ps1831/products_tech_note09186a00800a6ac4.sht
ml
Debug and System Messages
A large volume of debugging messages or system messages can take a significant amount of processor
time, because the PRE-1 module must spend a significant amount of time displaying these messages on
the console port. In particular, this can happen when using the verbose or detail mode of a debug
command, or if the debug command is dumping the contents of packets or packet buffers.
Use the following techniques to reduce the number of these messages:
Troubleshooting Common System Problems
1. Turn off the debugging messages by entering the no debug all command in privileged EXEC mode:
Router# no debug all
All possible debugging has been turned off
Router#
2. Disable console messages by using the no logging console command in global configuration mode:
Router# configure terminal
Router(config)# no logging console
Router(config)#
To keep the logging of console messages, but to limit the number of messages that can be displayed,
use the logging rate-limit command. You can rate-limit all messages (including debug messages),
or just the console messages, using one of the following commands:
Router(config)# logging rate-limit console
Router(config)# logging rate-limit all
3. If you have logged into the router using a Telnet connection, you can disable debug messages using
the terminal default monitor command in privileged EXEC mode:
Router# terminal default monitor
Router#
Exec and Virtual Exec Processes
number-of-messages-per-second
number-of-messages-per-second
OL-1237-01
The Exec process is the Cisco IOS process that handles the TTY serial lines (console, auxiliary,
asynchronous), and the Virtual Exec process handles the Virtual TTY (VTY) Telnet sessions. These
processes run as mid-level processes, so if either one is exceptionally busy, it could generate a high CPU
usage level.
For information on resolving problems with high CPU usage caused by the Exec and Virtual EXEC
processes, see the document High CPU Utilization in Exec and Virtual Exec Processes, at the following
URL:
Interrupts are Consuming a Large Amount of Resources
Interrupts allow software processes to request resources when needed, as opposed to waiting for time to
be allocated to the process. If a process requests too many interrupts, however, it could impact CPU
usage, resulting in less time available to other processes.
For more information, see the document Troubleshooting High CPU Utilization Due to Interrupts, at the
following URL:
The scheduler allocate command guarantees the minimum amount of time that can be allocated for
fast-switching during each network interrupt context, and the minimum amount of time that can be
allocated for non-interrupt-driven processes. An incorrect configuration for the scheduler allocate
command can cause high CPU usage, especially when too much time is allocated for non-interrupt
processes. This could result in messages such as
was received from the PRE
We recommend using the default configuration, which can be restored by giving the default scheduler
allocate command in global configuration mode:
The Cisco IOS software uses a process named IP input to process IP packets that cannot be processed
using the fast-switching process. If the router is process-switching a lot of IP traffic, it could result in
excessively high CPU usage.
The most common reasons for excessive IP Input processing are that fast-switching has been disabled
on one or more interfaces, and that the router is receiving a large volume of traffic that must be
process-switched. For more information on resolving problems with the IP Input process, see the
Troubleshooting High CPU Utilization in IP Input Process document at the following URL:
One or More Processes is Consuming an Excessive Amount of Resources
High CPU usage could occur if one or more processes is consuming an excessive amount of resources.
For example, the router might have an excessive number of TCP connections open, or the TTY
background process is busy displaying logging or debugging messages.
The PRE-1 module could experience high CPU usage if the router has been configured with an access
list (ACL) that is too complex or inefficiently written. Access lists are processed for top-down, starting
with the first entry in the list and continuing through each entry until a match is found. The router can
easily reach high CPU usage if it has to process dozens or hundreds of ACL entries for each packet it
receives or transmits.
To resolve the problem, reorganize the list so that the most frequently matched entries are listed first.
Also examine the list to see if multiple statements can be consolidated into a single entry. For example,
instead of listing multiple addresses on the same subnet, use one entry with a wildcard mask that matches
all of the individual addresses.
Consider using the Turbo ACL feature, which compiles the access lists so that they can be searched more
efficiently. Enable the use of Turbo ACLs by giving the access-list compiled command in global
configuration mode.
Troubleshooting Common System Problems
SNMP Traffic
Bus Errors
For more information on access lists, see the Configuring IP Services chapter in the IP Addressing and
Services section of the Cisco IOS IP Configuration Guide, Release 12.2, at the following URL:
TipIf you are using Ciscoworks to manage your network, consider using the Ciscoworks Access
Control List Manager to manage access lists.
High volumes of Simple Network Management Protocol (SNMP) traffic can occupy a significant portion
of the CPU time, as the processor receives SNMP requests and sets the appropriate attributes on the
router, or sends the appropriate information back to the SNMP manager. For information on controlling
SNMP traffic, see the Application Note, IP Simple Network Management Protocol (SNMP) Causes High CPU Utilization, at the following URL:
Bus errors occur when the router tries to access a memory location that either does not exist (which
indicates a software error) or that does not respond (which indicates a hardware error). Bus errors
generated by the PRE-1 module typically cause a crash and force the router to reload.
Use the following procedure to determine the cause of a bus error and to resolve the problem. Perform
these steps as soon as possible after the bus error, before manually reloading or power cycling the router.
OL-1237-01
Step 1Use the show version command to display the reason for the last system reload:
Cisco Internetwork Operating System Software
IOS (tm) 10000 Software (UBR10K-P6-M), Experimental Version 12.2(20031215:22350]
Copyright (c) 1986-2003 by cisco Systems, Inc.
Compiled Mon 15-Dec-03 17:28 by mnagai
Image text-base: 0x60008968, data-base: 0x61B80000
ROM: System Bootstrap, Version 12.0(9r)SL2, RELEASE SOFTWARE (fc1)
BOOTLDR: 10000 Software (C10K-EBOOT-M), Version 12.0(17)ST, EARLY DEPLOYMENT RE)
ubr10k uptime is 6 days, 18 hours, 59 minutes
System returned to ROM by bus error at PC 0x0, address 0x0 at 04:15:55 UTC Thu Dec 11 2003
System restarted at 04:18:56 UTC Thu Dec 11 2003
...
Router#
Step 2Determine whether the memory address for the bus error is a valid address. If the address is valid, the
problem is most likely a hardware problem. If the address is an invalid address (such as the above
example of 0x0), the problem is software-related.
Step 3If the problem is hardware-related, you can map the memory address to a particular hardware component
by using the show region command.
Router# show region
Chapter 3 Troubleshooting PRE-1 Modules
Region Manager:
Start End Size(b) Class Media Name
0x0A000000 0x0FFFFFFF 100663296 Iomem R/W iomem
0x2A000000 0x2FFFFFFF 100663296 Iomem R/W iomem:(iomem_cwt)
0x60000000 0x69FFFFFF 167772160 Local R/W main
0x60008968 0x61B7FFFF 28800664 IText R/O main:text
0x61B80000 0x61CC1ADF 1317600 IData R/W main:data
0x61CC1AE0 0x627663BF 11159776 IBss R/W main:bss
0x627663C0 0x69FFFFFF 126458944 Local R/W main:heap
0x70000000 0x7FFFFFFB 268435452 Local R/W heap2
0x80000000 0x89FFFFFF 167772160 Local R/W main:(main_k0)
0xA0000000 0xA9FFFFFF 167772160 Local R/W main:(main_k1)
Router#
Step 4When you have identified the hardware that is generating the bus error, try removing and reinserting the
hardware into the chassis. If this does not correct the problem, replace the DRAM chips on the hardware.
If the problem persists, replace the hardware.
Step 5If the problem is software-related, verify that you are running a released version of software, and that
this release of software supports all of the hardware that is installed in the router. If necessary, upgrade
the router to the latest version of software.
Step 6To further troubleshoot the problem, registered users on Cisco.com can also decode the output of
multiple show commands by using the Output Interpreter tool, which is at the following URL:
TipThe most effective way of using the Output Interpreter tool is to capture the output of the
show stacks and show tech-support commands and upload the output into the tool. If the
problem appears related to a line card, you can also try decoding the show context command.
For more information on troubleshooting bus errors, see the Troubleshooting Bus Error Crashes
document, at the following URL:
http://www.cisco.com/en/US/products/sw/iosswrel/ps1831/products_tech_note09186a00800cdd51.sht
ml
Memory Problems
This section describes the following types of memory problems:
• Alignment Errors, page 3-15
• Low Memory Errors, page 3-16
Troubleshooting Common System Problems
• Memory Parity Errors, page 3-16
• Particle Pool Fallbacks, page 3-17
• Spurious Interrupts, page 3-18
• Spurious Memory Accesses, page 3-19
Alignment Errors
Alignment errors occur when the software attempts to read or write data using a data size that is not
aligned with the memory address being used. For example, an alignment error occurs when attempting
to read two bytes from a memory address that is not an even multiple of two bytes.
Alignment errors are always caused by a software bug, and can be either correctable or fatal. See the
following sections for more information. Also see the document Troubleshooting Spurious Accesses, Alignment Errors, and Spurious Interrupts, at the following URL:
http://www.cisco.com/en/US/products/sw/iosswrel/ps1828/products_tech_note09186a00800a65d1.sht
ml
Correctable Alignment Errors
The Cisco IOS software can automatically correct most alignment errors. When it does so, the router
generates a system error message similar to the following:
%ALIGN-3-CORRECT: Alignment correction made at 0x60262478 reading/writing 0x60A9FF5C
OL-1237-01
Occasional alignment errors do not necessarily require operator intervention, because the Cisco IOS
software can correct these errors and continue with normal operations. However, correcting alignment
errors consumes processor resources and could impact performance if the errors continuously repeat.
If the alignment error was a fatal error, it displays a message similar to the following:
%ALIGN-1-FATAL: Corrupted program counter error.
ERROR: Slot 0, NPE300/IOFE2/VXR, CACHE, External Data Cache Memory Test:
Fatal alignment errors are most likely a hardware fault on the processor card. The card itself could be
faulty, or the memory on the card could be faulty. Try replacing the processor card and rebooting the
router. If a replacement card is not available, try replacing the memory on the processor card, making
sure that the new memory meets the specifications that are required by the card.
Low Memory Errors
The router can experience low memory errors for a number of reasons, including the following possible
causes:
• The router is handling an excessively large volume of traffic. In particular, the router could be
experiencing a large volume of traffic that requires special handling, such as ARP requests.
• Abnormal processes are using excessive amounts of memory.
Chapter 3 Troubleshooting PRE-1 Modules
*** Data Expected= 0x99999999 ***
• Large amounts of memory are still allocated to dead processes.
• Software errors could have resulted in memory leaks.
• Hardware problems with the memory on the processor card or line card.
• Hardware problems on the processor card or line card.
Low memory problems are usually indicated by one or more system messages (for example,
SYS-2-MALLOCFAIL). For troubleshooting steps to resolve problems with low memory, see the Tech
Note titled Troubleshooting Memory Problems, at the following URL:
A memory parity error means that one or more bits at a memory location were unexpectedly changed
after they were originally written. This error could indicate a potential problem with the Dynamic
Random Access Memory (DRAM) that is onboard the PRE-1 module.
Parity errors are not expected during normal operations and might force the router to reload. If the router
did reload because of a parity error, the show version command displays a message such as “System
restarted by processor memory parity error” or “System restarted by shared memory parity error.” For
example:
Router# show version
Cisco Internetwork Operating System Software
IOS (tm) 10000 Software (UBR10K-P6-M), Experimental Version 12.2(20031215:22350]
Copyright (c) 1986-2003 by cisco Systems, Inc.
Compiled Mon 15-Dec-03 17:28 by mnagai
Image text-base: 0x60008968, data-base: 0x61B80000
3-16
ROM: System Bootstrap, Version 12.0(9r)SL2, RELEASE SOFTWARE (fc1)
BOOTLDR: 10000 Software (C10K-EBOOT-M), Version 12.0(17)ST, EARLY DEPLOYMENT RE)
System returned to ROM by processor memory parity error at PC 0x60301298, address 0x0 at
17:19:47 PDT Mon Dec 15 2003
System restarted at 17:19:47 PDT Mon Dec 15 2003
...
Router#
Parity errors can be categorized in two different ways:
• Soft parity errors occur when an energy level within the DRAM memory changes a bit from a one
to a zero, or a zero to a one. Soft errors are rare and are most often the result of normal background
radiation. When the CPU detects a soft parity error, it attempts to recover by restarting the affected
subsystem, if possible. If the error is in a portion of memory that is not recoverable, it could cause
the system to crash. Although soft parity errors can cause a system crash, you do not need to swap
the board or any of the components, because the problem is not defective hardware.
• Hard parity errors occur when a hardware defect in the DRAM or processor board causes data to be
repeatedly corrupted at the same address. In general, a hard parity error occurs when more than one
parity error in a particular memory region occurs in a relatively short period of time (several weeks
to months).
When parity occurs, take the following steps to resolve the problem:
Troubleshooting Common System Problems
Step 1Determine whether this is a soft parity error or a hard parity error. Soft parity errors are 10 to 100 times
more frequent than hard parity errors. Therefore, wait for a second parity error before taking any action.
Monitor the router for several weeks after the first incident, and if the problem reoccurs, assume that the
problem is a hard parity error and proceed to the next step.
Step 2When a hard parity error occurs (two or more parity errors at the same memory location), try removing
and reinserting the PRE-1 module, making sure to fully insert the card and to securely tighten the
restraining screws on the front panel.
Step 3If this does not resolve the problem, remove and reseat the DRAM chips. If the problem continues,
replace the DRAM chips.
Step 4If parity errors occur, the problem is either with the PRE-1 module or the router chassis. Replace the
PRE-1 module.
Step 5If the problems continue, contact Cisco TAC for further instructions.
For more information about parity errors, see the Processor Memory Parity Errors document, at the
following URL:
Private particle pools are buffers in I/O memory that store packets while they are being processed. The
Cisco IOS software allocates a fixed number of private particle pools during system initialization, and
these buffers are reserved for packet use, so as to minimize system contention for memory resources.
OL-1237-01
The system uses buffer control structures called “rings” to manage the entries in the particle pools. Each
ring is a circular linked-list of pointers to each packet in the particle pool. The system creates a pair of
rings for each interface, with one ring for packets waiting to be transmitted and another ring for packets
that are being received.
The system also allocates public pools in a number of different sizes for more general use. If a packet
requires special handling, or if a packet cannot be completely processed at interrupt time, the system
copies the packet into a portion of contiguous memory in the public pool, so it can be processed
switched.
TipUse the show buffers command to display the current status of the router’s particle pools.
Fallbacks with particle pools occur when bursts of traffic produce more packets than would fit in the
available buffer space. When an interface runs out of space in the private particle pools, it falls back to
using the normal public memory. Fallbacks are expected during periods of bursty traffic, and the router
should be considered to be operating normally in these situations.
If fallbacks occur more frequently, however, it could indicate a problem. In particular, if the private
particle pools are consistently producing fallbacks, it could result in the router using excessive amounts
of public memory for packet processing, reducing the resources that are available to the other router
processes. If this is the case, look for the following possible causes.
• Extremely fast interfaces are handling large volumes of traffic with a high rate of throughput that is
approaching the maximum rate on the interface.
• The Fast Ethernet interfaces on the processor card could be heavily loaded.
Chapter 3 Troubleshooting PRE-1 Modules
For more information on resolving problems with particle pool buffers, see the document Buffer Tuning,
at the following URL:
A spurious interrupt occurs when the Cisco IOS software generates an unnecessary interrupt for packet
that has been processed already. This is a software error that is usually caused by an improper
initialization of interrupt handling routines, or by a race condition where two processes compete to
handle the same process.
Spurious interrupts can occasionally be expected during normal operations, and the occasional spurious
interrupt has no discernible impact on the router’s performance. However, action might be needed if the
number of spurious interrupts is high or increasing, and performance is being degraded, with packets
being dropped.
For information on resolving the problem with spurious interrupts, see the document Troubleshooting Spurious Accesses, Alignment Errors, and Spurious Interrupts, at the following URL:
• The Cisco IOS software has a memory leak that is not releasing the memory in the private particle
pool after the interface has finished processing a packet.
3-18
http://www.cisco.com/en/US/products/sw/iosswrel/ps1828/products_tech_note09186a00800a65d1.sht
ml
A spurious memory access occurs when a Cisco IOS software process attempts to access memory in the
lowest 16 KB region of memory, which is a restricted location. Typically, such errors display a system
error message similar to the following:
%ALIGN-3-SPURIOUS: Spurious memory access made at 0x60968C44 reading 0x0
%ALIGN-3-TRACE: -Traceback= 60968C44 60269808 602389D8 00000000 00000000 00000000
00000000 00000000
Where possible, the Cisco IOS software handles spurious memory accesses by returning a value of zero
to the calling routine, and then displaying the above error message. If this is not possible, the router
crashes with a Segment Violation (SegV) error. In either case, the cause of the error is almost always a
bug in the Cisco IOS software.
If possible, upgrade to the latest release of the Cisco IOS software. If the bug still exists on the router,
see the section Spurious Accesses in the document Troubleshooting Spurious Accesses, Alignment Errors, and Spurious Interrupts, at the following URL:
http://www.cisco.com/en/US/products/sw/iosswrel/ps1828/products_tech_note09186a00800a65d1.sht
ml
General Information for Troubleshooting Line Card Crashes
General Information for Troubleshooting Line Card Crashes
Line card crashes occur when the hardware or software encounter unexpected situations that are not
expected in the current design. As a general rule, they usually indicate a configuration error, a software
error, or a hardware problem.
Table 4-1 lists the show commands that are most useful in collecting information to troubleshoot line
card crashes.
Table 4-1Relevant Show Commands for Line Card Crashes
CommandDescription
show versionProvides general information about the system's hardware and software
configurations
show logging Displays the general logs of the router
show diag [slot/subslot]Provides specific information about a particular slot: type of engine,
hardware revision, firmware revision, memory configuration, and so on.
show context [summary |
slot [slot/subslot] ]
Provides context information about the most recent crashes. This is
often the most useful command for troubleshooting line card crashes.
Use the following procedure if you suspect that a line card has crashed.
Step 1If you can identify the particular card that has crashed or is experiencing problems, first use the other
sections in this chapter to perform basic troubleshooting. In particular, ensure that the line card is fully
inserted into the proper slot, and that all cables are properly connected.
Step 2If any system messages were displayed on the console or in the SYSLOG logs at the time of the crash,
consult the Cisco CMTS System Messages guide and the Cisco IOS System Messages Guide for possible
suggestions on the source of the problem.
Step 3Line cards can crash or appear to crash when an excessive number of debug messages are being
generated. In particular, this can happen when using the verbose or detail mode of a debug command,
or if the debug command is dumping the contents of packets or packet buffers. If the console contains a
large volume of debug output, turn off all debugging with the no debug all command.
Step 4If the system message log contains messages that indicate the line card is not responding (for example,
%IPCOIR-3-TIMEOUT), and the card’s LEDs are not lit, the line card might have shut down because of
overheating. Ensure that all chassis slots either have the proper card or module installed in them. If a slot
is blank, ensure that the slot has a blank front panel installed, so that proper airflow and cooling can be
maintained in the chassis.
Step 5Use the show context summary command to identify all of the line cards that have experienced a crash:
General Information for Troubleshooting Line Card Crashes
Ta b l e 4 - 2SI G Va l u e Ty p e s
SIG ValueSIG NameError Reason
22SIGERRORFatal hardware error
23SIGRELOADSoftware-forced crash
Step 8The vast majority of line card crashes are either Cache Parity Exception (SIG type=20), Bus Error
Exception (SIG type=10), and Software-forced Crashes (SIG type=23). Use the following sections to
further troubleshoot these problems:
• Cache Parity Errors, page 4-4
• Bus Errors, page 4-5
• Software-Forced Crashes, page 4-6
If the line card crashed for some other reason, capture the output of the show tech-support command.
Registered Cisco.com users can decode the output of this command by using the Output Interpreter tool,
which is at the following URL:
Step 9If you cannot resolve the problem using the information from the Output Interpreter, collect the
following information and contact Cisco TAC:
• All relevant information about the problem that you have available, including any troubleshooting
you have performed.
• Any console output that was generated at the time of the problem.
• Output of the show tech-support command.
• Output of the show log command (or the log that was captured by your SYSLOG server, if
available).
For information on contacting TAC and opening a case, see the “Obtaining Technical Assistance” section
on page x.
Cache Parity Errors
A cache parity error (SIG type is 20) means that one or more bits at a memory location were
unexpectedly changed after they were originally written. This error could indicate a potential problem
with the Dynamic Random Access Memory (DRAM) that is onboard the line card.
Parity errors are not expected during normal operations and could force the line card to crash or reload.
These memory errors can be categorized in two different ways:
• Soft parity errors occur when an energy level within the DRAM memory changes a bit from a one
to a zero, or a zero to a one. Soft errors are rare and are most often the result of normal background
radiation. When the CPU detects a soft parity error, it attempts to recover by restarting the affected
subsystem, if possible. If the error is in a portion of memory that is not recoverable, it could cause
the system to crash. Although soft parity errors can cause a system crash, you do not need to swap
the board or any of the components, because the problem is not defective hardware.
• Hard parity errors occur when a hardware defect in the DRAM or processor board causes data to be
repeatedly corrupted at the same address. In general, a hard parity error occurs when more than one
parity error in a particular memory region occurs in a relatively short period of time (several weeks
to months).
When parity occurs, take the following steps to resolve the problem:
Step 1Determine whether this is a soft parity error or a hard parity error. Soft parity errors are 10 to 100 times
more frequent than hard parity errors. Therefore, wait for a second parity error before taking any action.
Monitor the router for several weeks after the first incident, and if the problem reoccurs, assume that the
problem is a hard parity error and proceed to the next step.
Step 2When a hard parity error occurs (two or more parity errors at the same memory location), try removing
and reinserting the line card, making sure to fully insert the card and to securely tighten the restraining
screws on the front panel.
Step 3If this does not resolve the problem, remove and reseat the DRAM chips. If the problem continues,
replace the DRAM chips.
Step 4If parity errors occur, the problem is either with the line card or the router chassis. Try removing the line
card and reinserting it. If the problem persists, try removing the line card from its current slot and
reinserting it in another slot, if one is available. If that does not fix the problem, replace the line card.
General Information for Troubleshooting Line Card Crashes
Step 5If the problems continue, collect the following information and contact Cisco TAC:
Bus Errors
• All relevant information about the problem that you have available, including any troubleshooting
you have performed.
• Any console output that was generated at the time of the problem.
• Output of the show tech-support command.
• Output of the show log command (or the log that was captured by your SYSLOG server, if
available).
For information on contacting TAC and opening a case, see the “Obtaining Technical Assistance” section
on page x.
Bus errors (SIG type is 10) occur when the line card tries to access a memory location that either does
not exist (which indicates a software error) or that does not respond (which indicates a hardware error).
Use the following procedure to determine the cause of a bus error and to resolve the problem.
Perform these steps as soon as possible after the bus error. In particular, perform these steps before
manually reloading or power cycling the router, or before performing an Online Insertion/Removal
(OIR) of the line card, because doing so eliminates much of the information that is useful in debugging
line card crashes.
OL-1237-01
Step 1Capture the output of the show stacks, show context, and show tech-support commands. Registered
Cisco.com users can decode the output of this command by using the Output Interpreter tool, which is
at the following URL:
General Information for Troubleshooting Line Card Crashes
Step 2If the results from the Output Interpreter indicate a hardware-related problem, try removing and
reinserting the hardware into the chassis. If this does not correct the problem, replace the DRAM chips
on the hardware. If the problem persists, replace the hardware.
Step 3If the problem appears software-related, verify that you are running a released version of software, and
that this release of software supports all of the hardware that is installed in the router. If necessary,
upgrade the router to the latest version of software.
TipThe most effective way of using the Output Interpreter tool is to capture the output of the
show stacks and show tech-support commands and upload the output into the tool. If the
problem appears related to a line card, you can also try decoding the show context command.
Upgrading to the latest version of the Cisco IOS software eliminates all fixed bugs that can cause line
card bus errors. If the crash is still present after the upgrade, collect the relevant information from the
above troubleshooting, as well as any information about recent network changes, and contact Cisco TAC.
Software-Forced Crashes
Chapter 4 Troubleshooting Line Cards
Software-forced crashes (SIG type is 23) occur when the Cisco IOS software encounters a problem with
the line card and determines that it can no longer continue, so it forces the line card to crash. The original
problem could be either hardware-based or software-based.
The most common reason for a software-forced crash on a line card is a “Fabric Ping Timeout,” which
occurs when the PRE-1 module sends five keepalive messages (fabric pings) to the line card and does
not receive a reply. If this occurs, you should see error messages similar to the following in the router’s
console log:
%GRP-3-FABRIC_UNI: Unicast send timed out (4)
%GRP-3-COREDUMP: Core dump incident on slot 4, error: Fabric ping failure
Fabric ping timeouts are usually caused by one of the following problems:
• High CPU Utilization—Either the PRE-1 module or line card is experiencing high CPU utilization.
The PRE-1 module or line card could be so busy that either the ping request or ping reply message
was dropped. Use the show processes cpu command to determine whether CPU usage is
exceptionally high (at 95 percent or more). If so, see the “High CPU Utilization Problems” section
on page 3-9 for information on troubleshooting the problem.
• CEF-Related Problems—If the crash is accompanied by system messages that begin with “%FIB,”
it could indicate a problem with Cisco-Express Forwarding (CEF) on one of the line card’s
interfaces. For more information, see Troubleshooting CEF-Related Error Messages, at the
following URL:
http://www.cisco.com/en/US/products/hw/routers/ps359/products_tech_note09186a0080110d68.s
html
4-6
• IPC Timeout—The InterProcess Communication (IPC) message that carried the original ping
request or the ping reply was lost. This could be caused by a software bug that is disabling interrupts
for an excessive period of time, high CPU usage on the PRE-1 module, or by excessive traffic on the
line card that is filling up all available IPC buffers.
If the router is not running the most current Cisco IOS software, upgrade the router to the latest
software release, so that any known IPC bugs are fixed. If the show processes cpu shows that CPU
usage is exceptionally high (at 95 percent or more), or if traffic on the line card is excessive, see the
“High CPU Utilization Problems” section on page 3-9.
If the crash is accompanied by %IPC-3-NOBUFF messages, see Troubleshooting IPC-3-NOBUFF
Messages on the Cisco 12000, 10000 and 7500 Series, at the following URL:
http://www.cisco.com/en/US/products/hw/routers/ps133/products_tech_note09186a00800945a1.s
html
• Hardware Problem—The card might not be fully inserted into its slot, or the card hardware itself
could have failed. In particular, if the problem began occurring after the card was moved or after a
power outage, the card could have been damaged by static electricity or a power surge. Only a small
number of fabric ping timeouts are caused by hardware failures, so check for the following before
replacing the card:
–
Reload the software on the line card, using the hw-module slot reset command.
–
Remove and reinsert the line card in its slot.
–
Try moving the card to another slot, if one is available.
If software-forced crashes continue, collect the following information and contact Cisco TAC:
• All relevant information about the problem that you have available, including any troubleshooting
you have performed.
• Any console output that was generated at the time of the problem.
General Information for Troubleshooting Line Card Crashes
• Output of the show tech-support command.
• Output of the show log command (or the log that was captured by your SYSLOG server, if
available).
For information on contacting TAC and opening a case, see the “Obtaining Technical Assistance” section
Troubleshooting the Timing, Communication, and Control Plus Card
Troubleshooting the Timing, Communication, and Control Plus
Card
At least one working Timing, Communication, and Control Plus (TCC+) card must be installed in the
Cisco uBR10012 router for normal operations. The TCC+ card acts as a secondary processor that
performs the following functions:
• Generates and distributes 10.24 MHz clock references to each cable interface line card.
• Generates and distributes 32-bit time stamp references to each cable interface line card.
• Allows software to independently power off any or all cable interface line cards.
• Provides support for Online Insertion/Removal (OIR) operations of line cards.
• Drives the LCD panel used to display system configuration and status information.
• Monitors the supply power usage of the chassis.
• Provides two redundant RJ-45 ports for external timing clock reference inputs such as a Global
Positioning System (GPS) or BITS clock.
If the Cisco uBR10012 router does not have a working TCC+ card installed, the WAN and cable interface
line cards will experience excessive packet drops, or all traffic will be dropped, because of an invalid
timing signal. Also, if no TCC+ card is installed, the cable power command is disabled, because this
function is performed by the TCC+ card.
NoteBecause the TCC+ card is considered a half-height card, use slot numbers 1/1 or 2/1 to display
information for the TCC+ card using the show diag command. The show cable clock and show
controllers clock-reference commands also use these slot numbers when displaying clock-related
information.
Figure 4-1TCC+ Front Panel
56418
The front panel on the TCC+ card has seven LEDs. Tab le 4-3 describes each LED on the TCC+ card.
Troubleshooting the Timing, Communication, and Control Plus Card
Table 4-4TCC+ Card Faults and Recommended Responses
Fault TypeResponse
The show cable clock command shows that no
TCC+ cards are installed:
Router# show cable clock
Number of TCCplus Cards in the Chassis: 0
TCCplus Cards are not yet configured
Router#
1. Verify that at least one TCC+ card is installed in the chassis. If not,
install a TCC+ card, because it is required for normal operations.
2. If only one TCC+ card is installed, it might not have been properly
initialized, so remove it from its slot, wait approximately 30 seconds,
and reinsert it.
3. If only one TCC+ card is installed, its slot might be fault, so remove
the card from its slot and install it in the other TCC+ card slot.
Chapter 4 Troubleshooting Line Cards
The console also typically displays the error
message %UBR10KTCC-1-NOTCC: No working
TCCplus card available in the system.
An IPC error message (IPCGRP-3-SYSCALL)
occurs for the TCC+ card slots (slot 1/1 or slot
2/1).
The console displays the following error message:
%UBR10KTCC-1-BADTCC: TCCplus card in
slot put under maintenance: reason
4. Replace the TCC+ card with a known, working TCC+ card.
1. If this message results after a line card is reset using the hw-module
reset command, it is an informational-only message that indicates
only that an IPC message was missed while the processor was
performing the reset of the line card. This error message can be
ignored.
2. Upgrade the Cisco uBR10012 router to Cisco IOS Release
12.2(11)BC2 or later release.
The Cisco uBR10012 router detected that the TCC+ card in the indicated
slot was faulty. If a redundant card is installed, the system passed control
to it. The faulty TCC+ card has been put into maintenance mode. The
following errors are possible:
• Holdover rcvd by Active Card—The active TCC+ card generated a
holdover interrupt to indicate an error in its clock source.
• Holdover rcvd by Backup Card—The backup TCC+ card generated a
holdover interrupt to indicate an error in the clock source being
received by either the Active or Backup card.
• Bad Clock reported by CLC(s)—A cable interface line card reported
that the clock signal being received from the TCC+ card is faulty.
• TRU Loss of Sync—The clock hardware on the TCC+ card reported a
4-10
• Unknown—An unknown failure occurred, possibly a hardware
failure.
To correct the problem:
1. If an external clock source is being used, check that the clock source
is valid national clock source, such as a GPS receiver or BITS clock.
2. If the clock source is valid, remove and reinsert the faulty TCC+ card.
3. If the MAINTENANCE LED on the faulty TCC+ card is still lit,
Table 4-4TCC+ Card Faults and Recommended Responses
Fault TypeResponse
The show controllers clock-reference command
displays compare errors between the two TCC+
cards installed in the Cisco uBR10012 router.
1. If this occurs at system startup or only occasionally at other times,
this error can be ignored, because the system typically records a
slight delay when the TCC+ cards synchronize with each other. These
initial compare errors can be ignored and cleared with the cable clock clear-counters command.
2. If this error repeatedly occurs, it could indicate a hardware problem
with one of the TCC+ cards. Try replacing each card to see if the
problem disappears.
The show controllers command for a cable
interface displays the message “Timestamp is
from local oscillator.” This command should
show that the timestamp is coming either from an
external source or from the TCC+ card.
1. Verify that at least one TCC+ card is installed in the chassis. If not,
install a TCC+ card.
2. Verify that a valid national clock source, such as a GPS receiver or
BITS clock, is plugged into the TCC+ card’s RJ-45 connector.
3. Replace the TCC+ card.
Troubleshooting the Timing, Communication, and Control Plus Card
Troubleshooting the OC-12 Dynamic Packet Transport Spatial Reuse Protocol WAN Card
Troubleshooting the OC-12 Dynamic Packet Transport Spatial
Reuse Protocol WAN Card
Figure 4-3 shows and Ta b l e 4-6 describes the LEDs on the Cisco BR10-SRP-OC12SML Dynamic
Packet Transport (DPT) Spatial Reuse Protocol (SRP) WAN card.
Figure 4-3OC12 SRP/DPT WAN Line Card LEDs
68499
Table 4-6Cisco uBR10-SRP-OC12SML DPT WAN Line Card LEDs
LEDStatusDescription
POWERGreen
Off
STATUS - bi-color Yellow
Green
MAINTENANCEOff
Indicates that power is being supplied to the
Cisco uBR10-SRP- OC12SML DPT WAN line card.
Power off.
Indicates that the CPU is in the bootup process, self test, or
downloading code.
Indicates that the CPU has successfully completed the boot,
self test, and code download process, and that the
Cisco uBR10-SRP- OC12SML DPT WAN line card is the
active card.
Normally off. Indicates that no maintenance action is required.
Ye ll ow
RX CARRIER–BGreen
Off
ACTIVEGreen
Off
RX PKTS (Packets)Blinking
Green
Off
RX CARRIER–AGreen
Off
ACTIVEGreen
Off
Indicates a required maintenance operation and that the
Cisco uBR10-SRP- OC12SML DPT WAN line card can be
hot-swapped.
Indicates that the DPT port WAN has detected valid SONET or
SDH framing on the received carrier.
No valid SONET or SDH framing.
Indicates that side B of the DPT port line is functioning.
Not active.
Indicates that the DPT port line has received a packet. This
LED flickers in normal operation, indicating traffic.
No traffic.
Indicates that the DPT port line has detected valid SONET or
SDH framing on the received carrier.
No valid SONET or SDH framing.
Indicates that side A of the DPT port line is functioning.
Troubleshooting the Cisco uBR10012 OC-48 DPT/POS Line Card
Troubleshooting the Cisco uBR10012 OC-48 DPT/POS Line Card
The Cisco OC-48 DP T⁄PO S interf ace module has a pair of OC-48c , fiber-optic standard connector (SC)
duplex ports that provide an SC connection for either the single-mode short-reach or single-mode
long-reach version. Figure 4-4 shows the faceplate on the Cisco OC-48 DPT⁄POS interface module,
and Ta bl e 4 -7 describes each LED.
Table 4-8 describes the gigabit Ethernet line card fault indications and suggests responses to each.
Table 4-8Gigabit Ethernet Line Card Faults and Recommended Responses
Fault TypeResponse
Fail LED is lit yellow indicating that a major fault
has disabled the card
Fail LED blinks then lights steadily repeatedly
Troubleshooting the Gigabit Ethernet Line Card
1. Reinsert the line card.
2. Insert the line card into another slot.
3. Replace the line card.
4. If neither of the above responses to a card
failure succeeds, call the Cisco TAC.
1. Check for bent pins on the backplane.
or
Card seems to be passing traffic (Tx/Rx lights),
2. If there are no bent pins, replace with a new
line card.
Try inserting the line card in a different slot.
but cannot communicate with the PRE
If the card works in a different slot, the
Cisco uBR10012 backplane may be defective.
Call the Cisco TAC.
Fail LED blinks steadilyThis is a user correctable problem. The steadily
blinking LED indicates a transmit failure.
To correct the problem:
1. Reinsert the GBIC.
If reinsertion fails:
2. Replace the GBIC.
Link LED does not light but the port is enabled
1. Make sure the fiber optic cable is plugged in
properly, unbroken, and undamaged.
2. Make sure that you are using the correct type
of fiber optic cable.
3. If you have autonegotiation enabled on the
local gigabit Ethernet interface, make sure
that it is enabled on the remote interface also.
If autonegotiation is disabled, it must be
disabled at the remote interface as well.
This section describes how to recover a lost enable or console login password, and how to replace a lost
enable secret password on the Cisco uBR10012 router.
NoteIt is possible to recover the enable or console login password. The enable secret password is encrypted,
however, and must be replaced with a new enable secret password.
Password Recovery Procedure Overview
The following is an overview of the steps in the password recovery procedure.
• If you can log in to the router, enter the show version command to determine the existing
configuration register value.
CHAPTER
5
• Press the Break key to go to the bootstrap program prompt (ROM monitor). You might need to
reload the system image by power-cycling the router.
• Change the configuration register to 0x2142 so that the router ignores the startup configuration file
during bootup. This allows you to log in without using a password and to display the startup
configuration password.
• Power cycle the router by typing reload atthe rommon> prompt.
• Log in to the router and enter the privileged EXEC mode.
• Enter the show startup-config command to display the passwords.
• Recover or replace the displayed passwords.
• Change the configuration register back to its original setting.
NoteTo recover a lost password if the break function is disabled on the router, you must have physical access
to the router.
Password Recovery Procedure
To recover or replace a lost enable, enable secret, or console login password, use this procedure:
Step 1Attach an ASCII terminal to the console port on the router.
Step 2Configure the terminal to operate at 9600 baud, 8 data bits, no parity, and 1 stop bit.
Step 3If you can log in to the router as a nonprivileged user, enter the show version command to display the
existing configuration register value, then go to Step 6. If you cannot log in to the router at all, go to the
next step.
Step 4Press the Break key or send a break signal from the console terminal.
• If break is enabled, the router enters the ROM monitor, indicated by the ROM monitor prompt
• If break is disabled, power cycle the router (turn off the router or unplug the power cord, and then
Step 5Within 60 seconds of restoring the power to the router, press the break key or send a break signal. This
action causes the router to enter the ROM monitor and display the ROM monitor prompt (
Step 6Set the configuration register using the configuration register utility. Enter the confreg command at the
ROM monitor prompt as follows:
rommon> confreg
Answer yes to the enable “ignore system config info?” Press the return key at all other prompts to accept
the existing value.
(
rommon>). Go to Step 6.
restore power). Then go to Step 5.
Chapter 5 Replacing or Recovering Passwords
rommon>).
Step 7Reboot the router by entering the reset command:
rommon> reset
The router initializes, the configuration register is set to 0x142, and the router boots the system image
from Flash memory and enters the system configuration dialog (setup):
--- System Configuration Dialog --
Step 8Enter no in response to the system configuration dialog prompts until the following message appears:
Press RETURN to get started!
Step 9Press Return. The user EXEC prompt appears:
Router>
Step 10Enter the enable command to enter privileged EXEC mode. Then enter the show startup-config
command to display the passwords in the configuration file as follows:
Router# show startup-config
Step 11Scan the configuration file display, looking for the passwords (the enable passwords are usually located
near the beginning of the file, and the console login or user EXEC password is near the end). The
passwords displayed appear similar to the following:
enable secret 5 $1$ORPP$s9syZt4uKn3SnpuLDrhuei
enable password 23skiddoo
.
.
line con 0
password onramp
5-2
The enable secret password is encrypted and cannot be recovered; it must be replaced. Go to the next
step to replace an enable secret, console login, or enable password. If there is no enable secret password,
note the enable and console login passwords. If the enable and console login passwords are not
encrypted, go to Step 16.
CautionDo not execute the next step unless you have determined you must change or replace the enable, enable
secret, or console login passwords. Failure to follow the steps as shown might cause you to erase the
router configuration.
Step 12Enter the copy startup-config running-config command to load the startup configuration file into
running memory. This action allows you to modify or replace passwords in the configuration.
Router# copy startup-config running-config
Step 13Enter the privileged EXEC command configure terminal to enter configuration mode:
Router# configure terminal
Step 14Change all three passwords using the following commands:
Router(config)# enable secret
Router(config)# enable password
Router(config)# line con 0
Router(config-line)# password
Change only the passwords necessary for your configuration. You can remove individual passwords by
using the no form of the above commands. For example, entering the no enable secret command
removes the enable secret password.
Password Recovery Procedure
newpassword1
newpassword2
newpassword3
Step 15You must configure all interfaces to avoid having the system be administratively shut down:
Router(config)# interface fastethernet 0/0
Router(config-int)# no shutdown
Enter the equivalent commands for all interfaces that were originally configured. If you omit this step,
all interfaces are administratively shut down and unavailable when the router is restarted.
Step 16Use the config-register command to set the configuration register to the original value noted in Step 3
or Step 7, or to the factory default value 0x2102.
Router(config)# config-register 0x2102
Step 17Press Ctrl-Z (hold down the Control key while you press Z) or enter end to exit configuration mode
and return to the EXEC command interpreter.
CautionDo not execute the next step unless you have changed or replaced a password. If you skipped Step 12
through Step 15, go to Step 19. Failure to observe this caution causes you to erase the router
configuration file.
Step 18Enter the copy running-config startup-config command to save the new configuration to NVRAM.
Step 19Enter the reload command to reboot the router.
Step 20Log in to the router using the new or recovered passwords.
The following are lists of the commands that are present but not supported for the
Cisco uBR10012 router in various releases of the Cisco IOS software.
• Unsupported Frame Relay Commands, page A-1
• HCCP Commands, page A-2
• MLPPP Commands, page A-2
• Unsupported MPLS VPN Commands, page A-3
• Unsupported PPP Commands, page A-3
• Spectrum Management Commands, page A-3
• Unsupported Telco-Return Commands, page A-3
Cisco strongly advises against using unsupported Cisco IOS commands, even if described, because such
commands can have undesirable effects upon the performance of the Cisco CMTS. In particular, Cisco advises
against using any unsupported commands that pertain to service-policy or to Modular Quality of Service
command-line interface (MQC) while Parallel Express Forwarding (PXF) is running on the Cisco CMTS.
Such commands may cause the Cisco CMTS to hang with unpredictable recovery times.
Unsupported Frame Relay Commands
The following commands are not supported in any Cisco IOS software release.
• FRF.12 fragmentation commands
• FRF.11 VoFR commands
• frame-relay adaptive-shaping {becn | foresight}
• frame-relay bc {in | out} bits
• frame-relay be {in | out} bits
• frame-relay cir {in | out} bps
• frame-relay custom-queue-listlist-number
• frame-relay de-groupgroup-number dlci
• frame-relay ip [rtp | tcp] header-compression[passive]
• frame-relay map ip ip-address dlci[broadcast] [cisco | ietf][nocompress]tcp
header-compression {active | passive}
• frame-relay mincir {in | out} bps
• frame-relay traffic-rateaverage [peak]
• frame-relay traffic-shaping
• show frame-relay ip [rtp | tcp] header-compression [interface]
HCCP Commands
The following commands are supported in Cisco IOS Release 12.2(8)BC2 and later 12.2 BC releases.
These commands are not supported in previous releases.
• hccp authenticate
• hccp authenticate key-chain
• hccp ds-switch
• hccp lockout
Appendix A Unsupported Commands
• hccp protect
• hccp revert
• hccp reverttime
• hccp switch
• hccp timers
• hccp track
• hccp unlockout
• hccp working
• show hccp
• show hccp interface
• debug hccp authentication
• debug hccp events
• debug hccp sync
MLPPP Commands
The following commands are supported in Cisco IOS Release 12.2(4)BC1 and later 12.2 BC releases.
These commands are not supported in previous releases.
The following commands are supported in Cisco IOS Release 12.2(8)BC2 and later 12.2 BC releases.
These commands are not supported in previous releases.
• cable upstream hop-priority
• cable upstream threshold
• show cable modem [ip-address | interface | mac-address] snr
• show controllers cable upstream spectrum
Unsupported Telco-Return Commands
OL-1237-01
The following commands are not supported in any Cisco IOS software release.
Table B-1 lists the basic tools and test equipment necessary to perform general maintenance and
troubleshooting tasks on the Cisco uBR10012 router.
Table B-1Recommended Tools and Test Equipment
Equipment Description
Number 2 Phillips and flat-head
screwdrivers
Voltage testerRefer to the “Testing with Digital Multimeters and Cable
Optical fiber test equipmentRefer to the “Testing with Digital Multimeters and Cable
Cable testing equipmentRefer to the “Testing with Digital Multimeters and Cable
ESD-preventive wrist or ankle
strap with connection cord
Small and medium-sized.
Testers” section on page B-1.
Testers” section on page B-1.
Testers” section on page B-1.
—
The following sections describe advanced testing equipment to aid in complex problem isolation.
Testing with Digital Multimeters and Cable Testers
Use a digital multimeter to measure parameters such as AC and DC voltage, current, resistance,
capacitance, cable continuity. Use cable testers, also, to verify physical connectivity.
Use cable testers (scanners) to check physical connectivity. Cable testers are available for shielded
twisted pair (STP), unshielded twisted pair (UTP), 10BaseT, and coaxial and twinax cables. A given
cable tester might be able to perform any of the following functions:
• Test and report on cable conditions, including near-end crosstalk (NEXT), attenuation, and noise.
• Perform time domain reflectometer (TDR), traffic monitoring, and wire map functions.
• Display Media Access Control (MAC) layer information about LAN traffic, provide statistics such
as network utilization and packet error rates, and perform limited protocol testing (for example,
TCP/IP tests such as ping).
Test fiber-optic cable both before installation (on-the-reel testing) and after installation. Continuity
testing of the fiber requires either a visible light source or a reflectometer. Light sources capable of
providing light at the three predominant wavelengths, 850 nanometers (nm), 1300 nm, and 1550 nm, are
used with power meters that can measure the same wavelengths and test attenuation and return loss in
the fiber.
Testing with TDRs and OTDRs
This section describes time domain reflectometers (TDRs) and optical time domain reflectometers
(OTDRs), which are typically used to detect cable defects.
Testing with TDRs
Use time domain reflectometers to test for the following cable defects:
• Open and short circuits
• Crimps, kinks, and sharp bends
• Impedance mismatches
Appendix B Recommended Tools and Test Equipment
• Other defects
A TDR works by “bouncing” a signal off the end of the cable. Open circuits, short circuits and other
problems reflect the signal back at different amplitudes, depending on the problem.
A TDR measures:
• the amount of time it takes for the signal to reflect
• The physical distance to a fault in the cable
• The length of a cable
Some TDRs can also calculate the propagation rate based on a configured cable length.
Testing with OTDRs
Use optical time domain reflectometers to:
• Locate fiber breaks
• Measure attenuation
• Measure the length of a fiber
• Measure splice or connector losses
An OTDR can be used to identify the “signature” of a particular installation, noting attenuation and
splice losses. This baseline measurement can then be compared with future signatures if you suspect a
problem in the system.
Testing with Breakout Boxes, Fox Boxes, and BERTs/BLERTs
Testing with Breakout Boxes, Fox Boxes, and BERTs/BLERTs
Use breakout boxes, fox boxes, and bit/block error rate testers (BERTs/BLERTs) to measure the digital
signals present at:
• PCs
• Printers
• Modems
• CSU/DSUs
These devices can monitor data line conditions, analyze and trap data, and diagnose problems common
to data communication systems. Traffic from data terminal equipment (DTE) through data
communications equipment (DCE) can be examined to:
• Isolate problems
• Identify bit patterns
• Ensure that the correct cabling is installed
These devices cannot test media signals such as Ethernet, Token Ring, or FDDI.
Testing with Network Monitors
Use network monitors to:
• Track packets crossing a network
• Provide an accurate picture of network activity at any moment
• Provide a historical record of network activity over a period of time
Network monitors do not decode the contents of frames. Monitors are useful for baselining, in which the
activity on a network is sampled over a period of time to establish a normal performance profile, or
baseline.
Monitors collect information such as packet sizes, the number of packets, error packets, overall usage of
a connection, the number of hosts and their MAC addresses, and details about communications between
hosts and other devices. This data can be used to:
Use network analyzers (also called protocol analyzers) to decode protocol layers in a recorded frame and
present the layers as readable abbreviations or summaries, detailing which layer is involved (physical,
data link, and so forth) and the function each byte or byte content serves.
Most network analyzers can perform many of the following functions:
• Filter traffic that meets certain criteria so that, for example, all traffic to and from a particular device
can be captured.
• Time-stamp captured data.
• Present protocol layers in an easily readable form.
• Generate frames and transmit them onto the network.
• Incorporate an “expert” system in which the analyzer uses a set of rules, combined with information
about the network configuration and operation, to diagnose and solve, or offer potential solutions
to, network problems.