Cisco Systems Troubleshooting Common Cisco Parity Errors in Network Routers

Page 44

Chapter 3 Troubleshooting PRE-1 Modules

Troubleshooting Common System Problems

Fatal Alignment Errors

If the alignment error was a fatal error, it displays a message similar to the following:

%ALIGN-1-FATAL: Corrupted program counter error.

ERROR: Slot 0, NPE300/IOFE2/VXR, CACHE, External Data Cache Memory Test:

*** Data Expected= 0x99999999 ***

Fatal alignment errors are most likely a hardware fault on the processor card. The card itself could be faulty, or the memory on the card could be faulty. Try replacing the processor card and rebooting the router. If a replacement card is not available, try replacing the memory on the processor card, making sure that the new memory meets the specifications that are required by the card.

Low Memory Errors

The router can experience low memory errors for a number of reasons, including the following possible causes:

The router is handling an excessively large volume of traffic. In particular, the router could be experiencing a large volume of traffic that requires special handling, such as ARP requests.

Abnormal processes are using excessive amounts of memory.

Large amounts of memory are still allocated to dead processes.

Software errors could have resulted in memory leaks.

Hardware problems with the memory on the processor card or line card.

Hardware problems on the processor card or line card.

Low memory problems are usually indicated by one or more system messages (for example, SYS-2-MALLOCFAIL). For troubleshooting steps to resolve problems with low memory, see the Tech Note titled Troubleshooting Memory Problems, at the following URL:

http://www.cisco.com/warp/customer/63/mallocfail.shtml

Memory Parity Errors

 

 

 

A memory parity error means that one or more bits at a memory location were unexpectedly changed

 

 

 

after they were originally written. This error could indicate a potential problem with the Dynamic

 

 

 

Random Access Memory (DRAM) that is onboard the PRE-1 module.

 

 

 

Parity errors are not expected during normal operations and might force the router to reload. If the router

 

 

 

did reload because of a parity error, the show version command displays a message such as “System

 

 

 

restarted by processor memory parity error” or “System restarted by shared memory parity error.” For

 

 

 

example:

 

 

 

Router# show version

 

 

 

Cisco Internetwork Operating System Software

 

 

 

IOS (tm) 10000 Software (UBR10K-P6-M), Experimental Version 12.2(20031215:22350]

 

 

 

Copyright (c) 1986-2003 by cisco Systems, Inc.

 

 

 

Compiled Mon 15-Dec-03 17:28 by mnagai

 

 

 

Image text-base: 0x60008968, data-base: 0x61B80000

 

 

 

ROM: System Bootstrap, Version 12.0(9r)SL2, RELEASE SOFTWARE (fc1)

 

 

 

BOOTLDR: 10000 Software (C10K-EBOOT-M), Version 12.0(17)ST, EARLY DEPLOYMENT RE)

 

 

 

ubr10k uptime is 6 days, 18 hours, 59 minutes

 

 

 

Cisco uBR10012 Universal Broadband Router Troubleshooting Guide

 

 

 

 

3-16

 

OL-1237-01

 

 

 

 

Image 44
Contents Corporate Headquarters Text Part Number OL-1237-01Copyright 2001-2004, Cisco Systems, Inc All rights reserved N T E N T S ARP Traffic Testing with Digital Multimeters and Cable Testers B-1 OL-1237-01 Purpose AudienceChapter Description Document OrganizationRelated Documentation Obtaining Documentation Documentation FeedbackCisco.com Ordering DocumentationOpening a TAC Case Obtaining Technical AssistanceCisco TAC Website Obtaining Additional Publications and Information TAC Case Priority DefinitionsXii Basic Troubleshooting Checklist Basic Troubleshooting Tasks and Startup IssuesConfirming the Hardware Installation Last reset from power-on Displaying the Cisco IOS Software VersionHardware Troubleshooting Flowchart Displaying System Environment InformationCisco uBR10012 System Startup Sequence TCC+Startup Event Event Description PEM Faults and Fan Assembly Failures AC PEM FaultsFault Symptom Corrective Action Color DescriptionDC PEM Faults DC PEM Front Panel original model, UBR10-PWR-DC 2400W AC-Input Power Shelf Other Electrical Problems FaultAC OK DC OKFan Assembly Module Faults Fan Assembly ModuleFan Assembly Air Circulation Pattern MULTI-FAN Failure LED Symptom Steps to TakeSingle FAN Failure OL-1237-01 Troubleshooting PRE-1 Modules Message Description PRE Module Not SupportedPRE-1 Module Status Screen Booting Up with Redundant PRE-1 Modules IOS ProtIOS Intf IOS RUNPRE-1 Module Faults Fault Steps to Take LEDEthernet Connection Problems C10000config#interface fastethernet0/0/0Console Port Serial Connection Problems Troubleshooting Common System Problems Troubleshooting System CrashesHigh CPU Utilization Problems ARP TrafficRouterconfig-if# ip access-groupnumber Exec and Virtual Exec Processes Cpuhog ErrorsDebug and System Messages IP Input Processing Invalid Scheduler Allocate ConfigurationInterrupts are Consuming a Large Amount of Resources Snmp Traffic Bus ErrorsProblems with Access Lists Region Manager Start End Sizeb Class Media Name 0x0A000000 Memory Problems Alignment ErrorsLow Memory Errors Memory Parity ErrorsParticle Pool Fallbacks Spurious Interrupts Spurious Memory Accesses OL-1237-01 Troubleshooting Line Cards General Information for Troubleshooting Line Card Crashes Command DescriptionSIG Value SIG Name Error Reason Sigreload Cache Parity ErrorsSigerror Bus Errors Software-Forced Crashes Troubleshooting Line Cards TCC+ Front Panel Status Description PowerMaintenance Fault Type Response Show controllers clock-reference command Troubleshooting the OC-12 Packet-Over-SONET Line Card Fault Corrective Action RX CARRIER-A RX CARRIER-BActive Enabled PASS-THROUGHEnable FailPOS SRPPass Thru SyncWrap Troubleshooting the Gigabit Ethernet Line Card Gigabit Ethernet Line Card Faceplate and LED DescriptionsGigabit Ethernet Line Card Faults and Recommended Responses OL-1237-01 Password Recovery Procedure Overview Password Recovery ProcedurePress Return. The user Exec prompt appears Change all three passwords using the following commands OL-1237-01 Unsupported Commands Unsupported Frame Relay CommandsHccp Commands Mlppp CommandsUnsupported Mpls VPN Commands Unsupported PPP CommandsSpectrum Management Commands Unsupported Telco-Return CommandsOL-1237-01 Testing with Digital Multimeters and Cable Testers Equipment DescriptionTesting with OTDRs Testing with TDRs and OTDRsTesting with TDRs Testing with Network Monitors Testing with Breakout Boxes, Fox Boxes, and BERTs/BLERTsTesting with Network Analyzers Enable LED Active LEDBert BlertENABLE, OC-48 DPT/POS MAINTENANCE, OC-12 SRP/DPTMAINTENANCE, TCC+ POWER, OC-12 DPT/SRP POWER, TCC+Maintenance LED Power LEDSTATUS, OC-12 DPT/SRP STATUS, TCC+ SYNC, OC-48 DPT/POS TX, OC-48 DPT/POS WRAP, OC-48 DPT/POSOC-12 DPT/SRP TCC+ Present LED TCC+RX Carrier LED RX LED RX Pkts LEDWrap LED TDR B-2TX LED OC-48 DPT/POS IN-6