Contents

 

 

 

 

4.5

Late-BIST

33

 

4.6

QuickBoot Feature

34

 

 

4.6.1

Configuring QuickBoot

34

 

4.7

Event Log Area and Event Management

35

 

4.8

OS Flash Corruption Detection and Recovery Design

35

 

 

4.8.1

Monitoring the Static Images

35

 

 

4.8.2

Monitoring the Dynamic Images

36

 

 

4.8.3

CMM Failover

36

 

4.9

BIST Test Descriptions

36

 

 

4.9.1

Flash Checksum Test

36

 

 

4.9.2

Base Memory Test

36

 

 

4.9.3

Extended Memory Tests

36

 

 

4.9.4

FPGA Version Check

37

 

 

4.9.5

DS1307 RTC (Real-Time Clock) Test

37

 

 

4.9.6

NIC Presence/Local PCI Bus Test

37

 

 

4.9.7

OS Image Checksum Test

37

 

 

4.9.8

CRC32 Checksum

37

 

 

4.9.9

IPMB Bus Busy/Not Ready Test

38

5

Re-enumeration

39

 

5.1

Overview

39

 

5.2

Re-enumeration on Failover

39

 

5.3

Re-enumeration of M5 FRU

40

 

5.4

Resolution of EKeys

40

 

5.5

Events Regeneration

40

6

Process Monitoring and Integrity

41

 

6.1

Overview

41

 

 

6.1.1

Process Existence Monitoring

41

 

 

6.1.2

Thread Watchdog Monitoring

41

 

 

6.1.3

Process Integrity Monitoring

42

 

6.2

Processes Monitored

42

 

6.3

Process Monitoring Targets

42

 

6.4

Process Monitoring Dataitems

43

 

 

6.4.1

Examples

43

 

6.5

SNMP MIB Commands

44

 

6.6

Process Monitoring CMM Events

44

 

6.7

Failure Scenarios and Eventing

45

 

 

6.7.1

No Action Recovery

45

 

 

6.7.2

Successful Restart Recovery

46

 

 

6.7.3

Successful Failover/Restart Recovery

47

 

 

6.7.4

Successful Failover/Reboot Recovery

48

 

 

6.7.5

Failed Failover/Reboot Recovery, Non-Critical

48

 

 

6.7.6

Failed Failover/Reboot Recovery, Critical

49

 

 

6.7.7

Excessive Restarts, Escalate No Action

50

 

 

6.7.8

Excessive Restarts, Successful Escalate Failover/Reboot

51

 

 

6.7.9

Excessive Restarts, Failed Escalate Failover/Reboot, Non-Critical

52

 

 

6.7.10

Excessive Restarts, Failed Escalate Failover/Reboot, Critical

52

 

 

6.7.11

Process Administrative Action

53

 

 

6.7.12

Excessive Failover/Reboots, Administrative Action

54

4MPCMM0001 Chassis Management Module Software Technical Product Specification

Page 4
Image 4
Intel MPCMM0001 manual Failed Failover/Reboot Recovery, Non-Critical