NAS / Data Domain: Resetting Hung Alert

We had a power supply go out on one of our Data Domain 510 appliances about 2 weeks ago.  They shipped out a replacement power supply and it was put in place.  After the power supply was replaced, the alarm was still showing via the web page interface and CLI (Command Line Interface).  Since this is a standby backup unit, I did do also tried a reboot.  This also did not clear the alert.  Remember, I am in now way responsible for anyone ruining their system or voiding their warranty.

Snippet from the autosupport(email) log
Power Supply
Status
——–
DEGRADED
——–

From the CLI
# alerts show current
Alert Time         Description
—————-   —————————————————–
Mon Dec  8 08:38   Encl 1 A power supply module has failed.
—————-   —————————————————–
There is 1 active alert

I needed to go over to that colocation anyway so I made sure remote hands didn’t mess anything up.  So this alert is definitely hung.  The new power supply had good status indicators, and the power cable was replaced.  Removing this alert has to be done through the CLI in SE mode.  To do this, I SSHed into the Data Domain and escalated my privileges as follows.

Retrieve the serial number
# system show serialno
Serial number: XXXXXXXXXXX

Escalate to SE privileges
# priv set SE
Enter system password:[enter serial number]

Once in SE mode, the prompt will change to something resembling the following
SE@hostname##

Now, the ‘reg’ command is extended to its full feature set
SE@hostname## reg
Commands matching “reg”:

reg removekey <key>           Remove a key from the configuration
reg set reg-default <key>     Set the specified key to the default value
reg set unset-defaults        Set unset defaults.  Useful for upgrades
reg set <key> = <value>       Set a configuration key to the given value
reg setraw <key> = <value>    Set an encoded reg string
reg show defaults             Show registry defaults
reg show obsolete             Show obsolete keys
reg show stats                Show registry stats
reg showraw [nokey]           Show encoded registry data
reg show [nokey] <key>        Show a value for a configuration key

The autosupport contains the registry key that needs to be removed
dynamic.ems.701.x.1000.x.9.1.5 = Encl 1 A power supply

Cut out the part of the key that is needed
dynamic.ems.701.x.1000.x.9.1.5

The following removes the hung alert from the registry
SE@hostname## reg removekey dynamic.ems.701.x.1000.x.1.5

Note: Unfortunately, the alert does not tell which of the 3 power supplies have failed.  This was not a huge problem, since the remote hands just eyed the power supply that had no status lights on.

~ by Kevin Goodman on December 29, 2008.

Leave a Reply