Difference between revisions of "Kaiser Server Room Network Failure (Mar 2013)"

From ECE Information Technology Services
Jump to navigationJump to search
m (Date)
(Repeat incident 2013-03-22)
Line 10: Line 10:
 
* CMC CAD tools
 
* CMC CAD tools
 
* Several research groups' servers hosted in the Kaiser server room
 
* Several research groups' servers hosted in the Kaiser server room
 +
 +
**UPDATE 22 March 2013**
 +
 +
The same network switch failed again 22 March 19:33.  We are investigating the possibility of replacing the switch.

Revision as of 21:32, 22 March 2013

At 2 am on 21 March 2013, a network switch in the Kaiser server room spontaneously failed. Service was restored at 4:15 am by power-cycling the switch.

As a result, the following services were unavailable during the outage:

  • Authentication to the UBC_ECE domain
  • The ability to change account passwords
  • ssh-linux5, ssh-linux6, ssh-linux7
  • Electronic Software Distribution
  • Graduate Application Data Store
  • Several software license servers
  • CMC CAD tools
  • Several research groups' servers hosted in the Kaiser server room
    • UPDATE 22 March 2013**

The same network switch failed again 22 March 19:33. We are investigating the possibility of replacing the switch.