Break reports

Here are listed all breaks in HIIT's IT services.

Break in elrond.it.hiit.fi at 2012-08-16 17:05 - 17:10

Schedule: 
2012-08-16 17:05 to 17:10
Duration: 
5 min
Affected services: 
wiki.hiit.fi
Description: 

Rebooted elrond.it.hiit.fi that hosts wiki.hiit.fi due to kernel's multipath driver's inability to mount partition using device-mapper's path alias.

Break in arod.it.hiit.fi, hasufel.it.hiit.fi and meriadoc.it.hiit.fi at 2012-08-16 08:00 to 08:15

Schedule: 
2012-08-16 08:00 to 08:15
Duration: 
15 min
Affected services: 
Remote Windows server (rwin.hiit.fi)
Description: 

Updates will be installed to Windows servers. The only visible break is from remote windows server at 8:00. Other services are redundant and thus no visible breaks will happen.

Each break lasts less than 5 minutes.

Break in network connections at 2012-08-07 19:00-21:30

Schedule: 
2012-08-07 19:00 to 21:30
Duration: 
3 min
Affected services: 
University of Helsinki department of computer science's Ukko cluster, some of virtualised services
Description: 

Firmware upgrade on datacenter core switches. Few short breaks will happen. In most cases spanning tree will take care of traffic rerouting so breaks will be short.

Update at 21:16: Some Ukko nodes failed to switch network traffic to another network interface card (NIC) and thus network connections to those nodes were interrupted for appoximately 3 minutes during reboots of the switch via which the traffic to blade chassis those Ukko nodes were attached to was configured to flow. NIC bonding configurations will be fixed so that this kind of unneccessary breaks can be avoided.

Update at 21:29: The break is over.

Update at 2012-08-13 21:18: Ukko cluster's network switches are now configured to drop host port (access port) immediately if uplink port goes down. This will fix the forementioned problem when hosts' NIC bonding configuration didn't notice the loss of uplink.

Break in elrond.it.hiit.fi at 2012-07-20 11:30-11:42

Schedule: 
2012-07-20 11:30 to 11:42
Duration: 
12 minutes
Affected services: 
wiki.hiit.fi
Description: 

Configuration change at elrond.it.hiit.fi caused confluence application to crash.

Restart of Tomcat application server was required.

Update at 11:48: The break is over.

Break in arod.it.hiit.fi, hasufel.it.hiit.fi and meriadoc.it.hiit.fi at 2012-07-17 15:39-15:56

Schedule: 
2012-07-17 15:39 to 15:56
Duration: 
5 min
Affected services: 
Remote Windows server (rwin.hiit.fi)
Description: 

Updates will be installed to Windows servers. The only visible break is from remote windows server at 15:39. Other services are redundant and thus no visible breaks will happen.

Each break lasts less than 5 minutes.

Pages