drbd-user March 2013 archive
Main Archive Page > Month Archives  > drbd-user archives
drbd-user: [DRBD-user] Primary fully unavailable with "time

[DRBD-user] Primary fully unavailable with "time expired" errors

From: AZ 9901 <az9901_at_nospam>
Date: Tue Mar 05 2013 - 06:21:21 GMT
To: drbd-user@lists.linbit.com

// I made some errors in my previous mail, here they are corrected

Hello,

I faced a big issue with DRBD.

OS : Linux Debian 6
Kernel : 2.6.32-46
DRBD : 8.3.14

My primary server (srv2-2) was totally unreachable, it only replied to ping.
Apache, SSH etc... were not replying anymore.

So I connected to my secondary server (srv2-1) and closed network communication between both.
This made srv2-2 available again !
I decided however to change srv2-1 from Secondary to Primary and to reboot srv2-2.

Following are logs from srv2-2 and srv2-1, with some comments.
srv2-2 : http://pastebin.com/raw.php?i=zkHV5Tr9
srv2-1 : http://pastebin.com/raw.php?i=WX4vNR6d

on srv2-2, sar tells me that some of my CPU cores were 100% used (100% iowait) during all the time frame in which I had "time expired" errors.

Could you help me please ?

Thank you very much,

Ben

_______________________________________________
drbd-user mailing list
drbd-user@lists.linbit.com
http://lists.linbit.com/mailman/listinfo/drbd-user