drbd-user March 2010 archive
Main Archive Page > Month Archives  > drbd-user archives
drbd-user: Re: [DRBD-user] DRBD module won't load

Re: [DRBD-user] DRBD module won't load

From: <Wood.Chris_at_nospam>
Date: Thu Mar 18 2010 - 22:52:52 GMT
To: drbd-user@lists.linbit.com

drbd-user-bounces@lists.linbit.com wrote on 03/18/2010 03:27:10 PM:

> From: "Matt Graham" <danceswithcrows@usa.net>
> To: <drbd-user@lists.linbit.com>
> Date: 03/18/2010 03:33 PM
> Subject: Re: [DRBD-user] DRBD module won't load
> Sent by: drbd-user-bounces@lists.linbit.com
>
> From: Wood.Chris@tatravelcenters.com
> > I'm trying to start drbd on an Oracle Virtual Server (Xen) machine.
> > Starting DRBD resources: Can not load the drbd module.
> > $ rpm -qa|grep drbd
> > drbd-8.3.6-1.el5
> > kmod-drbd-8.0.16-5.el5_3
>
> drbd 8.3.6, drbd kernel module 8.0.16. The two really should match, no?
> And which /lib/modules/2.6.* dir did the module get installed in? If it
> got put in the wrong dir (possible) then modprobe won't find it. Where
> did the kernel module RPM come from? It has to be built against the
> kernel that's running, otherwise it won't work.
>
> > Any help would be much appreciated - do I have to build the module
from
> > scratch?
>
> Building the kernel module from source should be pretty easy; just go
> into the drbd source dir and do "make rpm
KDIR=/usr/src/kernels/2.6.18-blah"
> and you should get RPMs for userland and kernelspace drbd components.
> Not applicable to very recent kernels since drbd is now in the vanilla
> kernel source, but CentOS 5 doesn't have a recent kernel.

Ok, I uninstalled all the other packages... so I have no idea if I'll be
able to get this to work with heartbeat and openais... but I built the
8.3.4 versions and installed on both hosts.

I set up dedicated interfaces on each, they are directly connected. I
opened the firewall for that interface. I can ping each server from the
other over this link, although the xenbr1 interface is bridging it - I
don't really want that, but can't find anything on how to disable xen from
grabbing it... so I can ping each server, but when I start drbd on each
node, it just waits and waits and waits for the other node to come up...

Here's what I see in the log on server0

Mar 18 13:08:52 admin-lab-ovs0 kernel: drbd: initialized. Version: 8.3.4
(api:88/proto:86-91)
Mar 18 13:08:52 admin-lab-ovs0 kernel: drbd: GIT-hash:
70a645ae080411c87b4482a135847d69dc90a6a2 build by root@admin-lab-ovs0,
2010-03-18 11:28:37
Mar 18 13:08:52 admin-lab-ovs0 kernel: drbd: registered as block device
major 147
Mar 18 13:08:52 admin-lab-ovs0 kernel: drbd: minor_table @ 0xdf50c0c0
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: Starting worker thread
(from cqueue/0 [120])
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: disk( Diskless ->
Attaching )
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: No usable activity log
found.
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: Method to ensure write
ordering: barrier
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: max_segment_size ( =
BIO size ) = 32768
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: drbd_bm_resize called
with capacity == 1942729272
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: resync bitmap:
bits=242841159 words=7588788
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: size = 926 GB
(971364636 KB)
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: recounting of set bits
took additional 15 jiffies
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: 926 GB (242841159
bits) marked out-of-sync by on disk bit-map.
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: disk( Attaching ->
Inconsistent )
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: conn( StandAlone ->
Unconnected )
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: Starting receiver
thread (from drbd0_worker [2871])
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: receiver (re)started
Mar 18 13:08:52 admin-lab-ovs0 kernel: block drbd0: conn( Unconnected ->
WFConnection )
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: Handshake successful:
Agreed network protocol version 91
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: conn( WFConnection ->
WFReportParams )
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: Starting asender
thread (from drbd0_receiver [2881])
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: data-integrity-alg:
<not-used>
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: drbd_sync_handshake:
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: self
0000000000000004:0000000000000000:0000000000000000:0000000000000000
bits:242841159 flags:0
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: peer
0000000000000004:0000000000000000:0000000000000000:0000000000000000
bits:242841159 flags:0
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: uuid_compare()=0 by
rule 10
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: No resync, but
242841159 bits in bitmap!
Mar 18 13:09:51 admin-lab-ovs0 kernel: block drbd0: peer( Unknown ->
Secondary ) conn( WFReportParams -> Connected ) pdsk( DUnknown ->
Inconsistent )
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: peer( Secondary ->
Unknown ) conn( Connected -> Disconnecting )
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: short read expecting
header on sock: r=-512
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: asender terminated
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: Terminating asender
thread
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: Connection closed
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: conn( Disconnecting ->
StandAlone )
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: receiver terminated
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: Terminating receiver
thread
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: conn( StandAlone ->
Unconnected )
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: Starting receiver
thread (from drbd0_worker [2871])
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: receiver (re)started
Mar 18 13:28:42 admin-lab-ovs0 kernel: block drbd0: conn( Unconnected ->
WFConnection )

[root@admin-lab-ovs0 cwood]# cat /proc/drbd
version: 8.3.4 (api:88/proto:86-91)
GIT-hash: 70a645ae080411c87b4482a135847d69dc90a6a2 build by
root@admin-lab-ovs0, 2010-03-18 11:28:37
 0: cs:WFConnection ro:Secondary/Unknown ds:Inconsistent/Inconsistent C
r----
    ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b
oos:971364636

And server1

Mar 18 18:03:47 admin-lab-ovs1 kernel: drbd: initialized. Version: 8.3.4
(api:88/proto:86-91)
Mar 18 18:03:47 admin-lab-ovs1 kernel: drbd: GIT-hash:
70a645ae080411c87b4482a135847d69dc90a6a2 build by root@admin-lab-ovs0,
2010-03-18 11:28:37
Mar 18 18:03:47 admin-lab-ovs1 kernel: drbd: registered as block device
major 147
Mar 18 18:03:47 admin-lab-ovs1 kernel: drbd: minor_table @ 0xf718b6c0
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: Starting worker thread
(from cqueue/0 [120])
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: disk( Diskless ->
Attaching )
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: No usable activity log
found.
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: Method to ensure write
ordering: barrier
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: max_segment_size ( =
BIO size ) = 32768
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: drbd_bm_resize called
with capacity == 1942729272
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: resync bitmap:
bits=242841159 words=7588788
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: size = 926 GB
(971364636 KB)
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: recounting of set bits
took additional 18 jiffies
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: 926 GB (242841159
bits) marked out-of-sync by on disk bit-map.
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: disk( Attaching ->
Inconsistent )
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: conn( StandAlone ->
Unconnected )
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: Starting receiver
thread (from drbd0_worker [3206])
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: receiver (re)started
Mar 18 18:03:47 admin-lab-ovs1 kernel: block drbd0: conn( Unconnected ->
WFConnection )
Mar 18 18:04:06 admin-lab-ovs1 kernel: block drbd0: Handshake successful:
Agreed network protocol version 91
Mar 18 18:04:06 admin-lab-ovs1 kernel: block drbd0: conn( WFConnection ->
WFReportParams )
Mar 18 18:04:06 admin-lab-ovs1 kernel: block drbd0: Starting asender
thread (from drbd0_receiver [3216])
Mar 18 18:04:06 admin-lab-ovs1 kernel: block drbd0: data-integrity-alg:
<not-used>
Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: drbd_sync_handshake:
Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: self
0000000000000004:0000000000000000:0000000000000000:0000000000000000
bits:242841159 flags:0
Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: peer
0000000000000004:0000000000000000:0000000000000000:0000000000000000
bits:242841159 flags:0
Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: uuid_compare()=0 by
rule 10
Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: No resync, but
242841159 bits in bitmap!
Mar 18 18:04:07 admin-lab-ovs1 kernel: block drbd0: peer( Unknown ->
Secondary ) conn( WFReportParams -> Connected ) pdsk( DUnknown ->
Inconsistent )
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: peer( Secondary ->
Unknown ) conn( Connected -> TearDown )
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: asender terminated
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: Terminating asender
thread
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: Connection closed
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: conn( TearDown ->
Unconnected )
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: receiver terminated
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: Restarting receiver
thread
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: receiver (re)started
Mar 18 18:22:58 admin-lab-ovs1 kernel: block drbd0: conn( Unconnected ->
WFConnection )

[root@admin-lab-ovs1 cwood]# cat /proc/drbd
version: 8.3.4 (api:88/proto:86-91)
GIT-hash: 70a645ae080411c87b4482a135847d69dc90a6a2 build by
root@admin-lab-ovs0, 2010-03-18 11:28:37
 0: cs:WFConnection ro:Secondary/Unknown ds:Inconsistent/Inconsistent C
r----
    ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 ep:1 wo:b
oos:971364636
_______________________________________________
drbd-user mailing list
drbd-user@lists.linbit.com
http://lists.linbit.com/mailman/listinfo/drbd-user