thr3ads.net - Gluster users - [Gluster-users] Problem With AFR [Mar 2010]

If this information is useful, please help other people find it:
Share via:

DFHI Active Page

2010-Mar-06 08:53 UTC

[Gluster-users] Problem With AFR

Hi i?m using GlusterFs V3.0.2 in Fedora 12.

I have configure AFR with 3 nodes and mount the volume in the client, its
work fine, but when a node fail, the file system in the client locks and I
can?t execute any operations for about 40 to 50 Seconds.

After 40 to 50 Seconds the File system on the client start to work again.
How I can resolve this problem, because the file system can?t bee
inaccessible for so long Time.

Bellow is my Server Configuration and Client Configuration:

--------------------------------------------------------------------------
Server Configuration:
--------------------------------------------------------------------------
volume s3ap2posix
  type storage/posix
  option directory /gluster/s3ap2
end-volume

volume s3ap2locks
    type features/locks
    subvolumes s3ap2posix
end-volume

volume s3ap2brick
    type performance/io-threads
    option thread-count 8
    subvolumes s3ap2locks
end-volume

volume server-tcp
    type protocol/server
    option transport-type tcp
    option auth.addr.s3ap2brick.allow 192.168.30.*
    option transport.socket.listen-port 6996
    option transport.socket.nodelay on
    subvolumes s3ap2brick
end-volume

--------------------------------------------------------------------------
Cliente Configuration
--------------------------------------------------------------------------
volume 192.168.30.1-s3ap2
    type protocol/client
    option transport-type tcp
    option remote-host 192.168.30.1
    option transport.socket.nodelay on
    option transport.remote-port 6996
    option remote-subvolume s3ap2brick
end-volume

volume 192.168.30.2-s3ap2
    type protocol/client
    option transport-type tcp
    option remote-host 192.168.30.2
    option transport.socket.nodelay on
    option transport.remote-port 6996
    option remote-subvolume s3ap2brick
end-volume

volume 192.168.30.3-s3ap2
    type protocol/client
    option transport-type tcp
    option remote-host 192.168.30.3
    option transport.socket.nodelay on
    option transport.remote-port 6996
    option remote-subvolume s3ap2brick
end-volume

volume s3ap2afr
    type cluster/afr
    subvolumes 192.168.30.1-s3ap2 192.168.30.2-s3ap2 192.168.30.3-s3ap2
end-volume

volume s3ap2writebehind
    type performance/write-behind
    option cache-size 4MB
    subvolumes s3ap2afr
end-volume

volume s3ap2readahead
    type performance/read-ahead
    option page-count 4
    subvolumes s3ap2writebehind
end-volume

volume s3ap2iocache
    type performance/io-cache
    option cache-size 1GB
    option cache-timeout 1
    subvolumes s3ap2readahead
end-volume

volume s3ap2quickread
    type performance/quick-read
    option cache-timeout 1
    option max-file-size 64kB
    subvolumes s3ap2iocache
end-volume

volume s3ap2
    type performance/stat-prefetch
    subvolumes s3ap2quickread
end-volume

Thank?s
    Eduardo

Richard de Vries

2010-Mar-06 12:19 UTC

head link

[Gluster-users] Problem With AFR

Hello?Eduardo,

We had the same problem over here, with two nodes that are both server
and client.
You can try to lower the ping-timeout in the client volume file:

option ping-timeout 5

5 seconds is the sadly the lowest possible ping-timeout, our
applications on the main node can hang for about 5 seconds in case the
standby node fails (although we have a dedicated interconnect).
Maybe the gluster developers have a better solution to this.

Regards,
Richard

> Hi i?m using GlusterFs V3.0.2 in Fedora 12.
>
> I have configure AFR with 3 nodes and mount the volume in the client, its
> work fine, but when a node fail, the file system in the client locks and I
> can?t execute any operations for about 40 to 50 Seconds.
> After 40 to 50 Seconds the File system on the client start to work again.
> How I can resolve this problem, because the file system can?t bee
> inaccessible for so long Time.

Gluster users - Mar 2010 - Problem With AFR

[Gluster-users] Problem With AFR

[Gluster-users] Problem With AFR