Hello there Our gluster client was crashed 2 times last night. The gluster partition was unavailable for a long time and i have to remount it manually. On the gluster log, i saw things like these: /pending frames: frame : type(1) op(LOOKUP) frame : type(1) op(LOOKUP) patchset: git://git.gluster.com/glusterfs.git signal received: 11 time of crash: 2012-06-28 00:19:24 configuration details: argp 1 backtrace 1 dlfcn 1 fdatasync 1 libpthread 1 llistxattr 1 setfsid 1 spinlock 1 epoll.h 1 xattr.h 1 st_atim.tv_nsec 1 package-string: glusterfs 3.2.5 /lib64/libc.so.6[0x3cf0c32900]/ Here is my "gluster volume info" output: /Volume Name: xxxxxx Type: Distribute Status: Started Number of Bricks: 2 Transport-type: tcp Bricks: Brick1: Gnode1:/mnt/store Brick2: Gnode2:/mnt/store2 Options Reconfigured: nfs.addr-namelookup: off nfs.rpc-auth-allow: 10.0.0.245,10.0.0.244,10.0.0.247,10.0.0.54,10.0.0.55 auth.allow: 10.* cluster.min-free-disk: 5 performance.io-thread-count: 64 performance.cache-size: 512MB nfs.disable: off performance.write-behind-window-size: 4MB cluster.data-self-heal: off performance.stat-prefetch: off/ I googled around and found some people have the same problem with us but haven't found solution or any clues of what happened yet. Is it a bug ? and if it's could you please tell me how can i overcome it ? Many thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://supercolony.gluster.org/pipermail/gluster-users/attachments/20120628/e10d7261/attachment.html>
it keep being crashed everyday. I tried reinstalling ( from source code ) but it didn't help, will try installing from RPM to see if it's because of my compiler or not. Any idea, guys ? Any help would be highly appreciated :) On 6/28/2012 8:20 AM, tungdam wrote:> Hello there > > Our gluster client was crashed 2 times last night. The gluster > partition was unavailable for a long time and i have to remount it > manually. On the gluster log, i saw things like these: > > /pending frames: > frame : type(1) op(LOOKUP) > frame : type(1) op(LOOKUP) > > patchset: git://git.gluster.com/glusterfs.git > signal received: 11 > time of crash: 2012-06-28 00:19:24 > configuration details: > argp 1 > backtrace 1 > dlfcn 1 > fdatasync 1 > libpthread 1 > llistxattr 1 > setfsid 1 > spinlock 1 > epoll.h 1 > xattr.h 1 > st_atim.tv_nsec 1 > package-string: glusterfs 3.2.5 > /lib64/libc.so.6[0x3cf0c32900]/ > > > Here is my "gluster volume info" output: > > /Volume Name: xxxxxx > Type: Distribute > Status: Started > Number of Bricks: 2 > Transport-type: tcp > Bricks: > Brick1: Gnode1:/mnt/store > Brick2: Gnode2:/mnt/store2 > Options Reconfigured: > nfs.addr-namelookup: off > nfs.rpc-auth-allow: 10.0.0.245,10.0.0.244,10.0.0.247,10.0.0.54,10.0.0.55 > auth.allow: 10.* > cluster.min-free-disk: 5 > performance.io-thread-count: 64 > performance.cache-size: 512MB > nfs.disable: off > performance.write-behind-window-size: 4MB > cluster.data-self-heal: off > performance.stat-prefetch: off/ > > I googled around and found some people have the same problem with us > but haven't found solution or any clues of what happened yet. Is it a > bug ? and if it's could you please tell me how can i overcome it ? > > Many thanks > >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://supercolony.gluster.org/pipermail/gluster-users/attachments/20120629/e5ebf6b0/attachment.html>
On 06/27/2012 09:20 PM, tungdam wrote:> Hello there > > Our gluster client was crashed 2 times last night. The gluster partition > was unavailable for a long time and i have to remount it manually. On > the gluster log, i saw things like these: > > /pending frames: > frame : type(1) op(LOOKUP) > frame : type(1) op(LOOKUP) > > patchset: git://git.gluster.com/glusterfs.git > signal received: 11 > time of crash: 2012-06-28 00:19:24 > configuration details: > argp 1 > backtrace 1 > dlfcn 1 > fdatasync 1 > libpthread 1 > llistxattr 1 > setfsid 1 > spinlock 1 > epoll.h 1 > xattr.h 1 > st_atim.tv_nsec 1 > package-string: glusterfs 3.2.5 > /lib64/libc.so.6[0x3cf0c32900]/Can you please post the backtrace that usually follows the above in the log file? Thanks, Vijay