thr3ads.net - Gluster users - [Gluster-users] faied start the glusterd after reboot [Feb 2016]

If this information is useful, please help other people find it:
Share via:

Atin Mukherjee

2016-Feb-25 10:25 UTC

[Gluster-users] faied start the glusterd after reboot

I believe you and Abhishek are from the same group and sharing the
common set up. Could you check the content of /var/lib/glusterd/* in
board B (post reboot and before starting glusterd) matches with
/var/lib/glusterd/* from board A?

~Atin

On 02/25/2016 03:48 PM, songxin wrote:> Hi,
> I have a problem as below when I start the gluster after reboot a board.
> 
> precondition: 
> I use two boards do this test.
> The version of glusterfs is 3.7.6.
> 
> A board ip:128.224.162.255 
> B board ip:128.224.95.140 
> 
> reproduce steps?
> 
> 1.systemctl start glusterd (A board) 
> 2.systemctl start glusterd (B board) 
> 3.gluster peer probe 128.224.95.140 (A board) 
> 4.gluster volume create gv0 replica 2 128.224.95.140:/tmp/brick1/gv0
> 128.224.162.255:/data/brick/gv0 force (local board)
> 5.gluster volume start gv0 (A board) 
> 6.press the reset button on the A board.It is a develop board so it has
> a reset button that is similar to reset button on pc (A board) 
> 7.run command "systemctl start glusterd" after A board reboot.
And
> command failed because the file /var/lib/glusterd/snaps/.nfsxxxxxxxxx
> (local board) .
> Log is as below.
> [2015-12-07 07:55:38.260084] E [MSGID: 101032]
> [store.c:434:gf_store_handle_retrieve] 0-: Path corresponding to
> /var/lib/glusterd/snaps/.nfs0000000001722f4000000002
> [2015-12-07 07:55:38.260120] D [MSGID: 0]
> [store.c:439:gf_store_handle_retrieve] 0-: Returning -1                
>                                               
> [2015-12-07 07:55:38.260152] E [MSGID: 106200]
> [glusterd-store.c:3332:glusterd_store_update_snap] 0-management: snap
> handle is NULL                                 
> [2015-12-07 07:55:38.260180] E [MSGID: 106196]
> [glusterd-store.c:3427:glusterd_store_retrieve_snap] 0-management:
> Failed to update snapshot for .nfs0000000001722f40
> [2015-12-07 07:55:38.260208] E [MSGID: 106043]
> [glusterd-store.c:3589:glusterd_store_retrieve_snaps] 0-management:
> Unable to restore snapshot: .nfs0000000001722f400
> [2015-12-07 07:55:38.260241] D [MSGID: 0]
> [glusterd-store.c:3607:glusterd_store_retrieve_snaps] 0-management:
> Returning with -1                              
> [2015-12-07 07:55:38.260268] D [MSGID: 0]
> [glusterd-store.c:4339:glusterd_restore] 0-management: Returning -1    
>                                                  
> [2015-12-07 07:55:38.260325] E [MSGID: 101019]
> [xlator.c:428:xlator_init] 0-management: Initialization of volume
> 'management' failed, review your volfile again    
> [2015-12-07 07:55:38.260355] E [graph.c:322:glusterfs_graph_init]
> 0-management: initializing translator failed                            
>                          
> [2015-12-07 07:55:38.260374] E [graph.c:661:glusterfs_graph_activate]
> 0-graph: init failed                                 
> 
> 8.rm /var/lib/glusterd/snaps/.nfsxxxxxxxxx (A board) 
> 9..run command "systemctl start glusterd" and success. 
> 10.at this point the peer status is Peer in Cluster (Connected) and all
> process is online. 
> 
> If a node abnormal reset, must I remove
> the  /var/lib/glusterd/snaps/.nfsxxxxxx before starting the glusterd?
> 
> I want to know if it is nomal.
> 
> Thanks,
> Xin
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
>  
> 
> 
> 
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-users
>

songxin

2016-Feb-25 10:42 UTC

head link

[Gluster-users] faied start the glusterd after reboot

Thanks for your reply.


Do I need check all files in /var/lib/glusterd/*? 
Must all files be same in A node and B node?


I found that the size of file 
/var/lib/glusterd/snaps/.nfs0000000001722f4000000002 is 0 bytes after A board
reboot.
So glusterd can't restore by this snap file on A node.
Is it right?








At 2016-02-25 18:25:50, "Atin Mukherjee" <amukherj at
redhat.com> wrote:>I believe you and Abhishek are from the same group and sharing the
>common set up. Could you check the content of /var/lib/glusterd/* in
>board B (post reboot and before starting glusterd) matches with
>/var/lib/glusterd/* from board A?
>
>~Atin
>
>On 02/25/2016 03:48 PM, songxin wrote:
>> Hi,
>> I have a problem as below when I start the gluster after reboot a
board.
>> 
>> precondition: 
>> I use two boards do this test.
>> The version of glusterfs is 3.7.6.
>> 
>> A board ip:128.224.162.255 
>> B board ip:128.224.95.140 
>> 
>> reproduce steps?
>> 
>> 1.systemctl start glusterd (A board) 
>> 2.systemctl start glusterd (B board) 
>> 3.gluster peer probe 128.224.95.140 (A board) 
>> 4.gluster volume create gv0 replica 2 128.224.95.140:/tmp/brick1/gv0
>> 128.224.162.255:/data/brick/gv0 force (local board)
>> 5.gluster volume start gv0 (A board) 
>> 6.press the reset button on the A board.It is a develop board so it has
>> a reset button that is similar to reset button on pc (A board) 
>> 7.run command "systemctl start glusterd" after A board
reboot. And
>> command failed because the file /var/lib/glusterd/snaps/.nfsxxxxxxxxx
>> (local board) .
>> Log is as below.
>> [2015-12-07 07:55:38.260084] E [MSGID: 101032]
>> [store.c:434:gf_store_handle_retrieve] 0-: Path corresponding to
>> /var/lib/glusterd/snaps/.nfs0000000001722f4000000002
>> [2015-12-07 07:55:38.260120] D [MSGID: 0]
>> [store.c:439:gf_store_handle_retrieve] 0-: Returning -1                
>>                                               
>> [2015-12-07 07:55:38.260152] E [MSGID: 106200]
>> [glusterd-store.c:3332:glusterd_store_update_snap] 0-management: snap
>> handle is NULL                                 
>> [2015-12-07 07:55:38.260180] E [MSGID: 106196]
>> [glusterd-store.c:3427:glusterd_store_retrieve_snap] 0-management:
>> Failed to update snapshot for .nfs0000000001722f40
>> [2015-12-07 07:55:38.260208] E [MSGID: 106043]
>> [glusterd-store.c:3589:glusterd_store_retrieve_snaps] 0-management:
>> Unable to restore snapshot: .nfs0000000001722f400
>> [2015-12-07 07:55:38.260241] D [MSGID: 0]
>> [glusterd-store.c:3607:glusterd_store_retrieve_snaps] 0-management:
>> Returning with -1                              
>> [2015-12-07 07:55:38.260268] D [MSGID: 0]
>> [glusterd-store.c:4339:glusterd_restore] 0-management: Returning -1    
>>                                                  
>> [2015-12-07 07:55:38.260325] E [MSGID: 101019]
>> [xlator.c:428:xlator_init] 0-management: Initialization of volume
>> 'management' failed, review your volfile again    
>> [2015-12-07 07:55:38.260355] E [graph.c:322:glusterfs_graph_init]
>> 0-management: initializing translator failed
>>                          
>> [2015-12-07 07:55:38.260374] E [graph.c:661:glusterfs_graph_activate]
>> 0-graph: init failed                                 
>> 
>> 8.rm /var/lib/glusterd/snaps/.nfsxxxxxxxxx (A board) 
>> 9..run command "systemctl start glusterd" and success. 
>> 10.at this point the peer status is Peer in Cluster (Connected) and all
>> process is online. 
>> 
>> If a node abnormal reset, must I remove
>> the  /var/lib/glusterd/snaps/.nfsxxxxxx before starting the glusterd?
>> 
>> I want to know if it is nomal.
>> 
>> Thanks,
>> Xin
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>>  
>> 
>> 
>> 
>> _______________________________________________
>> Gluster-users mailing list
>> Gluster-users at gluster.org
>> http://www.gluster.org/mailman/listinfo/gluster-users
>> -------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://www.gluster.org/pipermail/gluster-users/attachments/20160225/db8534d5/attachment.html>

Gluster users - Feb 2016 - faied start the glusterd after reboot

[Gluster-users] faied start the glusterd after reboot

[Gluster-users] faied start the glusterd after reboot