thr3ads.net - Gluster users - [Gluster-users] gluster peer disconnected [Jul 2020]

If this information is useful, please help other people find it:
Share via:

Pat Haley

2020-Jul-30 17:22 UTC

[Gluster-users] gluster peer disconnected

Hi,

We have a cluster whose common storage is a gluster volume consisting of 
4 bricks residing on 2 servers (more details at bottom).? Yesterday we 
experienced a power outage.? To start the gluster volume after the power 
came back I had to

  * manually start a gluster daemon on one of the servers (mseas-data3)
  * start the gluster volume on the other server (mseas-data2)
      o I had just tried starting the gluster volume without manually
        starting the other daemon but that was unsuccessful.

After this my recollection is that the peers were talking to each other 
at that time.

Today I was looking around and noticed that the mseas-data3 server is in 
a disconnected state (even though the compute nodes of our cluster are 
seeing the full gluster volume)

-----------------------

[root at mseas-data2 ~]# gluster peer status
Number of Peers: 1

Hostname: mseas-data3
Uuid: b39d4deb-c291-437e-8013-09050c1fa9e3
State: Peer in Cluster (Disconnected)

-----------------------

Following the advice on 
https://lists.gluster.org/pipermail/gluster-users/2015-April/021597.html 
, I confirmed that the 2 servers can ping each other.? The gluster 
daemon on mseas-data2 is active but the daemon on mseas-data3 shows

--------------------------------

[root at mseas-data3 ~]# service glusterd status
glusterd dead but pid file exists

--------------------------------

Is it safe to just restart that daemon on mseas-data3?? Is there some 
other procedure I should do? I ask because we have a number of job 
running that appear to be successfully writing to the gluster volume and 
I'd prefer that they continue if possible.

Any advice would be appreciated.? Thanks

---------------------------------------------------

[root at mseas-data2 ~]# gluster volume info

Volume Name: data-volume
Type: Distribute
Volume ID: c162161e-2a2d-4dac-b015-f31fd89ceb18
Status: Started
Number of Bricks: 4
Transport-type: tcp
Bricks:
Brick1: mseas-data2:/mnt/brick1
Brick2: mseas-data2:/mnt/brick2
Brick3: mseas-data3:/export/sda/brick3
Brick4: mseas-data3:/export/sdc/brick4
Options Reconfigured:
diagnostics.client-log-level: ERROR
network.inode-lru-limit: 50000
performance.md-cache-timeout: 60
performance.open-behind: off
disperse.eager-lock: off
auth.allow: *
server.allow-insecure: on
nfs.exports-auth-enable: on
diagnostics.brick-sys-log-level: WARNING
performance.readdir-ahead: on
nfs.disable: on
nfs.export-volumes: off
cluster.min-free-disk: 1%

-- 

-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Pat Haley                          Email:  phaley at mit.edu
Center for Ocean Engineering       Phone:  (617) 253-6824
Dept. of Mechanical Engineering    Fax:    (617) 253-8125
MIT, Room 5-213                    http://web.mit.edu/phaley/www/
77 Massachusetts Avenue
Cambridge, MA  02139-4301

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.gluster.org/pipermail/gluster-users/attachments/20200730/714988c3/attachment.html>

Strahil Nikolov

2020-Jul-30 20:14 UTC

head link

[Gluster-users] gluster peer disconnected

Is 'gluster  pool list' consistent on all nodes?
Do you have all your bricks properly mounted on the affected node?

Bet Regards,
Strahil Nikolov

?? 30 ??? 2020 ?. 20:22:18 GMT+03:00, Pat Haley <phaley at mit.edu>
??????:>
>Hi,
>
>We have a cluster whose common storage is a gluster volume consisting
>of 
>4 bricks residing on 2 servers (more details at bottom).? Yesterday we 
>experienced a power outage.? To start the gluster volume after the
>power 
>came back I had to
>
>  * manually start a gluster daemon on one of the servers (mseas-data3)
>  * start the gluster volume on the other server (mseas-data2)
>      o I had just tried starting the gluster volume without manually
>        starting the other daemon but that was unsuccessful.
>
>After this my recollection is that the peers were talking to each other
>
>at that time.
>
>Today I was looking around and noticed that the mseas-data3 server is
>in 
>a disconnected state (even though the compute nodes of our cluster are 
>seeing the full gluster volume)
>
>-----------------------
>
>[root at mseas-data2 ~]# gluster peer status
>Number of Peers: 1
>
>Hostname: mseas-data3
>Uuid: b39d4deb-c291-437e-8013-09050c1fa9e3
>State: Peer in Cluster (Disconnected)
>
>-----------------------
>
>Following the advice on 
>https://lists.gluster.org/pipermail/gluster-users/2015-April/021597.html
>
>, I confirmed that the 2 servers can ping each other.? The gluster 
>daemon on mseas-data2 is active but the daemon on mseas-data3 shows
>
>--------------------------------
>
>[root at mseas-data3 ~]# service glusterd status
>glusterd dead but pid file exists
>
>--------------------------------
>
>Is it safe to just restart that daemon on mseas-data3?? Is there some 
>other procedure I should do? I ask because we have a number of job 
>running that appear to be successfully writing to the gluster volume
>and 
>I'd prefer that they continue if possible.
>
>Any advice would be appreciated.? Thanks
>
>---------------------------------------------------
>
>[root at mseas-data2 ~]# gluster volume info
>
>Volume Name: data-volume
>Type: Distribute
>Volume ID: c162161e-2a2d-4dac-b015-f31fd89ceb18
>Status: Started
>Number of Bricks: 4
>Transport-type: tcp
>Bricks:
>Brick1: mseas-data2:/mnt/brick1
>Brick2: mseas-data2:/mnt/brick2
>Brick3: mseas-data3:/export/sda/brick3
>Brick4: mseas-data3:/export/sdc/brick4
>Options Reconfigured:
>diagnostics.client-log-level: ERROR
>network.inode-lru-limit: 50000
>performance.md-cache-timeout: 60
>performance.open-behind: off
>disperse.eager-lock: off
>auth.allow: *
>server.allow-insecure: on
>nfs.exports-auth-enable: on
>diagnostics.brick-sys-log-level: WARNING
>performance.readdir-ahead: on
>nfs.disable: on
>nfs.export-volumes: off
>cluster.min-free-disk: 1%

Gluster users - Jul 2020 - gluster peer disconnected

[Gluster-users] gluster peer disconnected

[Gluster-users] gluster peer disconnected