Pablo J Rebollo Sosa
2018-Jul-23 20:17 UTC
[Gluster-users] Gluster 3.12.11 geo-replication connection to peer is broken
Hi, I?m having problem with Gluster 3.12.11 geo-replication in CentOS 7.5. The process starts the geo-replication but after few minutes the log shows ?connection to peer is broken?. The ?status detail? looks ok but no files are replicated. [root at gluster1 vol_replicated]# gluster volume geo-replication vol_replicated geoaccount1 at 10.20.220.12::georep_1 status detail | sort ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- MASTER NODE MASTER VOL MASTER BRICK SLAVE USER SLAVE SLAVE NODE STATUS CRAWL STATUS LAST_SYNCED ENTRY DATA META FAILURES CHECKPOINT TIME CHECKPOINT COMPLETED CHECKPOINT COMPLETION TIME gluster1 vol_replicated /export/brick1/vol_replicated geoaccount1 geoaccount1 at 10.20.220.12::georep_1 10.20.220.12 Active Hybrid Crawl N/A 8191 6550 0 0 N/A N/A N/A gluster2 vol_replicated /export/brick1/vol_replicated geoaccount1 geoaccount1 at 10.20.220.12::georep_1 10.20.220.13 Passive N/A N/A N/A N/A N/A N/A N/A N/A N/A gluster3 vol_replicated /export/brick1/vol_replicated geoaccount1 geoaccount1 at 10.20.220.12::georep_1 10.20.220.12 Passive N/A N/A N/A N/A N/A N/A N/A N/A N/A gluster4 vol_replicated /export/brick1/vol_replicated geoaccount1 geoaccount1 at 10.20.220.12::georep_1 10.20.220.13 Active Hybrid Crawl N/A 8191 6532 0 0 N/A N/A N/A These are the messages on the log file. [2018-07-23 19:35:50.18026] I [gsyncdstatus(/export/brick1/vol_replicated):276:set_active] GeorepStatus: Worker Status Change status=Active [2018-07-23 19:35:50.19126] I [gsyncdstatus(/export/brick1/vol_replicated):248:set_worker_crawl_status] GeorepStatus: Crawl Status Change status=History Crawl [2018-07-23 19:35:50.19480] I [master(/export/brick1/vol_replicated):1432:crawl] _GMaster: starting history crawl turns=1 stime=(0, 0) entry_stime=None etime=1532374550 [2018-07-23 19:35:50.20056] E [repce(/export/brick1/vol_replicated):117:worker] <top>: call failed: Traceback (most recent call last): File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 113, in worker res = getattr(self.obj, rmeth)(*in_data[2:]) File "/usr/libexec/glusterfs/python/syncdaemon/changelogagent.py", line 54, in history num_parallel) File "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", line 103, in cl_history_changelog raise ChangelogHistoryNotAvailable() ChangelogHistoryNotAvailable [2018-07-23 19:35:50.20999] E [repce(/export/brick1/vol_replicated):209:__call__] RepceClient: call failed on peer call=39755:140602890745664:1532374550.02 method=history error=ChangelogHistoryNotAvailable [2018-07-23 19:35:50.21156] I [resource(/export/brick1/vol_replicated):1675:service_loop] GLUSTER: Changelog history not available, using xsync [2018-07-23 19:35:50.28688] I [master(/export/brick1/vol_replicated):1543:crawl] _GMaster: starting hybrid crawl stime=(0, 0) [2018-07-23 19:35:50.30505] I [gsyncdstatus(/export/brick1/vol_replicated):248:set_worker_crawl_status] GeorepStatus: Crawl Status Change status=Hybrid Crawl [2018-07-23 19:35:54.35396] I [master(/export/brick1/vol_replicated):1554:crawl] _GMaster: processing xsync changelog path=/var/lib/misc/glusterfsd/vol_replicated/ssh%3A%2F%2Fgeoaccount1%4010.20.220.12%3Agluster%3A%2F%2F127.0.0.1%3Ageorep_1/a68ebfef8cdf86c3c6e9a0d85969cd3f/xsync/XSYNC-CHANGELOG.1532374550 [2018-07-23 19:36:11.590595] E [syncdutils(/export/brick1/vol_replicated):304:log_raise_exception] <top>: connection to peer is broken Anyone have some clues to what might be wrong? Best regards, Pablo J. Rebollo-Sosa -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.gluster.org/pipermail/gluster-users/attachments/20180723/9478a9ea/attachment.html> -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 488 bytes Desc: Message signed with OpenPGP URL: <http://lists.gluster.org/pipermail/gluster-users/attachments/20180723/9478a9ea/attachment.sig>
Kotresh Hiremath Ravishankar
2018-Jul-24 04:44 UTC
[Gluster-users] Gluster 3.12.11 geo-replication connection to peer is broken
Hi Pablo, The geo-rep status should go to Faulty if he connection to peer is broken. Does node log files failing with same error? Are these logs repeating? Does stop and start geo-rep giving the same error? Thanks, Kotresh HR On Tue, Jul 24, 2018 at 1:47 AM, Pablo J Rebollo Sosa <pablo.rebollo at upr.edu> wrote:> Hi, > > I?m having problem with Gluster 3.12.11 geo-replication in CentOS 7.5. > The process starts the geo-replication but after few minutes the log shows > ?connection to peer is broken?. > > The ?status detail? looks ok but no files are replicated. > > [root at gluster1 vol_replicated]# gluster volume geo-replication > vol_replicated geoaccount1 at 10.20.220.12::georep_1 status detail | sort > > ------------------------------------------------------------ > ------------------------------------------------------------ > ------------------------------------------------------------ > ------------------------------------------------------------ > ------------------------------------- > MASTER NODE MASTER VOL MASTER BRICK SLAVE > USER SLAVE SLAVE NODE STATUS > CRAWL STATUS LAST_SYNCED ENTRY DATA META FAILURES > CHECKPOINT TIME CHECKPOINT COMPLETED CHECKPOINT COMPLETION TIME > gluster1 vol_replicated /export/brick1/vol_replicated > geoaccount1 geoaccount1 at 10.20.220.12::georep_1 10.20.220.12 > Active Hybrid Crawl N/A 8191 6550 0 0 > N/A N/A N/A > gluster2 vol_replicated /export/brick1/vol_replicated > geoaccount1 geoaccount1 at 10.20.220.12::georep_1 10.20.220.13 > Passive N/A N/A N/A N/A N/A N/A > N/A N/A N/A > gluster3 vol_replicated /export/brick1/vol_replicated > geoaccount1 geoaccount1 at 10.20.220.12::georep_1 10.20.220.12 > Passive N/A N/A N/A N/A N/A N/A > N/A N/A N/A > gluster4 vol_replicated /export/brick1/vol_replicated > geoaccount1 geoaccount1 at 10.20.220.12::georep_1 10.20.220.13 > Active Hybrid Crawl N/A 8191 6532 0 0 > N/A N/A N/A > > These are the messages on the log file. > > [2018-07-23 19:35:50.18026] I [gsyncdstatus(/export/brick1/ > vol_replicated):276:set_active] GeorepStatus: Worker Status Change > status=Active > [2018-07-23 19:35:50.19126] I [gsyncdstatus(/export/brick1/ > vol_replicated):248:set_worker_crawl_status] GeorepStatus: Crawl Status > Change status=History Crawl > [2018-07-23 19:35:50.19480] I [master(/export/brick1/vol_replicated):1432:crawl] > _GMaster: starting history crawl turns=1 stime=(0, 0) > entry_stime=None etime=1532374550 > [2018-07-23 19:35:50.20056] E [repce(/export/brick1/vol_replicated):117:worker] > <top>: call failed: > Traceback (most recent call last): > File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 113, in > worker > res = getattr(self.obj, rmeth)(*in_data[2:]) > File "/usr/libexec/glusterfs/python/syncdaemon/changelogagent.py", line > 54, in history > num_parallel) > File "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", line > 103, in cl_history_changelog > raise ChangelogHistoryNotAvailable() > ChangelogHistoryNotAvailable > [2018-07-23 19:35:50.20999] E [repce(/export/brick1/vol_replicated):209:__call__] > RepceClient: call failed on peer call=39755:140602890745664:1532374550.02 > method=history error=ChangelogHistoryNotAvailable > [2018-07-23 19:35:50.21156] I [resource(/export/brick1/vol_replicated):1675:service_loop] > GLUSTER: Changelog history not available, using xsync > [2018-07-23 19:35:50.28688] I [master(/export/brick1/vol_replicated):1543:crawl] > _GMaster: starting hybrid crawl stime=(0, 0) > [2018-07-23 19:35:50.30505] I [gsyncdstatus(/export/brick1/ > vol_replicated):248:set_worker_crawl_status] GeorepStatus: Crawl Status > Change status=Hybrid Crawl > [2018-07-23 19:35:54.35396] I [master(/export/brick1/vol_replicated):1554:crawl] > _GMaster: processing xsync changelog path=/var/lib/misc/glusterfsd/ > vol_replicated/ssh%3A%2F%2Fgeoaccount1%4010.20.220.12% > 3Agluster%3A%2F%2F127.0.0.1%3Ageorep_1/a68ebfef8cdf86c3c6e9a0d85969cd > 3f/xsync/XSYNC-CHANGELOG.1532374550 > [2018-07-23 19:36:11.590595] E [syncdutils(/export/brick1/ > vol_replicated):304:log_raise_exception] <top>: connection to peer is > broken > > Anyone have some clues to what might be wrong? > > Best regards, > > Pablo J. Rebollo-Sosa > > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users >-- Thanks and Regards, Kotresh H R -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.gluster.org/pipermail/gluster-users/attachments/20180724/15546c7d/attachment-0001.html>