Displaying 20 results from an estimated 1000 matches similar to: "Glusterd proccess hangs on reboot"
2017 Aug 23
0
Glusterd proccess hangs on reboot
Hi Atin,
Do you have time to check the logs?
On Wed, Aug 23, 2017 at 10:02 AM, Serkan ?oban <cobanserkan at gmail.com> wrote:
> Same thing happens with 3.12.rc0. This time perf top shows hanging in
> libglusterfs.so and below is the glusterd logs, which are different
> from 3.10.
> With 3.10.5, after 60-70 minutes CPU usage becomes normal and we see
> brick processes come
2017 Aug 23
0
Glusterd proccess hangs on reboot
Could you be able to provide the pstack dump of the glusterd process?
On Wed, 23 Aug 2017 at 20:22, Atin Mukherjee <amukherj at redhat.com> wrote:
> Not yet. Gaurav will be taking a look at it tomorrow.
>
> On Wed, 23 Aug 2017 at 20:14, Serkan ?oban <cobanserkan at gmail.com> wrote:
>
>> Hi Atin,
>>
>> Do you have time to check the logs?
>>
2017 Aug 23
2
Glusterd proccess hangs on reboot
Same thing happens with 3.12.rc0. This time perf top shows hanging in
libglusterfs.so and below is the glusterd logs, which are different
from 3.10.
With 3.10.5, after 60-70 minutes CPU usage becomes normal and we see
brick processes come online and system starts to answer commands like
"gluster peer status"..
[2017-08-23 06:46:02.150472] E [client_t.c:324:gf_client_ref]
2017 Aug 23
2
Glusterd proccess hangs on reboot
Not yet. Gaurav will be taking a look at it tomorrow.
On Wed, 23 Aug 2017 at 20:14, Serkan ?oban <cobanserkan at gmail.com> wrote:
> Hi Atin,
>
> Do you have time to check the logs?
>
> On Wed, Aug 23, 2017 at 10:02 AM, Serkan ?oban <cobanserkan at gmail.com>
> wrote:
> > Same thing happens with 3.12.rc0. This time perf top shows hanging in
> >
2017 Aug 24
0
Glusterd proccess hangs on reboot
Restarting glusterd causes the same thing. I tried with 3.12.rc0,
3.10.5. 3.8.15, 3.7.20 all same behavior.
My OS is centos 6.9, I tried with centos 6.8 problem remains...
Only way to a healthy state is destroy gluster config/rpms, reinstall
and recreate volumes.
On Thu, Aug 24, 2017 at 8:49 AM, Serkan ?oban <cobanserkan at gmail.com> wrote:
> Here you can find 10 stack trace samples
2017 Aug 24
0
Glusterd proccess hangs on reboot
Thank you Gaurav,
Here is more findings:
Problem does not happen using only 20 servers each has 68 bricks.
(peer probe only 20 servers)
If we use 40 servers with single volume, glusterd cpu %100 state
continues for 5 minutes and it goes to normal state.
with 80 servers we have no working state yet...
On Thu, Aug 24, 2017 at 1:33 PM, Gaurav Yadav <gyadav at redhat.com> wrote:
>
> I am
2017 Aug 29
0
Glusterd proccess hangs on reboot
Till now I haven't found anything significant.
Can you send me gluster logs along with command-history-logs for these
scenarios:
Scenario1 : 20 servers
Scenario2 : 40 servers
Scenario3: 80 Servers
Thanks
Gaurav
On Mon, Aug 28, 2017 at 11:22 AM, Serkan ?oban <cobanserkan at gmail.com>
wrote:
> Hi Gaurav,
> Any progress about the problem?
>
> On Thursday, August 24,
2017 Aug 29
0
Glusterd proccess hangs on reboot
I believe logs you have shared logs which consist of create volume followed
by starting the volume.
However, you have mentioned that when a node from 80 server cluster gets
rebooted, glusterd process hangs.
Could you please provide the logs which led glusterd to hang for all the
cases along with gusterd process utilization.
Thanks
Gaurav
On Tue, Aug 29, 2017 at 2:44 PM, Serkan ?oban
2017 Aug 29
0
Glusterd proccess hangs on reboot
glusterd returned to normal, here is the logs:
https://www.dropbox.com/s/41jx2zn3uizvr53/80servers_glusterd_normal_status.zip?dl=0
On Tue, Aug 29, 2017 at 1:47 PM, Serkan ?oban <cobanserkan at gmail.com> wrote:
> Here is the logs after stopping all three volumes and restarting
> glusterd in all nodes. I waited 70 minutes after glusterd restart but
> it is still consuming %100 CPU.
2017 Sep 04
0
Glusterd proccess hangs on reboot
>1. On 80 nodes cluster, did you reboot only one node or multiple ones?
Tried both, result is same, but the logs/stacks are from stopping and
starting glusterd only on one server while others are running.
>2. Are you sure that pstack output was always constantly pointing on strcmp being stuck?
It stays 70-80 minutes in %100 cpu consuming state, the stacks I send
is from first 5-10 minutes.
2017 Sep 04
0
Glusterd proccess hangs on reboot
I have been using a 60 server 1560 brick 3.7.11 cluster without
problems for 1 years. I did not see this problem with it.
Note that this problem does not happen when I install packages & start
glusterd & peer probe and create the volumes. But after glusterd
restart.
Also note that this still happens without any volumes. So it is not
related with brick count I think...
On Mon, Sep 4, 2017
2017 Aug 24
2
Glusterd proccess hangs on reboot
I am working on it and will share my findings as soon as possible.
Thanks
Gaurav
On Thu, Aug 24, 2017 at 3:58 PM, Serkan ?oban <cobanserkan at gmail.com> wrote:
> Restarting glusterd causes the same thing. I tried with 3.12.rc0,
> 3.10.5. 3.8.15, 3.7.20 all same behavior.
> My OS is centos 6.9, I tried with centos 6.8 problem remains...
> Only way to a healthy state is
2017 Sep 02
0
Glusterd proccess hangs on reboot
Hi Milind,
Anything new about the issue? Can you able to find the problem,
anything else you need?
I will continue with two clusters each 40 servers, so I will not be
able to provide any further info for 80 servers.
On Fri, Sep 1, 2017 at 10:30 AM, Serkan ?oban <cobanserkan at gmail.com> wrote:
> Hi,
> You can find pstack sampes here:
>
2017 Aug 28
2
Glusterd proccess hangs on reboot
Hi Gaurav,
Any progress about the problem?
On Thursday, August 24, 2017, Serkan ?oban <cobanserkan at gmail.com> wrote:
> Thank you Gaurav,
> Here is more findings:
> Problem does not happen using only 20 servers each has 68 bricks.
> (peer probe only 20 servers)
> If we use 40 servers with single volume, glusterd cpu %100 state
> continues for 5 minutes and it goes to
2017 Aug 29
2
Glusterd proccess hangs on reboot
Here is the requested logs:
https://www.dropbox.com/s/vt187h0gtu5doip/gluster_logs_20_40_80_servers.zip?dl=0
On Tue, Aug 29, 2017 at 7:48 AM, Gaurav Yadav <gyadav at redhat.com> wrote:
> Till now I haven't found anything significant.
>
> Can you send me gluster logs along with command-history-logs for these
> scenarios:
> Scenario1 : 20 servers
> Scenario2 : 40
2017 Aug 29
2
Glusterd proccess hangs on reboot
Here is the logs after stopping all three volumes and restarting
glusterd in all nodes. I waited 70 minutes after glusterd restart but
it is still consuming %100 CPU.
https://www.dropbox.com/s/pzl0f198v03twx3/80servers_after_glusterd_restart.zip?dl=0
On Tue, Aug 29, 2017 at 12:37 PM, Gaurav Yadav <gyadav at redhat.com> wrote:
>
> I believe logs you have shared logs which consist of
2017 Sep 01
0
Glusterd proccess hangs on reboot
Serkan,
I have gone through other mails in the mail thread as well but responding
to this one specifically.
Is this a source install or an RPM install ?
If this is an RPM install, could you please install the glusterfs-debuginfo
RPM and retry to capture the gdb backtrace.
If this is a source install, then you'll need to configure the build with
--enable-debug and reinstall and retry
2017 Sep 03
0
Glusterd proccess hangs on reboot
----- Original Message -----
> From: "Milind Changire" <mchangir at redhat.com>
> To: "Serkan ?oban" <cobanserkan at gmail.com>
> Cc: "Gluster Users" <gluster-users at gluster.org>
> Sent: Saturday, September 2, 2017 11:44:40 PM
> Subject: Re: [Gluster-users] Glusterd proccess hangs on reboot
>
> No worries Serkan,
> You can
2017 Sep 04
2
Glusterd proccess hangs on reboot
On Fri, Sep 1, 2017 at 8:47 AM, Milind Changire <mchangir at redhat.com> wrote:
> Serkan,
> I have gone through other mails in the mail thread as well but responding
> to this one specifically.
>
> Is this a source install or an RPM install ?
> If this is an RPM install, could you please install the
> glusterfs-debuginfo RPM and retry to capture the gdb backtrace.
>
2017 Sep 04
2
Glusterd proccess hangs on reboot
On Mon, Sep 4, 2017 at 5:28 PM, Serkan ?oban <cobanserkan at gmail.com> wrote:
> >1. On 80 nodes cluster, did you reboot only one node or multiple ones?
> Tried both, result is same, but the logs/stacks are from stopping and
> starting glusterd only on one server while others are running.
>
> >2. Are you sure that pstack output was always constantly pointing on
>