thr3ads.net - similar to: "Hosted VM Pause when one node of gluster goes down"

Displaying 20 results from an estimated 1000 matches similar to: "Hosted VM Pause when one node of gluster goes down"

Getting glusterfs to expand volume size to brick size

2018 Apr 17

Getting glusterfs to expand volume size to brick size

To clarify, I was on 3.13.2 previously, recently updated to 4.0.1, and the bug seems to persist in 4.0.1. Sincerely, Artem -- Founder, Android Police <http://www.androidpolice.com>, APK Mirror <http://www.apkmirror.com/>, Illogical Robot LLC beerpla.net | +ArtemRussakovskii <https://plus.google.com/+ArtemRussakovskii> | @ArtemR <http://twitter.com/ArtemR> On Mon, Apr

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 18

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

Btw, I've now noticed at least 5 variations in toggling binary option values. Are they all interchangeable, or will using the wrong value not work in some cases? yes/no true/false True/False on/off enable/disable It's quite a confusing/inconsistent practice, especially given that many options will accept any value without erroring out/validation. Sincerely, Artem -- Founder, Android

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 18

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

On 04/18/2018 11:59 AM, Artem Russakovskii wrote: > Btw, I've now noticed at least 5 variations in toggling binary option > values. Are they all interchangeable, or will using the wrong value > not work in some cases? > > yes/no > true/false > True/False > on/off > enable/disable > > It's quite a confusing/inconsistent practice, especially given that

Getting glusterfs to expand volume size to brick size

2018 Apr 17

Getting glusterfs to expand volume size to brick size

That might be the reason. Perhaps the volfiles were not regenerated after upgrading to the version with the fix. There is a workaround detailed in [2] for the time being (you will need to copy the shell script into the correct directory for your Gluster release). [2] https://bugzilla.redhat.com/show_bug.cgi?id=1517260#c19 On 17 April 2018 at 09:58, Artem Russakovskii <archon810 at

Getting glusterfs to expand volume size to brick size

2018 Apr 17

Getting glusterfs to expand volume size to brick size

I just remembered that I didn't run https://docs.gluster.org/en/v3/Upgrade-Guide/op_version/ for this test volume/box like I did for the main production gluster, and one of these ops - either heal or the op-version, resolved the issue. I'm now seeing: pylon:/var/lib/glusterd/vols/dev_apkmirror_data # ack shared-brick-count dev_apkmirror_data.pylon.mnt-pylon_block3-dev_apkmirror_data.vol

Hosted VM Pause when one node of gluster goes down

2018 Apr 24

Hosted VM Pause when one node of gluster goes down

HI, I have a 3 node hyper-converged cluster running glusterfs with 3 replica 1 arbiter volumes. When I Shutdown 1 node i am having problems with high load VM's pausing due to storage error.what areas should i look in to get this to work? Russell Wecker -------------- next part -------------- An HTML attachment was scrubbed... URL:

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 18

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

On 04/18/2018 10:35 AM, Artem Russakovskii wrote: > Hi Ravi, > > Could you please expand on how these would help? > > By forcing full here, we move the logic from the CPU to network, thus > decreasing CPU utilization, is that right? Yes, 'diff' employs the rchecksum FOP which does a sha256? checksum which can consume CPU. So yes it is sort of shifting the load from CPU

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 10

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

Hi Vlad, I actually saw that post already and even asked a question 4 days ago ( https://serverfault.com/questions/517775/glusterfs-direct-i-o-mode#comment1172497_540917). The accepted answer also seems to go against your suggestion to enable direct-io-mode as it says it should be disabled for better performance when used just for file accesses. It'd be great if someone from the Gluster team

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 18

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

On 04/18/2018 10:14 AM, Artem Russakovskii wrote: > Following up here on a related and very serious for us issue. > > I took down one of the 4 replicate gluster servers for maintenance > today. There are 2 gluster volumes totaling about 600GB. Not that much > data. After the server comes back online, it starts auto healing and > pretty much all operations on gluster freeze for

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 18

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

Thanks for the link. Looking at the status of that doc, it isn't quite ready yet, and there's no mention of the option. Does it mean that whatever is ready now in 4.0.1 is incomplete but can be enabled via granular-entry-heal=on, and when it is complete, it'll become the default and the flag will simply go away? Is there any risk enabling the option now in 4.0.1? Sincerely, Artem

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 10

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

Hi Vlad, I'm using only localhost: mounts. Can you please explain what effect each option has on performance issues shown in my posts? "negative-timeout=10,attribute-timeout=30,fopen- keep-cache,direct-io-mode=enable,fetch-attempts=5" From what I remember, direct-io-mode=enable didn't make a difference in my tests, but I suppose I can try again. The explanations about

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 18

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

Hi Ravi, Could you please expand on how these would help? By forcing full here, we move the logic from the CPU to network, thus decreasing CPU utilization, is that right? This is assuming the CPU and disk utilization are caused by the differ and not by lstat and other calls or something. > Option: cluster.data-self-heal-algorithm > Default Value: (null) > Description: Select between

Getting glusterfs to expand volume size to brick size

2018 Apr 17

Getting glusterfs to expand volume size to brick size

Ok, it looks like the same problem. @Amar, this fix is supposed to be in 4.0.1. Is it possible to regenerate the volfiles to fix this? Regards, Nithya On 17 April 2018 at 09:57, Artem Russakovskii <archon810 at gmail.com> wrote: > pylon:/var/lib/glusterd/vols/dev_apkmirror_data # ack shared-brick-count > dev_apkmirror_data.pylon.mnt-pylon_block3-dev_apkmirror_data.vol > 3:

[dht-selfheal.c:2328:dht_selfheal_directory] 0-data-dht: Directory selfheal failed: Unable to form layout for directory /

2018 Apr 05

[dht-selfheal.c:2328:dht_selfheal_directory] 0-data-dht: Directory selfheal failed: Unable to form layout for directory /

Hi, I noticed when I run gluster volume heal data info, the follow message shows up in the log, along with other stuff: [dht-selfheal.c:2328:dht_selfheal_directory] 0-data-dht: Directory selfheal > failed: Unable to form layout for directory / I'm seeing it on Gluster 4.0.1 and 3.13.2. Here's the full log after running heal info:

Getting glusterfs to expand volume size to brick size

2018 Apr 17

Getting glusterfs to expand volume size to brick size

Hi Artem, Was the volume size correct before the bricks were expanded? This sounds like [1] but that should have been fixed in 4.0.0. Can you let us know the values of shared-brick-count in the files in /var/lib/glusterd/vols/dev_apkmirror_data/ ? [1] https://bugzilla.redhat.com/show_bug.cgi?id=1541880 On 17 April 2018 at 05:17, Artem Russakovskii <archon810 at gmail.com> wrote: > Hi

Very slow rsync to gluster volume UNLESS `ls` or `find` scan dir on gluster volume first

2018 Apr 18

Very slow rsync to gluster volume UNLESS `ls` or `find` scan dir on gluster volume first

Nithya, Amar, Any movement here? There could be a significant performance gain here that may also affect other bottlenecks that I'm experiencing which make gluster close to unusable at times. Sincerely, Artem -- Founder, Android Police <http://www.androidpolice.com>, APK Mirror <http://www.apkmirror.com/>, Illogical Robot LLC beerpla.net | +ArtemRussakovskii

[dht-selfheal.c:2328:dht_selfheal_directory] 0-data-dht: Directory selfheal failed: Unable to form layout for directory /

2018 Apr 05

[dht-selfheal.c:2328:dht_selfheal_directory] 0-data-dht: Directory selfheal failed: Unable to form layout for directory /

On Thu, Apr 5, 2018 at 10:48 AM, Artem Russakovskii <archon810 at gmail.com> wrote: > Hi, > > I noticed when I run gluster volume heal data info, the follow message > shows up in the log, along with other stuff: > > [dht-selfheal.c:2328:dht_selfheal_directory] 0-data-dht: Directory >> selfheal failed: Unable to form layout for directory / > > > I'm

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 06

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

I restarted rsync, and this has been sitting there for almost a minute, barely moved several bytes in that time: 2014/11/545b06baa3d98/com.google.android.apps.inputmethod.zhuyin-2.1.0.79226761-armeabi-v7a-175-minAPI14.apk 6,389,760 45% 18.76kB/s 0:06:50 I straced each of the 3 processes rsync created and saw this (note: every time there were several seconds of no output, I

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 18

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

Following up here on a related and very serious for us issue. I took down one of the 4 replicate gluster servers for maintenance today. There are 2 gluster volumes totaling about 600GB. Not that much data. After the server comes back online, it starts auto healing and pretty much all operations on gluster freeze for many minutes. For example, I was trying to run an ls -alrt in a folder with 7300

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

2018 Apr 10

performance.cache-size for high-RAM clients/servers, other tweaks for performance, and improvements to Gluster docs

Wish I knew or was able to get detailed description of those options myself. here is direct-io-mode https://serverfault.com/questions/517775/glusterfs-direct-i-o-mode Same as you I ran tests on a large volume of files, finding that main delays are in attribute calls, ending up with those mount options to add performance. I discovered those options through basically googling this user list with

similar to: Hosted VM Pause when one node of gluster goes down