On Wed, Feb 11, 2015 at 10:27:50AM +0100, Nacho del Rey wrote:> Hi Volker and the list > > Yesterday the problem appeared again > > I could launch a strace command over a samba PID to see what was going on > and this is the result > > [PRO] [root at pf3il0024 ~]# strace -frp 57686 > Process 57686 attached - interrupt to quit > 0.000000 fcntl(14, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, > start=35824, len=1}fd 14, is that really a file on acfs? What does ls -l /proc/57686/fd say? Volker -- SerNet GmbH, Bahnhofsallee 1b, 37081 G?ttingen phone: +49-551-370000-0, fax: +49-551-370000-9 AG G?ttingen, HRB 2816, GF: Dr. Johannes Loxen http://www.sernet.de, mailto:kontakt at sernet.de
Hi Volker Unfortunately I didn't keep this information but fd=14 now (samba pids) points to [PRO] [root at pf3il0024 ~]# for i in `ps -ef | grep smbd| grep -v grep| awk '{print $2}'`; do ls -la /proc/$i/fd/14; done lrwx------ 1 root root 64 Feb 10 21:16 /proc/3238/fd/14 -> /var/lib/samba/locking.tdb lrwx------ 1 root root 64 Feb 1 01:33 /proc/18405/fd/14 -> /var/lib/samba/locking.tdb lrwx------ 1 root root 64 Feb 1 01:33 /proc/18408/fd/14 -> /var/lib/samba/locking.tdb lrwx------ 1 root root 64 Feb 1 01:35 /proc/21975/fd/14 -> /var/lib/samba/locking.tdb lrwx------ 1 root root 64 Feb 1 01:35 /proc/21976/fd/14 -> /var/lib/samba/locking.tdb lrwx------ 1 root root 64 Feb 11 10:42 /proc/38279/fd/14 -> /var/lib/samba/locking.tdb so I guess yesterday the file descriptor was pointing the same place I hope it helps Regards Nacho. 2015-02-11 11:09 GMT+01:00 Volker Lendecke <Volker.Lendecke at sernet.de>:> On Wed, Feb 11, 2015 at 10:27:50AM +0100, Nacho del Rey wrote: > > Hi Volker and the list > > > > Yesterday the problem appeared again > > > > I could launch a strace command over a samba PID to see what was going on > > and this is the result > > > > [PRO] [root at pf3il0024 ~]# strace -frp 57686 > > Process 57686 attached - interrupt to quit > > 0.000000 fcntl(14, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, > > start=35824, len=1} > > fd 14, is that really a file on acfs? What does > > ls -l /proc/57686/fd > > say? > > Volker > > -- > SerNet GmbH, Bahnhofsallee 1b, 37081 G?ttingen > phone: +49-551-370000-0, fax: +49-551-370000-9 > AG G?ttingen, HRB 2816, GF: Dr. Johannes Loxen > http://www.sernet.de, mailto:kontakt at sernet.de >
On Wed, Feb 11, 2015 at 11:16:06AM +0100, Nacho del Rey wrote:> Hi Volker > > Unfortunately I didn't keep this information but fd=14 now (samba pids) > points to > > [PRO] [root at pf3il0024 ~]# for i in `ps -ef | grep smbd| grep -v grep| awk > '{print $2}'`; do ls -la /proc/$i/fd/14; done > lrwx------ 1 root root 64 Feb 10 21:16 /proc/3238/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 1 01:33 /proc/18405/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 1 01:33 /proc/18408/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 1 01:35 /proc/21975/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 1 01:35 /proc/21976/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 11 10:42 /proc/38279/fd/14 -> > /var/lib/samba/locking.tdb > > so I guess yesterday the file descriptor was pointing the same placeYep, that's what I thought. /var/lib/samba should not be on ACFS, it should be a local and locking-reliable file system. Is that the case? Volker -- SerNet GmbH, Bahnhofsallee 1b, 37081 G?ttingen phone: +49-551-370000-0, fax: +49-551-370000-9 AG G?ttingen, HRB 2816, GF: Dr. Johannes Loxen http://www.sernet.de, mailto:kontakt at sernet.de
Ok, it is becoming a nightmare ;-) Last question Volker Is it possible the problem comes from a network device? At job, we have been suffering problems with a F5 Load Balancer. It seems that, randomly, it sets the transmission frame to 0 bytes during a while and routing services don't work very well It is planned that this F5 will be upgraded next week to solve these problems Do you think this behaviour makes a collateral effect on samba connections? Many thanks for your help Nacho 2015-02-11 11:32 GMT+01:00 Volker Lendecke <Volker.Lendecke at sernet.de>:> On Wed, Feb 11, 2015 at 11:30:11AM +0100, Nacho del Rey wrote: > > Yes, it is in ext4 fs > > So the syscall you list should not be the problem. The > problem is somehwere else. > > Volker >
On Wed, Feb 11, 2015 at 11:44:24AM +0100, Nacho del Rey wrote:> Ok, it is becoming a nightmare ;-) > Last question Volker > > Is it possible the problem comes from a network device? > At job, we have been suffering problems with a F5 Load Balancer. It seems > that, randomly, it sets the transmission frame to 0 bytes during a while > and routing services don't work very well > > It is planned that this F5 will be upgraded next week to solve these > problems > > Do you think this behaviour makes a collateral effect on samba connections?Windows clients heavily depend on a reliable network connection towards SMB servers they connect to. So flaky network gear between those two can of course negatively impact drive mappings. But without further analysis I can't really say what the problem is here, I don't even exactly know what your users are experiencing :-) Volker -- SerNet GmbH, Bahnhofsallee 1b, 37081 G?ttingen phone: +49-551-370000-0, fax: +49-551-370000-9 AG G?ttingen, HRB 2816, GF: Dr. Johannes Loxen http://www.sernet.de, mailto:kontakt at sernet.de