On Wed, Feb 11, 2015 at 10:27:50AM +0100, Nacho del Rey wrote:> Hi Volker and the list > > Yesterday the problem appeared again > > I could launch a strace command over a samba PID to see what was going on > and this is the result > > [PRO] [root at pf3il0024 ~]# strace -frp 57686 > Process 57686 attached - interrupt to quit > 0.000000 fcntl(14, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET, > start=35824, len=1}fd 14, is that really a file on acfs? What does ls -l /proc/57686/fd say? Volker -- SerNet GmbH, Bahnhofsallee 1b, 37081 G?ttingen phone: +49-551-370000-0, fax: +49-551-370000-9 AG G?ttingen, HRB 2816, GF: Dr. Johannes Loxen http://www.sernet.de, mailto:kontakt at sernet.de
Hi Volker
Unfortunately I didn't keep this information but fd=14 now (samba pids)
points to
[PRO] [root at pf3il0024 ~]# for i in `ps -ef | grep smbd| grep -v grep| awk
'{print $2}'`; do ls -la /proc/$i/fd/14; done
lrwx------ 1 root root 64 Feb 10 21:16 /proc/3238/fd/14 ->
/var/lib/samba/locking.tdb
lrwx------ 1 root root 64 Feb 1 01:33 /proc/18405/fd/14 ->
/var/lib/samba/locking.tdb
lrwx------ 1 root root 64 Feb 1 01:33 /proc/18408/fd/14 ->
/var/lib/samba/locking.tdb
lrwx------ 1 root root 64 Feb 1 01:35 /proc/21975/fd/14 ->
/var/lib/samba/locking.tdb
lrwx------ 1 root root 64 Feb 1 01:35 /proc/21976/fd/14 ->
/var/lib/samba/locking.tdb
lrwx------ 1 root root 64 Feb 11 10:42 /proc/38279/fd/14 ->
/var/lib/samba/locking.tdb
so I guess yesterday the file descriptor was pointing the same place
I hope it helps
Regards
Nacho.
2015-02-11 11:09 GMT+01:00 Volker Lendecke <Volker.Lendecke at sernet.de>:
> On Wed, Feb 11, 2015 at 10:27:50AM +0100, Nacho del Rey wrote:
> > Hi Volker and the list
> >
> > Yesterday the problem appeared again
> >
> > I could launch a strace command over a samba PID to see what was going
on
> > and this is the result
> >
> > [PRO] [root at pf3il0024 ~]# strace -frp 57686
> > Process 57686 attached - interrupt to quit
> > 0.000000 fcntl(14, F_SETLKW, {type=F_RDLCK, whence=SEEK_SET,
> > start=35824, len=1}
>
> fd 14, is that really a file on acfs? What does
>
> ls -l /proc/57686/fd
>
> say?
>
> Volker
>
> --
> SerNet GmbH, Bahnhofsallee 1b, 37081 G?ttingen
> phone: +49-551-370000-0, fax: +49-551-370000-9
> AG G?ttingen, HRB 2816, GF: Dr. Johannes Loxen
> http://www.sernet.de, mailto:kontakt at sernet.de
>
On Wed, Feb 11, 2015 at 11:16:06AM +0100, Nacho del Rey wrote:> Hi Volker > > Unfortunately I didn't keep this information but fd=14 now (samba pids) > points to > > [PRO] [root at pf3il0024 ~]# for i in `ps -ef | grep smbd| grep -v grep| awk > '{print $2}'`; do ls -la /proc/$i/fd/14; done > lrwx------ 1 root root 64 Feb 10 21:16 /proc/3238/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 1 01:33 /proc/18405/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 1 01:33 /proc/18408/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 1 01:35 /proc/21975/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 1 01:35 /proc/21976/fd/14 -> > /var/lib/samba/locking.tdb > lrwx------ 1 root root 64 Feb 11 10:42 /proc/38279/fd/14 -> > /var/lib/samba/locking.tdb > > so I guess yesterday the file descriptor was pointing the same placeYep, that's what I thought. /var/lib/samba should not be on ACFS, it should be a local and locking-reliable file system. Is that the case? Volker -- SerNet GmbH, Bahnhofsallee 1b, 37081 G?ttingen phone: +49-551-370000-0, fax: +49-551-370000-9 AG G?ttingen, HRB 2816, GF: Dr. Johannes Loxen http://www.sernet.de, mailto:kontakt at sernet.de
Ok, it is becoming a nightmare ;-) Last question Volker Is it possible the problem comes from a network device? At job, we have been suffering problems with a F5 Load Balancer. It seems that, randomly, it sets the transmission frame to 0 bytes during a while and routing services don't work very well It is planned that this F5 will be upgraded next week to solve these problems Do you think this behaviour makes a collateral effect on samba connections? Many thanks for your help Nacho 2015-02-11 11:32 GMT+01:00 Volker Lendecke <Volker.Lendecke at sernet.de>:> On Wed, Feb 11, 2015 at 11:30:11AM +0100, Nacho del Rey wrote: > > Yes, it is in ext4 fs > > So the syscall you list should not be the problem. The > problem is somehwere else. > > Volker >
On Wed, Feb 11, 2015 at 11:44:24AM +0100, Nacho del Rey wrote:> Ok, it is becoming a nightmare ;-) > Last question Volker > > Is it possible the problem comes from a network device? > At job, we have been suffering problems with a F5 Load Balancer. It seems > that, randomly, it sets the transmission frame to 0 bytes during a while > and routing services don't work very well > > It is planned that this F5 will be upgraded next week to solve these > problems > > Do you think this behaviour makes a collateral effect on samba connections?Windows clients heavily depend on a reliable network connection towards SMB servers they connect to. So flaky network gear between those two can of course negatively impact drive mappings. But without further analysis I can't really say what the problem is here, I don't even exactly know what your users are experiencing :-) Volker -- SerNet GmbH, Bahnhofsallee 1b, 37081 G?ttingen phone: +49-551-370000-0, fax: +49-551-370000-9 AG G?ttingen, HRB 2816, GF: Dr. Johannes Loxen http://www.sernet.de, mailto:kontakt at sernet.de