thr3ads.net - freebsd stable - NFS server hang with backing store on ZFS and quota nearly exhausted [Dec 2016]

If this information is useful, please help other people find it:
Share via:

Garrett Wollman

2016-Dec-21 06:24 UTC

NFS server hang with backing store on ZFS and quota nearly exhausted

I've opened a bug about this before, which I can't cite by number
because bugzilla appears to be down at the moment.  But I had this
problem recur tonight under otherwise idle conditions, so I was able
to get a set of kernel stacks without any confounding RPC activity
going on.  This is on 10.2; we're not scheduled to take these servers
to 10.3 until next week.

Here's the "procstat -kk" output.

  PID    TID COMM             TDNAME           KSTACK                       
 1055 101965 nfsd             -                mi_switch+0xe1
sleepq_catch_signals+0xab sleepq_wait_sig+0xf _cv_wait_sig+0x16a seltdwait+0xae
kern_select+0x8fa sys_select+0x54 amd64_syscall+0x357 Xfast_syscall+0xfb
 1058 101012 nfsd             nfsd: service    mi_switch+0xe1
sleepq_catch_signals+0xab sleepq_wait_sig+0xf _cv_wait_sig+0x16a
svc_run_internal+0x8be svc_thread_start+0xb fork_exit+0x9a fork_trampoline+0xe

[Threads with the stack trace above are simply idle and waiting for
incoming requests, and I've deleted the other 5 of them.]

 1058 101688 nfsd             nfsd: service    mi_switch+0xe1
sleepq_catch_signals+0xab sleepq_timedwait_sig+0x10 _cv_timedwait_sig_sbt+0x18b
svc_run_internal+0x4bd svc_thread_start+0xb fork_exit+0x9a fork_trampoline+0xe

[Not sure what these threads are doing: obviously they are waiting for
a condvar, but at a different spot in svc_run_internal().  I've
deleted the other 7 of them.]

 1058 101720 nfsd             nfsd: service    mi_switch+0xe1 sleepq_wait+0x3a
_cv_wait+0x16d txg_wait_open+0x85 dmu_tx_wait+0x2ac dmu_tx_assign+0x48
zfs_freebsd_write+0x544 VOP_WRITE_APV+0x149 nfsvno_write+0x13e
nfsrvd_write+0x496 nfsrvd_dorpc+0x6f1 nfssvc_program+0x54e
svc_run_internal+0xd7b svc_thread_start+0xb fork_exit+0x9a fork_trampoline+0xe
 1058 102015 nfsd             nfsd: master     mi_switch+0xe1 sleepq_wait+0x3a
_cv_wait+0x16d txg_wait_open+0x85 dmu_tx_wait+0x2ac dmu_tx_assign+0x48
zfs_freebsd_write+0x544 VOP_WRITE_APV+0x149 nfsvno_write+0x13e
nfsrvd_write+0x496 nfsrvd_dorpc+0x6f1 nfssvc_program+0x54e
svc_run_internal+0xd7b svc_run+0x1de nfsrvd_nfsd+0x242 nfssvc_nfsd+0x107
sys_nfssvc+0x9c amd64_syscall+0x357

Then there are these two threads, both servicing WRITE RPCs, both
sleeping deep inside the ZFS code.  Note that one of them is the
"master" krpc thread in this service pool; I don't know if this
accounts for the fact that requests are not getting processed even
though plenty of idle threads exist.  (Note that zfs_write() does not
appear in the stack due to tail-call optimization.)

I don't know the ZFS code well enough to understand what running out
of quota has to do with this situation (you'd think it would just
return immediately with [EDQUOT]) but perhaps it matters that the
clients are not well-behaved and that the filesystem is often almost
at quota but not quite there yet.

-GAWollman

Ben RUBSON

2016-Dec-21 07:33 UTC

head link

NFS server hang with backing store on ZFS and quota nearly exhausted

> On 21 Dec 2016, at 07:24, Garrett Wollman <wollman at bimajority.org>
wrote:
> 
> I don't know the ZFS code well enough to understand what running out
> of quota has to do with this situation (you'd think it would just
> return immediately with [EDQUOT]) but perhaps it matters that the
> clients are not well-behaved and that the filesystem is often almost
> at quota but not quite there yet.
Hi Garrett,

ZFS is slow when your are playing around the quota limit,
due to how quota are implemented.
See this thread :
https://lists.freebsd.org/pipermail/freebsd-fs/2016-September/023874.html

Ben

freebsd stable - Dec 2016 - NFS server hang with backing store on ZFS and quota nearly exhausted

NFS server hang with backing store on ZFS and quota nearly exhausted

NFS server hang with backing store on ZFS and quota nearly exhausted