Hi, I need some help with NFSv4 setup/tuning. I have a dedicated nfs server (2 x E5-2620? 8cores/16 threads each, 64GB RAM, 1x10Gb ethernet and 16x 8TB HDD) used by two servers and a small cluster (400 cores). All the servers are running CentOS 7, the cluster is running CentOS6. Time to time on the server I get: ?kernel: NFSD: client xxx.xxx.xxx.xxx testing state ID with incorrect client ID And the client xxx.xxx.xxx.xxx freeze whith: ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, still trying ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, still trying ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK There is a discussion on RedHat7 support about this but only open to subscribers. Other searches with google do not provide? useful information. Do you have an idea how to solve these freeze states ? More generally I would be really interested with some advice/tutorials to improve NFS performances in this dedicated context. There are so many [different] things about tuning NFS available on the web that I'm a little bit lost (the opposite of the previous question). So if some one has "the tutorial"...;-) Thanks Patrick
Patrick B?gou wrote:> > Hi, > > I need some help with NFSv4 setup/tuning. I have a dedicated nfs server > (2 x E5-2620 8cores/16 threads each, 64GB RAM, 1x10Gb ethernet and 16x > 8TB HDD) used by two servers and a small cluster (400 cores). All the > servers are running CentOS 7, the cluster is running CentOS6. > > Time to time on the server I get: > > kernel: NFSD: client xxx.xxx.xxx.xxx testing state ID with > incorrect client ID > > And the client xxx.xxx.xxx.xxx freeze whith: > > kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, > still trying > kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK > kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, > still trying > kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK > > There is a discussion on RedHat7 support about this but only open to > subscribers. Other searches with google do not provide useful information. > > Do you have an idea how to solve these freeze states ? > > More generally I would be really interested with some advice/tutorials > to improve NFS performances in this dedicated context. There are so many > [different] things about tuning NFS available on the web that I'm a > little bit lost (the opposite of the previous question). So if some one > has "the tutorial"...;-)How many nfsd threads are you running on the server? - current count will be in /proc/fs/nfsd/threads James Pearson
Le 12/05/2020 ? 16:10, James Pearson a ?crit?:> Patrick B?gou wrote: >> >> Hi, >> >> I need some help with NFSv4 setup/tuning. I have a dedicated nfs server >> (2 x E5-2620? 8cores/16 threads each, 64GB RAM, 1x10Gb ethernet and 16x >> 8TB HDD) used by two servers and a small cluster (400 cores). All the >> servers are running CentOS 7, the cluster is running CentOS6. >> >> Time to time on the server I get: >> >> ????? kernel: NFSD: client xxx.xxx.xxx.xxx testing state ID with >> ???? incorrect client ID >> >> And the client xxx.xxx.xxx.xxx freeze whith: >> >> ????? kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, >> ???? still trying >> ????? kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK >> ????? kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, >> ???? still trying >> ????? kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK >> >> There is a discussion on RedHat7 support about this but only open to >> subscribers. Other searches with google do not provide? useful >> information. >> >> Do you have an idea how to solve these freeze states ? >> >> More generally I would be really interested with some advice/tutorials >> to improve NFS performances in this dedicated context. There are so many >> [different] things about tuning NFS available on the web that I'm a >> little bit lost (the opposite of the previous question). So if some one >> has "the tutorial"...;-) > > How many nfsd threads are you running on the server? - current count > will be in /proc/fs/nfsd/threads > > James PearsonHi James, Thanks for your answer. I've configured 24 threads (for 16 hardware cores/ 32Threads on the NFS server with this processors) But it seams that there are buffer setup to modify too when increasing the threads number... It is not done. Load average on the server is below 1.... Patrick
On 5/12/20 2:46 AM, Patrick B?gou wrote:> Hi, > > I need some help with NFSv4 setup/tuning. I have a dedicated nfs server > (2 x E5-2620? 8cores/16 threads each, 64GB RAM, 1x10Gb ethernet and 16x > 8TB HDD) used by two servers and a small cluster (400 cores). All the > servers are running CentOS 7, the cluster is running CentOS6. > > Time to time on the server I get: > > ?kernel: NFSD: client xxx.xxx.xxx.xxx testing state ID with > incorrect client ID > > And the client xxx.xxx.xxx.xxx freeze whith: > > ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, > still trying > ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK > ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, > still trying > ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK > > There is a discussion on RedHat7 support about this but only open to > subscribers. Other searches with google do not provide? useful information.FYI - you can get access to such info with a free RHEL developers account. -- Orion Poplawski Manager of NWRA Technical Systems 720-772-5637 NWRA, Boulder/CoRA Office FAX: 303-415-9702 3380 Mitchell Lane orion at nwra.com Boulder, CO 80301 https://www.nwra.com/
Le 13/05/2020 ? 02:13, Orion Poplawski a ?crit?:> On 5/12/20 2:46 AM, Patrick B?gou wrote: >> Hi, >> >> I need some help with NFSv4 setup/tuning. I have a dedicated nfs server >> (2 x E5-2620? 8cores/16 threads each, 64GB RAM, 1x10Gb ethernet and 16x >> 8TB HDD) used by two servers and a small cluster (400 cores). All the >> servers are running CentOS 7, the cluster is running CentOS6. >> >> Time to time on the server I get: >> >> ???? ?kernel: NFSD: client xxx.xxx.xxx.xxx testing state ID with >> ???? incorrect client ID >> >> And the client xxx.xxx.xxx.xxx freeze whith: >> >> ???? ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, >> ???? still trying >> ???? ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK >> ???? ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr not responding, >> ???? still trying >> ???? ?kernel: nfs: server xxxxx.legi.grenoble-inp.fr OK >> >> There is a discussion on RedHat7 support about this but only open to >> subscribers. Other searches with google do not provide? useful >> information. > > FYI - you can get access to such info with a free RHEL developers > account. > >Thanks for your suggestion. As the problem is back I've subscribed to reach the full content of this discussion. The answer was "do not use antivirus" :-(. I do not use antivirus as I am CentOS only. Patrick