Weber, Charles (NIH/NIA/IRP) [C]
2006-Sep-19 21:02 UTC
[Samba] Hung XFS filesystems on Samba server
This is probably a hardware problem but I am posting here in case anyone else has seen it or it is actually software. If you have seen anything like it please let me know. Chuck For the last 1.5 years I have had occasional problems on a large (6.8 TB) Samba server. Two of the mounted filesystems will partially dismount at intervals between 3 days and 3 months. Files will still be open but any local access to the filesystem such as "ls" will hang. The particualr share is no longer accessable through Samba. I end up having to do a hard shutdown as rebooting will also hang trying to close the filesystem. I have found no logged errors. I have 3 HP DL585 with multiple 6404 raid controllers. Two run samba and the other is NFS only. This only occurs on one server but it is unfortunately the busiest one. I have replaced cables and 6404 cards. The filesystems have been checked using xfs_repair. HP diagnostics has been run for hours. One of our other DL585 servers is physically very close to the problem server but runs NFS instead of Samba on XFS filesystems. It has not had this problem. The only significant hardware difference between the NFS server and Samba server is that the NFS server has all U320 hard drives. Physical config: HP DL 585 with dual processor and 3 6404 4 channel SCSI raid controllers. 6 U320 converted 4200 drive chassis with 72 GB U3/U320 and 146 GB U320. 8 GB ram. Firmware for all parts including disks has been flashed repeatedly over the last two years to current levels. Firmware changes have not made any noticeable difference in this problem. I do wonder about the mix of U3 and U320 drives but each disk carrier is either U3 or U320. Each diskcarrier is set as one ADG array and logical drive. It is then partitioned and formatted such as /dev/ddiss/c2d0p1 with XFS and mounted. Software: I started with Fedora Core2 X86_64 and have worked my way to Fedora Core 5 and samba 3.0.22-1.fc5, acl 2.2.34 and xfsprogs 2.7.3-1.2.1. No software changes have made any difference that I can see in this problem. Samba shares support ACLs. Hardware possiblities: This has occurred in the same 2 disk carriers. I could change the disk carriers or U320 modules. I worry also about the mix of U320 and U3 disks. I setup a test server dl385 with a 6404 from the problem server and a disk carrier with mix of drives. I could not recreate the problem. Software possiblities: Kernel, Samba, ACLs and XFS. But I have tried many versions and not seen any logged errors or change in behavior.
Felipe Augusto van de Wiel
2006-Sep-21 15:33 UTC
[Samba] Hung XFS filesystems on Samba server
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 09/19/2006 01:17 PM, Weber, Charles (NIH/NIA/IRP) [C] escreveu:> This is probably a hardware problem but I am posting here in case anyone > else has seen it or it is actually software. > If you have seen anything like it please let me know. > Chuck[...]> Software: > I started with Fedora Core2 X86_64 and have worked my way to Fedora Core > 5 and samba 3.0.22-1.fc5, acl 2.2.34 and xfsprogs 2.7.3-1.2.1. No > software changes have made any difference that I can see in this > problem. Samba shares support ACLs. > Hardware possiblities: > This has occurred in the same 2 disk carriers. I could change the disk > carriers or U320 modules. I worry also about the mix of U320 and U3 > disks. I setup a test server dl385 with a 6404 from the problem server > and a disk carrier with mix of drives. I could not recreate the problem. > Software possiblities: > Kernel, Samba, ACLs and XFS. But I have tried many versions and not seen > any logged errors or change in behavior.I don't have such powerful infrastructure, I have 0.6 TB using XFS and I don't have any problems. But I'm using Debian Sarge with Samba 3.014a and Debian Kernel. But maybe this information could be an useful reference, at least I hope so. ;) Kind regards, - -- Felipe Augusto van de Wiel <felipe@paranacidade.org.br> Coordenadoria de Tecnologia da Informa??o (CTI) - SEDU/PARANACIDADE http://www.paranacidade.org.br/ Phone: (+55 41 3350 3300) -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (GNU/Linux) Comment: Using GnuPG with Debian - http://enigmail.mozdev.org iD8DBQFFErCcCj65ZxU4gPQRAuhaAJ9tamwV7H8cDXuA6tK33TR6Bke/8wCeNrck GA1/XWU89kd7q8moEfOTCdw=AixS -----END PGP SIGNATURE-----