Matt Mitchell
2004-Dec-07 20:09 UTC
[Samba] smbd reads entire directory when creating files?
Apologies if this is a FAQ. We are saddled with some directories full of production image files that number in the hundreds of thousands to the millions. (Yes, I know this is bad.) They reside on a couple of Win 2003 boxes that are flaky and unreliable. I was hoping to copy them over to a Linux box running samba but it is taking forever. A strace on the smbd process receiving the files from the windows box (it is mapped as a share on the 2k3 server) reveals that smbd is looking up the entire directory (with getdents64) every time it writes a file. Several times, in fact. So as the number of files grows, it churns more and more. I'm sure most of it is in cache but the data still has to be moved around in memory. I know this is not necessary for writing/copying files with unix semantics, but I wonder if the case-fiddling or any other Windows imitative behavior is making this getdents64 orgy necessary. Is there any way to disable it? I was going to post a bit of the strace output but it is extremely long and (to my eyes, anyway) not very interesting. I can supply it to anyone who is interested in seeing it. Any suggestions appreciated. I also note that while using smbclient instead of mapping the drive from the Windows box might help with this copy, it will not help if we were to try to use this setup to replace the 2k3 servers (the eventual goal). So any pointers in that direction, while perhaps interesting for comparison, are not especially relevant. greenville:~# smbd -V Version 3.0.7-Debian smb.conf is attached. Not much interesting in there, it is pretty much stock. "vfs_ownerwrite" is a small VFS module I hacked up to do an extra chown on a file that is written to the share (used in a common PDC/Unix environment on some shared directories.) I am using the "deadwood_ftp" share in this case, so the module is not involved. -m -------------- next part -------------- # Samba configuration file for media servers. # ----- # # $Id: smb.conf,v 1.2.4.6 2002/03/13 18:56:16 peloy Exp $ # # This is the main Samba configuration file. You should read the # smb.conf(5) manual page in order to understand the options listed # here. Samba has a huge number of configurable options most of which # are not shown in this example # # Any line which starts with a ; (semi-colon) or a # (hash) # is a comment and is ignored. In this example we will use a # # for commentary and a ; for parts of the config file that you # may wish to enable # # NOTE: Whenever you modify this file you should run the command # "testparm" to check that you have not many any basic syntactic # errors. # #======================= Global Settings ====================== [global] # can this be turned back on? use sendfile = no # Do something sensible when Samba crashes: mail the admin a backtrace panic action = /usr/share/samba/panic-action %d # Change this for the workgroup/NT-domain name your Samba server will part of workgroup = VP # server string is the equivalent of the NT Description field server string = %h server (Samba %v) # If you want to automatically load your printer list rather # than setting them up individually then you'll need this ; load printers = yes # You may wish to override the location of the printcap file ; printcap name = /etc/printcap # 'printing = cups' works nicely ; printing = bsd ; guest account = nobody invalid users = root # This tells Samba to use a separate log file for each machine # that connects log file = /var/log/samba/log.%m # Put a capping on the size of the log files (in Kb). max log size = 1000 # If you want Samba to log though syslog only then set the following # parameter to 'yes'. Please note that logging through syslog in # Samba is still experimental. ; syslog only = no # We want Samba to log a minimum amount of information to syslog. Everything # should go to /var/log/samba/log.{smb,nmb} instead. If you want to log # through syslog you should set the following parameter to something higher. syslog = 0 # "security = user" is always a good idea. This will require a Unix account # in this server for every user accessing the server. See # security_level.txt for details. security = domain # You may wish to use password encryption. Please read ENCRYPTION.txt, # Win95.txt and WinNT.txt in the Samba documentation. Do not enable this # option unless you have read those documents encrypt passwords = true passdb backend = tdbsam guest # Using the following line enables you to customise your configuration # on a per machine basis. The %m gets replaced with the netbios name # of the machine that is connecting ; include = /home/samba/etc/smb.conf.%m # Most people will find that this option gives better performance. # See speed.txt and the manual pages for details # You may want to add the following on a Linux system: # SO_RCVBUF=8192 SO_SNDBUF=8192 socket options = TCP_NODELAY # --- Browser Control Options --- # Please _read_ BROWSING.txt and set the next four parameters according # to your network setup. The defaults are specified below (commented # out.) It's important that you read BROWSING.txt so you don't break # browsing in your network! # set local master to no if you don't want Samba to become a master # browser on your network. Otherwise the normal election rules apply ; local master = yes # OS Level determines the precedence of this server in master browser # elections. The default value should be reasonable ; os level = 20 # Domain Master specifies Samba to be the Domain Master Browser. This # allows Samba to collate browse lists between subnets. Don't use this # if you already have a Windows NT domain controller doing this job ; domain master = auto # Preferred Master causes Samba to force a local browser election on startup # and gives it a slightly higher chance of winning the election ; preferred master = auto # --- End of Browser Control Options --- # Windows Internet Name Serving Support Section: # WINS Support - Tells the NMBD component of Samba to enable it's WINS Server ; wins support = no # WINS Server - Tells the NMBD components of Samba to be a WINS Client # Note: Samba can be either a WINS Server, or a WINS Client, but NOT both wins server = 10.132.4.21 # This will prevent nmbd to search for NetBIOS names through DNS. dns proxy = no # What naming service and in what order should we use to resolve host names # to IP addresses ; name resolve order = lmhosts host wins bcast # Name mangling options ; preserve case = yes ; short preserve case = yes # This boolean parameter controlls whether Samba attempts to sync. the Unix # password with the SMB password when the encrypted SMB password in the # /etc/samba/smbpasswd file is changed. ; unix password sync = false # For Unix password sync. to work on a Debian GNU/Linux system, the following # parameters must be set (thanks to Augustin Luton <aluton@hybrigenics.fr> for # sending the correct chat script for the passwd program in Debian Potato). passwd program = /usr/bin/passwd %u passwd chat = *Enter\snew\sUNIX\spassword:* %n\n *Retype\snew\sUNIX\spassword:* %n\n . # This boolean controls whether PAM will be used for password changes # when requested by an SMB client instead of the program listed in # 'passwd program'. The default is 'no'. ; pam password change = no # The following parameter is useful only if you have the linpopup package # installed. The samba maintainer and the linpopup maintainer are # working to ease installation and configuration of linpopup and samba. ; message command = /bin/sh -c '/usr/bin/linpopup "%f" "%m" %s; rm %s' & obey pam restrictions = yes # Some defaults for winbind (make sure you're not using the ranges # for something else.) ; winbind uid = 10000-20000 ; winbind gid = 10000-20000 ; template shell = /bin/bash #======================= Share Definitions ====================== # The next two parameters show how to auto-mount a CD-ROM when the # cdrom share is accesed. For this to work /etc/fstab must contain # an entry like this: # # /dev/scd0 /cdrom iso9660 defaults,noauto,ro,user 0 0 # # The CD-ROM gets unmounted automatically after the connection to the # # If you don't want to use auto-mounting/unmounting make sure the CD # is mounted on /cdrom # ; preexec = /bin/mount /cdrom ; postexec = /bin/umount /cdrom [teaneck_m] comment = TEANECK media root path = /mnt/ha/teaneck_media public = yes writable = yes vfs objects = vfs_ownerwrite create mode = 0664 directory mode = 2775 [teaneck_ftp] comment = TEANECK FTP root path = /mnt/ha/teaneck_media/ftp public = yes writable = yes vfs objects = vfs_ownerwrite create mode = 0664 directory mode = 2775 [deadwood_e] comment = Old DEADWOOD media root path = /mnt/ha/teaneck_media public = yes writable = yes #vfs objects = vfs_ownerwrite create mode = 0664 directory mode = 2775 [deadwood_ftp] comment = Old DEADWOOD FTP root path = /mnt/ha/teaneck_media/ftp public = yes writable = yes #vfs objects = vfs_ownerwrite create mode = 0664 directory mode = 2775
Jeremy Allison
2004-Dec-07 21:25 UTC
[Samba] smbd reads entire directory when creating files?
On Tue, Dec 07, 2004 at 02:08:56PM -0600, Matt Mitchell wrote:> Apologies if this is a FAQ.Kind of.> We are saddled with some directories full of production image files that > number in the hundreds of thousands to the millions. (Yes, I know this > is bad.) They reside on a couple of Win 2003 boxes that are flaky and > unreliable. I was hoping to copy them over to a Linux box running samba > but it is taking forever.This is pretty much the worse case scenario for Samba I'm afraid.> A strace on the smbd process receiving the files from the windows box > (it is mapped as a share on the 2k3 server) reveals that smbd is looking > up the entire directory (with getdents64) every time it writes a file. > Several times, in fact. So as the number of files grows, it churns more > and more. I'm sure most of it is in cache but the data still has to be > moved around in memory. > > I know this is not necessary for writing/copying files with unix > semantics, but I wonder if the case-fiddling or any other Windows > imitative behavior is making this getdents64 orgy necessary. Is there > any way to disable it?Not currently. This is something I'm working on - would you be willing to test some prototype code once I'm done ? Jeremy.