thr3ads.net - samba - [Samba] 32 seconds vs 72 minutes -- expected performance difference? [Feb 2019]

If this information is useful, please help other people find it:
Share via:

Saurabh Nanda

2019-Feb-08 13:26 UTC

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

## QUESTION

I am sharing a 120GB folder with lots of files via Samba on a LAN (1Gbps
connection).

1) Doing an `ls -lR` on the server (on this folder) takes ~32 seconds,
compared with **72 minutes** on the client. Is this difference in
performance expected (due to network and protocol overhead)?

2) While the client is executing an `ls -lR`, one smbd process on the
server uses about 30-40% of a single CPU (on an 4c/8t machine). Is this
much CPU load expected? (Also, the client ends up consuming all 1Gbps of
network bandwidth).

Is samba transferring all the files from the server to the client for this
operation?

## CONFIGURATION

Server & client, both, are on Ubuntu 18.04 and are using the stock version
of the kernel, samba, etc. The version being reported:

    # smbd --version
    Version 4.7.6-Ubuntu

smb.conf:

    [global]
    smb encrypt = required
    disable netbios = yes
    case sensitive = yes
    preserve case = yes
    short preserve case = yes
    workgroup = WORKGROUP
    server string = %h server (Samba, Ubuntu)
    wins support = no
    dns proxy = no
    interfaces = {{ internal_ip }}
    bind interfaces only = yes
    log file = /var/log/samba/log.%m
    max log size = 1000
    syslog = 0
    panic action = /usr/share/samba/panic-action %d
    server role = standalone server
    passdb backend = tdbsam
    obey pam restrictions = no
    unix password sync = no
    pam password change = no
    map to guest = never
    usershare allow guests = yes

    [myshare]
    path = /samba/uploaded-files
    browseable = no
    read only = no
    valid users = [redacted]

-- Saurabh.

Jeremy Allison

2019-Feb-08 23:51 UTC

head link

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

On Fri, Feb 08, 2019 at 06:56:48PM +0530, Saurabh Nanda via samba
wrote:> ## QUESTION
> 
> I am sharing a 120GB folder with lots of files via Samba on a LAN (1Gbps
> connection).
Define "lots of files" ? What does ls | wc -l say ?

Saurabh Nanda

2019-Feb-09 01:52 UTC

head link

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

>
> Define "lots of files" ? What does ls | wc -l say ?
>
Number of files + directories:

    # find . | wc -l
    2651882

Number of files:

    # find . -type f | wc -l
    1057105

Number of directories:

    # find . -type d | wc -l
    1594777

-- Saurabh.

Aurélien Aptel

2019-Feb-11 09:35 UTC

head link

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

Saurabh Nanda via samba <samba at lists.samba.org>
writes:> ## QUESTION
>
> I am sharing a 120GB folder with lots of files via Samba on a LAN (1Gbps
> connection).
>
> 1) Doing an `ls -lR` on the server (on this folder) takes ~32 seconds,
> compared with **72 minutes** on the client. Is this difference in
> performance expected (due to network and protocol overhead)?
Unless you upload a network capture of you mounting and doing the ls -lR
on the client it's hard to say what really goes on. I understand you
might not want to make it public.. but if you do

You can make a capture using:

tcpdump -s 0 -w capture.pcap port 445 &
tracepid=$!
mount.cifs //foo/bar /mnt -o ....
ls -lR /mnt
#..wait or kill ls..

kill $tracepid

Note that you can probably kill tcpdump after 10mn or so. This should
provide enough data. If you end up doing it, the capture will probably
be quite large, try compressing it as well. 
> 2) While the client is executing an `ls -lR`, one smbd process on the
> server uses about 30-40% of a single CPU (on an 4c/8t machine). Is this
> much CPU load expected? (Also, the client ends up consuming all 1Gbps of
> network bandwidth).
>
> Is samba transferring all the files from the server to the client for this
> operation?
No it shouldn't.
> ## CONFIGURATION
>
> Server & client, both, are on Ubuntu 18.04 and are using the stock
version
> of the kernel, samba, etc. The version being reported:
How are you mounting your share (which mount options)?

I can't comment on the server side but using a very recent kernel
version like 4.20 or above (which at the moment is 5.0-rc6 :/) you will
get compounding, which basically bundles multiple SMB operations in one
paquet and saves a lot of round trips as a result. Note that compounding
requires mounting using SMB2+ user vers= mount option, cf mount.cifs man
page.

I suspect it won't solve the problem but it's something.

-- 
Aurélien Aptel / SUSE Labs Samba Team
GPG: 1839 CB5F 9F5B FB9B AA97  8C99 03C8 A49B 521B D5D3
SUSE Linux GmbH, Maxfeldstraße 5, 90409 Nürnberg, Germany
GF: Felix Imendörffer, Jane Smithard, Graham Norton, HRB 21284 (AG Nürnberg)

Saurabh Nanda

2019-Feb-14 12:04 UTC

head link

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

>
> Unless you upload a network capture of you mounting and doing the ls -lR
> on the client it's hard to say what really goes on. I understand you
> might not want to make it public.. but if you do
>
This is the last thing I'll try after I've exhausted all the other
options.


How are you mounting your share (which mount options)?>
Something weird is going on with the mount options. I hadn't used vers= in
my fstab earlier, and noticed the following in syslog:

    No dialect specified on mount. Default has changed to a more secure
dialect,
    SMB2.1 or later (e.g. SMB3), from CIFS (SMB1). To use the less secure
SMB1
    dialect to access old servers which do not support SMB3 (or SMB2.1)
specify
    vers=1.0 on mount.

Which implies that the server & client auto-negotiated a protocol version
greater than SMB2.1, right? However, to be sure, I manually specified versin
fstab, but something strange happened. While `man mount.cifs` claims
that the following are allowed -- 1.0, 2.0, 2.1, 3.0, 3.1.1 (or 3.11) --
few of them failed with strange errors:

# mount -t cifs -o
rw,username=myuser,password=[REDACTED],uid=myuser,gid=myuser,vers=3.11
//[REDACTED]/uploaded_files /home/myuser/shared

mount error(11): Resource temporarily unavailable
Refer to the mount.cifs(8) manual page (e.g. man mount.cifs)

# tail  /var/log/syslog

[...]
Feb 14 12:57:04 prod-backoffice kernel: [105926.746067] CIFS VFS: failed to
connect to IPC (rc=-11)
Feb 14 12:57:04 prod-backoffice kernel: [105926.767443] CIFS VFS: session
0000000044187aeb has no tcon available for a dfs referral request
Feb 14 12:57:04 prod-backoffice kernel: [105926.770039] CIFS VFS:
cifs_mount failed w/return code = -11


# mount -t cifs -o
rw,username=myuser,password=[REDACTED],uid=myuser,gid=myuser,vers=3.1
//[REDACTED]/uploaded_files /home/myuser/shared

mount error(22): Invalid argument
Refer to the mount.cifs(8) manual page (e.g. man mount.cifs)

# tail  /var/log/syslog

[...]
Feb 14 12:58:54 prod-backoffice kernel: [106036.706942] CIFS VFS: Unknown
vers= option specified: 3.1

# mount -t cifs -o
rw,username=myuser,password=[REDACTED],uid=myuser,gid=myuser,vers=3.0
//[REDACTED]/uploaded_files /home/myuser/shared

[ finally worked! ]

It seems that vers=3.0 worked, but how do I confirm that the mount ACTUALLY
happened with vers=3.0?

Further, `mount` is showing a LOT of options that I did not specify. I do
not fully understand their impact:

//[REDACTED]/uploaded_files on /home/myuser/shared type cifs
(rw,relatime,vers=3.0,cache=strict,username=vl,domain=,uid=1000,forceuid,gid=1000,forcegid,addr=95.216.67.179,file_mode=0755,dir_mode=0755,soft,nounix,serverino,mapposix,rsize=1048576,wsize=1048576,echo_interval=60,actimeo=1)

Namely: forceuid, forcegid, nouinx, serverino, & mapposix

Could any of these be causing the problems that I'm observing?

PS: vers=3.0 didn't fix the problem!

-- Saurabh.

Saurabh Nanda

2019-Feb-18 11:38 UTC

head link

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

>
> (Also, the client ends up consuming all 1Gbps of network bandwidth).
>
This statement is grossly incorrect. The client is not consuming 1Gbps, but
1MBps -- I'm confused between bits per second and bytes per second, it
seems.

I am replicating the scenario with NFS to see if it's an issue with Samba,
or something to do with the underlying network stack.

-- Saurabh.

Saurabh Nanda

2019-Feb-18 13:40 UTC

head link

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

>
> (Also, the client ends up consuming all 1Gbps of network bandwidth).
>>
>
> This statement is grossly incorrect. The client is not consuming 1Gbps,
> but 1MBps -- I'm confused between bits per second and bytes per second,
it
> seems.
>
> I am replicating the scenario with NFS to see if it's an issue with
Samba,
> or something to do with the underlying network stack.
>
Here's NFS vs Samba:

## Samba - Uses about 1MBps throughout

real    87m9.739s
user    2m9.222s
sys     17m21.828s

## NFS - Uses about 1.5-2.0 MBps throughout

real    34m3.902s
user    0m55.798s
sys     6m36.227s

I now have no clue what is going on. Are the **number** of files and
directories so large that this is kind-of expected?

-- Saurabh.

Götz Reinicke

2019-Feb-22 08:04 UTC

head link

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

> Am 08.02.2019 um 14:26 schrieb Saurabh Nanda via samba <samba at
lists.samba.org>:
> 
> ## QUESTION
> 
> I am sharing a 120GB folder with lots of files via Samba on a LAN (1Gbps
> connection).
> 
> 1) Doing an `ls -lR` on the server (on this folder) takes ~32 seconds,
> compared with **72 minutes** on the client. Is this difference in
> performance expected (due to network and protocol overhead)?
> 
> 2) While the client is executing an `ls -lR`, one smbd process on the
> server uses about 30-40% of a single CPU (on an 4c/8t machine). Is this
> much CPU load expected? (Also, the client ends up consuming all 1Gbps of
> network bandwidth).
> 
> Is samba transferring all the files from the server to the client for this
> operation?
Hi,

I’v been faced with similar situations ion the past - „lots“ of small files on a
network share like SMB/NFS. Beside the software configuration a major topic for
me was in most cases the storage type hosting the files. E.g. Raid-level and
spinning Disk vrs. SSD.

So, what kind of storage you use?

	Regards . Götz

Saurabh Nanda

2019-Feb-22 09:23 UTC

head link

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

>
> I’v been faced with similar situations ion the past - „lots“ of small
> files on a network share like SMB/NFS. Beside the software configuration a
> major topic for me was in most cases the storage type hosting the files.
> E.g. Raid-level and spinning Disk vrs. SSD.
>
> So, what kind of storage you use?
>
NVMe/SSD on the server side.

-- Saurabh.

Maybe Matching Threads

Search for more seemingly similar threads

samba - Feb 2019 - 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

[Samba] 32 seconds vs 72 minutes -- expected performance difference?

Maybe Matching Threads