I have an rsync script that it is copying one computer (over ssh) to a shared CIFS mount on Gentoo Linux, kernel 6.3.4. The script runs for a while and then at some point quits knocking my ssh session offline on all terminals and it blocks ssh from being able to connect again. Even restarting sshd doesn?t help. Rsync has apparently killed it. I have to reboot. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.samba.org/pipermail/rsync/attachments/20230603/ab2e2ee3/attachment.htm>
Maurice R Volaski via rsync <maurice.volaski at lists.samba.org> wrote:> I have an rsync script that it is copying one computer (over ssh) > to a shared CIFS mount on Gentoo Linux, kernel 6.3.4. The script > runs for a while and then at some point quits knocking my ssh > session offline on all terminals and it blocks ssh from being able > to connect again. Even restarting sshd doesn't help. Rsync has > apparently killed it. I have to reboot.For starters: What OS and version is the rsync script running on? Which end do you have to reboot? The machine running the script, or the Gentoo Linux? What versions of rsync are running on each end? Can you show the command line that fails? Based on the mention of multiple terminals, it sounds as if you have a fairly complex ssh environment. Can you get it to fail in a simpler environment, ideally with only one terminal?
On Sat 03 Jun 2023, Maurice R Volaski via rsync wrote:> I have an rsync script that it is copying one computer (over ssh) to a shared CIFS mount on Gentoo Linux, kernel 6.3.4. The script runs for a while and then at some point quits knocking my ssh session offline on all terminals and it blocks ssh from being able to connect again. Even restarting sshd doesn?t help. Rsync has apparently killed it. I have to reboot.Note there's no such thing as an rsync script. You probably mean you have a shell script that runs rsync at some point. Is the script copying from the system it's running on, to the Gentoo Linux system? Is the CIFS mount actually mounted on the Gentoo Linux, or is the Gentoo Linux system serving the CIFS mount which actually is mounted on the "one computer"? In that case it would be much better to directly rsync to the filesystem on the Gentoo system. Re: the ssh stopping working: To me this would suggest that there's an out-of-memory situation going on, and sshd is being killed because of this. However that would not explain why restarting it doesn't work. What exactly do you mean when you say restarting sshd doesn't help? Does it not stay running, or is the daemon in fact running but not accepting connections? It's the age-old question: "it doesn't work" -- "_how_ is it not working?" Does dmesg give any useful information? Or perhaps journalctl? Usually the clues are in plain signt if you check logs. Paul
Maurice, You say, "knocking my ssh session offline on all terminals and it blocks ssh from being able to connect again. Even restarting sshd doesn't help". Questions: * Is the network stack on the affected machine still active? (Can it reach other services or systems on the network?) * If the network is NOT reachable, does restarting the network stack make a difference? I ask because I intermittently see what seems to be a similar behavior--rsync client (3.2.7) to a remote system with rsync (3.2.3) and a 5.11.x linux kernel that occasionally terminates with the linux system losing access to the network where restarting the network stack doesn't seem to restore access and requires a reboot of the linux system in question. On 6/2/23 10:44 PM, Maurice R Volaski via rsync wrote:> I have an rsync script that it is copying one computer (over ssh) to a > shared CIFS mount on Gentoo Linux, kernel 6.3.4. The script runs for a > while and then at some point quits knocking my ssh session offline on > all terminals and it blocks ssh from being able to connect again. Even > restarting sshd doesn?t help. Rsync has apparently killed it. I have to > reboot. > >