Hi, We downloaded the latest rsync 2.5.6 source code and compiled it on HP-UX for both 10.20 and 11.x. Rsync runs fine if we run it from the command line on both platforms. But after we put rsync command into a HP Software Distributor package configure script (just a shell script with rsync command in it) and run swinstall, which will get installation bits including the configure script, put the installation bits into place and fork process to run the configure script, rsync hang on HP-UX 10.20 boxes. From the log, we can see rsync actually already finished all the work, however, both rsync processes were still running. As the result, swinstall won't finish either. Swinstall uses tcp 2121 and udp 2121 ports. I ran trace on the hang rsync processes. The parent rsync process seemed waiting for the child rsync process which is reasonable. The child rsync process seemed traped in a loop of msleep calls. Here are some trace outputs: 11982: gettimeofday(0x7b03ac00, 0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: gettimeofday(0x7b03ac00, 0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: gettimeofday(0x7b03ac00, 0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: gettimeofday(0x7b03ac00, 0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: gettimeofday(0x7b03ac00, 0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: gettimeofday(0x7b03ac00, 0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 11982: gettimeofday(0x7b03ac10, 0) = 0 Has anyone experienced similar hangs and know the possible reasons? Thank you very much for your help! Xiaoqin Qiu Technical Computing Group IT Infrastructure Services Organization Agilent Technologies, Inc. (818)879-6220 xiaoqin_qiu@agilent.com
Please invest in an enter key. On Tue, Aug 19, 2003 at 07:38:51PM -0600, xiaoqin_qiu@agilent.com wrote: [reformatted due to insanely long lines]> Hi, > > We downloaded the latest rsync 2.5.6 source code and > compiled it on HP-UX for both 10.20 and 11.x. Rsync runs > fine if we run it from the command line on both platforms. > But after we put rsync command into a HP Software > Distributor package configure script (just a shell script > with rsync command in it) and run swinstall, which will > get installation bits including the configure script, put > the installation bits into place and fork process to run > the configure script, rsync hang on HP-UX 10.20 boxes. > From the log, we can see rsync actually already finished > all the work, however, both rsync processes were still > running. As the result, swinstall won't finish either. > Swinstall uses tcp 2121 and udp 2121 ports. > > I ran trace on the hang rsync processes. The parent rsync > process seemed waiting for the child rsync process which > is reasonable. The child rsync process seemed traped in a > loop of msleep calls.The child is waiting for the parent to kill it with SIGUSR2. Evidently the signal wasn't delivered.> Here are some trace outputs: > > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > > Has anyone experienced similar hangs and know the possible reasons? > > Thank you very much for your help! > > Xiaoqin Qiu > Technical Computing Group > IT Infrastructure Services Organization > Agilent Technologies, Inc. > (818)879-6220 > xiaoqin_qiu@agilent.com > > -- > To unsubscribe or change options: http://lists.samba.org/mailman/listinfo/rsync > Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html >-- ________________________________________________________________ J.W. Schultz Pegasystems Technologies email address: jw@pegasys.ws Remember Cernan and Schmitt
Hi, Thank J.W. for your information. The strange thing is child process is waiting for parent to kill it, in the meantime, the parent process is waiting for child process. Is this a bug or what? Is there a fix for this type of problem? Thank you again for your help! I ran the installation again and here are some trace outputs from parent process: (4188 is child process ID. The trace output from child process is the same as my previous post except the process ID.) 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 Xiaoqin Qiu Technical Computing Group IT Infrastructure Services Organization Agilent Technologies, Inc. (818)879-6220 xiaoqin_qiu@agilent.com -----Original Message----- From: jw schultz [mailto:jw@pegasys.ws] Sent: Tuesday, August 19, 2003 7:05 PM To: rsync@lists.samba.org Subject: Re: Rsync hang on HP-UX 10.20 Please invest in an enter key. On Tue, Aug 19, 2003 at 07:38:51PM -0600, xiaoqin_qiu@agilent.com wrote: [reformatted due to insanely long lines]> Hi, > > We downloaded the latest rsync 2.5.6 source code and > compiled it on HP-UX for both 10.20 and 11.x. Rsync runs > fine if we run it from the command line on both platforms. > But after we put rsync command into a HP Software > Distributor package configure script (just a shell script > with rsync command in it) and run swinstall, which will > get installation bits including the configure script, put > the installation bits into place and fork process to run > the configure script, rsync hang on HP-UX 10.20 boxes. > From the log, we can see rsync actually already finished > all the work, however, both rsync processes were still > running. As the result, swinstall won't finish either. > Swinstall uses tcp 2121 and udp 2121 ports. > > I ran trace on the hang rsync processes. The parent rsync > process seemed waiting for the child rsync process which > is reasonable. The child rsync process seemed traped in a > loop of msleep calls.The child is waiting for the parent to kill it with SIGUSR2. Evidently the signal wasn't delivered.> Here are some trace outputs: > > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: gettimeofday(0x7b03ac00, 0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > 11982: select(0, 0, 0, 0, 0x7b03abf0) = 0 > 11982: gettimeofday(0x7b03ac10, 0) = 0 > > Has anyone experienced similar hangs and know the possible reasons? > > Thank you very much for your help! > > Xiaoqin Qiu > Technical Computing Group > IT Infrastructure Services Organization > Agilent Technologies, Inc. > (818)879-6220 > xiaoqin_qiu@agilent.com > > -- > To unsubscribe or change options: http://lists.samba.org/mailman/listinfo/rsync > Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html >-- ________________________________________________________________ J.W. Schultz Pegasystems Technologies email address: jw@pegasys.ws Remember Cernan and Schmitt -- To unsubscribe or change options: http://lists.samba.org/mailman/listinfo/rsync Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html
On Wed, Aug 20, 2003 at 11:56:28AM -0600, xiaoqin_qiu@agilent.com wrote:> Hi, > > Thank J.W. for your information. > > The strange thing is child process is waiting for parent to kill it, in the meantime, the parent process is waiting for child process. > > Is this a bug or what? Is there a fix for this type of problem? > > Thank you again for your help! > > I ran the installation again and here are some trace outputs from parent process: > (4188 is child process ID. > The trace output from child process is the same as my previous post except the process ID.)We know the state it is in. So traces once it is there are useless. The question is how did it get here. What was the return from kill()? What is the signal mask? Things like that.
Hi J.W., I asked this question before, however I could not see your second reply to my question. Could you please answer it again? Sorry about this. Refreshing the question: We downloaded the latest rsync 2.5.6 source code and compiled it on HP-UX for both 10.20 and 11.x. Rsync runs fine if we run it from the command line on both platforms. But after we put rsync command into a HP Software Distributor package configure script (just a shell script with rsync command in it) and run swinstall, which will get installation bits including the configure script, put the installation bits into place and fork process to run the configure script, rsync hang on HP-UX 10.20 boxes.>From the log, we can see rsync actually already finished all the work. However, both rsync processes were still running. As the result, swinstall won't finish either. Swinstall uses tcp 2121 and udp 2121 ports.I ran trace on the hang rsync processes. The parent rsync process seemed waiting for the child rsync process which is reasonable. The child rsync process seemed traped in a loop of msleep calls. Is this a bug or what? Here are some trace outputs from child and parent processes: Child: 4188: gettimeofday(0x7b03ac00, 0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: select(0, 0, 0, 0, 0x7b03abf0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: select(0, 0, 0, 0, 0x7b03abf0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: gettimeofday(0x7b03ac00, 0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: select(0, 0, 0, 0, 0x7b03abf0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: select(0, 0, 0, 0, 0x7b03abf0) = 0 Parent: 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 Thank you very much for your help! Xiaoqin Qiu
Hi J.W., Thank you for your quick response. However I still couldn't see your email. Microsoft outlook couldn't open the email and gave me "follow-up flag must be 100 characters or less" error. Could you please send it again and remove everything I wrote to keep it small OR remove your follow-up flag? Sorry about this and thank you again, Xiaoqin Qiu Technical Computing Group IT Infrastructure Services Organization Agilent Technologies, Inc. (818)879-6220 xiaoqin_qiu@agilent.com -----Original Message----- From: QIU,XIAOQIN (A-Sonoma,ex1) Sent: Monday, August 25, 2003 4:11 PM To: rsync list Subject: RE: Rsync hang on HP-UX 10.20 Hi J.W., I asked this question before, however I could not see your second reply to my question. Could you please answer it again? Sorry about this. Refreshing the question: We downloaded the latest rsync 2.5.6 source code and compiled it on HP-UX for both 10.20 and 11.x. Rsync runs fine if we run it from the command line on both platforms. But after we put rsync command into a HP Software Distributor package configure script (just a shell script with rsync command in it) and run swinstall, which will get installation bits including the configure script, put the installation bits into place and fork process to run the configure script, rsync hang on HP-UX 10.20 boxes.>From the log, we can see rsync actually already finished all the work. However, both rsync processes were still running. As the result, swinstall won't finish either. Swinstall uses tcp 2121 and udp 2121 ports.I ran trace on the hang rsync processes. The parent rsync process seemed waiting for the child rsync process which is reasonable. The child rsync process seemed traped in a loop of msleep calls. Is this a bug or what? Here are some trace outputs from child and parent processes: Child: 4188: gettimeofday(0x7b03ac00, 0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: select(0, 0, 0, 0, 0x7b03abf0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: select(0, 0, 0, 0, 0x7b03abf0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: gettimeofday(0x7b03ac00, 0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: select(0, 0, 0, 0, 0x7b03abf0) = 0 4188: gettimeofday(0x7b03ac10, 0) = 0 4188: select(0, 0, 0, 0, 0x7b03abf0) = 0 Parent: 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: select(0, 0, 0, 0, 0x7b03ac30) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 4187: waitpid(4188, 0x7b03ab50, 1) = 0 4187: gettimeofday(0x7b03ac40, 0) = 0 4187: gettimeofday(0x7b03ac50, 0) = 0 Thank you very much for your help! Xiaoqin Qiu