Hi Sriram, Once again, thanks for your timely reply.... I tested as your suggesstion and i found that i don''t have any file in the location "/var/lib/xen" [root@globushead xen]# pwd /var/lib/xen [root@globushead xen]# ls -ll total 0 Further i tested the command "xm save " and results as follows Terminal 1: xm save -c TestVM1 TestVM1.chkpt Terminal 2 : xm save -c TestVM2 TestVM2.chkpt & issued command simultaneously.. I could able to get those checkpoint files as follows & no error has occured.. [root@globushead ~]# ls -ll TestVM* -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM1.chkpt -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM2.chkpt I hereby represents the xend.log as follows for your reference...... ************************************************ [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent= 32768 (0.94x) [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were fixups) [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:124) [xc_save]: /usr/lib64/xen/bin/xc_save 33 4 0 0 0 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) xc_save: failed to get the suspend evtchn port [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:394) suspend [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:127) In saveInputHandler suspend [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:129) Suspending 4 ... [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:519) XendDomainInfo.shutdown(suspend) [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) XendDomainInfo.handleShutdownWatch [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:135) Domain 4 suspended. [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:144) Written done [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) XendDomainInfo.handleShutdownWatch [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: name=migrating-TestVM1 id=3 reason=suspend. [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Had 0 unexplained entries in p2m table [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: name=migrating-TestVM2 id=4 reason=suspend. [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Saving memory pages: iter 1 0%^H^H^H^H 5%outbuf_write: 4194304 > 4161520@12615696 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 11%^H^H^H^H 20%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 26%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 32%^H^H^H^H 38%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 43%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 49%^H^H^H^H 55%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 61%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 67%^H^H^H^H 73%outbuf_write: 622592 > 598004@16179212 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 78%^H^H^H^H 84%outbuf_write: 2662400 > 2007024@14770192 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 90%^H^H^H^H 97%^M 1: sent 32768, skipped 0, delta 321ms, dom0 12%, target 0%, sent 3344Mb/s, dirtied 1127Mb/s 11049 pages [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent= 32768 (0.94x) [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were fixups) [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) XendDomainInfo.resumeDomain(3) [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) XendDomainInfo.resumeDomain: completed [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) XendDomainInfo.resumeDomain(4) [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) XendDomainInfo.resumeDomain: completed ************************************************ On Thu, Jul 28, 2011 at 7:50 PM, Shriram Rajagopalan <rshriram@cs.ubc.ca>wrote:> check /var/lib/xen/suspend_evtchn_*_lock.d > Make sure there are different lock files for each domain. > And before starting, make sure there are no stray lock files. > > Try this litmus test first. (do both commands simultaneously, in two > different > terminals) > terminal 1: xm save -c TestVM1 TestVM1.chkpt > terminal 2: xm save -c TestVM2 TestVM2.chkpt > > If this fails, then your problem is something else. > Check the log files /var/log/xen/xend.log for errors. > > > Most important of all, you are not doing any disk replication! > So, when the VM recovers on backup, it will be running with an > older version of the disk and will possibly crash > > it should be tap2:remus:backuphost:backupport|aio:.... > > > shriram > > On Wed, Jul 27, 2011 at 11:55 PM, Rajendar K <k.rajendar@gmail.com> wrote: > >> Hi, >> Thanks for your kind reply. >> I have installed xen 4.0.2 in RHEL5 and tried remus. With a single >> remus command >> running for a single virtual machine, it works fine but when i tried for >> two virtual machine, >> the steps which i have tried is as follows, >> >> i have run remus command on two terminals >> >> (i) *remus --no-net 1 192.168.36.233 >> *(ii) *remus --no-net 2 192.168.36.233* >> >> >> Terminal 1 : >> The following error i got where i run command >> >> *remus --no-net 1 192.168.36.233* >> >> SUSPEND shinfo 00076122 >> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2457Mb/s 75 pages >> 86: sent 72, skipped 0, delta 1ms, dom0 0%, target 0%, sent 2359Mb/s, >> dirtied 2359Mb/s 72 pages >> Total pages sent= 41424 (1.19x) >> (of which 0 were fixups) >> All memory is saved >> PROF: resumed at 1311824828.014423 >> PROF: flushed memory at 1311824828.015603 >> PROF: suspending at 1311824828.212492 >> SUSPEND shinfo 00076122 >> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2359Mb/s 72 pages >> 87: sent 107, skipped 0, delta 1ms, dom0 0%, target 0%, sent 3506Mb/s, >> dirtied 3506Mb/s 107 pages >> Total pages sent= 41531 (1.19x) >> (of which 0 were fixups) >> All memory is saved >> PROF: resumed at 1311824828.215125 >> PROF: flushed memory at 1311824828.216482 >> PROF: suspending at 1311824828.426094 >> domain 1 not shut down >> ERROR Internal error: Suspend request failed >> ERROR Internal error: Domain appears not to have suspended >> Save exit rc=1 >> >> >> Terminal 2 : >> The following error i got where i run command >> >> *remus --no-net 2 192.168.36.233* >> >> >> Had 0 unexplained entries in p2m table >> 1: sent 32377, skipped 391, delta 14193ms, dom0 11%, target 0%, sent >> 74Mb/s, dirtied 1Mb/s 517 pages >> 2: sent 517, skipped 0, delta 219ms, dom0 11%, target 0%, sent 77Mb/s, >> dirtied 0Mb/s 0 pages >> 3: sent 0, skipped 0, Start last iteration >> PROF: suspending at 1311824828.327395 >> domain 2 not shut down >> ERROR Internal error: Suspend request failed >> ERROR Internal error: Domain appears not to have suspended >> Save exit rc=1 >> >> >> -------------------------------------- >> >> On the host machine where i run remus command , i can see the two VMs in >> suspended state >> [root@globuscompute ~]# xm list >> Name ID Mem VCPUs State >> Time(s) >> Domain-0 0 1681 2 r----- >> 75.1 >> TestVM1 1 128 1 ---s-- >> 5.2 >> TestVM2 2 128 1 ---s-- >> 5.1 >> >> >> And also the destination machine i can see one VM in "b" state >> [root@globushead ~]# xm list >> Name ID Mem VCPUs State >> Time(s) >> Domain-0 0 1684 2 r----- >> 215.4 >> TestVM1 1 128 1 -b---- >> 0.5 >> >> >> Kindly help me in this situation.. >> >> I hereby specify the configuration file which i have used for your >> reference.. >> >> VM1: >> >> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >> memory=128 >> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >> disk >> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86.img,xvda1,w''] >> name="TestVM1" >> root="/dev/xvda1 ro" >> vif=[''bridge=eth0'',] >> >> >> VM 2: >> >> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >> memory=128 >> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >> disk >> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86_1.img,xvda2,w''] >> name="TestVM2" >> root="/dev/xvda2 ro" >> vif=[''bridge=eth0'',] >> >> >> >> On Wed, Jul 27, 2011 at 6:56 PM, Shriram Rajagopalan <rshriram@cs.ubc.ca>wrote: >> >>> >>> Yes you certainly can. What kind of problem are you facing? >>> shriram >>> >>> On Wed, Jul 27, 2011 at 5:07 AM, Rajendar K <k.rajendar@gmail.com>wrote: >>> >>>> >>>> Hi Sriram, >>>> >>>> Can we run REMUS for more than one VM simultaneously ? >>>> I have tried it and found that it is not fruitful.. >>>> Kindly help me for a solution. >>>> >>>> >>>> -- >>>> Raj, >>>> CDAC. >>>> >>> >>> >> >> >> -- >> Raj, >> CDAC. >> > >-- Raj, CDAC. _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
I would suggest you upgrade your xen installation. IIRC 4.0.2 still had problems with remus. On Thu, Jul 28, 2011 at 11:41 PM, Rajendar K <k.rajendar@gmail.com> wrote:> Hi Sriram, > > Once again, thanks for your timely reply.... > I tested as your suggesstion and i found that i don''t have any > file in the > location "/var/lib/xen" > > [root@globushead xen]# pwd > /var/lib/xen > [root@globushead xen]# ls -ll > total 0 > > Further i tested the command "xm save " and results as follows > Terminal 1: > xm save -c TestVM1 TestVM1.chkpt > > Terminal 2 : > xm save -c TestVM2 TestVM2.chkpt > > & issued command simultaneously.. I could able to get those checkpoint > files as follows & no error has occured.. > > > [root@globushead ~]# ls -ll TestVM* > -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM1.chkpt > -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM2.chkpt > > > I hereby represents the xend.log as follows for your reference...... > ************************************************ > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent> 32768 (0.94x) > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were > fixups) > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:124) [xc_save]: > /usr/lib64/xen/bin/xc_save 33 4 0 0 0 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) xc_save: failed to get > the suspend evtchn port > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:394) suspend > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:127) In saveInputHandler > suspend > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:129) Suspending 4 ... > [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:519) > XendDomainInfo.shutdown(suspend) > [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) > XendDomainInfo.handleShutdownWatch > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:135) Domain 4 suspended. > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:144) Written done > [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) > XendDomainInfo.handleShutdownWatch > [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: > name=migrating-TestVM1 id=3 reason=suspend. > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Had 0 unexplained > entries in p2m table > [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: > name=migrating-TestVM2 id=4 reason=suspend. > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Saving memory pages: > iter 1 0%^H^H^H^H 5%outbuf_write: 4194304 > 4161520@12615696 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 11%^H^H^H^H > 20%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H > 26%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 32%^H^H^H^H > 38%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H > 43%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 49%^H^H^H^H > 55%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H > 61%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 67%^H^H^H^H > 73%outbuf_write: 622592 > 598004@16179212 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 78%^H^H^H^H > 84%outbuf_write: 2662400 > 2007024@14770192 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 90%^H^H^H^H > 97%^M 1: sent 32768, skipped 0, delta 321ms, dom0 12%, target 0%, sent > 3344Mb/s, dirtied 1127Mb/s 11049 pages > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent> 32768 (0.94x) > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were > fixups) > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved > [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 > [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) > XendDomainInfo.resumeDomain(3) > [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) > XendDomainInfo.resumeDomain: completed > [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 > [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) > XendDomainInfo.resumeDomain(4) > [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) > XendDomainInfo.resumeDomain: completed > > ************************************************ > > > > > > On Thu, Jul 28, 2011 at 7:50 PM, Shriram Rajagopalan <rshriram@cs.ubc.ca>wrote: > >> check /var/lib/xen/suspend_evtchn_*_lock.d >> Make sure there are different lock files for each domain. >> And before starting, make sure there are no stray lock files. >> >> Try this litmus test first. (do both commands simultaneously, in two >> different >> terminals) >> terminal 1: xm save -c TestVM1 TestVM1.chkpt >> terminal 2: xm save -c TestVM2 TestVM2.chkpt >> >> If this fails, then your problem is something else. >> Check the log files /var/log/xen/xend.log for errors. >> >> >> Most important of all, you are not doing any disk replication! >> So, when the VM recovers on backup, it will be running with an >> older version of the disk and will possibly crash >> >> it should be tap2:remus:backuphost:backupport|aio:.... >> >> >> shriram >> >> On Wed, Jul 27, 2011 at 11:55 PM, Rajendar K <k.rajendar@gmail.com>wrote: >> >>> Hi, >>> Thanks for your kind reply. >>> I have installed xen 4.0.2 in RHEL5 and tried remus. With a single >>> remus command >>> running for a single virtual machine, it works fine but when i tried for >>> two virtual machine, >>> the steps which i have tried is as follows, >>> >>> i have run remus command on two terminals >>> >>> (i) *remus --no-net 1 192.168.36.233 >>> *(ii) *remus --no-net 2 192.168.36.233* >>> >>> >>> Terminal 1 : >>> The following error i got where i run command >>> >>> *remus --no-net 1 192.168.36.233* >>> >>> SUSPEND shinfo 00076122 >>> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2457Mb/s 75 pages >>> 86: sent 72, skipped 0, delta 1ms, dom0 0%, target 0%, sent 2359Mb/s, >>> dirtied 2359Mb/s 72 pages >>> Total pages sent= 41424 (1.19x) >>> (of which 0 were fixups) >>> All memory is saved >>> PROF: resumed at 1311824828.014423 >>> PROF: flushed memory at 1311824828.015603 >>> PROF: suspending at 1311824828.212492 >>> SUSPEND shinfo 00076122 >>> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2359Mb/s 72 pages >>> 87: sent 107, skipped 0, delta 1ms, dom0 0%, target 0%, sent 3506Mb/s, >>> dirtied 3506Mb/s 107 pages >>> Total pages sent= 41531 (1.19x) >>> (of which 0 were fixups) >>> All memory is saved >>> PROF: resumed at 1311824828.215125 >>> PROF: flushed memory at 1311824828.216482 >>> PROF: suspending at 1311824828.426094 >>> domain 1 not shut down >>> ERROR Internal error: Suspend request failed >>> ERROR Internal error: Domain appears not to have suspended >>> Save exit rc=1 >>> >>> >>> Terminal 2 : >>> The following error i got where i run command >>> >>> *remus --no-net 2 192.168.36.233* >>> >>> >>> Had 0 unexplained entries in p2m table >>> 1: sent 32377, skipped 391, delta 14193ms, dom0 11%, target 0%, sent >>> 74Mb/s, dirtied 1Mb/s 517 pages >>> 2: sent 517, skipped 0, delta 219ms, dom0 11%, target 0%, sent 77Mb/s, >>> dirtied 0Mb/s 0 pages >>> 3: sent 0, skipped 0, Start last iteration >>> PROF: suspending at 1311824828.327395 >>> domain 2 not shut down >>> ERROR Internal error: Suspend request failed >>> ERROR Internal error: Domain appears not to have suspended >>> Save exit rc=1 >>> >>> >>> -------------------------------------- >>> >>> On the host machine where i run remus command , i can see the two VMs in >>> suspended state >>> [root@globuscompute ~]# xm list >>> Name ID Mem VCPUs State >>> Time(s) >>> Domain-0 0 1681 2 r----- >>> 75.1 >>> TestVM1 1 128 1 >>> ---s-- 5.2 >>> TestVM2 2 128 1 >>> ---s-- 5.1 >>> >>> >>> And also the destination machine i can see one VM in "b" state >>> [root@globushead ~]# xm list >>> Name ID Mem VCPUs State >>> Time(s) >>> Domain-0 0 1684 2 r----- >>> 215.4 >>> TestVM1 1 128 1 >>> -b---- 0.5 >>> >>> >>> Kindly help me in this situation.. >>> >>> I hereby specify the configuration file which i have used for your >>> reference.. >>> >>> VM1: >>> >>> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >>> memory=128 >>> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >>> disk >>> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86.img,xvda1,w''] >>> name="TestVM1" >>> root="/dev/xvda1 ro" >>> vif=[''bridge=eth0'',] >>> >>> >>> VM 2: >>> >>> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >>> memory=128 >>> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >>> disk >>> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86_1.img,xvda2,w''] >>> name="TestVM2" >>> root="/dev/xvda2 ro" >>> vif=[''bridge=eth0'',] >>> >>> >>> >>> On Wed, Jul 27, 2011 at 6:56 PM, Shriram Rajagopalan <rshriram@cs.ubc.ca >>> > wrote: >>> >>>> >>>> Yes you certainly can. What kind of problem are you facing? >>>> shriram >>>> >>>> On Wed, Jul 27, 2011 at 5:07 AM, Rajendar K <k.rajendar@gmail.com>wrote: >>>> >>>>> >>>>> Hi Sriram, >>>>> >>>>> Can we run REMUS for more than one VM simultaneously ? >>>>> I have tried it and found that it is not fruitful.. >>>>> Kindly help me for a solution. >>>>> >>>>> >>>>> -- >>>>> Raj, >>>>> CDAC. >>>>> >>>> >>>> >>> >>> >>> -- >>> Raj, >>> CDAC. >>> >> >> > > > -- > Raj, > CDAC. >_______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
remus really work ok on 4.2 unstable! walter On 31.07.2011, at 04:09, Shriram Rajagopalan <rshriram@cs.ubc.ca> wrote:> I would suggest you upgrade your xen installation. IIRC 4.0.2 still had problems with remus. > > On Thu, Jul 28, 2011 at 11:41 PM, Rajendar K <k.rajendar@gmail.com> wrote: > Hi Sriram, > > Once again, thanks for your timely reply.... > I tested as your suggesstion and i found that i don''t have any file in the > location "/var/lib/xen" > > [root@globushead xen]# pwd > /var/lib/xen > [root@globushead xen]# ls -ll > total 0 > > Further i tested the command "xm save " and results as follows > Terminal 1: > xm save -c TestVM1 TestVM1.chkpt > > Terminal 2 : > xm save -c TestVM2 TestVM2.chkpt > > & issued command simultaneously.. I could able to get those checkpoint files as follows & no error has occured.. > > > [root@globushead ~]# ls -ll TestVM* > -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM1.chkpt > -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM2.chkpt > > > I hereby represents the xend.log as follows for your reference...... > ************************************************ > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent= 32768 (0.94x) > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were fixups) > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:124) [xc_save]: /usr/lib64/xen/bin/xc_save 33 4 0 0 0 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) xc_save: failed to get the suspend evtchn port > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:394) suspend > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:127) In saveInputHandler suspend > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:129) Suspending 4 ... > [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:519) XendDomainInfo.shutdown(suspend) > [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) XendDomainInfo.handleShutdownWatch > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:135) Domain 4 suspended. > [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:144) Written done > [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) XendDomainInfo.handleShutdownWatch > [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: name=migrating-TestVM1 id=3 reason=suspend. > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Had 0 unexplained entries in p2m table > [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: name=migrating-TestVM2 id=4 reason=suspend. > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Saving memory pages: iter 1 0%^H^H^H^H 5%outbuf_write: 4194304 > 4161520@12615696 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 11%^H^H^H^H 20%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 26%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 32%^H^H^H^H 38%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 43%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 49%^H^H^H^H 55%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 61%outbuf_write: 4194304 > 4169716@12607500 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 67%^H^H^H^H 73%outbuf_write: 622592 > 598004@16179212 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 78%^H^H^H^H 84%outbuf_write: 2662400 > 2007024@14770192 > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 90%^H^H^H^H 97%^M 1: sent 32768, skipped 0, delta 321ms, dom0 12%, target 0%, sent 3344Mb/s, dirtied 1127Mb/s 11049 pages > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent= 32768 (0.94x) > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were fixups) > [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved > [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 > [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) XendDomainInfo.resumeDomain(3) > [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) XendDomainInfo.resumeDomain: completed > [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 > [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) XendDomainInfo.resumeDomain(4) > [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) XendDomainInfo.resumeDomain: completed > > ************************************************ > > > > > > On Thu, Jul 28, 2011 at 7:50 PM, Shriram Rajagopalan <rshriram@cs.ubc.ca> wrote: > check /var/lib/xen/suspend_evtchn_*_lock.d > Make sure there are different lock files for each domain. > And before starting, make sure there are no stray lock files. > > Try this litmus test first. (do both commands simultaneously, in two different > terminals) > terminal 1: xm save -c TestVM1 TestVM1.chkpt > terminal 2: xm save -c TestVM2 TestVM2.chkpt > > If this fails, then your problem is something else. > Check the log files /var/log/xen/xend.log for errors. > > > Most important of all, you are not doing any disk replication! > So, when the VM recovers on backup, it will be running with an > older version of the disk and will possibly crash > > it should be tap2:remus:backuphost:backupport|aio:.... > > > shriram > > On Wed, Jul 27, 2011 at 11:55 PM, Rajendar K <k.rajendar@gmail.com> wrote: > Hi, > Thanks for your kind reply. > I have installed xen 4.0.2 in RHEL5 and tried remus. With a single remus command > running for a single virtual machine, it works fine but when i tried for two virtual machine, > the steps which i have tried is as follows, > > i have run remus command on two terminals > > (i) remus --no-net 1 192.168.36.233 > (ii) remus --no-net 2 192.168.36.233 > > > Terminal 1 : > The following error i got where i run command > > remus --no-net 1 192.168.36.233 > > SUSPEND shinfo 00076122 > delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2457Mb/s 75 pages > 86: sent 72, skipped 0, delta 1ms, dom0 0%, target 0%, sent 2359Mb/s, dirtied 2359Mb/s 72 pages > Total pages sent= 41424 (1.19x) > (of which 0 were fixups) > All memory is saved > PROF: resumed at 1311824828.014423 > PROF: flushed memory at 1311824828.015603 > PROF: suspending at 1311824828.212492 > SUSPEND shinfo 00076122 > delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2359Mb/s 72 pages > 87: sent 107, skipped 0, delta 1ms, dom0 0%, target 0%, sent 3506Mb/s, dirtied 3506Mb/s 107 pages > Total pages sent= 41531 (1.19x) > (of which 0 were fixups) > All memory is saved > PROF: resumed at 1311824828.215125 > PROF: flushed memory at 1311824828.216482 > PROF: suspending at 1311824828.426094 > domain 1 not shut down > ERROR Internal error: Suspend request failed > ERROR Internal error: Domain appears not to have suspended > Save exit rc=1 > > > Terminal 2 : > The following error i got where i run command > > remus --no-net 2 192.168.36.233 > > > Had 0 unexplained entries in p2m table > 1: sent 32377, skipped 391, delta 14193ms, dom0 11%, target 0%, sent 74Mb/s, dirtied 1Mb/s 517 pages > 2: sent 517, skipped 0, delta 219ms, dom0 11%, target 0%, sent 77Mb/s, dirtied 0Mb/s 0 pages > 3: sent 0, skipped 0, Start last iteration > PROF: suspending at 1311824828.327395 > domain 2 not shut down > ERROR Internal error: Suspend request failed > ERROR Internal error: Domain appears not to have suspended > Save exit rc=1 > > > -------------------------------------- > > On the host machine where i run remus command , i can see the two VMs in suspended state > [root@globuscompute ~]# xm list > Name ID Mem VCPUs State Time(s) > Domain-0 0 1681 2 r----- 75.1 > TestVM1 1 128 1 ---s-- 5.2 > TestVM2 2 128 1 ---s-- 5.1 > > > And also the destination machine i can see one VM in "b" state > [root@globushead ~]# xm list > Name ID Mem VCPUs State Time(s) > Domain-0 0 1684 2 r----- 215.4 > TestVM1 1 128 1 -b---- 0.5 > > > Kindly help me in this situation.. > > I hereby specify the configuration file which i have used for your reference.. > > VM1: > > kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" > memory=128 > ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" > disk = [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86.img,xvda1,w''] > name="TestVM1" > root="/dev/xvda1 ro" > vif=[''bridge=eth0'',] > > > VM 2: > > kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" > memory=128 > ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" > disk = [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86_1.img,xvda2,w''] > name="TestVM2" > root="/dev/xvda2 ro" > vif=[''bridge=eth0'',] > > > > On Wed, Jul 27, 2011 at 6:56 PM, Shriram Rajagopalan <rshriram@cs.ubc.ca> wrote: > > Yes you certainly can. What kind of problem are you facing? > shriram > > On Wed, Jul 27, 2011 at 5:07 AM, Rajendar K <k.rajendar@gmail.com> wrote: > > Hi Sriram, > > Can we run REMUS for more than one VM simultaneously ? > I have tried it and found that it is not fruitful.. > Kindly help me for a solution. > > > -- > Raj, > CDAC. > > > > > -- > Raj, > CDAC. > > > > > -- > Raj, > CDAC. > > _______________________________________________ > Xen-users mailing list > Xen-users@lists.xensource.com > http://lists.xensource.com/xen-users_______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
Hi Sriram, Can i patch REMUS package alone ? or i should upgrade from XEN 4.0.2 to XEN 4.2? kindly provide me a suggestion regarding this... with regards, Raj CDAC. On Sun, Jul 31, 2011 at 7:39 AM, Shriram Rajagopalan <rshriram@cs.ubc.ca>wrote:> I would suggest you upgrade your xen installation. IIRC 4.0.2 still had > problems with remus. > > > On Thu, Jul 28, 2011 at 11:41 PM, Rajendar K <k.rajendar@gmail.com> wrote: > >> Hi Sriram, >> >> Once again, thanks for your timely reply.... >> I tested as your suggesstion and i found that i don''t have any >> file in the >> location "/var/lib/xen" >> >> [root@globushead xen]# pwd >> /var/lib/xen >> [root@globushead xen]# ls -ll >> total 0 >> >> Further i tested the command "xm save " and results as follows >> Terminal 1: >> xm save -c TestVM1 TestVM1.chkpt >> >> Terminal 2 : >> xm save -c TestVM2 TestVM2.chkpt >> >> & issued command simultaneously.. I could able to get those checkpoint >> files as follows & no error has occured.. >> >> >> [root@globushead ~]# ls -ll TestVM* >> -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM1.chkpt >> -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM2.chkpt >> >> >> I hereby represents the xend.log as follows for your reference...... >> ************************************************ >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent>> 32768 (0.94x) >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were >> fixups) >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved >> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:124) [xc_save]: >> /usr/lib64/xen/bin/xc_save 33 4 0 0 0 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) xc_save: failed to >> get the suspend evtchn port >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) >> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:394) suspend >> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:127) In saveInputHandler >> suspend >> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:129) Suspending 4 ... >> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:519) >> XendDomainInfo.shutdown(suspend) >> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) >> XendDomainInfo.handleShutdownWatch >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:135) Domain 4 suspended. >> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:144) Written done >> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) >> XendDomainInfo.handleShutdownWatch >> [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: >> name=migrating-TestVM1 id=3 reason=suspend. >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Had 0 unexplained >> entries in p2m table >> [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: >> name=migrating-TestVM2 id=4 reason=suspend. >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Saving memory pages: >> iter 1 0%^H^H^H^H 5%outbuf_write: 4194304 > 4161520@12615696 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 11%^H^H^H^H >> 20%outbuf_write: 4194304 > 4169716@12607500 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >> 26%outbuf_write: 4194304 > 4169716@12607500 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 32%^H^H^H^H >> 38%outbuf_write: 4194304 > 4169716@12607500 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >> 43%outbuf_write: 4194304 > 4169716@12607500 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 49%^H^H^H^H >> 55%outbuf_write: 4194304 > 4169716@12607500 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >> 61%outbuf_write: 4194304 > 4169716@12607500 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 67%^H^H^H^H >> 73%outbuf_write: 622592 > 598004@16179212 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 78%^H^H^H^H >> 84%outbuf_write: 2662400 > 2007024@14770192 >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 90%^H^H^H^H >> 97%^M 1: sent 32768, skipped 0, delta 321ms, dom0 12%, target 0%, sent >> 3344Mb/s, dirtied 1127Mb/s 11049 pages >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent>> 32768 (0.94x) >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were >> fixups) >> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved >> [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 >> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) >> XendDomainInfo.resumeDomain(3) >> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) >> XendDomainInfo.resumeDomain: completed >> [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 >> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) >> XendDomainInfo.resumeDomain(4) >> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) >> XendDomainInfo.resumeDomain: completed >> >> ************************************************ >> >> >> >> >> >> On Thu, Jul 28, 2011 at 7:50 PM, Shriram Rajagopalan <rshriram@cs.ubc.ca>wrote: >> >>> check /var/lib/xen/suspend_evtchn_*_lock.d >>> Make sure there are different lock files for each domain. >>> And before starting, make sure there are no stray lock files. >>> >>> Try this litmus test first. (do both commands simultaneously, in two >>> different >>> terminals) >>> terminal 1: xm save -c TestVM1 TestVM1.chkpt >>> terminal 2: xm save -c TestVM2 TestVM2.chkpt >>> >>> If this fails, then your problem is something else. >>> Check the log files /var/log/xen/xend.log for errors. >>> >>> >>> Most important of all, you are not doing any disk replication! >>> So, when the VM recovers on backup, it will be running with an >>> older version of the disk and will possibly crash >>> >>> it should be tap2:remus:backuphost:backupport|aio:.... >>> >>> >>> shriram >>> >>> On Wed, Jul 27, 2011 at 11:55 PM, Rajendar K <k.rajendar@gmail.com>wrote: >>> >>>> Hi, >>>> Thanks for your kind reply. >>>> I have installed xen 4.0.2 in RHEL5 and tried remus. With a >>>> single remus command >>>> running for a single virtual machine, it works fine but when i tried for >>>> two virtual machine, >>>> the steps which i have tried is as follows, >>>> >>>> i have run remus command on two terminals >>>> >>>> (i) *remus --no-net 1 192.168.36.233 >>>> *(ii) *remus --no-net 2 192.168.36.233* >>>> >>>> >>>> Terminal 1 : >>>> The following error i got where i run command >>>> >>>> *remus --no-net 1 192.168.36.233* >>>> >>>> SUSPEND shinfo 00076122 >>>> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2457Mb/s 75 pages >>>> 86: sent 72, skipped 0, delta 1ms, dom0 0%, target 0%, sent 2359Mb/s, >>>> dirtied 2359Mb/s 72 pages >>>> Total pages sent= 41424 (1.19x) >>>> (of which 0 were fixups) >>>> All memory is saved >>>> PROF: resumed at 1311824828.014423 >>>> PROF: flushed memory at 1311824828.015603 >>>> PROF: suspending at 1311824828.212492 >>>> SUSPEND shinfo 00076122 >>>> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2359Mb/s 72 pages >>>> 87: sent 107, skipped 0, delta 1ms, dom0 0%, target 0%, sent 3506Mb/s, >>>> dirtied 3506Mb/s 107 pages >>>> Total pages sent= 41531 (1.19x) >>>> (of which 0 were fixups) >>>> All memory is saved >>>> PROF: resumed at 1311824828.215125 >>>> PROF: flushed memory at 1311824828.216482 >>>> PROF: suspending at 1311824828.426094 >>>> domain 1 not shut down >>>> ERROR Internal error: Suspend request failed >>>> ERROR Internal error: Domain appears not to have suspended >>>> Save exit rc=1 >>>> >>>> >>>> Terminal 2 : >>>> The following error i got where i run command >>>> >>>> *remus --no-net 2 192.168.36.233* >>>> >>>> >>>> Had 0 unexplained entries in p2m table >>>> 1: sent 32377, skipped 391, delta 14193ms, dom0 11%, target 0%, sent >>>> 74Mb/s, dirtied 1Mb/s 517 pages >>>> 2: sent 517, skipped 0, delta 219ms, dom0 11%, target 0%, sent 77Mb/s, >>>> dirtied 0Mb/s 0 pages >>>> 3: sent 0, skipped 0, Start last iteration >>>> PROF: suspending at 1311824828.327395 >>>> domain 2 not shut down >>>> ERROR Internal error: Suspend request failed >>>> ERROR Internal error: Domain appears not to have suspended >>>> Save exit rc=1 >>>> >>>> >>>> -------------------------------------- >>>> >>>> On the host machine where i run remus command , i can see the two VMs in >>>> suspended state >>>> [root@globuscompute ~]# xm list >>>> Name ID Mem VCPUs State >>>> Time(s) >>>> Domain-0 0 1681 2 >>>> r----- 75.1 >>>> TestVM1 1 128 1 >>>> ---s-- 5.2 >>>> TestVM2 2 128 1 >>>> ---s-- 5.1 >>>> >>>> >>>> And also the destination machine i can see one VM in "b" state >>>> [root@globushead ~]# xm list >>>> Name ID Mem VCPUs State >>>> Time(s) >>>> Domain-0 0 1684 2 r----- >>>> 215.4 >>>> TestVM1 1 128 1 >>>> -b---- 0.5 >>>> >>>> >>>> Kindly help me in this situation.. >>>> >>>> I hereby specify the configuration file which i have used for your >>>> reference.. >>>> >>>> VM1: >>>> >>>> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >>>> memory=128 >>>> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >>>> disk >>>> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86.img,xvda1,w''] >>>> name="TestVM1" >>>> root="/dev/xvda1 ro" >>>> vif=[''bridge=eth0'',] >>>> >>>> >>>> VM 2: >>>> >>>> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >>>> memory=128 >>>> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >>>> disk >>>> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86_1.img,xvda2,w''] >>>> name="TestVM2" >>>> root="/dev/xvda2 ro" >>>> vif=[''bridge=eth0'',] >>>> >>>> >>>> >>>> On Wed, Jul 27, 2011 at 6:56 PM, Shriram Rajagopalan < >>>> rshriram@cs.ubc.ca> wrote: >>>> >>>>> >>>>> Yes you certainly can. What kind of problem are you facing? >>>>> shriram >>>>> >>>>> On Wed, Jul 27, 2011 at 5:07 AM, Rajendar K <k.rajendar@gmail.com>wrote: >>>>> >>>>>> >>>>>> Hi Sriram, >>>>>> >>>>>> Can we run REMUS for more than one VM simultaneously ? >>>>>> I have tried it and found that it is not fruitful.. >>>>>> Kindly help me for a solution. >>>>>> >>>>>> >>>>>> -- >>>>>> Raj, >>>>>> CDAC. >>>>>> >>>>> >>>>> >>>> >>>> >>>> -- >>>> Raj, >>>> CDAC. >>>> >>> >>> >> >> >> -- >> Raj, >> CDAC. >> > >-- Raj, CDAC. _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
I dont think it would be that easy. Did you install from the source tarball? If you want to pull the bug fixes alone, you ll have to search the xen repo for patches authored by me since the 4.0.2 branch and apply those diffs on the source code and recompile/install. I am not sure if all of those patches would fit in easily but the critical ones should, hopefully. shriram On Mon, Aug 1, 2011 at 12:53 AM, Rajendar K <k.rajendar@gmail.com> wrote:> > Hi Sriram, > Can i patch REMUS package alone ? or i should upgrade from > XEN 4.0.2 to XEN 4.2? > kindly provide me a suggestion regarding this... > > with regards, > Raj > CDAC. > > > On Sun, Jul 31, 2011 at 7:39 AM, Shriram Rajagopalan <rshriram@cs.ubc.ca>wrote: > >> I would suggest you upgrade your xen installation. IIRC 4.0.2 still had >> problems with remus. >> >> >> On Thu, Jul 28, 2011 at 11:41 PM, Rajendar K <k.rajendar@gmail.com>wrote: >> >>> Hi Sriram, >>> >>> Once again, thanks for your timely reply.... >>> I tested as your suggesstion and i found that i don''t have >>> any file in the >>> location "/var/lib/xen" >>> >>> [root@globushead xen]# pwd >>> /var/lib/xen >>> [root@globushead xen]# ls -ll >>> total 0 >>> >>> Further i tested the command "xm save " and results as follows >>> Terminal 1: >>> xm save -c TestVM1 TestVM1.chkpt >>> >>> Terminal 2 : >>> xm save -c TestVM2 TestVM2.chkpt >>> >>> & issued command simultaneously.. I could able to get those checkpoint >>> files as follows & no error has occured.. >>> >>> >>> [root@globushead ~]# ls -ll TestVM* >>> -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM1.chkpt >>> -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM2.chkpt >>> >>> >>> I hereby represents the xend.log as follows for your reference...... >>> ************************************************ >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent>>> 32768 (0.94x) >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were >>> fixups) >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved >>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:124) [xc_save]: >>> /usr/lib64/xen/bin/xc_save 33 4 0 0 0 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) xc_save: failed to >>> get the suspend evtchn port >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) >>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:394) suspend >>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:127) In saveInputHandler >>> suspend >>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:129) Suspending 4 ... >>> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:519) >>> XendDomainInfo.shutdown(suspend) >>> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) >>> XendDomainInfo.handleShutdownWatch >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:135) Domain 4 suspended. >>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:144) Written done >>> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) >>> XendDomainInfo.handleShutdownWatch >>> [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has >>> shutdown: name=migrating-TestVM1 id=3 reason=suspend. >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Had 0 unexplained >>> entries in p2m table >>> [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has >>> shutdown: name=migrating-TestVM2 id=4 reason=suspend. >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Saving memory pages: >>> iter 1 0%^H^H^H^H 5%outbuf_write: 4194304 > 4161520@12615696 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 11%^H^H^H^H >>> 20%outbuf_write: 4194304 > 4169716@12607500 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>> 26%outbuf_write: 4194304 > 4169716@12607500 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 32%^H^H^H^H >>> 38%outbuf_write: 4194304 > 4169716@12607500 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>> 43%outbuf_write: 4194304 > 4169716@12607500 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 49%^H^H^H^H >>> 55%outbuf_write: 4194304 > 4169716@12607500 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>> 61%outbuf_write: 4194304 > 4169716@12607500 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 67%^H^H^H^H >>> 73%outbuf_write: 622592 > 598004@16179212 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 78%^H^H^H^H >>> 84%outbuf_write: 2662400 > 2007024@14770192 >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 90%^H^H^H^H >>> 97%^M 1: sent 32768, skipped 0, delta 321ms, dom0 12%, target 0%, sent >>> 3344Mb/s, dirtied 1127Mb/s 11049 pages >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent>>> 32768 (0.94x) >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were >>> fixups) >>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved >>> [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 >>> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) >>> XendDomainInfo.resumeDomain(3) >>> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) >>> XendDomainInfo.resumeDomain: completed >>> [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 >>> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) >>> XendDomainInfo.resumeDomain(4) >>> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) >>> XendDomainInfo.resumeDomain: completed >>> >>> ************************************************ >>> >>> >>> >>> >>> >>> On Thu, Jul 28, 2011 at 7:50 PM, Shriram Rajagopalan <rshriram@cs.ubc.ca >>> > wrote: >>> >>>> check /var/lib/xen/suspend_evtchn_*_lock.d >>>> Make sure there are different lock files for each domain. >>>> And before starting, make sure there are no stray lock files. >>>> >>>> Try this litmus test first. (do both commands simultaneously, in two >>>> different >>>> terminals) >>>> terminal 1: xm save -c TestVM1 TestVM1.chkpt >>>> terminal 2: xm save -c TestVM2 TestVM2.chkpt >>>> >>>> If this fails, then your problem is something else. >>>> Check the log files /var/log/xen/xend.log for errors. >>>> >>>> >>>> Most important of all, you are not doing any disk replication! >>>> So, when the VM recovers on backup, it will be running with an >>>> older version of the disk and will possibly crash >>>> >>>> it should be tap2:remus:backuphost:backupport|aio:.... >>>> >>>> >>>> shriram >>>> >>>> On Wed, Jul 27, 2011 at 11:55 PM, Rajendar K <k.rajendar@gmail.com>wrote: >>>> >>>>> Hi, >>>>> Thanks for your kind reply. >>>>> I have installed xen 4.0.2 in RHEL5 and tried remus. With a >>>>> single remus command >>>>> running for a single virtual machine, it works fine but when i tried >>>>> for two virtual machine, >>>>> the steps which i have tried is as follows, >>>>> >>>>> i have run remus command on two terminals >>>>> >>>>> (i) *remus --no-net 1 192.168.36.233 >>>>> *(ii) *remus --no-net 2 192.168.36.233* >>>>> >>>>> >>>>> Terminal 1 : >>>>> The following error i got where i run command >>>>> >>>>> *remus --no-net 1 192.168.36.233* >>>>> >>>>> SUSPEND shinfo 00076122 >>>>> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2457Mb/s 75 pages >>>>> 86: sent 72, skipped 0, delta 1ms, dom0 0%, target 0%, sent 2359Mb/s, >>>>> dirtied 2359Mb/s 72 pages >>>>> Total pages sent= 41424 (1.19x) >>>>> (of which 0 were fixups) >>>>> All memory is saved >>>>> PROF: resumed at 1311824828.014423 >>>>> PROF: flushed memory at 1311824828.015603 >>>>> PROF: suspending at 1311824828.212492 >>>>> SUSPEND shinfo 00076122 >>>>> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2359Mb/s 72 pages >>>>> 87: sent 107, skipped 0, delta 1ms, dom0 0%, target 0%, sent 3506Mb/s, >>>>> dirtied 3506Mb/s 107 pages >>>>> Total pages sent= 41531 (1.19x) >>>>> (of which 0 were fixups) >>>>> All memory is saved >>>>> PROF: resumed at 1311824828.215125 >>>>> PROF: flushed memory at 1311824828.216482 >>>>> PROF: suspending at 1311824828.426094 >>>>> domain 1 not shut down >>>>> ERROR Internal error: Suspend request failed >>>>> ERROR Internal error: Domain appears not to have suspended >>>>> Save exit rc=1 >>>>> >>>>> >>>>> Terminal 2 : >>>>> The following error i got where i run command >>>>> >>>>> *remus --no-net 2 192.168.36.233* >>>>> >>>>> >>>>> Had 0 unexplained entries in p2m table >>>>> 1: sent 32377, skipped 391, delta 14193ms, dom0 11%, target 0%, sent >>>>> 74Mb/s, dirtied 1Mb/s 517 pages >>>>> 2: sent 517, skipped 0, delta 219ms, dom0 11%, target 0%, sent 77Mb/s, >>>>> dirtied 0Mb/s 0 pages >>>>> 3: sent 0, skipped 0, Start last iteration >>>>> PROF: suspending at 1311824828.327395 >>>>> domain 2 not shut down >>>>> ERROR Internal error: Suspend request failed >>>>> ERROR Internal error: Domain appears not to have suspended >>>>> Save exit rc=1 >>>>> >>>>> >>>>> -------------------------------------- >>>>> >>>>> On the host machine where i run remus command , i can see the two VMs >>>>> in suspended state >>>>> [root@globuscompute ~]# xm list >>>>> Name ID Mem VCPUs State >>>>> Time(s) >>>>> Domain-0 0 1681 2 >>>>> r----- 75.1 >>>>> TestVM1 1 128 1 >>>>> ---s-- 5.2 >>>>> TestVM2 2 128 1 >>>>> ---s-- 5.1 >>>>> >>>>> >>>>> And also the destination machine i can see one VM in "b" state >>>>> [root@globushead ~]# xm list >>>>> Name ID Mem VCPUs State >>>>> Time(s) >>>>> Domain-0 0 1684 2 >>>>> r----- 215.4 >>>>> TestVM1 1 128 1 >>>>> -b---- 0.5 >>>>> >>>>> >>>>> Kindly help me in this situation.. >>>>> >>>>> I hereby specify the configuration file which i have used for your >>>>> reference.. >>>>> >>>>> VM1: >>>>> >>>>> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >>>>> memory=128 >>>>> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >>>>> disk >>>>> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86.img,xvda1,w''] >>>>> name="TestVM1" >>>>> root="/dev/xvda1 ro" >>>>> vif=[''bridge=eth0'',] >>>>> >>>>> >>>>> VM 2: >>>>> >>>>> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >>>>> memory=128 >>>>> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >>>>> disk >>>>> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86_1.img,xvda2,w''] >>>>> name="TestVM2" >>>>> root="/dev/xvda2 ro" >>>>> vif=[''bridge=eth0'',] >>>>> >>>>> >>>>> >>>>> On Wed, Jul 27, 2011 at 6:56 PM, Shriram Rajagopalan < >>>>> rshriram@cs.ubc.ca> wrote: >>>>> >>>>>> >>>>>> Yes you certainly can. What kind of problem are you facing? >>>>>> shriram >>>>>> >>>>>> On Wed, Jul 27, 2011 at 5:07 AM, Rajendar K <k.rajendar@gmail.com>wrote: >>>>>> >>>>>>> >>>>>>> Hi Sriram, >>>>>>> >>>>>>> Can we run REMUS for more than one VM simultaneously >>>>>>> ? >>>>>>> I have tried it and found that it is not fruitful.. >>>>>>> Kindly help me for a solution. >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> Raj, >>>>>>> CDAC. >>>>>>> >>>>>> >>>>>> >>>>> >>>>> >>>>> -- >>>>> Raj, >>>>> CDAC. >>>>> >>>> >>>> >>> >>> >>> -- >>> Raj, >>> CDAC. >>> >> >> > > > -- > Raj, > CDAC. >_______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
He forgot to cc the list. ---------- Forwarded message ---------- From: Tiantian Liu <tiantianl@gmail.com> Date: Tue, Aug 2, 2011 at 9:47 AM Subject: Re: [Xen-users] Re: Reg REMUS on two VMs To: rshriram@cs.ubc.ca Hi: I have built xen+remus successfully on CentOS5.5. In fact, there are many bugs in Xen4.0 with remus. So I tell you: Xen4.0 no chance! I built Xen4.2 unstable and kernel-2.6.32.40, It works with remus well! you can follow the instructions: http://remusha.wikidot.com/configuring-and-installing-remus good luck! On Tue, Aug 2, 2011 at 10:44 AM, Shriram Rajagopalan <rshriram@cs.ubc.ca>wrote:> I dont think it would be that easy. Did you install from the source > tarball? > If you want to pull the bug fixes alone, you ll have to search the xen repo > for > patches authored by me since the 4.0.2 branch and apply those diffs on the > source code and recompile/install. I am not sure if all of those patches > would > fit in easily but the critical ones should, hopefully. > > shriram > > On Mon, Aug 1, 2011 at 12:53 AM, Rajendar K <k.rajendar@gmail.com>wrote: > >> >> Hi Sriram, >> Can i patch REMUS package alone ? or i should upgrade from >> XEN 4.0.2 to XEN 4.2? >> kindly provide me a suggestion regarding this... >> >> with regards, >> Raj >> CDAC. >> >> >> On Sun, Jul 31, 2011 at 7:39 AM, Shriram Rajagopalan <rshriram@cs.ubc.ca>wrote: >> >>> I would suggest you upgrade your xen installation. IIRC 4.0.2 still had >>> problems with remus. >>> >>> >>> On Thu, Jul 28, 2011 at 11:41 PM, Rajendar K <k.rajendar@gmail.com>wrote: >>> >>>> Hi Sriram, >>>> >>>> Once again, thanks for your timely reply.... >>>> I tested as your suggesstion and i found that i don''t have >>>> any file in the >>>> location "/var/lib/xen" >>>> >>>> [root@globushead xen]# pwd >>>> /var/lib/xen >>>> [root@globushead xen]# ls -ll >>>> total 0 >>>> >>>> Further i tested the command "xm save " and results as follows >>>> Terminal 1: >>>> xm save -c TestVM1 TestVM1.chkpt >>>> >>>> Terminal 2 : >>>> xm save -c TestVM2 TestVM2.chkpt >>>> >>>> & issued command simultaneously.. I could able to get those checkpoint >>>> files as follows & no error has occured.. >>>> >>>> >>>> [root@globushead ~]# ls -ll TestVM* >>>> -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM1.chkpt >>>> -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM2.chkpt >>>> >>>> >>>> I hereby represents the xend.log as follows for your reference...... >>>> ************************************************ >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent>>>> 32768 (0.94x) >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were >>>> fixups) >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved >>>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:124) [xc_save]: >>>> /usr/lib64/xen/bin/xc_save 33 4 0 0 0 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) xc_save: failed to >>>> get the suspend evtchn port >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) >>>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:394) suspend >>>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:127) In >>>> saveInputHandler suspend >>>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:129) Suspending 4 ... >>>> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:519) >>>> XendDomainInfo.shutdown(suspend) >>>> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) >>>> XendDomainInfo.handleShutdownWatch >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:135) Domain 4 suspended. >>>> [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:144) Written done >>>> [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) >>>> XendDomainInfo.handleShutdownWatch >>>> [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has >>>> shutdown: name=migrating-TestVM1 id=3 reason=suspend. >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Had 0 unexplained >>>> entries in p2m table >>>> [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has >>>> shutdown: name=migrating-TestVM2 id=4 reason=suspend. >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Saving memory >>>> pages: iter 1 0%^H^H^H^H 5%outbuf_write: 4194304 > 4161520@12615696 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 11%^H^H^H^H 20%outbuf_write: 4194304 > 4169716@12607500 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 26%outbuf_write: 4194304 > 4169716@12607500 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 32%^H^H^H^H 38%outbuf_write: 4194304 > 4169716@12607500 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 43%outbuf_write: 4194304 > 4169716@12607500 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 49%^H^H^H^H 55%outbuf_write: 4194304 > 4169716@12607500 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 61%outbuf_write: 4194304 > 4169716@12607500 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 67%^H^H^H^H 73%outbuf_write: 622592 > 598004@16179212 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 78%^H^H^H^H 84%outbuf_write: 2662400 > 2007024@14770192 >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H >>>> 90%^H^H^H^H 97%^M 1: sent 32768, skipped 0, delta 321ms, dom0 12%, target >>>> 0%, sent 3344Mb/s, dirtied 1127Mb/s 11049 pages >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent>>>> 32768 (0.94x) >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were >>>> fixups) >>>> [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved >>>> [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 >>>> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) >>>> XendDomainInfo.resumeDomain(3) >>>> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) >>>> XendDomainInfo.resumeDomain: completed >>>> [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0 >>>> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) >>>> XendDomainInfo.resumeDomain(4) >>>> [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) >>>> XendDomainInfo.resumeDomain: completed >>>> >>>> ************************************************ >>>> >>>> >>>> >>>> >>>> >>>> On Thu, Jul 28, 2011 at 7:50 PM, Shriram Rajagopalan < >>>> rshriram@cs.ubc.ca> wrote: >>>> >>>>> check /var/lib/xen/suspend_evtchn_*_lock.d >>>>> Make sure there are different lock files for each domain. >>>>> And before starting, make sure there are no stray lock files. >>>>> >>>>> Try this litmus test first. (do both commands simultaneously, in two >>>>> different >>>>> terminals) >>>>> terminal 1: xm save -c TestVM1 TestVM1.chkpt >>>>> terminal 2: xm save -c TestVM2 TestVM2.chkpt >>>>> >>>>> If this fails, then your problem is something else. >>>>> Check the log files /var/log/xen/xend.log for errors. >>>>> >>>>> >>>>> Most important of all, you are not doing any disk replication! >>>>> So, when the VM recovers on backup, it will be running with an >>>>> older version of the disk and will possibly crash >>>>> >>>>> it should be tap2:remus:backuphost:backupport|aio:.... >>>>> >>>>> >>>>> shriram >>>>> >>>>> On Wed, Jul 27, 2011 at 11:55 PM, Rajendar K <k.rajendar@gmail.com>wrote: >>>>> >>>>>> Hi, >>>>>> Thanks for your kind reply. >>>>>> I have installed xen 4.0.2 in RHEL5 and tried remus. With a >>>>>> single remus command >>>>>> running for a single virtual machine, it works fine but when i tried >>>>>> for two virtual machine, >>>>>> the steps which i have tried is as follows, >>>>>> >>>>>> i have run remus command on two terminals >>>>>> >>>>>> (i) *remus --no-net 1 192.168.36.233 >>>>>> *(ii) *remus --no-net 2 192.168.36.233* >>>>>> >>>>>> >>>>>> Terminal 1 : >>>>>> The following error i got where i run command >>>>>> >>>>>> *remus --no-net 1 192.168.36.233* >>>>>> >>>>>> SUSPEND shinfo 00076122 >>>>>> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2457Mb/s 75 pages >>>>>> 86: sent 72, skipped 0, delta 1ms, dom0 0%, target 0%, sent 2359Mb/s, >>>>>> dirtied 2359Mb/s 72 pages >>>>>> Total pages sent= 41424 (1.19x) >>>>>> (of which 0 were fixups) >>>>>> All memory is saved >>>>>> PROF: resumed at 1311824828.014423 >>>>>> PROF: flushed memory at 1311824828.015603 >>>>>> PROF: suspending at 1311824828.212492 >>>>>> SUSPEND shinfo 00076122 >>>>>> delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2359Mb/s 72 pages >>>>>> 87: sent 107, skipped 0, delta 1ms, dom0 0%, target 0%, sent >>>>>> 3506Mb/s, dirtied 3506Mb/s 107 pages >>>>>> Total pages sent= 41531 (1.19x) >>>>>> (of which 0 were fixups) >>>>>> All memory is saved >>>>>> PROF: resumed at 1311824828.215125 >>>>>> PROF: flushed memory at 1311824828.216482 >>>>>> PROF: suspending at 1311824828.426094 >>>>>> domain 1 not shut down >>>>>> ERROR Internal error: Suspend request failed >>>>>> ERROR Internal error: Domain appears not to have suspended >>>>>> Save exit rc=1 >>>>>> >>>>>> >>>>>> Terminal 2 : >>>>>> The following error i got where i run command >>>>>> >>>>>> *remus --no-net 2 192.168.36.233* >>>>>> >>>>>> >>>>>> Had 0 unexplained entries in p2m table >>>>>> 1: sent 32377, skipped 391, delta 14193ms, dom0 11%, target 0%, sent >>>>>> 74Mb/s, dirtied 1Mb/s 517 pages >>>>>> 2: sent 517, skipped 0, delta 219ms, dom0 11%, target 0%, sent >>>>>> 77Mb/s, dirtied 0Mb/s 0 pages >>>>>> 3: sent 0, skipped 0, Start last iteration >>>>>> PROF: suspending at 1311824828.327395 >>>>>> domain 2 not shut down >>>>>> ERROR Internal error: Suspend request failed >>>>>> ERROR Internal error: Domain appears not to have suspended >>>>>> Save exit rc=1 >>>>>> >>>>>> >>>>>> -------------------------------------- >>>>>> >>>>>> On the host machine where i run remus command , i can see the two VMs >>>>>> in suspended state >>>>>> [root@globuscompute ~]# xm list >>>>>> Name ID Mem VCPUs >>>>>> State Time(s) >>>>>> Domain-0 0 1681 2 >>>>>> r----- 75.1 >>>>>> TestVM1 1 128 1 >>>>>> ---s-- 5.2 >>>>>> TestVM2 2 128 1 >>>>>> ---s-- 5.1 >>>>>> >>>>>> >>>>>> And also the destination machine i can see one VM in "b" state >>>>>> [root@globushead ~]# xm list >>>>>> Name ID Mem VCPUs >>>>>> State Time(s) >>>>>> Domain-0 0 1684 2 >>>>>> r----- 215.4 >>>>>> TestVM1 1 128 1 >>>>>> -b---- 0.5 >>>>>> >>>>>> >>>>>> Kindly help me in this situation.. >>>>>> >>>>>> I hereby specify the configuration file which i have used for your >>>>>> reference.. >>>>>> >>>>>> VM1: >>>>>> >>>>>> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >>>>>> memory=128 >>>>>> >>>>>> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >>>>>> disk >>>>>> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86.img,xvda1,w''] >>>>>> name="TestVM1" >>>>>> root="/dev/xvda1 ro" >>>>>> vif=[''bridge=eth0'',] >>>>>> >>>>>> >>>>>> VM 2: >>>>>> >>>>>> kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" >>>>>> memory=128 >>>>>> >>>>>> ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" >>>>>> disk >>>>>> [''tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86_1.img,xvda2,w''] >>>>>> name="TestVM2" >>>>>> root="/dev/xvda2 ro" >>>>>> vif=[''bridge=eth0'',] >>>>>> >>>>>> >>>>>> >>>>>> On Wed, Jul 27, 2011 at 6:56 PM, Shriram Rajagopalan < >>>>>> rshriram@cs.ubc.ca> wrote: >>>>>> >>>>>>> >>>>>>> Yes you certainly can. What kind of problem are you facing? >>>>>>> shriram >>>>>>> >>>>>>> On Wed, Jul 27, 2011 at 5:07 AM, Rajendar K <k.rajendar@gmail.com>wrote: >>>>>>> >>>>>>>> >>>>>>>> Hi Sriram, >>>>>>>> >>>>>>>> Can we run REMUS for more than one VM simultaneously >>>>>>>> ? >>>>>>>> I have tried it and found that it is not fruitful.. >>>>>>>> Kindly help me for a solution. >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Raj, >>>>>>>> CDAC. >>>>>>>> >>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Raj, >>>>>> CDAC. >>>>>> >>>>> >>>>> >>>> >>>> >>>> -- >>>> Raj, >>>> CDAC. >>>> >>> >>> >> >> >> -- >> Raj, >> CDAC. >> > > > _______________________________________________ > Xen-users mailing list > Xen-users@lists.xensource.com > http://lists.xensource.com/xen-users >_______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users
I have the following problem when start a vm in xen 4.0.1 with 2GB memory and 4GB of max-memory and run the tool sosreport -a the vm crash If before launching sosreport-a modify the memory with xm mem-set 2.5 Gb vm everything is ok anyone can help me _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users