Hello I have been experimenting with xen 2.0 and before that with xeno-unstable. It has proved interesting and promising However, the last few updates have been problematic. In particular it looks as if the ext3 filesystem is not correctly handled. I typically develop in the xen0 domain, connecting over ssh from a different machine Recent history: 20040907: occasional kernel BUGs 20040908: instant reboot loop when booting xen 20040909: mysterious filesystem problems 20040910: mysterious filesystem problems The main symptoms are: rpm -qa crashes with floating point exception or otherwise dies some way into generating a list of installed rpms. Looked like corruption of rpm database. The ''rpm --rebuilddb'' operation also failed, so I was investigating ways of reconstructing the database when I encountered other problems - eg: mv bla* dirname complained that the last argument was not a directory when it quite clearly was, as in ''mkdir newdirectory; mv bla* newdirectory;'' Neither of these problems arise when I reboot the same system using my standard kernel (2.6.7 compiled from source). In each case I am booting into the same environment (mandrake 10.0 community + updates, running in init level 3). I haven''t tried with any other filesystem, but have tried using gcc versions 3.3.2 and 3.4.0. I don''t know if anyone else has encountered this kind of problem. The kernel BUG problem looks as if it may be related to preemption in the kernel. It occurred when I had dom0 + 6 domU systems running, and would happen both in dom0 and in the domU systems. A typcial example: ------------[ cut here ]------------ kernel BUG at mm/rmap.c:251! invalid operand: 0000 [#1] PREEMPT Modules linked in: CPU: 0 EIP: 0061:[<c0149ca9>] Not tainted EFLAGS: 00010246 (2.6.8.1-xenU) EIP is at page_referenced+0x15e/0x175 eax: c2050000 ebx: c1119858 ecx: 00000f30 edx: c1c52480 esi: c1119840 edi: c1c52480 ebp: 00000000 esp: c2051e4c ds: 0069 es: 0069 ss: 0069 Process kswapd0 (pid: 18, threadinfo=c2050000 task=c204b080) Stack: c111e600 c18360cc c2051e6c 00000009 00000009 00000009 00000000 00000000 00000001 c2050000 c02bc9c0 00000000 c013f387 c1119858 c1119840 c02bc9c0 c2050000 c0141207 c1119840 c02bcb08 c02bc9dc 00000001 00000020 00000020 Call Trace: [<c013f387>] __pagevec_release+0x25/0x30 [<c0141207>] refill_inactive_zone+0x591/0x662 [<c013ffbc>] shrink_slab+0x98/0x1a9 [<c014134a>] shrink_zone+0x72/0xb6 [<c0141784>] balance_pgdat+0x1d4/0x2b1 [<c0141920>] kswapd+0xbf/0xc1 [<c011a245>] autoremove_wake_function+0x0/0x57 [<c010d7c6>] ret_from_fork+0x6/0x14 [<c011a245>] autoremove_wake_function+0x0/0x57 [<c0141861>] kswapd+0x0/0xc1 [<c010ee91>] kernel_thread_helper+0x5/0xb Code: 0f 0b fb 00 c9 2a 29 c0 e9 08 ff ff ff e8 a0 c1 13 00 e9 53 <6>note: kswapd0[18] exited with preempt_count 2 ------------[ cut here ]------------ The following instance mentioned ''scheduling while atomic''. I can send more log output if it would be useful - I was waiting to see if relevant-sounding changesets had done the trick. On a different topic, I have never yet seen the web interface work for long enough to be useful - it always crashes with the int-text type clash that was mentioned somewhere in the changelogs as having been fixed. Regards Peri Hankey ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
> I have been experimenting with xen 2.0 and before that with xeno-unstable. It > has proved interesting and promising > > However, the last few updates have been problematic. In particular it looks as > if the ext3 filesystem is not correctly handled.Can you give us an idea when it last worked reliably for you?> Recent history: > > 20040907: occasional kernel BUGs > 20040908: instant reboot loop when booting xen > 20040909: mysterious filesystem problems > 20040910: mysterious filesystem problemsI suspect that these are all a result of experiencing some virtual memory bug. It''s pretty odd, as we haven''t seen any problems in our testing, but its possible that our automated testing doesn''t run enough concurrent domains to tweak it. Can you tell us a bit more about your setup, e.g. SMP? Xeon or Opteron? What are the other domains doing when ''rpm -qa'' explodes? What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve never used mandrake, but I doubt this is significant) If you can narrow down to a simple to reproduce test case I''m sure we can find and fix this.> On a different topic, I have never yet seen the web interface work for long > enough to be useful - it always crashes with the int-text type clash that was > mentioned somewhere in the changelogs as having been fixed.The web interface is very much a work-in-progress that we''re hoping other people will contribute on. Ian ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
Ian Pratt wrote:>>I have been experimenting with xen 2.0 and before that with xeno-unstable. It >>has proved interesting and promising >> >>However, the last few updates have been problematic. In particular it looks as >>if the ext3 filesystem is not correctly handled. >> >> > >Can you give us an idea when it last worked reliably for you? > > >The previous snapshot - 20040906 - worked ok but encountered "Kernel panic: Unable to reduce memory reservation ..." as discussed elsewhere in this list.>>Recent history: >> >> 20040907: occasional kernel BUGs >> 20040908: instant reboot loop when booting xen >> 20040909: mysterious filesystem problems >> 20040910: mysterious filesystem problems >> >> > >I suspect that these are all a result of experiencing some >virtual memory bug. > >At present the snapshot for 20040911 seems to show no signs of the filesystem problem, so I probably was seeing a side effect of some of the changes you were making in the pagetable system. However I have seen at least one instance of the "Kernel panic: Unable to reduce memory reservation ..." problem.>It''s pretty odd, as we haven''t seen any problems in our testing, >but its possible that our automated testing doesn''t run enough >concurrent domains to tweak it. > >Can you tell us a bit more about your setup, e.g. SMP? Xeon or >Opteron? > > >CPU: AMD Athlon(tm) XP 2400+ stepping 01 (''uname -a'' shows architecture as i686). 512Mbytes RAM. El-cheapo-self-build(tm). The Xen kernels are built straight out of the snapshot with no local configuration.>What are the other domains doing when ''rpm -qa'' explodes? > >Nothing much (no exerciser running). But running sshd, postfix, apache, proftpd, webmin, xinetd. Each xenU domain has only 32Mbytes, so the virtual memory would have been fairly active. Each domain would like to have a swap partition but doesn''t get one as yet largely because I haven''t got round to changing the configuration from the minimum required to make untrusted domains boot and run. But I''m not clear how swap and xen are intended to interact, and whether I should configure swap devices for untrusted domains.>What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve >never used mandrake, but I doubt this is significant) > >....$ ldd `which rpm` linux-gate.so.1 => (0xfbffd000) librpm-4.2.so => /usr/lib/librpm-4.2.so (0x40029000) librpmdb-4.2.so => /usr/lib/librpmdb-4.2.so (0x40082000) librpmio-4.2.so => /usr/lib/librpmio-4.2.so (0x4016d000) libpopt.so.0 => /lib/libpopt.so.0 (0x401af000) libelf.so.1 => /usr/lib/libelf.so.1 (0x401b7000) libbeecrypt.so.6 => /usr/lib/libbeecrypt.so.6 (0x401c9000) librt.so.1 => /lib/i686/librt.so.1 (0x401e7000) libpthread.so.0 => /lib/i686/libpthread.so.0 (0x401fa000) libbz2.so.1 => /usr/lib/libbz2.so.1 (0x4024c000) libc.so.6 => /lib/i686/libc.so.6 (0x4025b000) /lib/ld-linux.so.2 => /lib/ld-linux.so.2 (0x40000000) I very much doubt if use of Mandrake has much to do with it. My main issue with Mandrake is that the standard boot files need quite a lot of tweaking if you want to have an nfs root. But this also applies to some other distributions I''ve tried.>If you can narrow down to a simple to reproduce test case I''m >surewe can find and fix this. > >It currently looks to have been a passing glitch.> > > >>On a different topic, I have never yet seen the web interface work for long >>enough to be useful - it always crashes with the int-text type clash that was >>mentioned somewhere in the changelogs as having been fixed. >> >> > >The web interface is very much a work-in-progress that we''re >hoping other people will contribute on. > >I realised that, but hadn''t yet made very serious attempts to find my way into the guts of the python. Thanks Peri ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
>>> On a different topic, I have never yet seen the web interface work >>> for long >>> enough to be useful - it always crashes with the int-text type clash >>> that was >>> mentioned somewhere in the changelogs as having been fixed. >>> >> >> The web interface is very much a work-in-progress that we''re >> hoping other people will contribute on. >> > I realised that, but hadn''t yet made very serious attempts to find my > way into the guts of the python. >Hi If you can give me a pointer to where the problem is, I''ll fix it. I''ve just been in america for a week, and am off to europe soon, but I''ll give it a stab. The int-string clash is because we changed xend from addressing domains by id to name, but the functions where supposed to still accept ints... Also, I think the wizard is broke again. Cheers Tom ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
One simple problem: ... # xensv start Traceback (most recent call last): File "/usr/sbin/xensv", line 25, in ? from xen.xend.server.params import PID_FILE as XEND_PID_FILE ImportError: cannot import name PID_FILE Then after modifying xensv line line 25 to read "from xen.xend.server.params import XEND_PID_FILE" ... # xensv start ... # start a domain using a tried and tested sxp file .... ... try web access ... ... # Traceback (most recent call last): File "/usr/lib/python2.3/site-packages/twisted/protocols/basic.py", line 223, in dataReceived why = self.lineReceived(line) File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py", line 965, in lineReceived self.allContentReceived() File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py", line 1006, in allContentReceived req.requestReceived(command, path, version) File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py", line 557, in requestReceived self.process() --- <exception caught here> --- File "/usr/lib/python2.3/site-packages/twisted/web/server.py", line 165, in process self.render(resrc) File "/usr/lib/python2.3/site-packages/twisted/web/server.py", line 172, in render body = resrc.render(self) File "/usr/lib/python2.3/site-packages/twisted/web/resource.py", line 201, in render return m(request) File "/usr/lib/python2.3/site-packages/xen/sv/HTMLBase.py", line 17, in render_GET self.write_BODY( request ) File "/usr/lib/python2.3/site-packages/xen/sv/Main.py", line 58, in write_BODY module( self.mainUrlWriter( modName ) ).write_MENU( request ) File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 16, in write_MENU return self.write_BODY( request, head=True, long=False ) File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 45, in write_BODY self.write_DOMAIN( request, getDomInfoHash( domain ), long ) File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 66, in write_DOMAIN request.write( "<img src=''images/small-pause.png'' onclick=''doOp2( \"pause\", \"%(dom)-4d\" )''>" % domInfoHash ) exceptions.TypeError: int argument required I hope that helps - as I explained, I haven''t yet dug into it, and I''m learning python as I go. Peri Tom Wilkie wrote:>>>> On a different topic, I have never yet seen the web interface work >>>> for long >>>> enough to be useful - it always crashes with the int-text type >>>> clash that was >>>> mentioned somewhere in the changelogs as having been fixed. >>>> >>> >>> The web interface is very much a work-in-progress that we''re >>> hoping other people will contribute on. >>> >> I realised that, but hadn''t yet made very serious attempts to find my >> way into the guts of the python. >> > > Hi > > If you can give me a pointer to where the problem is, I''ll fix it. > I''ve just been in america for a week, and am off to europe soon, but > I''ll give it a stab. > > The int-string clash is because we changed xend from addressing > domains by id to name, but the functions where supposed to still > accept ints... Also, I think the wizard is broke again. > > Cheers > > Tom > >------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
Easy fix - change line 66 of file "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", to: request.write( "<img src=''images/small-pause.png'' onclick=''doOp2( \"pause\", \"%(dom)s\" )''>" % domInfoHash ) be careful of the indentation. I will merge in a fix 2moro. Cheers Tom ps heres a tip: read error logs bottom to top in python. On 12 Sep 2004, at 16:07, Peri Hankey wrote:> One simple problem: > > ... # xensv start > Traceback (most recent call last): > File "/usr/sbin/xensv", line 25, in ? > from xen.xend.server.params import PID_FILE as XEND_PID_FILE > ImportError: cannot import name PID_FILE > > Then after modifying xensv line line 25 to read "from > xen.xend.server.params import XEND_PID_FILE" > > > ... # xensv start > ... # start a domain using a tried and tested sxp file .... > ... try web access ... > > ... # Traceback (most recent call last): > File "/usr/lib/python2.3/site-packages/twisted/protocols/basic.py", > line 223, in dataReceived > why = self.lineReceived(line) > File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py", > line 965, in lineReceived > self.allContentReceived() > File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py", > line 1006, in allContentReceived > req.requestReceived(command, path, version) > File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py", > line 557, in requestReceived > self.process() > --- <exception caught here> --- > File "/usr/lib/python2.3/site-packages/twisted/web/server.py", line > 165, in process > self.render(resrc) > File "/usr/lib/python2.3/site-packages/twisted/web/server.py", line > 172, in render > body = resrc.render(self) > File "/usr/lib/python2.3/site-packages/twisted/web/resource.py", line > 201, in render > return m(request) > File "/usr/lib/python2.3/site-packages/xen/sv/HTMLBase.py", line 17, > in render_GET > self.write_BODY( request ) > File "/usr/lib/python2.3/site-packages/xen/sv/Main.py", line 58, in > write_BODY > module( self.mainUrlWriter( modName ) ).write_MENU( request ) > File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 16, > in write_MENU > return self.write_BODY( request, head=True, long=False ) > File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 45, > in write_BODY > self.write_DOMAIN( request, getDomInfoHash( domain ), long ) > File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 66, > in write_DOMAIN > request.write( "<img src=''images/small-pause.png'' onclick=''doOp2( > \"pause\", \"%(dom)-4d\" )''>" % domInfoHash ) > exceptions.TypeError: int argument required > > I hope that helps - as I explained, I haven''t yet dug into it, and I''m > learning python as I go. > > Peri > > Tom Wilkie wrote: > >>>>> On a different topic, I have never yet seen the web interface work >>>>> for long >>>>> enough to be useful - it always crashes with the int-text type >>>>> clash that was >>>>> mentioned somewhere in the changelogs as having been fixed. >>>>> >>>> >>>> The web interface is very much a work-in-progress that we''re >>>> hoping other people will contribute on. >>> I realised that, but hadn''t yet made very serious attempts to find >>> my way into the guts of the python. >>> >> >> Hi >> >> If you can give me a pointer to where the problem is, I''ll fix it. >> I''ve just been in america for a week, and am off to europe soon, but >> I''ll give it a stab. >> >> The int-string clash is because we changed xend from addressing >> domains by id to name, but the functions where supposed to still >> accept ints... Also, I think the wizard is broke again. >> >> Cheers >> >> Tom >> >> >------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
A VM bug would account for the repeated glitch that I am running into. In domain0 the xenlinux ernel will eventaully enter an infinite loop of the same message. "null pointer encountered in RC ring. Skiping". (not a word for word copy, haven''t captured via serial port yet). It appears that if I leave the xen-br0 interface mis-configured so that Domain-0 can NOT pass packets destined to other domains it triggers the bug much more quickly. This has happened to snapshots taken at the following dates: 20040905, 20040908, 20040909 and did NOT happen with snapshots at or prior to 20040903. Yes, I made certain in each case that all fo the kernels and libs were not infected by prior snapshot versions.(a side benifit of the .deb packages, when replaced by a new version, all of the old version files are deleted prior to the install fo the new version.) If you need any specific tests, traces, or information, please feel free to reply to me with testing instructions and parameters. :) Hope this helps! Brian On Sun, 2004-09-12 at 02:05, Ian Pratt wrote:> > I have been experimenting with xen 2.0 and before that with xeno-unstable. It > > has proved interesting and promising > > > > However, the last few updates have been problematic. In particular it looks as > > if the ext3 filesystem is not correctly handled. > > Can you give us an idea when it last worked reliably for you? > > > Recent history: > > > > 20040907: occasional kernel BUGs > > 20040908: instant reboot loop when booting xen > > 20040909: mysterious filesystem problems > > 20040910: mysterious filesystem problems > > I suspect that these are all a result of experiencing some > virtual memory bug. > > It''s pretty odd, as we haven''t seen any problems in our testing, > but its possible that our automated testing doesn''t run enough > concurrent domains to tweak it. > > Can you tell us a bit more about your setup, e.g. SMP? Xeon or > Opteron? > > What are the other domains doing when ''rpm -qa'' explodes? > > What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve > never used mandrake, but I doubt this is significant) > > If you can narrow down to a simple to reproduce test case I''m > sure we can find and fix this. > > > On a different topic, I have never yet seen the web interface work for long > > enough to be useful - it always crashes with the int-text type clash that was > > mentioned somewhere in the changelogs as having been fixed. > > The web interface is very much a work-in-progress that we''re > hoping other people will contribute on. > > Ian > > > ------------------------------------------------------- > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > Project Admins to receive an Apple iPod Mini FREE for your judgement on > who ports your project to Linux PPC the best. Sponsored by IBM. > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xen-devel------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
What network driver are you using? The sis900 driver has a message similar to what you describe: "NULL pointer encountered in Rx ring, skipping" Looks like a path that isn''t supposed to trigger, but the driver writer notes som elatent bug(s) that perhaps Xen + high load triggers? -- Keir> A VM bug would account for the repeated glitch that I am running into. > In domain0 the xenlinux ernel will eventaully enter an infinite loop of > the same message. "null pointer encountered in RC ring. Skiping". (not a > word for word copy, haven''t captured via serial port yet). > > It appears that if I leave the xen-br0 interface mis-configured so that > Domain-0 can NOT pass packets destined to other domains it triggers the > bug much more quickly. > > This has happened to snapshots taken at the following dates: 20040905, > 20040908, 20040909 and did NOT happen with snapshots at or prior to > 20040903. > > Yes, I made certain in each case that all fo the kernels and libs were > not infected by prior snapshot versions.(a side benifit of the .deb > packages, when replaced by a new version, all of the old version files > are deleted prior to the install fo the new version.) > > If you need any specific tests, traces, or information, please feel free > to reply to me with testing instructions and parameters. :) > > Hope this helps! > > Brian > > On Sun, 2004-09-12 at 02:05, Ian Pratt wrote: > > > I have been experimenting with xen 2.0 and before that with xeno-unstable. It > > > has proved interesting and promising > > > > > > However, the last few updates have been problematic. In particular it looks as > > > if the ext3 filesystem is not correctly handled. > > > > Can you give us an idea when it last worked reliably for you? > > > > > Recent history: > > > > > > 20040907: occasional kernel BUGs > > > 20040908: instant reboot loop when booting xen > > > 20040909: mysterious filesystem problems > > > 20040910: mysterious filesystem problems > > > > I suspect that these are all a result of experiencing some > > virtual memory bug. > > > > It''s pretty odd, as we haven''t seen any problems in our testing, > > but its possible that our automated testing doesn''t run enough > > concurrent domains to tweak it. > > > > Can you tell us a bit more about your setup, e.g. SMP? Xeon or > > Opteron? > > > > What are the other domains doing when ''rpm -qa'' explodes? > > > > What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve > > never used mandrake, but I doubt this is significant) > > > > If you can narrow down to a simple to reproduce test case I''m > > sure we can find and fix this. > > > > > On a different topic, I have never yet seen the web interface work for long > > > enough to be useful - it always crashes with the int-text type clash that was > > > mentioned somewhere in the changelogs as having been fixed. > > > > The web interface is very much a work-in-progress that we''re > > hoping other people will contribute on. > > > > Ian > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > > Project Admins to receive an Apple iPod Mini FREE for your judgement on > > who ports your project to Linux PPC the best. Sponsored by IBM. > > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > > _______________________________________________ > > Xen-devel mailing list > > Xen-devel@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/xen-devel > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > Project Admins to receive an Apple iPod Mini FREE for your judgement on > who ports your project to Linux PPC the best. Sponsored by IBM. > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xen-devel------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
On 12 Sep 2004, at 09:35, Peri Hankey wrote:> The previous snapshot - 20040906 - worked ok but encountered "Kernel > panic: Unable to reduce memory reservation ..." as discussed elsewhere > in this list. >Just to follow up on that specific error; I had that panic from xenU domains running 20040904, but have had no problems since turning off writable pagetables in both xen0 and xenU kernels -- I''m not entirely sure what I''m missing out on, but it does seem stable enough. Chris. ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
Actually, it IS an sis900. 8-P I didn''t even stop to think of looking for the module with the error message. *thwaps self* In past kernels, the sis900 driver has behaved correctly. I DO have writeable page tables enabled though. The server was sitting idle at the time of the problem triggering each time. So I don''t think it was a load issue.... Could you point me to the location you found that path at? I''ll do a bit of debugging with it as a moduel to see if I can''t figure it out... On Sun, 2004-09-12 at 15:09, Keir Fraser wrote:> What network driver are you using? The sis900 driver has a message > similar to what you describe: > "NULL pointer encountered in Rx ring, skipping" > > Looks like a path that isn''t supposed to trigger, but the driver > writer notes som elatent bug(s) that perhaps Xen + high load triggers? > > -- Keir > > > A VM bug would account for the repeated glitch that I am running into. > > In domain0 the xenlinux ernel will eventaully enter an infinite loop of > > the same message. "null pointer encountered in RC ring. Skiping". (not a > > word for word copy, haven''t captured via serial port yet). > > > > It appears that if I leave the xen-br0 interface mis-configured so that > > Domain-0 can NOT pass packets destined to other domains it triggers the > > bug much more quickly. > > > > This has happened to snapshots taken at the following dates: 20040905, > > 20040908, 20040909 and did NOT happen with snapshots at or prior to > > 20040903. > > > > Yes, I made certain in each case that all fo the kernels and libs were > > not infected by prior snapshot versions.(a side benifit of the .deb > > packages, when replaced by a new version, all of the old version files > > are deleted prior to the install fo the new version.) > > > > If you need any specific tests, traces, or information, please feel free > > to reply to me with testing instructions and parameters. :) > > > > Hope this helps! > > > > Brian > > > > On Sun, 2004-09-12 at 02:05, Ian Pratt wrote: > > > > I have been experimenting with xen 2.0 and before that with xeno-unstable. It > > > > has proved interesting and promising > > > > > > > > However, the last few updates have been problematic. In particular it looks as > > > > if the ext3 filesystem is not correctly handled. > > > > > > Can you give us an idea when it last worked reliably for you? > > > > > > > Recent history: > > > > > > > > 20040907: occasional kernel BUGs > > > > 20040908: instant reboot loop when booting xen > > > > 20040909: mysterious filesystem problems > > > > 20040910: mysterious filesystem problems > > > > > > I suspect that these are all a result of experiencing some > > > virtual memory bug. > > > > > > It''s pretty odd, as we haven''t seen any problems in our testing, > > > but its possible that our automated testing doesn''t run enough > > > concurrent domains to tweak it. > > > > > > Can you tell us a bit more about your setup, e.g. SMP? Xeon or > > > Opteron? > > > > > > What are the other domains doing when ''rpm -qa'' explodes? > > > > > > What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve > > > never used mandrake, but I doubt this is significant) > > > > > > If you can narrow down to a simple to reproduce test case I''m > > > sure we can find and fix this. > > > > > > > On a different topic, I have never yet seen the web interface work for long > > > > enough to be useful - it always crashes with the int-text type clash that was > > > > mentioned somewhere in the changelogs as having been fixed. > > > > > > The web interface is very much a work-in-progress that we''re > > > hoping other people will contribute on. > > > > > > Ian > > > > > > > > > ------------------------------------------------------- > > > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > > > Project Admins to receive an Apple iPod Mini FREE for your judgement on > > > who ports your project to Linux PPC the best. Sponsored by IBM. > > > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > > > _______________________________________________ > > > Xen-devel mailing list > > > Xen-devel@lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/xen-devel > > > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > > Project Admins to receive an Apple iPod Mini FREE for your judgement on > > who ports your project to Linux PPC the best. Sponsored by IBM. > > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > > _______________________________________________ > > Xen-devel mailing list > > Xen-devel@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/xen-devel > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > Project Admins to receive an Apple iPod Mini FREE for your judgement on > who ports your project to Linux PPC the best. Sponsored by IBM. > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xen-devel------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel