Hello
I have been experimenting with xen 2.0 and before that with xeno-unstable. It
has proved interesting and promising
However, the last few updates have been problematic. In particular it looks as
if the ext3 filesystem is not correctly handled.
I typically develop in the xen0 domain, connecting over ssh from a different
machine
Recent history:
20040907: occasional kernel BUGs
20040908: instant reboot loop when booting xen
20040909: mysterious filesystem problems
20040910: mysterious filesystem problems
The main symptoms are:
rpm -qa
crashes with floating point exception or otherwise dies some way into
generating a list of installed rpms. Looked like corruption of rpm database.
The ''rpm --rebuilddb'' operation also failed, so I was
investigating ways of
reconstructing the database when I encountered other problems - eg:
mv bla* dirname
complained that the last argument was not a directory when it quite clearly was,
as in ''mkdir newdirectory; mv bla* newdirectory;''
Neither of these problems arise when I reboot the same system using my standard
kernel (2.6.7 compiled from source). In each case I am booting into the same
environment (mandrake 10.0 community + updates, running in init level 3).
I haven''t tried with any other filesystem, but have tried using gcc
versions
3.3.2 and 3.4.0.
I don''t know if anyone else has encountered this kind of problem.
The kernel BUG problem looks as if it may be related to preemption in the
kernel. It occurred when I had dom0 + 6 domU systems running, and would happen
both in dom0 and in the domU systems. A typcial example:
------------[ cut here ]------------
kernel BUG at mm/rmap.c:251!
invalid operand: 0000 [#1]
PREEMPT
Modules linked in:
CPU: 0
EIP: 0061:[<c0149ca9>] Not tainted
EFLAGS: 00010246 (2.6.8.1-xenU)
EIP is at page_referenced+0x15e/0x175
eax: c2050000 ebx: c1119858 ecx: 00000f30 edx: c1c52480
esi: c1119840 edi: c1c52480 ebp: 00000000 esp: c2051e4c
ds: 0069 es: 0069 ss: 0069
Process kswapd0 (pid: 18, threadinfo=c2050000 task=c204b080)
Stack: c111e600 c18360cc c2051e6c 00000009 00000009 00000009 00000000 00000000
00000001 c2050000 c02bc9c0 00000000 c013f387 c1119858 c1119840 c02bc9c0
c2050000 c0141207 c1119840 c02bcb08 c02bc9dc 00000001 00000020 00000020
Call Trace:
[<c013f387>] __pagevec_release+0x25/0x30
[<c0141207>] refill_inactive_zone+0x591/0x662
[<c013ffbc>] shrink_slab+0x98/0x1a9
[<c014134a>] shrink_zone+0x72/0xb6
[<c0141784>] balance_pgdat+0x1d4/0x2b1
[<c0141920>] kswapd+0xbf/0xc1
[<c011a245>] autoremove_wake_function+0x0/0x57
[<c010d7c6>] ret_from_fork+0x6/0x14
[<c011a245>] autoremove_wake_function+0x0/0x57
[<c0141861>] kswapd+0x0/0xc1
[<c010ee91>] kernel_thread_helper+0x5/0xb
Code: 0f 0b fb 00 c9 2a 29 c0 e9 08 ff ff ff e8 a0 c1 13 00 e9 53
<6>note: kswapd0[18] exited with preempt_count 2
------------[ cut here ]------------
The following instance mentioned ''scheduling while atomic''. I
can send more log
output if it would be useful - I was waiting to see if relevant-sounding
changesets had done the trick.
On a different topic, I have never yet seen the web interface work for long
enough to be useful - it always crashes with the int-text type clash that was
mentioned somewhere in the changelogs as having been fixed.
Regards
Peri Hankey
-------------------------------------------------------
This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170
Project Admins to receive an Apple iPod Mini FREE for your judgement on
who ports your project to Linux PPC the best. Sponsored by IBM.
Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel
> I have been experimenting with xen 2.0 and before that with xeno-unstable. It > has proved interesting and promising > > However, the last few updates have been problematic. In particular it looks as > if the ext3 filesystem is not correctly handled.Can you give us an idea when it last worked reliably for you?> Recent history: > > 20040907: occasional kernel BUGs > 20040908: instant reboot loop when booting xen > 20040909: mysterious filesystem problems > 20040910: mysterious filesystem problemsI suspect that these are all a result of experiencing some virtual memory bug. It''s pretty odd, as we haven''t seen any problems in our testing, but its possible that our automated testing doesn''t run enough concurrent domains to tweak it. Can you tell us a bit more about your setup, e.g. SMP? Xeon or Opteron? What are the other domains doing when ''rpm -qa'' explodes? What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve never used mandrake, but I doubt this is significant) If you can narrow down to a simple to reproduce test case I''m sure we can find and fix this.> On a different topic, I have never yet seen the web interface work for long > enough to be useful - it always crashes with the int-text type clash that was > mentioned somewhere in the changelogs as having been fixed.The web interface is very much a work-in-progress that we''re hoping other people will contribute on. Ian ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
Ian Pratt wrote:>>I have been experimenting with xen 2.0 and before that with xeno-unstable. It >>has proved interesting and promising >> >>However, the last few updates have been problematic. In particular it looks as >>if the ext3 filesystem is not correctly handled. >> >> > >Can you give us an idea when it last worked reliably for you? > > >The previous snapshot - 20040906 - worked ok but encountered "Kernel panic: Unable to reduce memory reservation ..." as discussed elsewhere in this list.>>Recent history: >> >> 20040907: occasional kernel BUGs >> 20040908: instant reboot loop when booting xen >> 20040909: mysterious filesystem problems >> 20040910: mysterious filesystem problems >> >> > >I suspect that these are all a result of experiencing some >virtual memory bug. > >At present the snapshot for 20040911 seems to show no signs of the filesystem problem, so I probably was seeing a side effect of some of the changes you were making in the pagetable system. However I have seen at least one instance of the "Kernel panic: Unable to reduce memory reservation ..." problem.>It''s pretty odd, as we haven''t seen any problems in our testing, >but its possible that our automated testing doesn''t run enough >concurrent domains to tweak it. > >Can you tell us a bit more about your setup, e.g. SMP? Xeon or >Opteron? > > >CPU: AMD Athlon(tm) XP 2400+ stepping 01 (''uname -a'' shows architecture as i686). 512Mbytes RAM. El-cheapo-self-build(tm). The Xen kernels are built straight out of the snapshot with no local configuration.>What are the other domains doing when ''rpm -qa'' explodes? > >Nothing much (no exerciser running). But running sshd, postfix, apache, proftpd, webmin, xinetd. Each xenU domain has only 32Mbytes, so the virtual memory would have been fairly active. Each domain would like to have a swap partition but doesn''t get one as yet largely because I haven''t got round to changing the configuration from the minimum required to make untrusted domains boot and run. But I''m not clear how swap and xen are intended to interact, and whether I should configure swap devices for untrusted domains.>What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve >never used mandrake, but I doubt this is significant) > >....$ ldd `which rpm` linux-gate.so.1 => (0xfbffd000) librpm-4.2.so => /usr/lib/librpm-4.2.so (0x40029000) librpmdb-4.2.so => /usr/lib/librpmdb-4.2.so (0x40082000) librpmio-4.2.so => /usr/lib/librpmio-4.2.so (0x4016d000) libpopt.so.0 => /lib/libpopt.so.0 (0x401af000) libelf.so.1 => /usr/lib/libelf.so.1 (0x401b7000) libbeecrypt.so.6 => /usr/lib/libbeecrypt.so.6 (0x401c9000) librt.so.1 => /lib/i686/librt.so.1 (0x401e7000) libpthread.so.0 => /lib/i686/libpthread.so.0 (0x401fa000) libbz2.so.1 => /usr/lib/libbz2.so.1 (0x4024c000) libc.so.6 => /lib/i686/libc.so.6 (0x4025b000) /lib/ld-linux.so.2 => /lib/ld-linux.so.2 (0x40000000) I very much doubt if use of Mandrake has much to do with it. My main issue with Mandrake is that the standard boot files need quite a lot of tweaking if you want to have an nfs root. But this also applies to some other distributions I''ve tried.>If you can narrow down to a simple to reproduce test case I''m >surewe can find and fix this. > >It currently looks to have been a passing glitch.> > > >>On a different topic, I have never yet seen the web interface work for long >>enough to be useful - it always crashes with the int-text type clash that was >>mentioned somewhere in the changelogs as having been fixed. >> >> > >The web interface is very much a work-in-progress that we''re >hoping other people will contribute on. > >I realised that, but hadn''t yet made very serious attempts to find my way into the guts of the python. Thanks Peri ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
>>> On a different topic, I have never yet seen the web interface work >>> for long >>> enough to be useful - it always crashes with the int-text type clash >>> that was >>> mentioned somewhere in the changelogs as having been fixed. >>> >> >> The web interface is very much a work-in-progress that we''re >> hoping other people will contribute on. >> > I realised that, but hadn''t yet made very serious attempts to find my > way into the guts of the python. >Hi If you can give me a pointer to where the problem is, I''ll fix it. I''ve just been in america for a week, and am off to europe soon, but I''ll give it a stab. The int-string clash is because we changed xend from addressing domains by id to name, but the functions where supposed to still accept ints... Also, I think the wizard is broke again. Cheers Tom ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
One simple problem:
... # xensv start
Traceback (most recent call last):
File "/usr/sbin/xensv", line 25, in ?
from xen.xend.server.params import PID_FILE as XEND_PID_FILE
ImportError: cannot import name PID_FILE
Then after modifying xensv line line 25 to read "from
xen.xend.server.params import XEND_PID_FILE"
... # xensv start
... # start a domain using a tried and tested sxp file ....
... try web access ...
... # Traceback (most recent call last):
File "/usr/lib/python2.3/site-packages/twisted/protocols/basic.py",
line 223, in dataReceived
why = self.lineReceived(line)
File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py",
line 965, in lineReceived
self.allContentReceived()
File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py",
line 1006, in allContentReceived
req.requestReceived(command, path, version)
File "/usr/lib/python2.3/site-packages/twisted/protocols/http.py",
line 557, in requestReceived
self.process()
--- <exception caught here> ---
File "/usr/lib/python2.3/site-packages/twisted/web/server.py", line
165, in process
self.render(resrc)
File "/usr/lib/python2.3/site-packages/twisted/web/server.py", line
172, in render
body = resrc.render(self)
File "/usr/lib/python2.3/site-packages/twisted/web/resource.py",
line
201, in render
return m(request)
File "/usr/lib/python2.3/site-packages/xen/sv/HTMLBase.py", line 17,
in render_GET
self.write_BODY( request )
File "/usr/lib/python2.3/site-packages/xen/sv/Main.py", line 58, in
write_BODY
module( self.mainUrlWriter( modName ) ).write_MENU( request )
File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 16,
in
write_MENU
return self.write_BODY( request, head=True, long=False )
File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 45,
in
write_BODY
self.write_DOMAIN( request, getDomInfoHash( domain ), long )
File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line 66,
in
write_DOMAIN
request.write( "<img src=''images/small-pause.png''
onclick=''doOp2(
\"pause\", \"%(dom)-4d\" )''>" % domInfoHash
)
exceptions.TypeError: int argument required
I hope that helps - as I explained, I haven''t yet dug into it, and
I''m
learning python as I go.
Peri
Tom Wilkie wrote:
>>>> On a different topic, I have never yet seen the web interface
work
>>>> for long
>>>> enough to be useful - it always crashes with the int-text type
>>>> clash that was
>>>> mentioned somewhere in the changelogs as having been fixed.
>>>>
>>>
>>> The web interface is very much a work-in-progress that
we''re
>>> hoping other people will contribute on.
>>>
>> I realised that, but hadn''t yet made very serious attempts to
find my
>> way into the guts of the python.
>>
>
> Hi
>
> If you can give me a pointer to where the problem is, I''ll fix it.
> I''ve just been in america for a week, and am off to europe soon,
but
> I''ll give it a stab.
>
> The int-string clash is because we changed xend from addressing
> domains by id to name, but the functions where supposed to still
> accept ints... Also, I think the wizard is broke again.
>
> Cheers
>
> Tom
>
>
-------------------------------------------------------
This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170
Project Admins to receive an Apple iPod Mini FREE for your judgement on
who ports your project to Linux PPC the best. Sponsored by IBM.
Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel
Easy fix - change line 66 of file
"/usr/lib/python2.3/site-packages/xen/sv/DomList.py", to:
request.write( "<img src=''images/small-pause.png''
onclick=''doOp2(
\"pause\", \"%(dom)s\" )''>" % domInfoHash )
be careful of the indentation. I will merge in a fix 2moro.
Cheers
Tom
ps heres a tip: read error logs bottom to top in python.
On 12 Sep 2004, at 16:07, Peri Hankey wrote:
> One simple problem:
>
> ... # xensv start
> Traceback (most recent call last):
> File "/usr/sbin/xensv", line 25, in ?
> from xen.xend.server.params import PID_FILE as XEND_PID_FILE
> ImportError: cannot import name PID_FILE
>
> Then after modifying xensv line line 25 to read "from
> xen.xend.server.params import XEND_PID_FILE"
>
>
> ... # xensv start
> ... # start a domain using a tried and tested sxp file ....
> ... try web access ...
>
> ... # Traceback (most recent call last):
> File
"/usr/lib/python2.3/site-packages/twisted/protocols/basic.py",
> line 223, in dataReceived
> why = self.lineReceived(line)
> File
"/usr/lib/python2.3/site-packages/twisted/protocols/http.py",
> line 965, in lineReceived
> self.allContentReceived()
> File
"/usr/lib/python2.3/site-packages/twisted/protocols/http.py",
> line 1006, in allContentReceived
> req.requestReceived(command, path, version)
> File
"/usr/lib/python2.3/site-packages/twisted/protocols/http.py",
> line 557, in requestReceived
> self.process()
> --- <exception caught here> ---
> File "/usr/lib/python2.3/site-packages/twisted/web/server.py",
line
> 165, in process
> self.render(resrc)
> File "/usr/lib/python2.3/site-packages/twisted/web/server.py",
line
> 172, in render
> body = resrc.render(self)
> File "/usr/lib/python2.3/site-packages/twisted/web/resource.py",
line
> 201, in render
> return m(request)
> File "/usr/lib/python2.3/site-packages/xen/sv/HTMLBase.py", line
17,
> in render_GET
> self.write_BODY( request )
> File "/usr/lib/python2.3/site-packages/xen/sv/Main.py", line 58,
in
> write_BODY
> module( self.mainUrlWriter( modName ) ).write_MENU( request )
> File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line
16,
> in write_MENU
> return self.write_BODY( request, head=True, long=False )
> File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line
45,
> in write_BODY
> self.write_DOMAIN( request, getDomInfoHash( domain ), long )
> File "/usr/lib/python2.3/site-packages/xen/sv/DomList.py", line
66,
> in write_DOMAIN
> request.write( "<img
src=''images/small-pause.png'' onclick=''doOp2(
> \"pause\", \"%(dom)-4d\" )''>" %
domInfoHash )
> exceptions.TypeError: int argument required
>
> I hope that helps - as I explained, I haven''t yet dug into it, and
I''m
> learning python as I go.
>
> Peri
>
> Tom Wilkie wrote:
>
>>>>> On a different topic, I have never yet seen the web
interface work
>>>>> for long
>>>>> enough to be useful - it always crashes with the int-text
type
>>>>> clash that was
>>>>> mentioned somewhere in the changelogs as having been fixed.
>>>>>
>>>>
>>>> The web interface is very much a work-in-progress that
we''re
>>>> hoping other people will contribute on.
>>> I realised that, but hadn''t yet made very serious attempts
to find
>>> my way into the guts of the python.
>>>
>>
>> Hi
>>
>> If you can give me a pointer to where the problem is, I''ll fix
it.
>> I''ve just been in america for a week, and am off to europe
soon, but
>> I''ll give it a stab.
>>
>> The int-string clash is because we changed xend from addressing
>> domains by id to name, but the functions where supposed to still
>> accept ints... Also, I think the wizard is broke again.
>>
>> Cheers
>>
>> Tom
>>
>>
>
-------------------------------------------------------
This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170
Project Admins to receive an Apple iPod Mini FREE for your judgement on
who ports your project to Linux PPC the best. Sponsored by IBM.
Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/xen-devel
A VM bug would account for the repeated glitch that I am running into. In domain0 the xenlinux ernel will eventaully enter an infinite loop of the same message. "null pointer encountered in RC ring. Skiping". (not a word for word copy, haven''t captured via serial port yet). It appears that if I leave the xen-br0 interface mis-configured so that Domain-0 can NOT pass packets destined to other domains it triggers the bug much more quickly. This has happened to snapshots taken at the following dates: 20040905, 20040908, 20040909 and did NOT happen with snapshots at or prior to 20040903. Yes, I made certain in each case that all fo the kernels and libs were not infected by prior snapshot versions.(a side benifit of the .deb packages, when replaced by a new version, all of the old version files are deleted prior to the install fo the new version.) If you need any specific tests, traces, or information, please feel free to reply to me with testing instructions and parameters. :) Hope this helps! Brian On Sun, 2004-09-12 at 02:05, Ian Pratt wrote:> > I have been experimenting with xen 2.0 and before that with xeno-unstable. It > > has proved interesting and promising > > > > However, the last few updates have been problematic. In particular it looks as > > if the ext3 filesystem is not correctly handled. > > Can you give us an idea when it last worked reliably for you? > > > Recent history: > > > > 20040907: occasional kernel BUGs > > 20040908: instant reboot loop when booting xen > > 20040909: mysterious filesystem problems > > 20040910: mysterious filesystem problems > > I suspect that these are all a result of experiencing some > virtual memory bug. > > It''s pretty odd, as we haven''t seen any problems in our testing, > but its possible that our automated testing doesn''t run enough > concurrent domains to tweak it. > > Can you tell us a bit more about your setup, e.g. SMP? Xeon or > Opteron? > > What are the other domains doing when ''rpm -qa'' explodes? > > What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve > never used mandrake, but I doubt this is significant) > > If you can narrow down to a simple to reproduce test case I''m > sure we can find and fix this. > > > On a different topic, I have never yet seen the web interface work for long > > enough to be useful - it always crashes with the int-text type clash that was > > mentioned somewhere in the changelogs as having been fixed. > > The web interface is very much a work-in-progress that we''re > hoping other people will contribute on. > > Ian > > > ------------------------------------------------------- > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > Project Admins to receive an Apple iPod Mini FREE for your judgement on > who ports your project to Linux PPC the best. Sponsored by IBM. > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xen-devel------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
What network driver are you using? The sis900 driver has a message similar to what you describe: "NULL pointer encountered in Rx ring, skipping" Looks like a path that isn''t supposed to trigger, but the driver writer notes som elatent bug(s) that perhaps Xen + high load triggers? -- Keir> A VM bug would account for the repeated glitch that I am running into. > In domain0 the xenlinux ernel will eventaully enter an infinite loop of > the same message. "null pointer encountered in RC ring. Skiping". (not a > word for word copy, haven''t captured via serial port yet). > > It appears that if I leave the xen-br0 interface mis-configured so that > Domain-0 can NOT pass packets destined to other domains it triggers the > bug much more quickly. > > This has happened to snapshots taken at the following dates: 20040905, > 20040908, 20040909 and did NOT happen with snapshots at or prior to > 20040903. > > Yes, I made certain in each case that all fo the kernels and libs were > not infected by prior snapshot versions.(a side benifit of the .deb > packages, when replaced by a new version, all of the old version files > are deleted prior to the install fo the new version.) > > If you need any specific tests, traces, or information, please feel free > to reply to me with testing instructions and parameters. :) > > Hope this helps! > > Brian > > On Sun, 2004-09-12 at 02:05, Ian Pratt wrote: > > > I have been experimenting with xen 2.0 and before that with xeno-unstable. It > > > has proved interesting and promising > > > > > > However, the last few updates have been problematic. In particular it looks as > > > if the ext3 filesystem is not correctly handled. > > > > Can you give us an idea when it last worked reliably for you? > > > > > Recent history: > > > > > > 20040907: occasional kernel BUGs > > > 20040908: instant reboot loop when booting xen > > > 20040909: mysterious filesystem problems > > > 20040910: mysterious filesystem problems > > > > I suspect that these are all a result of experiencing some > > virtual memory bug. > > > > It''s pretty odd, as we haven''t seen any problems in our testing, > > but its possible that our automated testing doesn''t run enough > > concurrent domains to tweak it. > > > > Can you tell us a bit more about your setup, e.g. SMP? Xeon or > > Opteron? > > > > What are the other domains doing when ''rpm -qa'' explodes? > > > > What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve > > never used mandrake, but I doubt this is significant) > > > > If you can narrow down to a simple to reproduce test case I''m > > sure we can find and fix this. > > > > > On a different topic, I have never yet seen the web interface work for long > > > enough to be useful - it always crashes with the int-text type clash that was > > > mentioned somewhere in the changelogs as having been fixed. > > > > The web interface is very much a work-in-progress that we''re > > hoping other people will contribute on. > > > > Ian > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > > Project Admins to receive an Apple iPod Mini FREE for your judgement on > > who ports your project to Linux PPC the best. Sponsored by IBM. > > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > > _______________________________________________ > > Xen-devel mailing list > > Xen-devel@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/xen-devel > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > Project Admins to receive an Apple iPod Mini FREE for your judgement on > who ports your project to Linux PPC the best. Sponsored by IBM. > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xen-devel------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
On 12 Sep 2004, at 09:35, Peri Hankey wrote:> The previous snapshot - 20040906 - worked ok but encountered "Kernel > panic: Unable to reduce memory reservation ..." as discussed elsewhere > in this list. >Just to follow up on that specific error; I had that panic from xenU domains running 20040904, but have had no problems since turning off writable pagetables in both xen0 and xenU kernels -- I''m not entirely sure what I''m missing out on, but it does seem stable enough. Chris. ------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel
Actually, it IS an sis900. 8-P I didn''t even stop to think of looking for the module with the error message. *thwaps self* In past kernels, the sis900 driver has behaved correctly. I DO have writeable page tables enabled though. The server was sitting idle at the time of the problem triggering each time. So I don''t think it was a load issue.... Could you point me to the location you found that path at? I''ll do a bit of debugging with it as a moduel to see if I can''t figure it out... On Sun, 2004-09-12 at 15:09, Keir Fraser wrote:> What network driver are you using? The sis900 driver has a message > similar to what you describe: > "NULL pointer encountered in Rx ring, skipping" > > Looks like a path that isn''t supposed to trigger, but the driver > writer notes som elatent bug(s) that perhaps Xen + high load triggers? > > -- Keir > > > A VM bug would account for the repeated glitch that I am running into. > > In domain0 the xenlinux ernel will eventaully enter an infinite loop of > > the same message. "null pointer encountered in RC ring. Skiping". (not a > > word for word copy, haven''t captured via serial port yet). > > > > It appears that if I leave the xen-br0 interface mis-configured so that > > Domain-0 can NOT pass packets destined to other domains it triggers the > > bug much more quickly. > > > > This has happened to snapshots taken at the following dates: 20040905, > > 20040908, 20040909 and did NOT happen with snapshots at or prior to > > 20040903. > > > > Yes, I made certain in each case that all fo the kernels and libs were > > not infected by prior snapshot versions.(a side benifit of the .deb > > packages, when replaced by a new version, all of the old version files > > are deleted prior to the install fo the new version.) > > > > If you need any specific tests, traces, or information, please feel free > > to reply to me with testing instructions and parameters. :) > > > > Hope this helps! > > > > Brian > > > > On Sun, 2004-09-12 at 02:05, Ian Pratt wrote: > > > > I have been experimenting with xen 2.0 and before that with xeno-unstable. It > > > > has proved interesting and promising > > > > > > > > However, the last few updates have been problematic. In particular it looks as > > > > if the ext3 filesystem is not correctly handled. > > > > > > Can you give us an idea when it last worked reliably for you? > > > > > > > Recent history: > > > > > > > > 20040907: occasional kernel BUGs > > > > 20040908: instant reboot loop when booting xen > > > > 20040909: mysterious filesystem problems > > > > 20040910: mysterious filesystem problems > > > > > > I suspect that these are all a result of experiencing some > > > virtual memory bug. > > > > > > It''s pretty odd, as we haven''t seen any problems in our testing, > > > but its possible that our automated testing doesn''t run enough > > > concurrent domains to tweak it. > > > > > > Can you tell us a bit more about your setup, e.g. SMP? Xeon or > > > Opteron? > > > > > > What are the other domains doing when ''rpm -qa'' explodes? > > > > > > What version of libc are you using i.e. ''ldd /bin/rpm'' ? (We''ve > > > never used mandrake, but I doubt this is significant) > > > > > > If you can narrow down to a simple to reproduce test case I''m > > > sure we can find and fix this. > > > > > > > On a different topic, I have never yet seen the web interface work for long > > > > enough to be useful - it always crashes with the int-text type clash that was > > > > mentioned somewhere in the changelogs as having been fixed. > > > > > > The web interface is very much a work-in-progress that we''re > > > hoping other people will contribute on. > > > > > > Ian > > > > > > > > > ------------------------------------------------------- > > > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > > > Project Admins to receive an Apple iPod Mini FREE for your judgement on > > > who ports your project to Linux PPC the best. Sponsored by IBM. > > > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > > > _______________________________________________ > > > Xen-devel mailing list > > > Xen-devel@lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/xen-devel > > > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > > Project Admins to receive an Apple iPod Mini FREE for your judgement on > > who ports your project to Linux PPC the best. Sponsored by IBM. > > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > > _______________________________________________ > > Xen-devel mailing list > > Xen-devel@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/xen-devel > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 > Project Admins to receive an Apple iPod Mini FREE for your judgement on > who ports your project to Linux PPC the best. Sponsored by IBM. > Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/xen-devel------------------------------------------------------- This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170 Project Admins to receive an Apple iPod Mini FREE for your judgement on who ports your project to Linux PPC the best. Sponsored by IBM. Deadline: Sept. 13. Go here: http://sf.net/ppc_contest.php _______________________________________________ Xen-devel mailing list Xen-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/xen-devel