John Byrne
2006-Jun-22 18:58 UTC
[Xen-devel] x86_64 live migration problems: help needed with shadow page table
Hi, I''ve been doing most of my testing with Novell''s SLES10 3.0.2 9742c; however, I did verify the problem existed in xen-unstable as of last week. x86_64 live migration is unreliable, if the domain is under stress (I used a kernel make), the domU frequently OOPses or the compile gets a segfault afterwards. The primary issue seems to be that L1 and L2 page table pages are not getting marked dirty it the shadow_dirty_bitmap. (It is running 4-level page tables and I have yet to see a verify problem with the L3 and L4 page tables.) I have shown this to my satisfaction by adding code in xc_linux_save.c to mark all L1 and L2 pages for fixups on the last iteration. I''ve managed to migrate 10 times without obvious problems, so far. (I also found the the clear_bit() and set_bit() routines were broken on x86_64; a patch against the latest xen-unstable is attached.) I cannot call what I''ve done a fix since it causing the transfer of 2000 extra, and mostly unnecessary, pages on the last iteration. So what I''d like help with is where to fix the shadow page table code so the L1 and L2 pages get marked dirty properly. It is not immediately obvious to me where this needs to be done and I''m hoping someone can save me a lot of time. Thanks, John Byrne _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel