Christopher J.Walker
2012-Jul-16 11:23 UTC
[Lustre-discuss] Recommended failover software for Lustre
The "configuring failover" section in the Whamcloud release of the Lustre manual seems rather out of date: http://build.whamcloud.com/job/lustre-manual/lastSuccessfulBuild/artifact/lustre_manual.html#configuringfailover The Oracle release says much the same thing: http://wiki.lustre.org/manual/LustreManual20_HTML/ConfiguringFailover.html#50540588_50628 In section 11.1.1 "Power management software", it says: "For more information about PowerMan, go to: https://computing.llnl.gov/linux/powerman.html" Which no longer exists. It should probably point at http://code.google.com/p/powerman/ Then in section 11.2. "Setting up High-Availability (HA) Software with Lustre" it mentions "Red Hat Cluster Manager" and "Pacemaker". "Red Hat Cluster Manager" points to http://wiki.lustre.org/index.php/Using_Red_Hat_Cluster_Manager_with_Lustre which says "In comparison with other HA solutions, RedHat Cluster as in RHEL 5.5 is an old HA solution. We recommend using other HA solutions like Pacemaker, if possible. " The pacemaker link: http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre Although the title of this is "Using Pacemaker with Lustre", it starts off by saying "In modern clusters, OpenAIS, or more specifically, its communication stack corosync, is used for this task". In summary: 1) The manual could do with some updating here. 2) I suspect I should be using corosync. Chris
Cliff White
2012-Jul-16 18:43 UTC
[Lustre-discuss] Recommended failover software for Lustre
Thanks, we''ve created http://jira.whamcloud.com/browse/LUDOC-69 to track the fixes to the manual. cliffw On Mon, Jul 16, 2012 at 4:23 AM, Christopher J.Walker <C.J.Walker at qmul.ac.uk> wrote:> The "configuring failover" section in the Whamcloud release of the > Lustre manual seems rather out of date: > > > http://build.whamcloud.com/job/lustre-manual/lastSuccessfulBuild/artifact/lustre_manual.html#configuringfailover > > The Oracle release says much the same thing: > > http://wiki.lustre.org/manual/LustreManual20_HTML/ConfiguringFailover.html#50540588_50628 > > In section 11.1.1 "Power management software", it says: > > "For more information about PowerMan, go to: > https://computing.llnl.gov/linux/powerman.html" > > Which no longer exists. It should probably point at > http://code.google.com/p/powerman/ > > > Then in section 11.2. "Setting up High-Availability (HA) Software with > Lustre" it mentions "Red Hat Cluster Manager" and "Pacemaker". > > "Red Hat Cluster Manager" points to > http://wiki.lustre.org/index.php/Using_Red_Hat_Cluster_Manager_with_Lustre > > which says "In comparison with other HA solutions, RedHat Cluster as in > RHEL 5.5 is an old HA solution. We recommend using other HA solutions > like Pacemaker, if possible. " > > The pacemaker link: > http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre > > Although the title of this is "Using Pacemaker with Lustre", it starts > off by saying "In modern clusters, OpenAIS, or more specifically, its > communication stack corosync, is used for this task". > > > In summary: > > 1) The manual could do with some updating here. > > 2) I suspect I should be using corosync. > > Chris > > > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss at lists.lustre.org > http://lists.lustre.org/mailman/listinfo/lustre-discuss >-- cliffw Support Guy WhamCloud, Inc. www.whamcloud.com -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.lustre.org/pipermail/lustre-discuss/attachments/20120716/7d891381/attachment.html