Jürgen Kabelitz
2007-Nov-12 13:22 UTC
[Lustre-discuss] Install Lustre on SLES 10 with Infiniband
Hello Folks, I need some help. We have a customer he will setup a lustre System over infiniband. His OS are SuSE Enterprise 10 with SP1 and the infiniband software is from OFED. First I take the kernel source from the download site - http://www.clusterfs.com/download.html - for sles10 : 2.6.16-46-0.14_lustre.1.6.3smp>From the openfabrics.org I take the ofed 1.2.5.1This version is unable to compile against the kernel source from clusterfs. I try it with the version ofed 1.0, ofed 1.2. All have the same error. I have the questions: Is there any system with lustre FS over Infiniband based on the Open Fabrics SW? When there is a system, which software packets are taken? When it is possible, give a short description how do make the software. With regards J. Kabelitz sysGen GmbH Support und Technik Clustersysteme Am Hallacker 48 28327 Bremen Tel.: +49 421 40966-28 Fax: +49 421 40966-33 jkabelitz at sysgen.de<mailto:jkabelitz at sysgen.de> www.sysgen.de<http://www.sysgen.de> Geschaeftsfuehrerin Gabriele Nikisch Eingetragen beim Amtsgericht Walsrode HRB 121943 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.lustre.org/pipermail/lustre-discuss/attachments/20071112/e5f4a6d0/attachment-0002.html
Liang Zhen
2007-Nov-12 14:36 UTC
[Lustre-discuss] Install Lustre on SLES 10 with Infiniband
Hi, Build Lustre against OFED 1.2.x 1. Lustre supports OFED 1.2.0 since 1.6.1 2. 1.6.1 - 1.6.3 can only support OFED 1.2.0 3. To run lustre over 1.2.5, two patches are needed: https://bugzilla.lustre.org/attachment.cgi?id=12508 https://bugzilla.lustre.org/attachment.cgi?id=12629 These patches have been landed to LNet HEAD 4. If got problem like "disagrees about version of symbol ib_create_cq" while loading ko2iblnd.ko: a) If <prefix>/src/openib/Module.symvers exists copy this file to <lustre>/lnet/klnds/o2iblnd and build lustre b) If <prefix>/src/openib/Module.symvers doesn''t exist Run this script in /lib/modules/<kernel version>/build/ and build lustre https://bugzilla.lustre.org/attachment.cgi?id=10827 To get more details about this, please read: https://bugs.openfabrics.org/show_bug.cgi?id=355 https://bugzilla.lustre.org/show_bug.cgi?id=12316 Regards Liang J?rgen Kabelitz wrote:> > Hello Folks, > > > > I need some help. We have a customer he will setup a lustre System > over infiniband. His OS are SuSE Enterprise 10 with SP1 and the > infiniband software is from OFED. > > First I take the kernel source from the download site - > http://www.clusterfs.com/download.html - for sles10 : > 2.6.16-46-0.14_lustre.1.6.3smp > > From the openfabrics.org I take the ofed 1.2.5.1 > > This version is unable to compile against the kernel source from > clusterfs. I try it with the version ofed 1.0, ofed 1.2. All have the > same error. > > > > I have the questions: > > > > Is there any system with lustre FS over Infiniband based on the Open > Fabrics SW? > > > > When there is a system, which software packets are taken? > > > > When it is possible, give a short description how do make the software. > > > > With regards > > > > > > J. Kabelitz > > sysGen GmbH > > Support und Techni k Clustersysteme > > Am Hallacker 48 > > 28327 Bremen > > Tel.: +49 421 40966-28 > > Fax: +49 421 40966-33 > > jkabelitz at sysgen.de <mailto:jkabelitz at sysgen.de> > > www.sysgen.de <http://www.sysgen.de> > > Geschaeftsfuehrerin Gabriele Nikisch > > Eingetragen beim Amtsgericht > > Walsrode HRB 121943 > > > > > > ------------------------------------------------------------------------ > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss at clusterfs.com > https://mail.clusterfs.com/mailman/listinfo/lustre-discuss >
Jürgen Kabelitz
2007-Nov-16 11:43 UTC
[Lustre-discuss] Install Lustre on SLES 10 with Infiniband
when I build the OPENIB against your SLES 10 kernel, I got the following error message: Is this a problem from the lustre kernel or a problem from the ofed part? gcc -Wp,-MD,/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/.addr.o.d -nostdinc -isystem /usr/lib64/gcc/x86_64-suse-linux/ 4.1.2/include -D__KERNEL__ -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/ -I/var/tmp/OFEDRPM/BUILD/ofa_ker nel-1.2/include -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/include -Iinclude -Iinclude2 -I/usr/src/linux-2.6.16-46-0.14_ lustre.1.6.3/include -include include/linux/autoconf.h -include /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/include/linux/autoconf.h -I/var/ tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core -Wall -Wundef -Wstrict-prototypes -Wno-trigraphs -Werror-implicit-function-declar ation -fno-strict-aliasing -fno-common -ffreestanding -Os -mtune=generic -m64 -mno-red-zone -mcmodel=kernel -pipe -fno-reorder-blocks -Wno- sign-compare -fno-asynchronous-unwind-tables -funit-at-a-time -mno-sse -mno-mmx -mno-sse2 -mno-3dnow -fomit-frame-pointer -fno-stack-protec tor -Wdeclaration-after-statement -Wno-pointer-sign -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/include -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1. 2/drivers/infiniband/include -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/ulp/ipoib -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2 /drivers/infiniband/debug -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/hw/cxgb3/core -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1. 2/drivers/net/cxgb3 -I/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/net/rds -DMODULE -D"KBUILD_STR(s)=#s" -D"KBUILD_BASENAME=KBUILD_STR(a ddr)" -D"KBUILD_MODNAME=KBUILD_STR(ib_addr)" -c -o /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/.tmp_addr.o /var/tmp/OFED RPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/addr.c In file included from include2/asm/processor.h:23, from /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/prefetch.h:14, from /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/list.h:7, from /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/mutex.h:13, from /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/mutex.h:5, from /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/addr.c:31: /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/cpumask.h:6:1: warning: "for_each_possible_cpu" redefined In file included from /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/cpumask.h:4, from include2/asm/processor.h:23, from /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/prefetch.h:14, from /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/list.h:7, from /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/mutex.h:13, from /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/mutex.h:5, from /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/addr.c:31: /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/cpumask.h:411:1: warning: this is the location of the previous definition In file included from /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/inetdevice.h:7, from /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/inetdevice.h:4, from /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/addr.c:32: /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/netdevice.h:7: error: redefinition of ?netif_tx_lock? /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/netdevice.h:927: error: previous definition of ?netif_tx_lock? was here /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/netdevice.h: In function ?netif_tx_lock?: /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/netdevice.h:8: error: ?struct net_device? has no member n amed ?xmit_lock? /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/netdevice.h: At top level: /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/netdevice.h:13: error: redefinition of ?netif_tx_unlock? /usr/src/linux-2.6.16-46-0.14_lustre.1.6.3/include/linux/netdevice.h:947: error: previous definition of ?netif_tx_unlock? was here /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/netdevice.h: In function ?netif_tx_unlock?: /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/kernel_addons/backport/2.6.16/include/linux/netdevice.h:15: error: ?struct net_device? has no member named ?xmit_lock? /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/addr.c: At top level: /var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/addr.c:61: warning: initialization from incompatible pointer type make[6]: *** [/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core/addr.o] Error 1 make[5]: *** [/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband/core] Error 2 make[4]: *** [/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2/drivers/infiniband] Error 2 make[3]: *** [_module_/var/tmp/OFEDRPM/BUILD/ofa_kernel-1.2] Error 2 make[2]: *** [modules] Error 2 make[1]: *** [modules] Error 2 make[1]: Leaving directory `/usr/src/linux-2.6.16-46-0.14_lustre.1.6.3-obj/x86_64/smp'' make: *** [kernel] Error 2 error: Bad exit status from /var/tmp/rpm-tmp.67752 (%install) RPM build errors: user vlad does not exist - using root group vlad does not exist - using root user vlad does not exist - using root group vlad does not exist - using root Bad exit status from /var/tmp/rpm-tmp.67752 (%install) ERROR: Failed executing "rpmbuild --rebuild --define ''_topdir /var/tmp/OFEDRPM'' --define ''_prefix /usr'' --define ''build_root /var/tmp/OFED'' --define ''configure_options --with-cxgb3-mod --with-ipoib-mod --with-mthca-mod --with-sdp-mod --with-srp-mod --with-core-mod --with-user_m ad-mod --with-user_access-mod --with-addr_trans-mod --with-rds-mod '' --define ''KVERSION 2.6.16-46-0.14_lustre.1.6.3smp'' --define ''KSRC /lib /modules/2.6.16-46-0.14_lustre.1.6.3smp/build'' --define ''build_kernel_ib 1'' --define ''build_kernel_ib_devel 1'' --define ''NETWORK_CONF_DIR / etc/sysconfig/network'' --define ''modprobe_update 1'' --define ''include_ipoib_conf 1'' /usr/src/OFED-1.2/SRPMS/ofa_kernel-1.2-0.src.rpm" ~ J. Kabelitz sysGen GmbH Support und Technik Clustersysteme Am Hallacker 48 28327 Bremen Tel.: +49 421 40966-28 Fax: +49 421 40966-33 jkabelitz at sysgen.de<mailto:jkabelitz at sysgen.de> www.sysgen.de<http://www.sysgen.de> Geschaeftsfuehrerin Gabriele Nikisch Eingetragen beim Amtsgericht Walsrode HRB 121943 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.lustre.org/pipermail/lustre-discuss/attachments/20071116/fa215591/attachment-0002.html