Hi All, We looking for suggestions on dealing with mellanox drivers in CentOS 6.7 We tried installing mellanox drivers (MLNX_OFED_LINUX-3.2-2.0.0.0-rhel6.7-x86_64) on a Quanta Cirrascale server running Centos 6.7 - 2.6.32-573.22.1.el6.x86_64. When we rebooted the machine after installing the drivers, it went into a kernel panic for every installed kernel except for Centos 6.7 2.6.32-573.22.1.el6.x86_64.debug. After we uninstalled the drivers, the machine failed to boot for any installed kernel. Any suggestions on how to proceed would be greatly appreciated. Thanks -- -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- Pat Haley Email: phaley at mit.edu Center for Ocean Engineering Phone: (617) 253-6824 Dept. of Mechanical Engineering Fax: (617) 253-8125 MIT, Room 5-213 http://web.mit.edu/phaley/www/ 77 Massachusetts Avenue Cambridge, MA 02139-4301
Peter Kjellström
2016-May-25 09:42 UTC
[CentOS] Recommendations for Infiniband with CentOS 6.7
On Tue, 24 May 2016 21:08:27 -0400 Pat Haley <phaley at mit.edu> wrote:> Hi All, > > We looking for suggestions on dealing with mellanox drivers in CentOS > 6.7Unless you really need a specific feature in MOFED I'd recommend you stay with the, so called, in-box drivers already in CentOS-6.7. We run >2000 HPC nodes on CentOS-6.7 with stock ib drivers.> We tried installing mellanox drivers > (MLNX_OFED_LINUX-3.2-2.0.0.0-rhel6.7-x86_64) on a Quanta Cirrascale > server running Centos 6.7 - 2.6.32-573.22.1.el6.x86_64. When we > rebooted the machine after installing the drivers, it went into a > kernel panic for every installed kernel except for Centos 6.7Sounds as if your initramfs got assembled incorrectly. Kernel panic is usually due to the initramfs not having what it takes to find and mount the root-fs. You can try to manually update the initrd for a specific installed kernel. /Peter K
Fabian Arrotin
2016-May-25 11:32 UTC
[CentOS] Recommendations for Infiniband with CentOS 6.7
On 25/05/16 03:08, Pat Haley wrote:> > Hi All, > > We looking for suggestions on dealing with mellanox drivers in CentOS 6.7 > > We tried installing mellanox drivers > (MLNX_OFED_LINUX-3.2-2.0.0.0-rhel6.7-x86_64) on a Quanta Cirrascale > server running Centos 6.7 - 2.6.32-573.22.1.el6.x86_64. When we > rebooted the machine after installing the drivers, it went into a kernel > panic for every installed kernel except for Centos 6.7 > 2.6.32-573.22.1.el6.x86_64.debug. After we uninstalled the drivers, the > machine failed to boot for any installed kernel. > > Any suggestions on how to proceed would be greatly appreciated. > > Thanks >Well, we (CentOS) are using a gluster setup on top of Infiniband, but we're just using the default mlx4_ib kernel module that is included with the kernel shipped with 6.7 (/lib/modules/2.6.32-573.22.1.el6.x86_64/kernel/drivers/infiniband/hw/mlx4/mlx4_ib.ko) so nothing to be done at the kernel/initrd level. Is there a reason why you needed a different version ? PS : the IB HBA model we have in those servers is the following one : 81:00.0 InfiniBand: Mellanox Technologies MT25418 [ConnectX VPI PCIe 2.0 2.5GT/s - IB DDR / 10GigE] (rev a0) -- Fabian Arrotin The CentOS Project | http://www.centos.org gpg key: 56BEC54E | twitter: @arrfab -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 198 bytes Desc: OpenPGP digital signature URL: <http://lists.centos.org/pipermail/centos/attachments/20160525/88b64460/attachment-0001.sig>
We have a new install of CentOS 6.7 with infiniband support installed. We can see the card in hardware and we can see the mlx4 drivers loaded in the kernel but cannot see the card as an ethernet interface, using ifconfig -a. Can you recommend an install procedure to see this as an ethernet interface? Thanks On 05/25/2016 07:32 AM, Fabian Arrotin wrote:> On 25/05/16 03:08, Pat Haley wrote: >> Hi All, >> >> We looking for suggestions on dealing with mellanox drivers in CentOS 6.7 >> >> We tried installing mellanox drivers >> (MLNX_OFED_LINUX-3.2-2.0.0.0-rhel6.7-x86_64) on a Quanta Cirrascale >> server running Centos 6.7 - 2.6.32-573.22.1.el6.x86_64. When we >> rebooted the machine after installing the drivers, it went into a kernel >> panic for every installed kernel except for Centos 6.7 >> 2.6.32-573.22.1.el6.x86_64.debug. After we uninstalled the drivers, the >> machine failed to boot for any installed kernel. >> >> Any suggestions on how to proceed would be greatly appreciated. >> >> Thanks >> > Well, we (CentOS) are using a gluster setup on top of Infiniband, but > we're just using the default mlx4_ib kernel module that is included with > the kernel shipped with 6.7 > (/lib/modules/2.6.32-573.22.1.el6.x86_64/kernel/drivers/infiniband/hw/mlx4/mlx4_ib.ko) > so nothing to be done at the kernel/initrd level. > > Is there a reason why you needed a different version ? > > PS : the IB HBA model we have in those servers is the following one : > 81:00.0 InfiniBand: Mellanox Technologies MT25418 [ConnectX VPI PCIe 2.0 > 2.5GT/s - IB DDR / 10GigE] (rev a0) > > > > > _______________________________________________ > CentOS mailing list > CentOS at centos.org > https://lists.centos.org/mailman/listinfo/centos-- -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- Pat Haley Email: phaley at mit.edu Center for Ocean Engineering Phone: (617) 253-6824 Dept. of Mechanical Engineering Fax: (617) 253-8125 MIT, Room 5-213 http://web.mit.edu/phaley/www/ 77 Massachusetts Avenue Cambridge, MA 02139-4301