Got a HBS (y'know, Honkin' Big Server, one o' them technical terms), a Dell 720 with two Tesla GPUs. I updated the o/s, 6.5, and I cannot get the GPUs recognized. As a last resort, I d/l NVidia's proprietary driver/installer, 325, and it builds fine... I've yum removed the kmod-nvidia I had on the system, nouveau is blacklisted, and when I reboot, lsmod shows me nvidia loaded, which modinfo tells me looks like the one I built.... but enum_gpu, which is from a CUDA group, builds... but can't enumerate the GPUs (how we wake them up for the users). I see the /dev/nvidia*, and they're a+r, a+w.... Oh, and selinux is permissive. Anyone got a clue? If I can't get this working, I'm going to have to downgrade the system several kernels. mark
On 03/25/2014 10:36 PM, m.roth at 5-cent.us wrote:> Got a HBS (y'know, Honkin' Big Server, one o' them technical terms), a > Dell 720 with two Tesla GPUs. I updated the o/s, 6.5, and I cannot get the > GPUs recognized. As a last resort, I d/l NVidia's proprietary > driver/installer, 325, and it builds fine... I've yum removed the > kmod-nvidia I had on the system, nouveau is blacklisted, and when I > reboot, lsmod shows me nvidia loaded, which modinfo tells me looks like > the one I built.... but enum_gpu, which is from a CUDA group, builds... > but can't enumerate the GPUs (how we wake them up for the users). I see > the /dev/nvidia*, and they're a+r, a+w.... Oh, and selinux is permissive. > > Anyone got a clue? If I can't get this working, I'm going to have to > downgrade the system several kernels. > > markElrepo kmod drivers are not an option? First nvidia-detect then selected packages... -- Ljubomir Ljubojevic (Love is in the Air) PL Computers Serbia, Europe StarOS, Mikrotik and CentOS/RHEL/Linux consultant
On 03/25/2014 04:36 PM, m.roth at 5-cent.us wrote:> Got a HBS (y'know, Honkin' Big Server, one o' them technical terms), a > Dell 720 with two Tesla GPUs. I updated the o/s, 6.5, and I cannot get the > GPUs recognized. As a last resort, I d/l NVidia's proprietary > driver/installer, 325, and it builds fine... I've yum removed the > kmod-nvidia I had on the system, nouveau is blacklisted, and when I > reboot, lsmod shows me nvidia loaded, which modinfo tells me looks like > the one I built.... but enum_gpu, which is from a CUDA group, builds... > but can't enumerate the GPUs (how we wake them up for the users). I see > the /dev/nvidia*, and they're a+r, a+w.... Oh, and selinux is permissive. > > Anyone got a clue? If I can't get this working, I'm going to have to > downgrade the system several kernels.Do you have an /etc/X11/xorg.conf file or something in /etc/X11/xorg.conf.d/ that actually name nvidia and not nv as the driver? -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 198 bytes Desc: OpenPGP digital signature URL: <http://lists.centos.org/pipermail/centos/attachments/20140326/79302875/attachment-0001.sig>
On 03/26/14 03:01, Johnny Hughes wrote:> On 03/25/2014 04:36 PM, m.roth at 5-cent.us wrote: >> Got a HBS (y'know, Honkin' Big Server, one o' them technical terms), a >> Dell 720 with two Tesla GPUs. I updated the o/s, 6.5, and I cannot get the >> GPUs recognized. As a last resort, I d/l NVidia's proprietary >> driver/installer, 325, and it builds fine... I've yum removed the >> kmod-nvidia I had on the system, nouveau is blacklisted, and when I >> reboot, lsmod shows me nvidia loaded, which modinfo tells me looks like >> the one I built.... but enum_gpu, which is from a CUDA group, builds... >> but can't enumerate the GPUs (how we wake them up for the users). I see >> the /dev/nvidia*, and they're a+r, a+w.... Oh, and selinux is permissive. >> >> Anyone got a clue? If I can't get this working, I'm going to have to >> downgrade the system several kernels. > > Do you have an /etc/X11/xorg.conf file or something in > /etc/X11/xorg.conf.d/ that actually name nvidia and not nv as the driver?Nope - nothing there. mark