Hi, I am trying to use the lustre snmp module to see what is going on with my lustre cluster. I compiled the lustresnmp.so module from sources and put it in /usr/lib64/lustre/snmp/. I added the: dlmod lustresnmp /usr/lib64/lustre/snmp/lustresnmp.so line to /etc/snmp/snmpd.conf, put the lustre mib file into /usr/share/snmp/mibs and started the snmp daemons. The docs on using snmp with lustre are somewhat poor and I wasn''t able to find how I can query the snmp server for lustre info. I tried the following from localhost: snmpget -v2c -c public localhost LUSTRE-MIB::systemInformation.3 LUSTRE-MIB::sysHealthCheck = No Such Object available on this agent at this OID snmpget localhost -c public -v 1 -m LUSTRE-MIB sysHealthCheck Error in packet Reason: (noSuchName) There is no such variable name in this MIB. Failed object: LUSTRE-MIB::sysHealthCheck Am I doing something wrong? Please help
George wrote:> Hi, > > I am trying to use the lustre snmp module to see what is going on with > my lustre cluster. > > I compiled the lustresnmp.so module from sources and put it in > /usr/lib64/lustre/snmp/. I added the: > dlmod lustresnmp /usr/lib64/lustre/snmp/lustresnmp.so > line to /etc/snmp/snmpd.conf, put the lustre mib file into > /usr/share/snmp/mibs and started the snmp daemons. > > The docs on using snmp with lustre are somewhat poor and I wasn''t able > to find how I can query the snmp server for lustre info. > > I tried the following from localhost: > snmpget -v2c -c public localhost LUSTRE-MIB::systemInformation.3 > LUSTRE-MIB::sysHealthCheck = No Such Object available on this agent at this OID > > snmpget localhost -c public -v 1 -m LUSTRE-MIB sysHealthCheck > Error in packet > Reason: (noSuchName) There is no such variable name in this MIB. > Failed object: LUSTRE-MIB::sysHealthCheck > > Am I doing something wrong?Probably not. the lustre snmp module is pretty old and stale. Do you have a lustre server running on that machine? ''health check'' basically reads a /proc file, which won''t exist if the machine isn''t a lustre server. cliffw> > Please help > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss at lists.lustre.org > http://lists.lustre.org/mailman/listinfo/lustre-discuss
Hi, Here''s what I get: [root at lustre1-centos50145 ~]# ls /proc/fs/lustre/ devices health_check ldlm llite lov lquota mdc mds mdt mgc mgs osc pinger version [root at lustre1-centos50145 ~]# cat /proc/fs/lustre/health_check healthy So the lustre server is running. Please advise.> Probably not. the lustre snmp module is pretty old and stale. > Do you have a lustre server running on that machine? ''health check'' > basically reads a /proc file, which won''t exist if the machine isn''t a > lustre server. > cliffw >