Charles Lepple
2017-Apr-11 01:36 UTC
[Nut-upsuser] battery not installed, but battery still 100% and NUT 2.7.2-4 does not catch this and report a error
On Apr 10, 2017, at 10:42 AM, Jon Bendtsen wrote:> > Actually maybe it is within NUT's control. Maybe NUT should only claim that a UPS is ONLINE if ONLINE is the only thing it is?The problem is that a lot of the UPS status values are more of a de-facto standard, but they have been generally defined in such a way that simpler UPSes only need to report basic status. We don't have an equivalent of "no ALARM", just the absence of "ALARM" in the status line. Also, I think you are reading more into the OL status than is intended. OL simply means that AC power is available and passing through the UPS (contrast with OB and OFF). Again, because of the de-facto nature of this, we would have to consult the ups.type value to accurately reflect whether the UPS is truly an online (double-conversion) system, or "offline" with the relay feeding power directly from line to load (different than an online UPS in bypass). Unfortunately, ups.type is marked as opaque, and is not available everywhere. The short answer here is that if a monitoring system wants to represent the overall health of the system, ALARM needs to be taken into account. (Never mind the fact that a basic Back-UPS LS 500 uses a more common HID PDC Usage that maps to RB when its battery test fails...) I think we have established that the monitoring in upsmon was not sufficient, but by extension, that means the Nagios plugin probably needs a change to expose the ALARM bit and message. Not sure if this got answered already, but is the "No battery installed" alarm accurate, or is it just an old battery? If old, does the battery.runtime value get adjusted downwards after a battery test? Either way, we would need to establish which reading should take priority, and I don't think this is straightforward. I almost think we need another layer of logic to handle priority logic like this, as well as scale values. It irks me that we add scale values to the driver without knowing the extent of the error (is it only for one firmware revision? for a whole line?) This would offer some hope of being able to silence false alarms (I vaguely remember some "life cycle alarm" in one UPS that contradicted another, more direct, status bit). But this is the sort of thing that should be designed, rather than slapped together, and it shouldn't get in the way of an UPS that behaves predictably. And I think it should be a separate layer so that we can always go directly to the driver to see the raw values that the UPS is returning.
Jon Bendtsen
2017-Apr-11 06:18 UTC
[Nut-upsuser] battery not installed, but battery still 100% and NUT 2.7.2-4 does not catch this and report a error
On 11/04/17 03.36, Charles Lepple wrote: [cuuuuuut] good points.> Not sure if this got answered already, but is the "No battery installed" alarm > accurate, or is it just an old battery? If old, does the battery.runtime value> get adjusted downwards after a battery test? Either way, we would need to > establish which reading should take priority, and I don't think this is> straightforward.I think the battery is installed, but it is a really really old battery. I will try to do a battery test now that it doesn't power anything at all, hopefully a monitor is enough to do the load test. I could also try to open it and see if the battery is connected. JonB ps: in my new UPS Eaton 5SC I could not see a battery manufacture date in the extended tree data from nut-cgi :-/ so I wrote the install date on a sticker and put it on it.
Charles Lepple
2017-Apr-11 12:52 UTC
[Nut-upsuser] battery not installed, but battery still 100% and NUT 2.7.2-4 does not catch this and report a error
On Apr 11, 2017, at 2:18 AM, Jon Bendtsen wrote:> > ps: in my new UPS Eaton 5SC I could not see a battery manufacture date in the extended tree data from nut-cgi :-/ so I wrote the install date on a sticker and put it on it.You could also do something like this in ups.conf: override.battery.mfr.date = 2017-04-11 Reference: http://networkupstools.org/docs/man/ups.conf.html
Possibly Parallel Threads
- battery not installed, but battery still 100% and NUT 2.7.2-4 does not catch this and report a error
- battery not installed, but battery still 100% and NUT 2.7.2-4 does not catch this and report a error
- battery not installed, but battery still 100% and NUT 2.7.2-4 does not catch this and report a error
- battery not installed, but battery still 100% and NUT 2.7.2-4 does not catch this and report a error
- battery not installed, but battery still 100% and NUT 2.7.2-4 does not catch this and report a error