search for: edac

Displaying 20 results from an estimated 143 matches for "edac".

2016 Jan 22
2
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
...t 16:01, <andreas.pflug at web.de> wrote: >> Initially reported to debian >> (http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=810964), redirected here: >> >> With AMD Opteron 6xxx processors, half of the memory controllers are >> missing from /sys/devices/system/edac/mc >> Checked with single 6120 (dual memory controller) and twin 6344 (2x dual >> MC), other dual-module CPUs might be affected too. >> >> Booting plain Linux (3.2, 3.16, 4.1, 4.3), all memory controllers are >> listed under /sys/devices/system/edac/mc as expected. Sam...
2016 Jan 20
2
Bug#810964: [BUG] EDAC infomation partially missing
Initially reported to debian (http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=810964), redirected here: With AMD Opteron 6xxx processors, half of the memory controllers are missing from /sys/devices/system/edac/mc Checked with single 6120 (dual memory controller) and twin 6344 (2x dual MC), other dual-module CPUs might be affected too. Booting plain Linux (3.2, 3.16, 4.1, 4.3), all memory controllers are listed under /sys/devices/system/edac/mc as expected. Same happens, when Xen 4.1 is used: all MCs pre...
2017 May 13
2
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
I haven't yet done as much experimentation as Andreas Pflug has, but I can confirm I'm also running into this bug with Xen 4.4.1. I've only tried Linux kernel 3.16.43, but as Dom0: EDAC MC: Ver: 3.0.0 AMD64 EDAC driver v3.4.0 EDAC amd64: DRAM ECC enabled. EDAC amd64: NB MCE bank disabled, set MSR 0x0000017b[4] on node 0 to enable. EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load. AMD64 EDAC driver v3.4.0 EDAC amd64: DRAM ECC enabled. EDAC amd64: NB M...
2006 Mar 10
1
CESA-2006:0132 Update CentOS 4 i386 kernel
New kernel install spits out a lot of warnings about unknown symbols in the edac drivers: Installing: kernel i586 2.6.9-34.EL update 10 M [ ... ] Installing: kernel ####################### [10/19] WARNING: /lib/modules/2.6.9-34.EL/kernel/drivers/edac/i82860_edac.ko needs unknown symbol edac_mc_del_mc WARNING: /...
2014 Jun 25
2
How to enable EDAC kernel module for checking ECC memory?
In order to support ZFS, we upgraded a backups server with a new, ECC motherboard. We're running CentOS 6 with ZFS on Linux, recently patched. Now, I want to enable EDAC so we can check for memory errors (and maybe PCI errors as well) but so far, repeatedly pounding on the Google hasn't yielded exactly what I need to do to enable EDAC. One howto was covering PCI and edac, but "modprobe edac_mc" didn't work. Here's some information below, H...
2006 Mar 12
1
2.6.9-34.EL kernel broken on i586?
...hree basement machines got upgraded to 2.6.9-34.EL by nightly yum cron job. The two i686 seem to be fine. However, on the old i586 machine I got hole bunch of warnings. Haven't attempted rebooting it with new kernel yet. The warnings are: WARNING: /lib/modules/2.6.9-34.EL/kernel/drivers/edac/amd76x_edac.ko needs unknown symbol edac_mc_del_mc WARNING: /lib/modules/2.6.9-34.EL/kernel/drivers/edac/amd76x_edac.ko needs unknown symbol edac_mc_find_mci_by_pdev WARNING: /lib/modules/2.6.9-34.EL/kernel/drivers/edac/amd76x_edac.ko needs unknown symbol edac_mc_add_mc WARNING: /lib/modules/2.6...
2017 May 16
3
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
...00:36, <ehem+debian at m5p.com> wrote: > > I haven't yet done as much experimentation as Andreas Pflug has, but I > > can confirm I'm also running into this bug with Xen 4.4.1. > > > > I've only tried Linux kernel 3.16.43, but as Dom0: > > > > EDAC MC: Ver: 3.0.0 > > AMD64 EDAC driver v3.4.0 > > EDAC amd64: DRAM ECC enabled. > > EDAC amd64: NB MCE bank disabled, set MSR 0x0000017b[4] on node 0 to enable. > > EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not > > load. > > AMD64 EDAC...
2015 Nov 21
0
[Bug 92971] [GF110] KDE plasma locks randomly due to crash of nouveau driver
...: fifo: read fault at 0000000000 engine 07 [PFIFO] client 06 [PFIFO] reason 00 [PT_NOT_PRESENT] on channel 30 [007e6ab000 kwin_x11[2097]] Nov 20 22:39:12 hpprol2 kernel: nouveau 0000:0a:00.0: fifo: fifo engine fault on channel 30, recovering... during each boot i see two error messages related to edac (at the end) Nov 20 22:58:07 hpprol2 kernel: EDAC MC: Ver: 3.0.0 Nov 20 22:58:07 hpprol2 kernel: EDAC sbridge: Seeking for: PCI ID 8086:3ca0 Nov 20 22:58:07 hpprol2 kernel: EDAC sbridge: Seeking for: PCI ID 8086:3ca0 Nov 20 22:58:07 hpprol2 kernel: EDAC sbridge: Seeking for: PCI ID 8086:3ca8 Nov 20...
2011 Aug 17
2
Strange Kernel Warning.
...s going bad or I am having problem with the actual board. Thank you in advace. I am getting the following error via stdout and also in /var/log/messages Aug 15 20:37:10 saturn kernel: Northbridge Error, node 0 Aug 15 20:37:10 saturn kernel: ECC/ChipKill ECC error. Aug 15 20:37:10 saturn kernel: EDAC amd64 MC0: CE ERROR_ADDRESS= 0x1b9e740 Aug 15 20:37:10 saturn kernel: EDAC MC0: CE page 0x1b9e, offset 0x740, grain 0, syndrome 0x1cc8, row 2, channel 0, label "": amd64_edac Aug 15 20:37:10 saturn kernel: EDAC MC0: CE - no information available: amd64_edacError Overflow Aug 15 23:33:41 s...
2008 Jan 19
2
EDAC error
Hello, I upgraded to CentOS 5.1 and everything went smoothly (Thanks for the awesome work!). But after rebooting, I get the following error: EDAC MC: Ver: 2.0.1 Nov 30 2007 EDAC e7xxx: error reporting device not found:vendor 8086 device 0x2541 (broken BIOS?) I found http://edacbugs.buttersideup.com/show_bug.cgi?id=21 with google but no solution. Is it safe to ignore the error or remove the EDAC module? I read their wiki but I'm ne...
2013 Apr 29
4
ECC memory errors
I started to receive this kind of messages a few days ago on one of my servers: Message from syslogd@ at Mon Apr 29 08:02:55 2013 ... server1 kernel: EDAC MC0: UE row 0, channel-a= 0 channel-b= 1 labels "-": (Branch=0 DRAM-Bank=0 RDWR=Read RAS=0 CAS=0, UE Err=0x2 (Aliased Uncorrectable Non-Mirrored Demand Data ECC)) I've never had ECC memory to fail on me before, so now I am wondering the following: * The server is running CentOS 5.7...
2007 Aug 03
0
Strange kernel error message: EDAC GART TLB blahblah..
What does the following EDAC problem means? The machine is a AMD 64bit box running Centos 5. It looks like some problems aroung AMD DRAM Memory controller. But what does it really mean b/c most of my AMD boxes has these messages in /var/log/messages. Please help. ... Aug 1 23:29:40 ccn128 kernel: EDAC MC: Ver: 2.0.1 Jun 1...
2016 May 03
2
Centos 6.7: kernel: EDAC MC0: CE row 2, channel 1, label "": (..... (Correctable Patrol Data ECC))
After update from centos 6.6 to centos 6.7 and reboot it, I have get a lot of this error into /var/log/messages: > May??3 11:27:20 s-virt kernel: EDAC MC0: CE row 2, channel 1, label > "": (Branch=0 DRAM-Bank=2 RDWR=Read RAS=6093 CAS=896, CE Err=0x10000 > (Correctable Patrol Data ECC)) > May??3 11:27:21 s-virt kernel: EDAC MC0: CE row 2, channel 1, label > "": (Branch=0 DRAM-Bank=1 RDWR=Read RAS=1330 CAS=4, CE Err=...
2008 Oct 13
1
"EDAC i5000 MC0: FATAL ERRORS Found!!!" error message?
...emtester, just to check, didn't find anything; and the box has been running for months before this without issue. I'm wondering if anyone has run across this before, and if so, if it was software (CentOS) or hardware (PowerEdge / PowerVault) related? Oct 8 12:19:35 someServer kernel: EDAC i5000 MC0: FATAL ERRORS Found!!! 1st FATAL Err Reg= 0x4 Oct 8 12:19:35 someServer kernel: EDAC i5000 MC0: >Tmid Thermal event with intelligent throttling disabled Oct 8 12:19:35 someServer kernel: EDAC MC0: UE row 1, channel-a= 2 channel-b= 3 labels "-": (Branch=1 DRAM-Bank=0 R...
2014 Jun 19
0
CEBA-2014:0768 CentOS 6 edac-utils FASTTRACK Update
...Errata and Bugfix Advisory 2014:0768 Upstream details at : https://rhn.redhat.com/errata/RHBA-2014-0768.html The following updated files have been uploaded and are currently syncing to the mirrors: ( sha256sum Filename ) i386: f9238919a8e55753462b1690cc36a16b0a0c29260663dd5fe3e9ee0ae7a187c9 edac-utils-0.9-15.el6.i686.rpm 91b52fdaf78f24484edef2e78096ba684ed88bd392f81f93d869c502a572d90d edac-utils-devel-0.9-15.el6.i686.rpm x86_64: f9238919a8e55753462b1690cc36a16b0a0c29260663dd5fe3e9ee0ae7a187c9 edac-utils-0.9-15.el6.i686.rpm 52d7bb5b647fba78ada71901b02330ba8257b622655fbe462c624ca2e639960c...
2009 Jul 04
2
x86_64 EDAC throwing error
Hi All, We have installed CentOS 5.3 x86_64 in an HP DL585 server with AMD Opteron 64 bit processor and 16 GB RAM. The kernel version is 2.6.18-128.el5 . Now this has thrown an error message in /var/log/message, Jul 3 21:41:11 db1 kernel: EDAC k8 MC0: general bus error: participating processor(local node origin), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic) Jul 3 21:41:11 db1 kernel: EDAC MC0: CE page 0x65bc7, offset 0x6a0, grain 8, syndrome 0x6e1a, row 0, channel 0, label &quo...
2016 Jan 14
6
Bug#810964: only partial EDAC information with Xen
Package: xen-hypervisor-4.4-amd64 Version: 4.4.1-9+deb8u3 Debian 8.2 installed on a supermicro H8SGL Board, AMD 6128 with 4x4GB ECC RAM. When booting the plain kernel (stock Jessie 3.16 or backport 4.1 or 4.3), both memory controllers (mc0 and mc1) appear under /sys/devices/system/edac/mc with two csrow* each as expected. Same happens, when booted with Xen 4.1.4-3+deb7u1. When booted with Xen 4.4.1, only mc1 with two RAM modules is visible, although all 16GB RAM is available in the OS (xl info).
2009 Oct 19
2
EDAC Kernel Panic 2.6.9-78 and above
I've got a production system running CentOS 4 that was rock solid until I upgraded from 2.6.9-55 to 2.6.9-78.0.13 (now running 2.6.9-89.0.11). The system now crashes intermittently after a few weeks. I finally caught the panic message : EDAC MC0: INTERNAL ERROR: channel-b out of range (4 >= 4) Kernel panic - not syncing: MC0: Uncorrected Error Looking at the kernel changelog, I see that EDAC support was added for the Intel 5000 chipset in 2.6.9-68.20.EL which this server runs. I'm trying to determine if this is a potential mem...
2016 Jan 22
0
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
>>> On 22.01.16 at 10:09, <pgadmin at pse-consulting.de> wrote: > When booting with Xen 4.4.1: > > AMD64 EDAC driver v3.4.0 > EDAC amd64: DRAM ECC enabled. > EDAC amd64: NB MCE bank disabled, set MSR 0x0000017b[4] on node 0 to enable. I wonder how valid his message is. We actually write this MSR with all ones during boot. However, considering involved functions like nb_mce_bank_enabled_on_node() or...
2016 Jan 21
0
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
...;> On 20.01.16 at 16:01, <andreas.pflug at web.de> wrote: > Initially reported to debian > (http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=810964), redirected here: > > With AMD Opteron 6xxx processors, half of the memory controllers are > missing from /sys/devices/system/edac/mc > Checked with single 6120 (dual memory controller) and twin 6344 (2x dual > MC), other dual-module CPUs might be affected too. > > Booting plain Linux (3.2, 3.16, 4.1, 4.3), all memory controllers are > listed under /sys/devices/system/edac/mc as expected. Same happens, when &gt...