similar to: mcelog

Displaying 20 results from an estimated 10000 matches similar to: "mcelog"

2010 Jun 22
4
New kernel causes hardware error?
I have recently upgraded to 2.6.18-194.3.1.el5 and within several days the machine crashed with the following error (repeating in mcelog): MCE 0 HARDWARE ERROR. This is *NOT* a software problem! Please contact your hardware vendor CPU 2 BANK 8 MISC 41 MCG status: MCi status: Error overflow Uncorrected error MCi_MISC register valid Processor context corrupt MCA: MEMORY CONTROLLER AC_CHANNEL0_ERR
2014 Feb 11
1
odd mcelogd problem
CentOS 6.4, 2.6.32-358.11.1.el6.x86_64 (And no, I can't just upgrade - the users have to be sure that the computational results will be correct....) It's throwing ECC errors. Trying to start mcelogd, first it said nothing. Restart told me "Please load edac_mce_amd module." I did a modprobe edac_mce_amd, and lsmod tells me it's in. But now service mcelogd restart Stopping
2011 Mar 21
1
Cant find out MCE reason (CPU 35 BANK 8)
Hello community. We are running, Centos 4.8 on SuperMicro SYS-6026T-3RF with 2xIntel Xeon E5630 and 8xKingston KVR1333D3D4R9S/4G For some time we have lots of MCE in mcelog and we cant find out the reason. "Ordinary" mce message looks like: CPU 51 BANK 8 TSC 8511e3ca77dc MISC 274d587f00006141 ADDR 807044840 STATUS cc0055000001009f MCGSTATUS 0 decode with mcelog --ascii --cpu p4(cause
2012 Nov 16
5
[ 3009.778974] mcelog:16842 map pfn expected mapping type write-back for [mem 0x0009f000-0x000a0fff], got uncached-minus
Hi Konrad, Sometime ago i reported this one at boot up: [ 3009.778974] mcelog:16842 map pfn expected mapping type write-back for [mem 0x0009f000-0x000a0fff], got uncached-minus [ 3009.788570] ------------[ cut here ]------------ [ 3009.798175] WARNING: at arch/x86/mm/pat.c:774 untrack_pfn+0xa1/0xb0() [ 3009.807966] Hardware name: MS-7640 [ 3009.817677] Modules linked in: [ 3009.827524] Pid:
2009 Feb 13
3
After electric breaking: HARDWARE ERROR Kernel panic
Hi all, After an electric breaking, my server (Centos 5.2 x86_64 with all updates) can not boot. The error message on screen is: ----------------------------------------------------------------------------------------------------------- Memory for crash kernel (0x0 to 0x0) notwithin permissible range <0> HARDWARE ERROR CPU 1: Machine Check Exception: 7 Bank 4: .... RIP 10:<.....>
2011 Feb 03
2
CentOS Digest, Vol 73, Issue 3
On 02/03/2011 09:00 AM, Lamar Owen wrote: > ------------------------------ > > On Wednesday, February 02, 2011 08:04:43 pm Les Mikesell wrote: >> > I think there are ways that drives can fail that would make them not be detected >> > at all - and for an autodetected raid member in a system that has been rebooted, >> > not leave much evidence of where it was
2012 May 28
0
mcelog SELinux errors
Prowling around in the system logs this morning I discover the following entries: May 27 09:48:27 vhost01 mcelog: Cannot open logfile /var/log/mcelog: Permission denied May 27 09:48:27 vhost01 mcelog: failed to prefill DIMM database from DMI data May 27 09:48:27 vhost01 mcelog: Cannot bind to client unix socket `/var/run/mcel og-client': Permission denied and later: vhost01 setroubleshoot:
2009 Nov 17
2
High load averages with latest kernel and USB drives?
I'm having a server report a high load average when backing up Postgres database files to an external USB drive. This is driving my loadbalancers all out of kilter and causing a large volume of network monitor alerts. I have a 1TB USB drive plugged into a USB2 port that I use to back up the production drives (which are SCSI). It's working fine, but while doing backups (hourly) the
2008 Nov 14
23
Still more questions WRT selecting a mobo for small ZFS RAID
Like many others, I am looking to put together a SOHO NAS based on ZFS/CIFS. The plan is 6 x 1TB drives in RAIDZ2 configuration, driven via mobo with 6 SATA ports. I''ve read most, if not all, of the threads here, as well as sbredon''s excellent article on building a home NAS, yet I still have a number of unanswered questions. I was leaning heavily towards the M2N-E for a while,
2007 Dec 07
0
Cannot open /dev/mcelog
Hi there, I'am running CentOS 5.1 on a Dual AMD Opteron QuadCore System, without major issues so far. But if I call mcelog I'am getting the message "Cannot open /dev/mcelog" ... because it's not there. I'am running 2.6.18-53.1.4.el5xen kernel, is this related in any form? I found something similar for the debian universe: http://www.mail-archive.com/debian-bugs-closed
2011 May 18
0
CEBA-2011:0512 CentOS 5 x86_64 mcelog Update
CentOS Errata and Bugfix Advisory 2011:0512 Upstream details at : https://rhn.redhat.com/errata/RHBA-2011-0512.html The following updated files have been uploaded and are currently syncing to the mirrors: ( md5sum Filename ) x86_64: 770b7960cc4a775b21bdc7c282c90a42 mcelog-0.9pre-1.32.el5.x86_64.rpm Source: 4e4826b260464bf1ae5f5c9311c76557 mcelog-0.9pre-1.32.el5.src.rpm -- Johnny Hughes
2017 May 13
2
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
I haven't yet done as much experimentation as Andreas Pflug has, but I can confirm I'm also running into this bug with Xen 4.4.1. I've only tried Linux kernel 3.16.43, but as Dom0: EDAC MC: Ver: 3.0.0 AMD64 EDAC driver v3.4.0 EDAC amd64: DRAM ECC enabled. EDAC amd64: NB MCE bank disabled, set MSR 0x0000017b[4] on node 0 to enable. EDAC amd64: ECC disabled in the BIOS or no ECC
2017 May 16
3
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
On Mon, May 15, 2017 at 02:02:53AM -0600, Jan Beulich wrote: > >>> On 14.05.17 at 00:36, <ehem+debian at m5p.com> wrote: > > I haven't yet done as much experimentation as Andreas Pflug has, but I > > can confirm I'm also running into this bug with Xen 4.4.1. > > > > I've only tried Linux kernel 3.16.43, but as Dom0: > > > > EDAC
2016 May 03
2
Centos 6.7: kernel: EDAC MC0: CE row 2, channel 1, label "": (..... (Correctable Patrol Data ECC))
After update from centos 6.6 to centos 6.7 and reboot it, I have get a lot of this error into /var/log/messages: > May??3 11:27:20 s-virt kernel: EDAC MC0: CE row 2, channel 1, label > "": (Branch=0 DRAM-Bank=2 RDWR=Read RAS=6093 CAS=896, CE Err=0x10000 > (Correctable Patrol Data ECC)) > May??3 11:27:21 s-virt kernel: EDAC MC0: CE row 2, channel 1, label > "":
2011 May 18
0
CEBA-2011:0512 CentOS 5 i386 mcelog Update
CentOS Errata and Bugfix Advisory 2011:0512 Upstream details at : https://rhn.redhat.com/errata/RHBA-2011-0512.html The following updated files have been uploaded and are currently syncing to the mirrors: ( md5sum Filename ) i386: Source: 4e4826b260464bf1ae5f5c9311c76557 mcelog-0.9pre-1.32.el5.src.rpm -- Johnny Hughes CentOS Project { http://www.centos.org/ } irc: hughesjr, #centos at
2009 Feb 24
44
Motherboard for home zfs/solaris file server
Hello, I am building a home file server and am looking for an ATX mother board that will be supported well with OpenSolaris (onboard SATA controller, network, graphics if any, audio, etc). I decided to go for Intel based boards (socket LGA 775) since it seems like power management is better supported with Intel processors and power efficiency is an important factor. After reading several
2015 Dec 21
1
Supermicro CentOS 7 install failure
My workhorse server is a SuperMicro with their H8DM8-2 motherboard. For many years it ran CentOS 5.x and 6.x until the boot drive failed last year. I installed a 1TB SSD as /dev/sda and planned to install CentOS 7 on it, replacing CentOS 6.5 on the failed drive. Unfortunately every CentOS 7 media I tried, either optical disk or USB thumb drive, breaks down just a few seconds after selecting
2013 Apr 24
3
DIMM problem
Hey, folks, I've got an HP Proliant DL580 G5 throwing ECC errors. This is annoying, since a) it's all new as of a few months ago, and b) it's *fully* populated. The two things I need to figure out are a) *which* DIMM it is, and b) is it mirrored; if so, which *other* DIMM needs to come out until we get replacements from the OEM. Here's one of many, all identical, from dmesg:
2014 Mar 04
1
Xen4CentOS installation strangeness
Hi, I have a server with Supermicro X7DVL-3 (P9) motherboard, 16G ECC RAM and LSI SAS 1068e RAID controller. I installed CentOS 6.5 64bit on the machine without any problems, but after following the Xen setup steps at http://wiki.centos.org/HowTos/Xen/Xen4QuickStart which installed me the kernel 3.10.32-11.el6.centos.alt.x86_64, I encountered a problem: After "Starting certmonger
2014 Jun 25
2
How to enable EDAC kernel module for checking ECC memory?
In order to support ZFS, we upgraded a backups server with a new, ECC motherboard. We're running CentOS 6 with ZFS on Linux, recently patched. Now, I want to enable EDAC so we can check for memory errors (and maybe PCI errors as well) but so far, repeatedly pounding on the Google hasn't yielded exactly what I need to do to enable EDAC. One howto was covering PCI and edac, but