thr3ads.net - similar to: "mcelog"

Displaying 20 results from an estimated 10000 matches similar to: "mcelog"

2010 Jun 22

New kernel causes hardware error?

I have recently upgraded to 2.6.18-194.3.1.el5 and within several days the machine crashed with the following error (repeating in mcelog): MCE 0 HARDWARE ERROR. This is *NOT* a software problem! Please contact your hardware vendor CPU 2 BANK 8 MISC 41 MCG status: MCi status: Error overflow Uncorrected error MCi_MISC register valid Processor context corrupt MCA: MEMORY CONTROLLER AC_CHANNEL0_ERR

odd mcelogd problem

2014 Feb 11

odd mcelogd problem

CentOS 6.4, 2.6.32-358.11.1.el6.x86_64 (And no, I can't just upgrade - the users have to be sure that the computational results will be correct....) It's throwing ECC errors. Trying to start mcelogd, first it said nothing. Restart told me "Please load edac_mce_amd module." I did a modprobe edac_mce_amd, and lsmod tells me it's in. But now service mcelogd restart Stopping

Cant find out MCE reason (CPU 35 BANK 8)

2011 Mar 21

Cant find out MCE reason (CPU 35 BANK 8)

Hello community. We are running, Centos 4.8 on SuperMicro SYS-6026T-3RF with 2xIntel Xeon E5630 and 8xKingston KVR1333D3D4R9S/4G For some time we have lots of MCE in mcelog and we cant find out the reason. "Ordinary" mce message looks like: CPU 51 BANK 8 TSC 8511e3ca77dc MISC 274d587f00006141 ADDR 807044840 STATUS cc0055000001009f MCGSTATUS 0 decode with mcelog --ascii --cpu p4(cause

[ 3009.778974] mcelog:16842 map pfn expected mapping type write-back for [mem 0x0009f000-0x000a0fff], got uncached-minus

2012 Nov 16

[ 3009.778974] mcelog:16842 map pfn expected mapping type write-back for [mem 0x0009f000-0x000a0fff], got uncached-minus

Hi Konrad, Sometime ago i reported this one at boot up: [ 3009.778974] mcelog:16842 map pfn expected mapping type write-back for [mem 0x0009f000-0x000a0fff], got uncached-minus [ 3009.788570] ------------[ cut here ]------------ [ 3009.798175] WARNING: at arch/x86/mm/pat.c:774 untrack_pfn+0xa1/0xb0() [ 3009.807966] Hardware name: MS-7640 [ 3009.817677] Modules linked in: [ 3009.827524] Pid:

After electric breaking: HARDWARE ERROR Kernel panic

2009 Feb 13

After electric breaking: HARDWARE ERROR Kernel panic

Hi all, After an electric breaking, my server (Centos 5.2 x86_64 with all updates) can not boot. The error message on screen is: ----------------------------------------------------------------------------------------------------------- Memory for crash kernel (0x0 to 0x0) notwithin permissible range <0> HARDWARE ERROR CPU 1: Machine Check Exception: 7 Bank 4: .... RIP 10:<.....>

CentOS Digest, Vol 73, Issue 3

2011 Feb 03

CentOS Digest, Vol 73, Issue 3

On 02/03/2011 09:00 AM, Lamar Owen wrote: > ------------------------------ > > On Wednesday, February 02, 2011 08:04:43 pm Les Mikesell wrote: >> > I think there are ways that drives can fail that would make them not be detected >> > at all - and for an autodetected raid member in a system that has been rebooted, >> > not leave much evidence of where it was

mcelog SELinux errors

2012 May 28

mcelog SELinux errors

Prowling around in the system logs this morning I discover the following entries: May 27 09:48:27 vhost01 mcelog: Cannot open logfile /var/log/mcelog: Permission denied May 27 09:48:27 vhost01 mcelog: failed to prefill DIMM database from DMI data May 27 09:48:27 vhost01 mcelog: Cannot bind to client unix socket `/var/run/mcel og-client': Permission denied and later: vhost01 setroubleshoot:

High load averages with latest kernel and USB drives?

2009 Nov 17

High load averages with latest kernel and USB drives?

I'm having a server report a high load average when backing up Postgres database files to an external USB drive. This is driving my loadbalancers all out of kilter and causing a large volume of network monitor alerts. I have a 1TB USB drive plugged into a USB2 port that I use to back up the production drives (which are SCSI). It's working fine, but while doing backups (hourly) the

Still more questions WRT selecting a mobo for small ZFS RAID

2008 Nov 14

Still more questions WRT selecting a mobo for small ZFS RAID

Like many others, I am looking to put together a SOHO NAS based on ZFS/CIFS. The plan is 6 x 1TB drives in RAIDZ2 configuration, driven via mobo with 6 SATA ports. I''ve read most, if not all, of the threads here, as well as sbredon''s excellent article on building a home NAS, yet I still have a number of unanswered questions. I was leaning heavily towards the M2N-E for a while,

Cannot open /dev/mcelog

2007 Dec 07

Cannot open /dev/mcelog

Hi there, I'am running CentOS 5.1 on a Dual AMD Opteron QuadCore System, without major issues so far. But if I call mcelog I'am getting the message "Cannot open /dev/mcelog" ... because it's not there. I'am running 2.6.18-53.1.4.el5xen kernel, is this related in any form? I found something similar for the debian universe: http://www.mail-archive.com/debian-bugs-closed

CEBA-2011:0512 CentOS 5 x86_64 mcelog Update

2011 May 18

CEBA-2011:0512 CentOS 5 x86_64 mcelog Update

CentOS Errata and Bugfix Advisory 2011:0512 Upstream details at : https://rhn.redhat.com/errata/RHBA-2011-0512.html The following updated files have been uploaded and are currently syncing to the mirrors: ( md5sum Filename ) x86_64: 770b7960cc4a775b21bdc7c282c90a42 mcelog-0.9pre-1.32.el5.x86_64.rpm Source: 4e4826b260464bf1ae5f5c9311c76557 mcelog-0.9pre-1.32.el5.src.rpm -- Johnny Hughes

Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing

2017 May 13

Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing

I haven't yet done as much experimentation as Andreas Pflug has, but I can confirm I'm also running into this bug with Xen 4.4.1. I've only tried Linux kernel 3.16.43, but as Dom0: EDAC MC: Ver: 3.0.0 AMD64 EDAC driver v3.4.0 EDAC amd64: DRAM ECC enabled. EDAC amd64: NB MCE bank disabled, set MSR 0x0000017b[4] on node 0 to enable. EDAC amd64: ECC disabled in the BIOS or no ECC

Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing

2017 May 16

Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing

On Mon, May 15, 2017 at 02:02:53AM -0600, Jan Beulich wrote: > >>> On 14.05.17 at 00:36, <ehem+debian at m5p.com> wrote: > > I haven't yet done as much experimentation as Andreas Pflug has, but I > > can confirm I'm also running into this bug with Xen 4.4.1. > > > > I've only tried Linux kernel 3.16.43, but as Dom0: > > > > EDAC

Centos 6.7: kernel: EDAC MC0: CE row 2, channel 1, label "": (..... (Correctable Patrol Data ECC))

2016 May 03

Centos 6.7: kernel: EDAC MC0: CE row 2, channel 1, label "": (..... (Correctable Patrol Data ECC))

After update from centos 6.6 to centos 6.7 and reboot it, I have get a lot of this error into /var/log/messages: > May??3 11:27:20 s-virt kernel: EDAC MC0: CE row 2, channel 1, label > "": (Branch=0 DRAM-Bank=2 RDWR=Read RAS=6093 CAS=896, CE Err=0x10000 > (Correctable Patrol Data ECC)) > May??3 11:27:21 s-virt kernel: EDAC MC0: CE row 2, channel 1, label > "":

CEBA-2011:0512 CentOS 5 i386 mcelog Update

2011 May 18

CEBA-2011:0512 CentOS 5 i386 mcelog Update

CentOS Errata and Bugfix Advisory 2011:0512 Upstream details at : https://rhn.redhat.com/errata/RHBA-2011-0512.html The following updated files have been uploaded and are currently syncing to the mirrors: ( md5sum Filename ) i386: Source: 4e4826b260464bf1ae5f5c9311c76557 mcelog-0.9pre-1.32.el5.src.rpm -- Johnny Hughes CentOS Project { http://www.centos.org/ } irc: hughesjr, #centos at

Motherboard for home zfs/solaris file server

2009 Feb 24

Motherboard for home zfs/solaris file server

Hello, I am building a home file server and am looking for an ATX mother board that will be supported well with OpenSolaris (onboard SATA controller, network, graphics if any, audio, etc). I decided to go for Intel based boards (socket LGA 775) since it seems like power management is better supported with Intel processors and power efficiency is an important factor. After reading several

Supermicro CentOS 7 install failure

2015 Dec 21

Supermicro CentOS 7 install failure

My workhorse server is a SuperMicro with their H8DM8-2 motherboard. For many years it ran CentOS 5.x and 6.x until the boot drive failed last year. I installed a 1TB SSD as /dev/sda and planned to install CentOS 7 on it, replacing CentOS 6.5 on the failed drive. Unfortunately every CentOS 7 media I tried, either optical disk or USB thumb drive, breaks down just a few seconds after selecting

DIMM problem

2013 Apr 24

DIMM problem

Hey, folks, I've got an HP Proliant DL580 G5 throwing ECC errors. This is annoying, since a) it's all new as of a few months ago, and b) it's *fully* populated. The two things I need to figure out are a) *which* DIMM it is, and b) is it mirrored; if so, which *other* DIMM needs to come out until we get replacements from the OEM. Here's one of many, all identical, from dmesg:

Xen4CentOS installation strangeness

2014 Mar 04

Xen4CentOS installation strangeness

Hi, I have a server with Supermicro X7DVL-3 (P9) motherboard, 16G ECC RAM and LSI SAS 1068e RAID controller. I installed CentOS 6.5 64bit on the machine without any problems, but after following the Xen setup steps at http://wiki.centos.org/HowTos/Xen/Xen4QuickStart which installed me the kernel 3.10.32-11.el6.centos.alt.x86_64, I encountered a problem: After "Starting certmonger

How to enable EDAC kernel module for checking ECC memory?

2014 Jun 25

How to enable EDAC kernel module for checking ECC memory?

In order to support ZFS, we upgraded a backups server with a new, ECC motherboard. We're running CentOS 6 with ZFS on Linux, recently patched. Now, I want to enable EDAC so we can check for memory errors (and maybe PCI errors as well) but so far, repeatedly pounding on the Google hasn't yielded exactly what I need to do to enable EDAC. One howto was covering PCI and edac, but

similar to: mcelog