Displaying 20 results from an estimated 10000 matches similar to: "mcelog"
2010 Jun 22
4
New kernel causes hardware error?
I have recently upgraded to 2.6.18-194.3.1.el5 and within several days
the machine crashed with the following error (repeating in mcelog):
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 2 BANK 8 MISC 41
MCG status:
MCi status:
Error overflow
Uncorrected error
MCi_MISC register valid
Processor context corrupt
MCA: MEMORY CONTROLLER AC_CHANNEL0_ERR
2014 Feb 11
1
odd mcelogd problem
CentOS 6.4, 2.6.32-358.11.1.el6.x86_64
(And no, I can't just upgrade - the users have to be sure that the
computational results will be correct....)
It's throwing ECC errors. Trying to start mcelogd, first it said nothing.
Restart told me "Please load edac_mce_amd module." I did a modprobe
edac_mce_amd, and lsmod tells me it's in. But now
service mcelogd restart
Stopping
2011 Mar 21
1
Cant find out MCE reason (CPU 35 BANK 8)
Hello community.
We are running, Centos 4.8 on SuperMicro SYS-6026T-3RF with 2xIntel Xeon
E5630 and 8xKingston KVR1333D3D4R9S/4G
For some time we have lots of MCE in mcelog and we cant find out the reason.
"Ordinary" mce message looks like:
CPU 51 BANK 8 TSC 8511e3ca77dc
MISC 274d587f00006141 ADDR 807044840
STATUS cc0055000001009f MCGSTATUS 0
decode with mcelog --ascii --cpu p4(cause
2012 Nov 16
5
[ 3009.778974] mcelog:16842 map pfn expected mapping type write-back for [mem 0x0009f000-0x000a0fff], got uncached-minus
Hi Konrad,
Sometime ago i reported this one at boot up:
[ 3009.778974] mcelog:16842 map pfn expected mapping type write-back for [mem 0x0009f000-0x000a0fff], got uncached-minus
[ 3009.788570] ------------[ cut here ]------------
[ 3009.798175] WARNING: at arch/x86/mm/pat.c:774 untrack_pfn+0xa1/0xb0()
[ 3009.807966] Hardware name: MS-7640
[ 3009.817677] Modules linked in:
[ 3009.827524] Pid:
2009 Feb 13
3
After electric breaking: HARDWARE ERROR Kernel panic
Hi all,
After an electric breaking, my server (Centos 5.2 x86_64 with all
updates) can not boot. The error message on screen is:
-----------------------------------------------------------------------------------------------------------
Memory for crash kernel (0x0 to 0x0) notwithin permissible range
<0>
HARDWARE ERROR
CPU 1: Machine Check Exception: 7 Bank 4: ....
RIP 10:<.....>
2011 Feb 03
2
CentOS Digest, Vol 73, Issue 3
On 02/03/2011 09:00 AM, Lamar Owen wrote:
> ------------------------------
>
> On Wednesday, February 02, 2011 08:04:43 pm Les Mikesell wrote:
>> > I think there are ways that drives can fail that would make them not be detected
>> > at all - and for an autodetected raid member in a system that has been rebooted,
>> > not leave much evidence of where it was
2012 May 28
0
mcelog SELinux errors
Prowling around in the system logs this morning I discover the
following entries:
May 27 09:48:27 vhost01 mcelog: Cannot open logfile /var/log/mcelog:
Permission
denied
May 27 09:48:27 vhost01 mcelog: failed to prefill DIMM database from
DMI data
May 27 09:48:27 vhost01 mcelog: Cannot bind to client unix socket
`/var/run/mcel
og-client': Permission denied
and later:
vhost01 setroubleshoot:
2009 Nov 17
2
High load averages with latest kernel and USB drives?
I'm having a server report a high load average when backing up Postgres
database files to an external USB drive. This is driving my loadbalancers all
out of kilter and causing a large volume of network monitor alerts.
I have a 1TB USB drive plugged into a USB2 port that I use to back up the
production drives (which are SCSI). It's working fine, but while doing backups
(hourly) the
2008 Nov 14
23
Still more questions WRT selecting a mobo for small ZFS RAID
Like many others, I am looking to put together a SOHO NAS based on ZFS/CIFS. The plan is 6 x 1TB drives in RAIDZ2 configuration, driven via mobo with 6 SATA ports.
I''ve read most, if not all, of the threads here, as well as sbredon''s excellent article on building a home NAS, yet I still have a number of unanswered questions.
I was leaning heavily towards the M2N-E for a while,
2007 Dec 07
0
Cannot open /dev/mcelog
Hi there,
I'am running CentOS 5.1 on a Dual AMD Opteron QuadCore System, without
major issues so far.
But if I call mcelog I'am getting the message "Cannot open /dev/mcelog"
... because it's not there.
I'am running 2.6.18-53.1.4.el5xen kernel, is this related in any form? I
found something similar for the debian universe:
http://www.mail-archive.com/debian-bugs-closed
2011 May 18
0
CEBA-2011:0512 CentOS 5 x86_64 mcelog Update
CentOS Errata and Bugfix Advisory 2011:0512
Upstream details at : https://rhn.redhat.com/errata/RHBA-2011-0512.html
The following updated files have been uploaded and are currently
syncing to the mirrors: ( md5sum Filename )
x86_64:
770b7960cc4a775b21bdc7c282c90a42 mcelog-0.9pre-1.32.el5.x86_64.rpm
Source:
4e4826b260464bf1ae5f5c9311c76557 mcelog-0.9pre-1.32.el5.src.rpm
--
Johnny Hughes
2017 May 13
2
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
I haven't yet done as much experimentation as Andreas Pflug has, but I
can confirm I'm also running into this bug with Xen 4.4.1.
I've only tried Linux kernel 3.16.43, but as Dom0:
EDAC MC: Ver: 3.0.0
AMD64 EDAC driver v3.4.0
EDAC amd64: DRAM ECC enabled.
EDAC amd64: NB MCE bank disabled, set MSR 0x0000017b[4] on node 0 to enable.
EDAC amd64: ECC disabled in the BIOS or no ECC
2017 May 16
3
Bug#810964: [Xen-devel] [BUG] EDAC infomation partially missing
On Mon, May 15, 2017 at 02:02:53AM -0600, Jan Beulich wrote:
> >>> On 14.05.17 at 00:36, <ehem+debian at m5p.com> wrote:
> > I haven't yet done as much experimentation as Andreas Pflug has, but I
> > can confirm I'm also running into this bug with Xen 4.4.1.
> >
> > I've only tried Linux kernel 3.16.43, but as Dom0:
> >
> > EDAC
2016 May 03
2
Centos 6.7: kernel: EDAC MC0: CE row 2, channel 1, label "": (..... (Correctable Patrol Data ECC))
After update from centos 6.6 to centos 6.7 and reboot it, I have get a
lot of this error into /var/log/messages:
> May??3 11:27:20 s-virt kernel: EDAC MC0: CE row 2, channel 1, label
> "": (Branch=0 DRAM-Bank=2 RDWR=Read RAS=6093 CAS=896, CE Err=0x10000
> (Correctable Patrol Data ECC))
> May??3 11:27:21 s-virt kernel: EDAC MC0: CE row 2, channel 1, label
> "":
2011 May 18
0
CEBA-2011:0512 CentOS 5 i386 mcelog Update
CentOS Errata and Bugfix Advisory 2011:0512
Upstream details at : https://rhn.redhat.com/errata/RHBA-2011-0512.html
The following updated files have been uploaded and are currently
syncing to the mirrors: ( md5sum Filename )
i386:
Source:
4e4826b260464bf1ae5f5c9311c76557 mcelog-0.9pre-1.32.el5.src.rpm
--
Johnny Hughes
CentOS Project { http://www.centos.org/ }
irc: hughesjr, #centos at
2009 Feb 24
44
Motherboard for home zfs/solaris file server
Hello,
I am building a home file server and am looking for an ATX mother board
that will be supported well with OpenSolaris (onboard SATA controller,
network, graphics if any, audio, etc). I decided to go for Intel based
boards (socket LGA 775) since it seems like power management is better
supported with Intel processors and power efficiency is an important
factor. After reading several
2015 Dec 21
1
Supermicro CentOS 7 install failure
My workhorse server is a SuperMicro with their H8DM8-2 motherboard. For
many years it ran CentOS 5.x and 6.x until the boot drive failed last
year. I installed a 1TB SSD as /dev/sda and planned to install CentOS 7 on
it, replacing CentOS 6.5 on the failed drive. Unfortunately every CentOS 7
media I tried, either optical disk or USB thumb drive, breaks down just a
few seconds after selecting
2013 Apr 24
3
DIMM problem
Hey, folks,
I've got an HP Proliant DL580 G5 throwing ECC errors. This is annoying,
since a) it's all new as of a few months ago, and b) it's *fully*
populated. The two things I need to figure out are a) *which* DIMM it
is, and b) is it mirrored; if so, which *other* DIMM needs to come out
until we get replacements from the OEM.
Here's one of many, all identical, from dmesg:
2014 Mar 04
1
Xen4CentOS installation strangeness
Hi,
I have a server with Supermicro X7DVL-3 (P9) motherboard, 16G ECC RAM and
LSI SAS 1068e RAID controller. I installed CentOS 6.5 64bit on the machine
without any problems, but after following the Xen setup steps at
http://wiki.centos.org/HowTos/Xen/Xen4QuickStart
which installed me the kernel 3.10.32-11.el6.centos.alt.x86_64, I
encountered a problem: After "Starting certmonger
2014 Jun 25
2
How to enable EDAC kernel module for checking ECC memory?
In order to support ZFS, we upgraded a backups server with a new, ECC
motherboard. We're running CentOS 6 with ZFS on Linux, recently patched.
Now, I want to enable EDAC so we can check for memory errors (and maybe
PCI errors as well) but so far, repeatedly pounding on the Google hasn't
yielded exactly what I need to do to enable EDAC.
One howto was covering PCI and edac, but