Volker Poplawski
2011-Sep-23 10:54 UTC
[CentOS] Problems with Intel Ethernet and module e1000e
Hi all, I'm facing a serious problem with the e100e kernel module for Intel 82574L gigabit nics on Centos 6. The device eth0 suddenly stops working i.e. no more networking. When I do ifconfig from console I get eth0 Link encap:Ethernet HWaddr 00:xx:xx:xx:xx:EA inet6 addr: fe80::225:90ff:fe50:8fea/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:37984 errors:54245436935850 dropped:9040906155975 overruns:0 frame:36163624623900 TX packets:20884 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:4431149 (4.2 MiB) TX bytes:4628666 (4.4 MiB) Memory:fb900000-fb920000 Reported byte count and RX TX packet count is reasonable. Howerver the incredible large number of errors is not. Also they don't pile up but simply appear when the device stopped working. In /var/log/messages I get: Sep 23 12:21:09 wader2 kernel: ------------[ cut here ]------------ Sep 23 12:21:09 wader2 kernel: WARNING: at net/sched/sch_generic.c:261 dev_watchdog+0x26d/0x280() (Not tainted) Sep 23 12:21:09 wader2 kernel: Hardware name: X9SCL/X9SCM Sep 23 12:21:09 wader2 kernel: NETDEV WATCHDOG: eth0 (e1000e): transmit queue 0 timed out Sep 23 12:21:09 wader2 kernel: Modules linked in: tun ebtable_nat ebtables xt_CHECKSUM iptable_mangle ipt_MASQUERADE iptable_nat nf_nat sunrpc bridge stp llc xt_physdev ipt_REJECT nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter ip_tables ip6t_REJECT nf_conntrack_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6 kvm_intel kvm serio_raw i2c_i801 i2c_core iTCO_wdt iTCO_vendor_support sg e1000e ext4 mbcache jbd2 sd_mod crc_t10dif ahci megaraid_sas dm_mod [last unloaded: scsi_wait_scan] Sep 23 12:21:09 wader2 kernel: Pid: 0, comm: swapper Not tainted 2.6.32-71.29.1.el6.x86_64 #1 Sep 23 12:21:09 wader2 kernel: Call Trace: Sep 23 12:21:09 wader2 kernel: <IRQ> [<ffffffff8106b947>] warn_slowpath_common+0x87/0xc0 Sep 23 12:21:09 wader2 kernel: [<ffffffff8106ba36>] warn_slowpath_fmt+0x46/0x50 Sep 23 12:21:09 wader2 kernel: [<ffffffff8142a07d>] dev_watchdog+0x26d/0x280 Sep 23 12:21:09 wader2 kernel: [<ffffffff8107d3c5>] ? internal_add_timer+0xb5/0x110 Sep 23 12:21:09 wader2 kernel: [<ffffffff81429e10>] ? dev_watchdog+0x0/0x280 Sep 23 12:21:09 wader2 kernel: [<ffffffff8107dfc7>] run_timer_softirq+0x197/0x340 Sep 23 12:21:09 wader2 kernel: [<ffffffff810a0e90>] ? tick_sched_timer+0x0/0xc0 Sep 23 12:21:09 wader2 kernel: [<ffffffff8102f52d>] ? lapic_next_event+0x1d/0x30 Sep 23 12:21:09 wader2 kernel: [<ffffffff81073d67>] __do_softirq+0xb7/0x1e0 Sep 23 12:21:09 wader2 kernel: [<ffffffff81095c50>] ? hrtimer_interrupt+0x140/0x250 Sep 23 12:21:09 wader2 kernel: [<ffffffff810142cc>] call_softirq+0x1c/0x30 Sep 23 12:21:09 wader2 kernel: [<ffffffff81015f35>] do_softirq+0x65/0xa0 Sep 23 12:21:09 wader2 kernel: [<ffffffff81073b65>] irq_exit+0x85/0x90 Sep 23 12:21:09 wader2 kernel: [<ffffffff814d0a31>] smp_apic_timer_interrupt+0x71/0x9c Sep 23 12:21:09 wader2 kernel: [<ffffffff81013c93>] apic_timer_interrupt+0x13/0x20 Sep 23 12:21:09 wader2 kernel: <EOI> [<ffffffff812dac0f>] ? acpi_idle_enter_bm+0x28f/0x2c3 Sep 23 12:21:09 wader2 kernel: [<ffffffff812dac08>] ? acpi_idle_enter_bm+0x288/0x2c3 Sep 23 12:21:09 wader2 kernel: [<ffffffff813df687>] cpuidle_idle_call+0xa7/0x140 Sep 23 12:21:09 wader2 kernel: [<ffffffff81011e96>] cpu_idle+0xb6/0x110 Sep 23 12:21:09 wader2 kernel: [<ffffffff814b1a0a>] rest_init+0x7a/0x80 Sep 23 12:21:09 wader2 kernel: [<ffffffff818c3f19>] start_kernel+0x413/0x41f Sep 23 12:21:09 wader2 kernel: [<ffffffff818c333a>] x86_64_start_reservations+0x125/0x129 Sep 23 12:21:09 wader2 kernel: [<ffffffff818c3438>] x86_64_start_kernel+0xfa/0x109 Sep 23 12:21:09 wader2 kernel: ---[ end trace 69b6c5e494cffe4d ]--- Sep 23 12:21:10 wader2 kernel: 0000:04:00.0: eth0: Error reading PHY register Sep 23 12:21:10 wader2 kernel: e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX The last line falsely reports the link to be 1000Mbit but it is actually 100Mbit. Same does ethtool. Bringing down the interface with ifconfig eth0 down and then ifconfig eth0 up does not help. A reboot gets the interface back to normal. The problem returns after some minutes, hours or a day. Any ideas? Regards .......Volker
m.roth at 5-cent.us
2011-Sep-23 14:35 UTC
[CentOS] Problems with Intel Ethernet and module e1000e
Volker Poplawski wrote:> Hi all, > > I'm facing a serious problem with the e100e kernel module for Intel > 82574L gigabit nics on Centos 6. > > The device eth0 suddenly stops working i.e. no more networking. When I > do ifconfig from console I getDo you have either another port on the box, or can you swap in another NIC? mark
Emmanuel Noobadmin
2011-Sep-23 16:35 UTC
[CentOS] Problems with Intel Ethernet and module e1000e
On 9/23/11, Volker Poplawski <volker at openbios.org> wrote:> Hi all, > > I'm facing a serious problem with the e100e kernel module for Intel > 82574L gigabit nics on Centos 6. > > The device eth0 suddenly stops working i.e. no more networking. When I > do ifconfig from console I get. . .> Bringing down the interface with ifconfig eth0 down and then ifconfig > eth0 up does not help. A reboot gets the interface back to normal. The > problem returns after some minutes, hours or a day.I had the same problem with an Intel Gigabit NIC and the e1000 module just last week on a fresh CentOS 6.0 install. Network will die for no explicable reason and fixable only by reboot. Problem was solved by following one of the threads in the CentOS forum IIRC, using the elrepo kmod drivers.
From: Volker Poplawski <volker at openbios.org>> I'm facing a serious problem with the e100e kernel module for Intel > 82574L gigabit nics on Centos 6.I had pbms with my Intel 1000e too. Installed elrepo's kmod-e1000e and so far so good... http://elrepo.org/tiki/kmod-e1000e JD
Patrick Lists
2011-Sep-26 12:54 UTC
[CentOS] Problems with Intel Ethernet and module e1000e
On 09/23/2011 12:54 PM, Volker Poplawski wrote:> Hi all, > > I'm facing a serious problem with the e100e kernel module for Intel > 82574L gigabit nics on Centos 6.I have also had problems with an Intel Gbit nic in a Dell Optiplex 760 using the e1000 module. If it would come up it would come up at 100Mbit speed. Only unplugging the LAN cable en reinserting it in the switch would sometimes result in the proper 1Gb speed. After replacing the 3Com Gbit switch with a DLink Gbit switch things started to work ok. As suggested, you could try out the e1000 kernel module form elrepo (if it works, good for you) but you may want to look into the switch too. Regards, Patrick
Apparently Analagous Threads
- Network hangs after several hours (Centos 6 recently upgraded kernel/glibc)
- Network hangs after several hours (Centos 6 recently upgraded kernel/glibc)
- Network hangs after several hours (Centos 6 recently upgraded kernel/glibc)
- Network hangs after several hours (Centos 6 recently upgraded kernel/glibc)
- Network hangs after several hours (Centos 6 recently upgraded kernel/glibc)