NFS Server: dell poweredge 1950, with the 1.2.2.6 version of if_bce.c: bce0: <Broadcom NetXtreme II BCM5708 1000Base-T (B1), v0.9.6> mem 0xf4000000-0xf5ffffff irq 16 at device 0.0 on pci9 bce0: ASIC ID 0x57081010; Revision (B1); PCI-X 64-bit 133MHz miibus0: <MII bus> on bce0 bce0: Ethernet address: 00:13:72:f8:6f:8c bce0: link state changed to UP NFS Client: dell poweredge 1750, with : bge0: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2002> Both running an up-to-date setup of FreeBSD 6.1. Situation: - Copying GB of data via rsync or ftp: no problem - Mounting a test directory from the 1950 via NFS on the 1750 server and creating some files/directories: ok - Mounting a dir with many files via NFS (/var/tmp, 14000 php_sess): ok - Start a directory listing on it: immediate (network) crash of the NFS server. (reproduced 3 times) I don't have remote console display anymore (only power on/off), so I can only see the server going away from the network (not pingable anymore) at the moment, but it is most probably still working then. Only thing I found in /var/log/messages: Nov 6 23:09:44 gemini kernel: bce0: /usr/src/sys/dev/bce/if_bce.c(5000): Watchdog timeout occurred, resetting! Nov 6 23:09:44 gemini kernel: bce0: link state changed to DOWN Nov 6 23:09:46 gemini kernel: bce0: link state changed to UP Nov 6 23:10:49 gemini kernel: bce0: /usr/src/sys/dev/bce/if_bce.c(5000): Watchdog timeout occurred, resetting! Nov 6 23:10:49 gemini kernel: bce0: link state changed to DOWN Nov 6 23:10:51 gemini kernel: bce0: link state changed to UP What could/should I try next? The 1.18 version of if_bce.c ? (if the dell PE1950 is using the same NIC as the PE1955: cf. http://www.mail-archive.com/freebsd-stable@freebsd.org/msg83981.html ) Are there people around using a 1950 without any NIC problems? (tcp && udp). If yes, please tell me which cvs version of the driver you are using. Thanks & regards, Olivier
Olivier Mueller wrote:> NFS Server: dell poweredge 1950, with the 1.2.2.6 version of if_bce.c: > > bce0: <Broadcom NetXtreme II BCM5708 1000Base-T (B1), v0.9.6> mem > 0xf4000000-0xf5ffffff irq 16 at device 0.0 on pci9 > bce0: ASIC ID 0x57081010; Revision (B1); PCI-X 64-bit 133MHz > miibus0: <MII bus> on bce0 > bce0: Ethernet address: 00:13:72:f8:6f:8c > bce0: link state changed to UP > > > NFS Client: dell poweredge 1750, with : > bge0: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2002> > > Both running an up-to-date setup of FreeBSD 6.1. > > Situation: > > - Copying GB of data via rsync or ftp: no problem > - Mounting a test directory from the 1950 via NFS on the 1750 server and > creating some files/directories: ok > - Mounting a dir with many files via NFS (/var/tmp, 14000 php_sess): ok > - Start a directory listing on it: immediate (network) crash of the NFS > server. (reproduced 3 times) > > I don't have remote console display anymore (only power on/off), so I > can only see the server going away from the network (not pingable > anymore) at the moment, but it is most probably still working then. Only > thing I found in /var/log/messages: > > Nov 6 23:09:44 gemini kernel: > bce0: /usr/src/sys/dev/bce/if_bce.c(5000): Watchdog timeout occurred, > resetting! > Nov 6 23:09:44 gemini kernel: bce0: link state changed to DOWN > Nov 6 23:09:46 gemini kernel: bce0: link state changed to UP > Nov 6 23:10:49 gemini kernel: > bce0: /usr/src/sys/dev/bce/if_bce.c(5000): Watchdog timeout occurred, > resetting! > Nov 6 23:10:49 gemini kernel: bce0: link state changed to DOWN > Nov 6 23:10:51 gemini kernel: bce0: link state changed to UP > > > > What could/should I try next? The 1.18 version of if_bce.c ? (if the > dell PE1950 is using the same NIC as the PE1955: cf. > http://www.mail-archive.com/freebsd-stable@freebsd.org/msg83981.html ) > > Are there people around using a 1950 without any NIC problems? (tcp && > udp). If yes, please tell me which cvs version of the driver you are > using. > > Thanks & regards, > Olivier >Do the following, then retry your test: ifconfig bce0 -txcsum Scott
Le 7 nov. 06 ? 01:15, Scott Long a ?crit :> Olivier Mueller wrote: >> NFS Server: dell poweredge 1950, with the 1.2.2.6 version of >> if_bce.c: >> bce0: <Broadcom NetXtreme II BCM5708 1000Base-T (B1), v0.9.6> mem >> - Start a directory listing on it: immediate (network) crash of >> the NFS >> server. (reproduced 3 times) > > Do the following, then retry your test: > ifconfig bce0 -txcsumOh, this way it looks much better, thanks. Directory listing was fine, and copying files during 2-3 minutes over NFS worked as well. More tests will follow tomorrow. Next step? :-) Should I put that command somewhere in my init scripts, or even directly in rc.conf, or wait for a new if_bce0.c version? I am available for any other PE1950-related tests if this may help. Regards, Olivier
Olivier Mueller wrote:> Le 7 nov. 06 ? 01:15, Scott Long a ?crit : > >> Olivier Mueller wrote: >>> NFS Server: dell poweredge 1950, with the 1.2.2.6 version of if_bce.c: >>> bce0: <Broadcom NetXtreme II BCM5708 1000Base-T (B1), v0.9.6> mem >>> - Start a directory listing on it: immediate (network) crash of the NFS >>> server. (reproduced 3 times) >> >> Do the following, then retry your test: >> ifconfig bce0 -txcsum > > Oh, this way it looks much better, thanks. Directory listing was fine, > and copying files during 2-3 minutes over NFS worked as well. More > tests will follow tomorrow. > > Next step? :-) Should I put that command somewhere in my init > scripts, or even directly in rc.conf, or wait for a new if_bce0.c > version? I am available for any other PE1950-related tests if this > may help. > > Regards, > OlivierChange /sys/dev/bce/if_bcereg.h so that BCE_IF_HWASSIST is defined to 0. Then recompile. Scott
Hi, Will a fix/this fix be part of the 6.2 Release? We will be relying heavily on fbsd6.2 and pe1950 and are worried about the BCE stability? Kind regards, Fredrik Widlund Scott Long wrote:> Olivier Mueller wrote: >> Le 7 nov. 06 ? 01:15, Scott Long a ?crit : >> >>> Olivier Mueller wrote: >>>> NFS Server: dell poweredge 1950, with the 1.2.2.6 version of if_bce.c: >>>> bce0: <Broadcom NetXtreme II BCM5708 1000Base-T (B1), v0.9.6> mem >>>> - Start a directory listing on it: immediate (network) crash of >>>> the NFS >>>> server. (reproduced 3 times) >>> >>> Do the following, then retry your test: >>> ifconfig bce0 -txcsum >> >> Oh, this way it looks much better, thanks. Directory listing was fine, >> and copying files during 2-3 minutes over NFS worked as well. More >> tests will follow tomorrow. >> >> Next step? :-) Should I put that command somewhere in my init >> scripts, or even directly in rc.conf, or wait for a new if_bce0.c >> version? I am available for any other PE1950-related tests if this >> may help. >> >> Regards, >> Olivier > > Change /sys/dev/bce/if_bcereg.h so that BCE_IF_HWASSIST is defined to 0. > Then recompile. > > Scott > > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"
Hi, This is still an issue, we are experiencing hangs and loss of connectivity on 6.2-release Dell pe1950 machines without debug.mpsafenet=0. They last about a minute then the machines come alive again. Needless to say this is impossible to tolerate in a production environment. Kind regards, Fredrik Widlund Fredrik Widlund wrote:> Hi, > > Will a fix/this fix be part of the 6.2 Release? We will be relying > heavily on fbsd6.2 and pe1950 and are worried about the BCE stability? > > Kind regards, > Fredrik Widlund > > Scott Long wrote: > >> Olivier Mueller wrote: >> >>> Le 7 nov. 06 ? 01:15, Scott Long a ?crit : >>> >>> >>>> Olivier Mueller wrote: >>>> >>>>> NFS Server: dell poweredge 1950, with the 1.2.2.6 version of if_bce.c: >>>>> bce0: <Broadcom NetXtreme II BCM5708 1000Base-T (B1), v0.9.6> mem >>>>> - Start a directory listing on it: immediate (network) crash of >>>>> the NFS >>>>> server. (reproduced 3 times) >>>>> >>>> Do the following, then retry your test: >>>> ifconfig bce0 -txcsum >>>> >>> Oh, this way it looks much better, thanks. Directory listing was fine, >>> and copying files during 2-3 minutes over NFS worked as well. More >>> tests will follow tomorrow. >>> >>> Next step? :-) Should I put that command somewhere in my init >>> scripts, or even directly in rc.conf, or wait for a new if_bce0.c >>> version? I am available for any other PE1950-related tests if this >>> may help. >>> >>> Regards, >>> Olivier >>> >> Change /sys/dev/bce/if_bcereg.h so that BCE_IF_HWASSIST is defined to 0. >> Then recompile. >> >> Scott >> >> _______________________________________________ >> freebsd-stable@freebsd.org mailing list >> http://lists.freebsd.org/mailman/listinfo/freebsd-stable >> To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" >> > > >