network interface zombies randomly [was Re: interface dies under network load on SMP machines]
david
david@kalifornia.com
Thu Aug 27 20:51:08 1998
Reply to mail from Mike Simons about interface dies under network load on SMP machines
-----------------
>> The interface goes dead from time to time without leaving any log
>> messages - a simple ifconfig down/up brings them back to live. To
>> stabilise the systems I've written a small program that checks the
>> network and restarts the interface if required. So everything is
>> nearly perfectly fine...
[some parts snipped]
stipulations:
- it may be the tulip driver
- it may be the [xxx] driver
- it may be the kernel
it may be a subtle bug somewhere in the network/device path that's not
directly related
I used to feel that it was the tulip driver. Now I am not sure. The
default version of tulip.c that comes in the 2.1 tarball works fine. If I
get any of the most recent tulip.c files from cesdis, then the above
anomally appears.
I.e., random dying of the interface. It can happen within minutes, it can
take days. I have not yet found a pattern.
As is mentioned, ifconfig down/up fixes it.
If I recall correctly, packets aren't lost by doing ifconfig down/up. For
example, ping <host> will flood your screen with all the latent packets
finally getting through.
comments?
-d
--
Look, look, see Windows 98. Buy, lemmings, buy!
(c) 1998 David Ford. Redistribution via the Microsoft Network is prohibited.
for linux-kernel: please read linux/Documentation/* before posting problems