network connections are suddenly dropped (arp problem)?


Subject: network connections are suddenly dropped (arp problem)?
From: Markus Koch (koch@mpiib-berlin.mpg.de)
Date: Tue Aug 14 2001 - 17:33:58 EDT


Hi

One of my servers (PII400/440BX/Intel EtherexpressPro and SCSI on board /
2.2.16-RAID) suddenly drops all connections to it (appletalk and samba). The
server is not even reachable by pings and all afpd's time out. This persists
for 1 to 5 min and then everything is fine again. After reading around I
found that the eepro driver is buggy in this kernel version. I got the
newest intel driver e100 compiled it and installed it as a module, I made
sure it is loaded at boot time (lsmod). At first this seemed to solve the
problem but it turns out the the problem persists although less often. I
sniffed all broadcast and server specific traffic from a second linux
computer within the same network (logs are attached). It looks to me as if
the server suddenly forgets all arp entries and sends out arp requests. The
weird thing is, that he does not seem to recognize the arp answers but keeps
on asking.
Is this a known problem even with the new intel etherexpress pro driver and
appletalk ? Is there any possibility to solve it?

thanks

markus

sniffed from 192.168.1.1
(server in question is 192.168.1.2 192.168.1.104 is a win95 that is active
but does not answer)

159447 16:58:38.2394 00:e0:18:90:57:31 -> 00:05:02:e1:a9:5b ARP Who has
192.168.1.120? Tell 192.168.1.2
159448 16:58:38.2398 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159450 16:58:39.2395 00:e0:18:90:57:31 -> 00:05:02:e1:a9:5b ARP Who has
192.168.1.120? Tell 192.168.1.2
159451 16:58:39.2398 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159452 16:58:40.2396 00:e0:18:90:57:31 -> 00:05:02:e1:a9:5b ARP Who has
192.168.1.120? Tell 192.168.1.2
159453 16:58:40.2399 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159454 16:58:41.2396 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.120? Tell 192.168.1.2
159455 16:58:41.2399 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159457 16:58:42.2397 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.120? Tell 192.168.1.2
159458 16:58:42.2400 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159459 16:58:43.2398 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.120? Tell 192.168.1.2
159460 16:58:43.2400 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159461 16:58:43.7198 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.105? Tell 192.168.1.2
159462 16:58:43.7209 Apple_d7:be:26 -> 00:e0:18:90:57:31 ARP 192.168.1.105
is at 08:00:07:d7:be:26
159463 16:58:44.7199 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.105? Tell 192.168.1.2
159464 16:58:44.7209 Apple_d7:be:26 -> 00:e0:18:90:57:31 ARP 192.168.1.105
is at 08:00:07:d7:be:26
159466 16:58:45.7199 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.105? Tell 192.168.1.2
159467 16:58:45.7210 Apple_d7:be:26 -> 00:e0:18:90:57:31 ARP 192.168.1.105
is at 08:00:07:d7:be:26
159468 16:58:45.9900 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.122? Tell 192.168.1.2
159469 16:58:46.0084 00:0a:27:b6:0d:5c -> 00:e0:18:90:57:31 ARP
192.168.1.122 is at 00:0a:27:b6:0d:5c
159471 16:58:46.9900 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.122? Tell 192.168.1.2
159472 16:58:46.9902 00:0a:27:b6:0d:5c -> 00:e0:18:90:57:31 ARP
192.168.1.122 is at 00:0a:27:b6:0d:5c
1159475 16:58:47.9901 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.122? Tell 192.168.1.2
159476 16:58:47.9903 00:0a:27:b6:0d:5c -> 00:e0:18:90:57:31 ARP
192.168.1.122 is at 00:0a:27:b6:0d:5c159574
(snipped)
159579 17:01:03.2613 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.1? Tell 192.168.1.2
159580 17:01:03.2614 00:30:84:0b:27:c3 -> 00:e0:18:90:57:31 ARP 192.168.1.1
is at 00:30:84:0b:27:c3
159581 17:01:03.7204 00:30:84:0b:27:c3 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.104? Tell 192.168.1.1
159582 17:01:04.2592 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.1? Tell 192.168.1.2
159583 17:01:04.2593 00:30:84:0b:27:c3 -> 00:e0:18:90:57:31 ARP 192.168.1.1
is at 00:30:84:0b:27:c3
159584 17:01:05.2593 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.1? Tell 192.168.1.2
159585 17:01:05.2593 00:30:84:0b:27:c3 -> 00:e0:18:90:57:31 ARP 192.168.1.1
is at 00:30:84:0b:27:c3
159586 17:01:06.8494 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.120? Tell 192.168.1.2
159587 17:01:06.8497 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159588 17:01:07.8495 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.120? Tell 192.168.1.2
159589 17:01:07.8497 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159590 17:01:08.2204 00:30:84:0b:27:c3 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.104? Tell 192.168.1.1
159591 17:01:08.8495 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.120? Tell 192.168.1.2
159592 17:01:08.8498 00:05:02:e1:a9:5b -> 00:e0:18:90:57:31 ARP
192.168.1.120 is at 00:05:02:e1:a9:5b
159593 17:01:09.2204 00:30:84:0b:27:c3 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.104? Tell 192.168.1.1
159594 17:01:10.2204 00:30:84:0b:27:c3 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.104? Tell 192.168.1.1
159595 17:01:13.2681 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.1? Tell 192.168.1.2
159596 17:01:13.2682 00:30:84:0b:27:c3 -> 00:e0:18:90:57:31 ARP 192.168.1.1
is at 00:30:84:0b:27:c3
159597 17:01:14.2599 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.1? Tell 192.168.1.2
159598 17:01:14.2599 00:30:84:0b:27:c3 -> 00:e0:18:90:57:31 ARP 192.168.1.1
is at 00:30:84:0b:27:c3
159599 17:01:15.2599 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.1? Tell 192.168.1.2
159600 17:01:15.2600 00:30:84:0b:27:c3 -> 00:e0:18:90:57:31 ARP 192.168.1.1
is at 00:30:84:0b:27:c3
159601 17:01:16.6801 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.104? Tell 192.168.1.2
159602 17:01:17.3301 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.105? Tell 192.168.1.2
159603 17:01:17.3311 Apple_d7:be:26 -> 00:e0:18:90:57:31 ARP 192.168.1.105
is at 08:00:07:d7:be:26
159604 17:01:17.6801 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.104? Tell 192.168.1.2
159605 17:01:18.3302 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.105? Tell 192.168.1.2
159606 17:01:18.3312 Apple_d7:be:26 -> 00:e0:18:90:57:31 ARP 192.168.1.105
is at 08:00:07:d7:be:26
159607 17:01:18.6802 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.104? Tell 192.168.1.2
159608 17:01:19.3302 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.105? Tell 192.168.1.2
159609 17:01:19.3313 Apple_d7:be:26 -> 00:e0:18:90:57:31 ARP 192.168.1.105
is at 08:00:07:d7:be:26
159610 17:01:19.6003 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.122? Tell 192.168.1.2
159611 17:01:19.6005 00:0a:27:b6:0d:5c -> 00:e0:18:90:57:31 ARP
192.168.1.122 is at 00:0a:27:b6:0d:5c
159612 17:01:20.6003 00:e0:18:90:57:31 -> ff:ff:ff:ff:ff:ff ARP Who has
192.168.1.122? Tell 192.168.1.2

ifconfig
eth0 Link encap:Ethernet HWaddr 00:E0:18:90:57:31
          inet addr:192.168.1.2 Bcast:192.168.1.255 Mask:255.255.255.0
          EtherTalk Phase 2 addr:65280/94
          UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
          RX packets:349589 errors:0 dropped:0 overruns:0 frame:0
          TX packets:208378 errors:0 dropped:0 overruns:0 carrier:0
          collisions:132288



This archive was generated by hypermail 2b28 : Sun Oct 14 2001 - 03:04:48 EDT