Dangerous ARPs

Noah Davids

The ARP protocol works quietly in the background, and most of the time it works flawlessly. Unfortunately, such reliability can lead to complacency; when TCP connection problems occur, no one ever thinks of the ARP protocol. In this article, I'll provide a quick review of the ARP protocol and describe some ways in which ARP can create problems. I'll also show how to diagnose these problems using a protocol analyzer like tcpdump.

The ARP Protocol

For a TCP/IP packet to be transmitted over an Ethernet, the packet must be encapsulated in an Ethernet Frame. The first 6 bytes of that frame are the Ethernet media access control (MAC) address of the destination host. This will be either the MAC address of the destination host (if the source and destination hosts are on the same subnet), or the address of a router (if the source and destination are on different subnets).

The ARP protocol is used to find the MAC address of the destination. The sender broadcasts an "ARP Request" packet containing the IP address of the destination (or the IP address of the router) and the owner of that IP address sends an "ARP Reply" packet containing its MAC address. The sender then places the IP to MAC address association in an ARP cache, so it does not have to go through this ARP Request/Reply for every packet. Because the size of the ARP cache is limited and the IP to MAC address mappings can be changed by many things (e.g., hardware failures and DHCP renewals), entries in the ARP cache eventually time out. How long an entry stays in the ARP cache depends on the operating system and its configuration parameters.

You can display the ARP cache table with the ARP command. Note that all examples were done on a system running Red Hat 7.2 (Linux 2.4 kernel). The "?" indicates that the IP address does not resolve to a name. Name resolution, especially name resolution failure, can take time. So if the command appears to hang, try it with the -n option, which will cause the command to skip the name resolution step. See Listings 1a and 1b.

Remember that there is no provision in the ARP protocol to confirm that the host sending the ARP Reply is actually the owner of the requested IP address. Generally, this is a good thing. Many routers and hosts can provide something called "Proxy ARP". Basically, if one host sends out an ARP Request for another host on a remote network, a router can respond with an ARP Reply as if it were the destination host. This allows the source host to communicate when, for whatever reason, it believes that it and the destination host are on the same subnet but in fact they are not. One common cause of this situation is a subnet mask that has fewer bits then it should (e.g., 255.255.0.0 instead of 255.255.255.0). If you ever see an ARP cache where lots of hosts have the same MAC address, you can be pretty sure that the MAC address belongs to a router doing proxy ARPs. In Listing 2, note the entries for 192.168.2.1, 2 and 8. You should also investigate why the system you are on is ARP-ing hosts on a different subnet.

Of course, if something is generally good, there are also times when it is bad. If a router (or any host) is doing proxy ARPs when it shouldn't be, then you have a problem. As a network support specialist, I see this kind of problem once or twice a quarter. That's not very often, but if you don't look in the right place, it can take much longer to solve than typical problems and can create months' worth of headaches all on its own. On the other hand, if you know what to look for, things become much easier.

The following traces were captured in a test environment (hence the similar names and MAC and IP addresses between the two problems), but the traces accurately represent what I saw at my customer locations.

Problem #1: On Again, Off Again Connections

In the first case, the symptom is an intermittent connection between two hosts -- Top and RoadRunner. Top is running Red Hat 7.2, and its IP address is 172.16.1.2. RoadRunner is running Microsoft Windows 2000 Server and has an IP address of 192.168.1.100. Top is running the client side of the application, and RoadRunner has the server side. The connection will work for a time, and then Top will get a "Connection closed by foreign host" error message after sending some data. A new connection can be established but will eventually get the same "Connection closed by foreign host" error.

Listing 3 shows a partial trace taken on Top with tcpdump. It is unremarkable until frame #8. Frame #8 is a retransmission (from Top) of frame #7 and is immediately followed by a reset from RoadRunner in frame #9. Based on this trace, it appears that RoadRunner reset the connection for some reason.

Listing 4 shows a trace taken from another Red Hat system (Spy1) connected to the same switch as RoadRunner. The switch was configured to mirror all traffic to/from RoadRunner to the switch port connected to Spy1. Except for the timestamps, which are not in sync, the two traces are the same through frame #7. At that point, Spy1's frame #8 shows RoadRunner sending an acknowledgement frame to Top and Top sending the reset to RoadRunner in frame #9. These frames do not show up in Top's trace. The traces get back in sync at frames #10 and #11, which shows Top's retransmission frame and the reset from Roadrunner. Based on this trace, it appears that Top has reset the connection.

So, at first glance, it appears that Top has reset the connection. There are a few unusual things about this trace. First, frame #8 is sent to and frame #9 comes from a different MAC address than all the other frames. Second, the time to live (TTL) value in frame #9 is completely different from the TTL value in all the other frames that came from Top. The difference is so large (192) that it is not reasonable to assume that the packet just took another path. Third, the value in frame #9 is 255, which indicates that it did not pass through any routers (255 is the maximum TTL value). Clearly, this should not be possible. Finally, why would Top reset the connection in frame #9 and then send data in frame #10? Top should not be sending any data after resetting the connection.

It also appears that RoadRunner sent an acknowledgement packet (frame #8) to the wrong host (the different MAC address) and that that host responded with a reset (frame #9). RoadRunner then closed its connection and, when Top retransmitted frame #7 with frame #10, RoadRunner responded with a reset (frame #11) because it had already closed its connection. The trace taken on Top sees only frames #7, #10, and #11.

We still need to know why RoadRunner is sending frame #8 to the wrong MAC address. The key is to look at the entire trace, not just the connection between RoadRunner and Top. Between frame #7 and #8, RoadRunner sends an ARP request looking for the MAC address of 192.168.1.99 (see Figure 5). This is the router that connects RoadRunner's subnet to Top's subnet. A reply is received from 0:50:bf:2e:1e:56. This is the MAC address used in frame #8. After frame #9, there is another reply -- this time with the address 0:e0:1e:60:8e:c7. This is the address used in the rest of the frames.

What's happening is that, on RoadRunner, the ARP cache entry for the router times out and is purged (by default, Microsoft Windows 2000 will do this every 10 minutes). The next time a packet needs to be sent to the router, an ARP request is sent; as soon as an answer is received, the ARP cache is updated and the packet sent. When the second ARP reply is received, the ARP cache is again updated but it is too late to save the connection (because of the reset). The next connection, however, will use the updated ARP cache and work fine -- for a while.

At this point we had to identify what host had a MAC address of 0:50:bf:2e:1e:56. It turned out to belong to a firewall, which explained why it was responding with the reset. It had no knowledge of a connection between RoadRunner and Top so it shut down the connection. Unfortunately, we could not determine why it was responding to the ARP for 192.168.1.99 in the first place. To work around the problem, I set a static entry for the router in the ARP cache of RoadRunner. Setting a static ARP cache entry should be considered only a stopgap measure. Once the firewall is fixed, the entry should be deleted; otherwise, if the router NIC card were changed, we would be unable to communicate with it.

It is very important to get a complete trace of all frames when debugging a problem. If I had just done a "tcpdump host RoadRunner", I would not have seen the MAC addresses and would not have known that the packets were sent to different hosts. I also had to remove the "host RoadRunner" argument in order to see the ARP packets and learn what was really happening. Finally, getting a trace from both ends of the network at the same time can significantly cut down on finger pointing. With just the trace from Top, the problem appeared to be on RoadRunner. With just the trace from Spy1, we could easily have assumed that the problem was with Top. With both traces, however, we could see the inconsistencies in Spy1's trace.

Problem #2: One-Way Connections/Pings

In the second example, connections between hosts on the same subnet (Daffy at 192.168.1.2 and RoadRunner at 192.168.1.100) can sometimes be made. Daffy is running Red Hat 7.2, and RoadRunner is still running Microsoft Windows 2000 Server. After careful questioning, I learn that pings from Daffy to RoadRunner usually succeed, and pings from RoadRunner to Daffy usually fail. Again, tcpdump on Spy1 shows what is happening.

In Listing 6, Daffy has initiated the ping. Note that in frame #1 Daffy has sent an ARP request looking for RoadRunner; RoadRunner replies in frame #2, and Daffy sends a ping in frame #3 and gets a reply in frame #4. In frame #5, we get another ARP reply claiming to be RoadRunner but with a different Ethernet MAC address. Daffy however, ignores this reply and continues to use the address from the first reply (see frames #8 and #10). Note also that RoadRunner never sends an ARP query looking for Daffy's address; it has already learned the address from the packets that Daffy has sent.

The trace in Listing 7 starts out very much like the trace in Listing 6. Frame #1 shows RoadRunner sending an ARP query for Daffy, and Daffy responds in frame #2. Frame #3 contains RoadRunner's ping, and frame #4 contains Daffy's response. Frame #5 contains another ARP reply, but Microsoft Windows 2000 immediately starts using the MAC address in the new reply in frames #6, #7, and #8, which now go unanswered. This is a difference in how Microsoft Windows and Linux deal with gratuitous ARPs. If the ARP reply in frame #5 had arrived first, Daffy would not have been able to make a connection, but RoadRoader would have (at least until its ARP cache timed out (see problem #1)).

Again, a static ARP entry on RoadRunner resolved the problem but we were still wondering what sent the second ARP reply. We were unable to identify the second MAC address 0:50:bf:2e:1e:56, but another trace taken over a much longer time frame turned up some ARP frames looking for hosts on a completely different subnet, see Listing 8.

When we looked at the 172.16.2.0 network for the second MAC address, it was identified as a router. A check of the router showed that it thought it had a route back to our 192.168.1.0 network. Someone had bridged the two networks 172.16.2.0 and 192.168.1.0, and the router was doing proxy ARPs. The route was deleted and the proxy ARPs stopped; however, it took most of a day to trace where the two networks had been accidentally bridged and to break the connection.

Summary

The first thing to remember if you are having communications problems is to check the ARP cache of both systems (and any routers). The process is quick and easy and can prevent hours spent looking for the wrong thing. However, as we saw in example 1, the ARP cache on RoadRunner was wrong for only a very brief time. So, just because the cache is correct when you check doesn't mean it was correct when the packet was sent. Nevertheless, if it is wrong when you look, find out why.

Second, if possible, get simultaneous traces from each host participating in the connection. If you can't get a trace from both ends, remember that what you are seeing in the trace is not necessarily what the other side of the connection is seeing.

Third, filtering packets in a trace is usually necessary because of the volume of packets on a network, but sometimes the key packets don't show up in the trace because they have been filtered out. Be prepared to change the filter or, if that doesn't help, eliminate the filter altogether.

Fourth, pay attention to all the fields in the trace -- not just the IP addresses and TCP flags. Some of the secondary fields, like the TTL field, can provide critical clues to diagnose the problem.

Noah Davids ([email protected]) has worked as a LAN technical support specialist for a mid-sized computer company for the last 10 years. He has published numerous articles on LAN programming and troubleshooting. He has CNX, Network+, and MCSE certifications as well as a Master of Science in Computer Science from Arizona State University.