1*ed775ee7SAntonio Huete Jimenez# TCPDUMP 4.x.y by [The Tcpdump Group](https://www.tcpdump.org/) 2411677aeSAaron LI 3*ed775ee7SAntonio Huete Jimenez**To report a security issue please send an e-mail to security@tcpdump.org.** 4411677aeSAaron LI 5411677aeSAaron LITo report bugs and other problems, contribute patches, request a 6*ed775ee7SAntonio Huete Jimenezfeature, provide generic feedback etc please see the 7*ed775ee7SAntonio Huete Jimenez[guidelines for contributing](CONTRIBUTING) in the tcpdump source tree root. 8411677aeSAaron LI 9*ed775ee7SAntonio Huete JimenezAnonymous Git is available via 10411677aeSAaron LI 11*ed775ee7SAntonio Huete Jimenez https://github.com/the-tcpdump-group/tcpdump.git 12411677aeSAaron LI 13411677aeSAaron LIThis directory contains source code for tcpdump, a tool for network 14*ed775ee7SAntonio Huete Jimenezmonitoring and data acquisition. 15411677aeSAaron LI 16*ed775ee7SAntonio Huete JimenezOver the past few years, tcpdump has been steadily improved by the 17*ed775ee7SAntonio Huete Jimenezexcellent contributions from the Internet community (just browse 18*ed775ee7SAntonio Huete Jimenezthrough the [change log](CHANGES)). We are grateful for all the input. 19*ed775ee7SAntonio Huete Jimenez 20*ed775ee7SAntonio Huete Jimenez### Supported platforms 21*ed775ee7SAntonio Huete JimenezIn many operating systems tcpdump is available as a native package or port, 22*ed775ee7SAntonio Huete Jimenezwhich simplifies installation of updates and long-term maintenance. However, 23*ed775ee7SAntonio Huete Jimenezthe native packages are sometimes a few versions behind and to try a more 24*ed775ee7SAntonio Huete Jimenezrecent snapshot it will take to compile tcpdump from the source code. 25*ed775ee7SAntonio Huete Jimenez 26*ed775ee7SAntonio Huete Jimeneztcpdump compiles and works on at least the following platforms: 27*ed775ee7SAntonio Huete Jimenez 28*ed775ee7SAntonio Huete Jimenez* AIX 29*ed775ee7SAntonio Huete Jimenez* DragonFly BSD 30*ed775ee7SAntonio Huete Jimenez* FreeBSD 31*ed775ee7SAntonio Huete Jimenez* Haiku 32*ed775ee7SAntonio Huete Jimenez* HP-UX 11i 33*ed775ee7SAntonio Huete Jimenez* GNU/Linux 34*ed775ee7SAntonio Huete Jimenez* {Mac} OS X / macOS 35*ed775ee7SAntonio Huete Jimenez* NetBSD 36*ed775ee7SAntonio Huete Jimenez* OpenBSD 37*ed775ee7SAntonio Huete Jimenez* OpenWrt 38*ed775ee7SAntonio Huete Jimenez* Solaris 39*ed775ee7SAntonio Huete Jimenez* Windows (requires WinPcap or Npcap, and Visual Studio with CMake) 40*ed775ee7SAntonio Huete Jimenez 41*ed775ee7SAntonio Huete Jimenez### Dependency on libpcap 42411677aeSAaron LITcpdump uses libpcap, a system-independent interface for user-level 43411677aeSAaron LIpacket capture. Before building tcpdump, you must first retrieve and 44*ed775ee7SAntonio Huete Jimenezbuild libpcap. 45411677aeSAaron LI 46411677aeSAaron LIOnce libpcap is built (either install it or make sure it's in 47*ed775ee7SAntonio Huete Jimenez`../libpcap`), you can build tcpdump using the procedure in the 48*ed775ee7SAntonio Huete Jimenez[installation guide](INSTALL.txt). 49411677aeSAaron LI 50*ed775ee7SAntonio Huete Jimenez### Origins of tcpdump 51411677aeSAaron LIThe program is loosely based on SMI's "etherfind" although none of the 52411677aeSAaron LIetherfind code remains. It was originally written by Van Jacobson as 53*ed775ee7SAntonio Huete Jimenezpart of an ongoing research project to investigate and improve TCP and 54*ed775ee7SAntonio Huete JimenezInternet gateway performance. The parts of the program originally 55411677aeSAaron LItaken from Sun's etherfind were later re-written by Steven McCanne of 56411677aeSAaron LILBL. To insure that there would be no vestige of proprietary code in 57411677aeSAaron LItcpdump, Steve wrote these pieces from the specification given by the 58411677aeSAaron LImanual entry, with no access to the source of tcpdump or etherfind. 59*ed775ee7SAntonio Huete Jimenez```text 60*ed775ee7SAntonio Huete Jimenezformerly from Lawrence Berkeley National Laboratory 61*ed775ee7SAntonio Huete Jimenez Network Research Group <tcpdump@ee.lbl.gov> 62*ed775ee7SAntonio Huete Jimenez ftp://ftp.ee.lbl.gov/old/tcpdump.tar.Z (3.4) 63*ed775ee7SAntonio Huete Jimenez``` 64411677aeSAaron LI 65*ed775ee7SAntonio Huete Jimenez### See also 66411677aeSAaron LIRichard Stevens gives an excellent treatment of the Internet protocols 67411677aeSAaron LIin his book *"TCP/IP Illustrated, Volume 1"*. If you want to learn more 68411677aeSAaron LIabout tcpdump and how to interpret its output, pick up this book. 69411677aeSAaron LI 70*ed775ee7SAntonio Huete JimenezAnother tool that tcpdump users might find useful is 71*ed775ee7SAntonio Huete Jimenez[tcpslice](https://github.com/the-tcpdump-group/tcpslice). 72411677aeSAaron LIIt is a program that can be used to extract portions of tcpdump binary 73*ed775ee7SAntonio Huete Jimeneztrace files. 74411677aeSAaron LI 75*ed775ee7SAntonio Huete Jimenez### The original LBL README by Steve McCanne, Craig Leres and Van Jacobson 76411677aeSAaron LI``` 77411677aeSAaron LIThis directory also contains some short awk programs intended as 78411677aeSAaron LIexamples of ways to reduce tcpdump data when you're tracking 79411677aeSAaron LIparticular network problems: 80411677aeSAaron LI 81411677aeSAaron LIsend-ack.awk 82411677aeSAaron LI Simplifies the tcpdump trace for an ftp (or other unidirectional 83411677aeSAaron LI tcp transfer). Since we assume that one host only sends and 84411677aeSAaron LI the other only acks, all address information is left off and 85411677aeSAaron LI we just note if the packet is a "send" or an "ack". 86411677aeSAaron LI 87411677aeSAaron LI There is one output line per line of the original trace. 88411677aeSAaron LI Field 1 is the packet time in decimal seconds, relative 89411677aeSAaron LI to the start of the conversation. Field 2 is delta-time 90411677aeSAaron LI from last packet. Field 3 is packet type/direction. 91411677aeSAaron LI "Send" means data going from sender to receiver, "ack" 92411677aeSAaron LI means an ack going from the receiver to the sender. A 93411677aeSAaron LI preceding "*" indicates that the data is a retransmission. 94411677aeSAaron LI A preceding "-" indicates a hole in the sequence space 95411677aeSAaron LI (i.e., missing packet(s)), a "#" means an odd-size (not max 96411677aeSAaron LI seg size) packet. Field 4 has the packet flags 97411677aeSAaron LI (same format as raw trace). Field 5 is the sequence 98411677aeSAaron LI number (start seq. num for sender, next expected seq number 99411677aeSAaron LI for acks). The number in parens following an ack is 100411677aeSAaron LI the delta-time from the first send of the packet to the 101411677aeSAaron LI ack. A number in parens following a send is the 102411677aeSAaron LI delta-time from the first send of the packet to the 103411677aeSAaron LI current send (on duplicate packets only). Duplicate 104411677aeSAaron LI sends or acks have a number in square brackets showing 105411677aeSAaron LI the number of duplicates so far. 106411677aeSAaron LI 107411677aeSAaron LI Here is a short sample from near the start of an ftp: 108411677aeSAaron LI 3.00 0.20 send . 512 109411677aeSAaron LI 3.20 0.20 ack . 1024 (0.20) 110411677aeSAaron LI 3.20 0.00 send P 1024 111411677aeSAaron LI 3.40 0.20 ack . 1536 (0.20) 112411677aeSAaron LI 3.80 0.40 * send . 0 (3.80) [2] 113411677aeSAaron LI 3.82 0.02 * ack . 1536 (0.62) [2] 114411677aeSAaron LI Three seconds into the conversation, bytes 512 through 1023 115411677aeSAaron LI were sent. 200ms later they were acked. Shortly thereafter 116411677aeSAaron LI bytes 1024-1535 were sent and again acked after 200ms. 117411677aeSAaron LI Then, for no apparent reason, 0-511 is retransmitted, 3.8 118411677aeSAaron LI seconds after its initial send (the round trip time for this 119411677aeSAaron LI ftp was 1sec, +-500ms). Since the receiver is expecting 120411677aeSAaron LI 1536, 1536 is re-acked when 0 arrives. 121411677aeSAaron LI 122411677aeSAaron LIpacketdat.awk 123411677aeSAaron LI Computes chunk summary data for an ftp (or similar 124411677aeSAaron LI unidirectional tcp transfer). [A "chunk" refers to 125411677aeSAaron LI a chunk of the sequence space -- essentially the packet 126411677aeSAaron LI sequence number divided by the max segment size.] 127411677aeSAaron LI 128411677aeSAaron LI A summary line is printed showing the number of chunks, 129411677aeSAaron LI the number of packets it took to send that many chunks 130411677aeSAaron LI (if there are no lost or duplicated packets, the number 131411677aeSAaron LI of packets should equal the number of chunks) and the 132411677aeSAaron LI number of acks. 133411677aeSAaron LI 134411677aeSAaron LI Following the summary line is one line of information 135411677aeSAaron LI per chunk. The line contains eight fields: 136411677aeSAaron LI 1 - the chunk number 137411677aeSAaron LI 2 - the start sequence number for this chunk 138411677aeSAaron LI 3 - time of first send 139411677aeSAaron LI 4 - time of last send 140411677aeSAaron LI 5 - time of first ack 141411677aeSAaron LI 6 - time of last ack 142411677aeSAaron LI 7 - number of times chunk was sent 143411677aeSAaron LI 8 - number of times chunk was acked 144411677aeSAaron LI (all times are in decimal seconds, relative to the start 145411677aeSAaron LI of the conversation.) 146411677aeSAaron LI 147411677aeSAaron LI As an example, here is the first part of the output for 148411677aeSAaron LI an ftp trace: 149411677aeSAaron LI 150411677aeSAaron LI # 134 chunks. 536 packets sent. 508 acks. 151411677aeSAaron LI 1 1 0.00 5.80 0.20 0.20 4 1 152411677aeSAaron LI 2 513 0.28 6.20 0.40 0.40 4 1 153411677aeSAaron LI 3 1025 1.16 6.32 1.20 1.20 4 1 154411677aeSAaron LI 4 1561 1.86 15.00 2.00 2.00 6 1 155411677aeSAaron LI 5 2049 2.16 15.44 2.20 2.20 5 1 156411677aeSAaron LI 6 2585 2.64 16.44 2.80 2.80 5 1 157411677aeSAaron LI 7 3073 3.00 16.66 3.20 3.20 4 1 158411677aeSAaron LI 8 3609 3.20 17.24 3.40 5.82 4 11 159411677aeSAaron LI 9 4097 6.02 6.58 6.20 6.80 2 5 160411677aeSAaron LI 161411677aeSAaron LI This says that 134 chunks were transferred (about 70K 162411677aeSAaron LI since the average packet size was 512 bytes). It took 163411677aeSAaron LI 536 packets to transfer the data (i.e., on the average 164411677aeSAaron LI each chunk was transmitted four times). Looking at, 165411677aeSAaron LI say, chunk 4, we see it represents the 512 bytes of 166411677aeSAaron LI sequence space from 1561 to 2048. It was first sent 167411677aeSAaron LI 1.86 seconds into the conversation. It was last 168411677aeSAaron LI sent 15 seconds into the conversation and was sent 169411677aeSAaron LI a total of 6 times (i.e., it was retransmitted every 170411677aeSAaron LI 2 seconds on the average). It was acked once, 140ms 171411677aeSAaron LI after it first arrived. 172411677aeSAaron LI 173411677aeSAaron LIstime.awk 174411677aeSAaron LIatime.awk 175411677aeSAaron LI Output one line per send or ack, respectively, in the form 176411677aeSAaron LI <time> <seq. number> 177411677aeSAaron LI where <time> is the time in seconds since the start of the 178411677aeSAaron LI transfer and <seq. number> is the sequence number being sent 179411677aeSAaron LI or acked. I typically plot this data looking for suspicious 180411677aeSAaron LI patterns. 181411677aeSAaron LI 182411677aeSAaron LI 183411677aeSAaron LIThe problem I was looking at was the bulk-data-transfer 184411677aeSAaron LIthroughput of medium delay network paths (1-6 sec. round trip 185411677aeSAaron LItime) under typical DARPA Internet conditions. The trace of the 186411677aeSAaron LIftp transfer of a large file was used as the raw data source. 187411677aeSAaron LIThe method was: 188411677aeSAaron LI 189411677aeSAaron LI - On a local host (but not the Sun running tcpdump), connect to 190411677aeSAaron LI the remote ftp. 191411677aeSAaron LI 192411677aeSAaron LI - On the monitor Sun, start the trace going. E.g., 193411677aeSAaron LI tcpdump host local-host and remote-host and port ftp-data >tracefile 194411677aeSAaron LI 195411677aeSAaron LI - On local, do either a get or put of a large file (~500KB), 196411677aeSAaron LI preferably to the null device (to minimize effects like 197411677aeSAaron LI closing the receive window while waiting for a disk write). 198411677aeSAaron LI 199411677aeSAaron LI - When transfer is finished, stop tcpdump. Use awk to make up 200411677aeSAaron LI two files of summary data (maxsize is the maximum packet size, 201411677aeSAaron LI tracedata is the file of tcpdump tracedata): 202411677aeSAaron LI awk -f send-ack.awk packetsize=avgsize tracedata >sa 203411677aeSAaron LI awk -f packetdat.awk packetsize=avgsize tracedata >pd 204411677aeSAaron LI 205411677aeSAaron LI - While the summary data files are printing, take a look at 206411677aeSAaron LI how the transfer behaved: 207411677aeSAaron LI awk -f stime.awk tracedata | xgraph 208411677aeSAaron LI (90% of what you learn seems to happen in this step). 209411677aeSAaron LI 210411677aeSAaron LI - Do all of the above steps several times, both directions, 211411677aeSAaron LI at different times of day, with different protocol 212411677aeSAaron LI implementations on the other end. 213411677aeSAaron LI 214411677aeSAaron LI - Using one of the Unix data analysis packages (in my case, 215411677aeSAaron LI S and Gary Perlman's Unix|Stat), spend a few months staring 216411677aeSAaron LI at the data. 217411677aeSAaron LI 218411677aeSAaron LI - Change something in the local protocol implementation and 219411677aeSAaron LI redo the steps above. 220411677aeSAaron LI 221411677aeSAaron LI - Once a week, tell your funding agent that you're discovering 222411677aeSAaron LI wonderful things and you'll write up that research report 223411677aeSAaron LI "real soon now". 224411677aeSAaron LI``` 225