xref: /dflybsd-src/contrib/tcpdump/README.md (revision 59c07fbdf8168fa08c76c515186d561b5a92690c)
1*ed775ee7SAntonio Huete Jimenez# TCPDUMP 4.x.y by [The Tcpdump Group](https://www.tcpdump.org/)
2411677aeSAaron LI
3*ed775ee7SAntonio Huete Jimenez**To report a security issue please send an e-mail to security@tcpdump.org.**
4411677aeSAaron LI
5411677aeSAaron LITo report bugs and other problems, contribute patches, request a
6*ed775ee7SAntonio Huete Jimenezfeature, provide generic feedback etc please see the
7*ed775ee7SAntonio Huete Jimenez[guidelines for contributing](CONTRIBUTING) in the tcpdump source tree root.
8411677aeSAaron LI
9*ed775ee7SAntonio Huete JimenezAnonymous Git is available via
10411677aeSAaron LI
11*ed775ee7SAntonio Huete Jimenez	https://github.com/the-tcpdump-group/tcpdump.git
12411677aeSAaron LI
13411677aeSAaron LIThis directory contains source code for tcpdump, a tool for network
14*ed775ee7SAntonio Huete Jimenezmonitoring and data acquisition.
15411677aeSAaron LI
16*ed775ee7SAntonio Huete JimenezOver the past few years, tcpdump has been steadily improved by the
17*ed775ee7SAntonio Huete Jimenezexcellent contributions from the Internet community (just browse
18*ed775ee7SAntonio Huete Jimenezthrough the [change log](CHANGES)).  We are grateful for all the input.
19*ed775ee7SAntonio Huete Jimenez
20*ed775ee7SAntonio Huete Jimenez### Supported platforms
21*ed775ee7SAntonio Huete JimenezIn many operating systems tcpdump is available as a native package or port,
22*ed775ee7SAntonio Huete Jimenezwhich simplifies installation of updates and long-term maintenance. However,
23*ed775ee7SAntonio Huete Jimenezthe native packages are sometimes a few versions behind and to try a more
24*ed775ee7SAntonio Huete Jimenezrecent snapshot it will take to compile tcpdump from the source code.
25*ed775ee7SAntonio Huete Jimenez
26*ed775ee7SAntonio Huete Jimeneztcpdump compiles and works on at least the following platforms:
27*ed775ee7SAntonio Huete Jimenez
28*ed775ee7SAntonio Huete Jimenez* AIX
29*ed775ee7SAntonio Huete Jimenez* DragonFly BSD
30*ed775ee7SAntonio Huete Jimenez* FreeBSD
31*ed775ee7SAntonio Huete Jimenez* Haiku
32*ed775ee7SAntonio Huete Jimenez* HP-UX 11i
33*ed775ee7SAntonio Huete Jimenez* GNU/Linux
34*ed775ee7SAntonio Huete Jimenez* {Mac} OS X / macOS
35*ed775ee7SAntonio Huete Jimenez* NetBSD
36*ed775ee7SAntonio Huete Jimenez* OpenBSD
37*ed775ee7SAntonio Huete Jimenez* OpenWrt
38*ed775ee7SAntonio Huete Jimenez* Solaris
39*ed775ee7SAntonio Huete Jimenez* Windows (requires WinPcap or Npcap, and Visual Studio with CMake)
40*ed775ee7SAntonio Huete Jimenez
41*ed775ee7SAntonio Huete Jimenez### Dependency on libpcap
42411677aeSAaron LITcpdump uses libpcap, a system-independent interface for user-level
43411677aeSAaron LIpacket capture.  Before building tcpdump, you must first retrieve and
44*ed775ee7SAntonio Huete Jimenezbuild libpcap.
45411677aeSAaron LI
46411677aeSAaron LIOnce libpcap is built (either install it or make sure it's in
47*ed775ee7SAntonio Huete Jimenez`../libpcap`), you can build tcpdump using the procedure in the
48*ed775ee7SAntonio Huete Jimenez[installation guide](INSTALL.txt).
49411677aeSAaron LI
50*ed775ee7SAntonio Huete Jimenez### Origins of tcpdump
51411677aeSAaron LIThe program is loosely based on SMI's "etherfind" although none of the
52411677aeSAaron LIetherfind code remains.  It was originally written by Van Jacobson as
53*ed775ee7SAntonio Huete Jimenezpart of an ongoing research project to investigate and improve TCP and
54*ed775ee7SAntonio Huete JimenezInternet gateway performance.  The parts of the program originally
55411677aeSAaron LItaken from Sun's etherfind were later re-written by Steven McCanne of
56411677aeSAaron LILBL.  To insure that there would be no vestige of proprietary code in
57411677aeSAaron LItcpdump, Steve wrote these pieces from the specification given by the
58411677aeSAaron LImanual entry, with no access to the source of tcpdump or etherfind.
59*ed775ee7SAntonio Huete Jimenez```text
60*ed775ee7SAntonio Huete Jimenezformerly from 	Lawrence Berkeley National Laboratory
61*ed775ee7SAntonio Huete Jimenez		Network Research Group <tcpdump@ee.lbl.gov>
62*ed775ee7SAntonio Huete Jimenez		ftp://ftp.ee.lbl.gov/old/tcpdump.tar.Z (3.4)
63*ed775ee7SAntonio Huete Jimenez```
64411677aeSAaron LI
65*ed775ee7SAntonio Huete Jimenez### See also
66411677aeSAaron LIRichard Stevens gives an excellent treatment of the Internet protocols
67411677aeSAaron LIin his book *"TCP/IP Illustrated, Volume 1"*. If you want to learn more
68411677aeSAaron LIabout tcpdump and how to interpret its output, pick up this book.
69411677aeSAaron LI
70*ed775ee7SAntonio Huete JimenezAnother tool that tcpdump users might find useful is
71*ed775ee7SAntonio Huete Jimenez[tcpslice](https://github.com/the-tcpdump-group/tcpslice).
72411677aeSAaron LIIt is a program that can be used to extract portions of tcpdump binary
73*ed775ee7SAntonio Huete Jimeneztrace files.
74411677aeSAaron LI
75*ed775ee7SAntonio Huete Jimenez### The original LBL README by Steve McCanne, Craig Leres and Van Jacobson
76411677aeSAaron LI```
77411677aeSAaron LIThis directory also contains some short awk programs intended as
78411677aeSAaron LIexamples of ways to reduce tcpdump data when you're tracking
79411677aeSAaron LIparticular network problems:
80411677aeSAaron LI
81411677aeSAaron LIsend-ack.awk
82411677aeSAaron LI	Simplifies the tcpdump trace for an ftp (or other unidirectional
83411677aeSAaron LI	tcp transfer).  Since we assume that one host only sends and
84411677aeSAaron LI	the other only acks, all address information is left off and
85411677aeSAaron LI	we just note if the packet is a "send" or an "ack".
86411677aeSAaron LI
87411677aeSAaron LI	There is one output line per line of the original trace.
88411677aeSAaron LI	Field 1 is the packet time in decimal seconds, relative
89411677aeSAaron LI	to the start of the conversation.  Field 2 is delta-time
90411677aeSAaron LI	from last packet.  Field 3 is packet type/direction.
91411677aeSAaron LI	"Send" means data going from sender to receiver, "ack"
92411677aeSAaron LI	means an ack going from the receiver to the sender.  A
93411677aeSAaron LI	preceding "*" indicates that the data is a retransmission.
94411677aeSAaron LI	A preceding "-" indicates a hole in the sequence space
95411677aeSAaron LI	(i.e., missing packet(s)), a "#" means an odd-size (not max
96411677aeSAaron LI	seg size) packet.  Field 4 has the packet flags
97411677aeSAaron LI	(same format as raw trace).  Field 5 is the sequence
98411677aeSAaron LI	number (start seq. num for sender, next expected seq number
99411677aeSAaron LI	for acks).  The number in parens following an ack is
100411677aeSAaron LI	the delta-time from the first send of the packet to the
101411677aeSAaron LI	ack.  A number in parens following a send is the
102411677aeSAaron LI	delta-time from the first send of the packet to the
103411677aeSAaron LI	current send (on duplicate packets only).  Duplicate
104411677aeSAaron LI	sends or acks have a number in square brackets showing
105411677aeSAaron LI	the number of duplicates so far.
106411677aeSAaron LI
107411677aeSAaron LI	Here is a short sample from near the start of an ftp:
108411677aeSAaron LI		3.00    0.20   send . 512
109411677aeSAaron LI		3.20    0.20    ack . 1024  (0.20)
110411677aeSAaron LI		3.20    0.00   send P 1024
111411677aeSAaron LI		3.40    0.20    ack . 1536  (0.20)
112411677aeSAaron LI		3.80    0.40 * send . 0  (3.80) [2]
113411677aeSAaron LI		3.82    0.02 *  ack . 1536  (0.62) [2]
114411677aeSAaron LI	Three seconds into the conversation, bytes 512 through 1023
115411677aeSAaron LI	were sent.  200ms later they were acked.  Shortly thereafter
116411677aeSAaron LI	bytes 1024-1535 were sent and again acked after 200ms.
117411677aeSAaron LI	Then, for no apparent reason, 0-511 is retransmitted, 3.8
118411677aeSAaron LI	seconds after its initial send (the round trip time for this
119411677aeSAaron LI	ftp was 1sec, +-500ms).  Since the receiver is expecting
120411677aeSAaron LI	1536, 1536 is re-acked when 0 arrives.
121411677aeSAaron LI
122411677aeSAaron LIpacketdat.awk
123411677aeSAaron LI	Computes chunk summary data for an ftp (or similar
124411677aeSAaron LI	unidirectional tcp transfer). [A "chunk" refers to
125411677aeSAaron LI	a chunk of the sequence space -- essentially the packet
126411677aeSAaron LI	sequence number divided by the max segment size.]
127411677aeSAaron LI
128411677aeSAaron LI	A summary line is printed showing the number of chunks,
129411677aeSAaron LI	the number of packets it took to send that many chunks
130411677aeSAaron LI	(if there are no lost or duplicated packets, the number
131411677aeSAaron LI	of packets should equal the number of chunks) and the
132411677aeSAaron LI	number of acks.
133411677aeSAaron LI
134411677aeSAaron LI	Following the summary line is one line of information
135411677aeSAaron LI	per chunk.  The line contains eight fields:
136411677aeSAaron LI	   1 - the chunk number
137411677aeSAaron LI	   2 - the start sequence number for this chunk
138411677aeSAaron LI	   3 - time of first send
139411677aeSAaron LI	   4 - time of last send
140411677aeSAaron LI	   5 - time of first ack
141411677aeSAaron LI	   6 - time of last ack
142411677aeSAaron LI	   7 - number of times chunk was sent
143411677aeSAaron LI	   8 - number of times chunk was acked
144411677aeSAaron LI	(all times are in decimal seconds, relative to the start
145411677aeSAaron LI	of the conversation.)
146411677aeSAaron LI
147411677aeSAaron LI	As an example, here is the first part of the output for
148411677aeSAaron LI	an ftp trace:
149411677aeSAaron LI
150411677aeSAaron LI	# 134 chunks.  536 packets sent.  508 acks.
151411677aeSAaron LI	1       1       0.00    5.80    0.20    0.20    4       1
152411677aeSAaron LI	2       513     0.28    6.20    0.40    0.40    4       1
153411677aeSAaron LI	3       1025    1.16    6.32    1.20    1.20    4       1
154411677aeSAaron LI	4       1561    1.86    15.00   2.00    2.00    6       1
155411677aeSAaron LI	5       2049    2.16    15.44   2.20    2.20    5       1
156411677aeSAaron LI	6       2585    2.64    16.44   2.80    2.80    5       1
157411677aeSAaron LI	7       3073    3.00    16.66   3.20    3.20    4       1
158411677aeSAaron LI	8       3609    3.20    17.24   3.40    5.82    4       11
159411677aeSAaron LI	9       4097    6.02    6.58    6.20    6.80    2       5
160411677aeSAaron LI
161411677aeSAaron LI	This says that 134 chunks were transferred (about 70K
162411677aeSAaron LI	since the average packet size was 512 bytes).  It took
163411677aeSAaron LI	536 packets to transfer the data (i.e., on the average
164411677aeSAaron LI	each chunk was transmitted four times).  Looking at,
165411677aeSAaron LI	say, chunk 4, we see it represents the 512 bytes of
166411677aeSAaron LI	sequence space from 1561 to 2048.  It was first sent
167411677aeSAaron LI	1.86 seconds into the conversation.  It was last
168411677aeSAaron LI	sent 15 seconds into the conversation and was sent
169411677aeSAaron LI	a total of 6 times (i.e., it was retransmitted every
170411677aeSAaron LI	2 seconds on the average).  It was acked once, 140ms
171411677aeSAaron LI	after it first arrived.
172411677aeSAaron LI
173411677aeSAaron LIstime.awk
174411677aeSAaron LIatime.awk
175411677aeSAaron LI	Output one line per send or ack, respectively, in the form
176411677aeSAaron LI		<time> <seq. number>
177411677aeSAaron LI	where <time> is the time in seconds since the start of the
178411677aeSAaron LI	transfer and <seq. number> is the sequence number being sent
179411677aeSAaron LI	or acked.  I typically plot this data looking for suspicious
180411677aeSAaron LI	patterns.
181411677aeSAaron LI
182411677aeSAaron LI
183411677aeSAaron LIThe problem I was looking at was the bulk-data-transfer
184411677aeSAaron LIthroughput of medium delay network paths (1-6 sec.  round trip
185411677aeSAaron LItime) under typical DARPA Internet conditions.  The trace of the
186411677aeSAaron LIftp transfer of a large file was used as the raw data source.
187411677aeSAaron LIThe method was:
188411677aeSAaron LI
189411677aeSAaron LI  - On a local host (but not the Sun running tcpdump), connect to
190411677aeSAaron LI    the remote ftp.
191411677aeSAaron LI
192411677aeSAaron LI  - On the monitor Sun, start the trace going.  E.g.,
193411677aeSAaron LI      tcpdump host local-host and remote-host and port ftp-data >tracefile
194411677aeSAaron LI
195411677aeSAaron LI  - On local, do either a get or put of a large file (~500KB),
196411677aeSAaron LI    preferably to the null device (to minimize effects like
197411677aeSAaron LI    closing the receive window while waiting for a disk write).
198411677aeSAaron LI
199411677aeSAaron LI  - When transfer is finished, stop tcpdump.  Use awk to make up
200411677aeSAaron LI    two files of summary data (maxsize is the maximum packet size,
201411677aeSAaron LI    tracedata is the file of tcpdump tracedata):
202411677aeSAaron LI      awk -f send-ack.awk packetsize=avgsize tracedata >sa
203411677aeSAaron LI      awk -f packetdat.awk packetsize=avgsize tracedata >pd
204411677aeSAaron LI
205411677aeSAaron LI  - While the summary data files are printing, take a look at
206411677aeSAaron LI    how the transfer behaved:
207411677aeSAaron LI      awk -f stime.awk tracedata | xgraph
208411677aeSAaron LI    (90% of what you learn seems to happen in this step).
209411677aeSAaron LI
210411677aeSAaron LI  - Do all of the above steps several times, both directions,
211411677aeSAaron LI    at different times of day, with different protocol
212411677aeSAaron LI    implementations on the other end.
213411677aeSAaron LI
214411677aeSAaron LI  - Using one of the Unix data analysis packages (in my case,
215411677aeSAaron LI    S and Gary Perlman's Unix|Stat), spend a few months staring
216411677aeSAaron LI    at the data.
217411677aeSAaron LI
218411677aeSAaron LI  - Change something in the local protocol implementation and
219411677aeSAaron LI    redo the steps above.
220411677aeSAaron LI
221411677aeSAaron LI  - Once a week, tell your funding agent that you're discovering
222411677aeSAaron LI    wonderful things and you'll write up that research report
223411677aeSAaron LI    "real soon now".
224411677aeSAaron LI```
225