xref: /minix3/external/bsd/llvm/dist/llvm/docs/Bugpoint.rst (revision f4a2713ac843a11c696ec80c0a5e3e5d80b4d338)
1*f4a2713aSLionel Sambuc====================================
2*f4a2713aSLionel SambucLLVM bugpoint tool: design and usage
3*f4a2713aSLionel Sambuc====================================
4*f4a2713aSLionel Sambuc
5*f4a2713aSLionel Sambuc.. contents::
6*f4a2713aSLionel Sambuc   :local:
7*f4a2713aSLionel Sambuc
8*f4a2713aSLionel SambucDescription
9*f4a2713aSLionel Sambuc===========
10*f4a2713aSLionel Sambuc
11*f4a2713aSLionel Sambuc``bugpoint`` narrows down the source of problems in LLVM tools and passes.  It
12*f4a2713aSLionel Sambuccan be used to debug three types of failures: optimizer crashes, miscompilations
13*f4a2713aSLionel Sambucby optimizers, or bad native code generation (including problems in the static
14*f4a2713aSLionel Sambucand JIT compilers).  It aims to reduce large test cases to small, useful ones.
15*f4a2713aSLionel SambucFor example, if ``opt`` crashes while optimizing a file, it will identify the
16*f4a2713aSLionel Sambucoptimization (or combination of optimizations) that causes the crash, and reduce
17*f4a2713aSLionel Sambucthe file down to a small example which triggers the crash.
18*f4a2713aSLionel Sambuc
19*f4a2713aSLionel SambucFor detailed case scenarios, such as debugging ``opt``, or one of the LLVM code
20*f4a2713aSLionel Sambucgenerators, see `How To Submit a Bug Report document <HowToSubmitABug.html>`_.
21*f4a2713aSLionel Sambuc
22*f4a2713aSLionel SambucDesign Philosophy
23*f4a2713aSLionel Sambuc=================
24*f4a2713aSLionel Sambuc
25*f4a2713aSLionel Sambuc``bugpoint`` is designed to be a useful tool without requiring any hooks into
26*f4a2713aSLionel Sambucthe LLVM infrastructure at all.  It works with any and all LLVM passes and code
27*f4a2713aSLionel Sambucgenerators, and does not need to "know" how they work.  Because of this, it may
28*f4a2713aSLionel Sambucappear to do stupid things or miss obvious simplifications.  ``bugpoint`` is
29*f4a2713aSLionel Sambucalso designed to trade off programmer time for computer time in the
30*f4a2713aSLionel Sambuccompiler-debugging process; consequently, it may take a long period of
31*f4a2713aSLionel Sambuc(unattended) time to reduce a test case, but we feel it is still worth it. Note
32*f4a2713aSLionel Sambucthat ``bugpoint`` is generally very quick unless debugging a miscompilation
33*f4a2713aSLionel Sambucwhere each test of the program (which requires executing it) takes a long time.
34*f4a2713aSLionel Sambuc
35*f4a2713aSLionel SambucAutomatic Debugger Selection
36*f4a2713aSLionel Sambuc----------------------------
37*f4a2713aSLionel Sambuc
38*f4a2713aSLionel Sambuc``bugpoint`` reads each ``.bc`` or ``.ll`` file specified on the command line
39*f4a2713aSLionel Sambucand links them together into a single module, called the test program.  If any
40*f4a2713aSLionel SambucLLVM passes are specified on the command line, it runs these passes on the test
41*f4a2713aSLionel Sambucprogram.  If any of the passes crash, or if they produce malformed output (which
42*f4a2713aSLionel Sambuccauses the verifier to abort), ``bugpoint`` starts the `crash debugger`_.
43*f4a2713aSLionel Sambuc
44*f4a2713aSLionel SambucOtherwise, if the ``-output`` option was not specified, ``bugpoint`` runs the
45*f4a2713aSLionel Sambuctest program with the "safe" backend (which is assumed to generate good code) to
46*f4a2713aSLionel Sambucgenerate a reference output.  Once ``bugpoint`` has a reference output for the
47*f4a2713aSLionel Sambuctest program, it tries executing it with the selected code generator.  If the
48*f4a2713aSLionel Sambucselected code generator crashes, ``bugpoint`` starts the `crash debugger`_ on
49*f4a2713aSLionel Sambucthe code generator.  Otherwise, if the resulting output differs from the
50*f4a2713aSLionel Sambucreference output, it assumes the difference resulted from a code generator
51*f4a2713aSLionel Sambucfailure, and starts the `code generator debugger`_.
52*f4a2713aSLionel Sambuc
53*f4a2713aSLionel SambucFinally, if the output of the selected code generator matches the reference
54*f4a2713aSLionel Sambucoutput, ``bugpoint`` runs the test program after all of the LLVM passes have
55*f4a2713aSLionel Sambucbeen applied to it.  If its output differs from the reference output, it assumes
56*f4a2713aSLionel Sambucthe difference resulted from a failure in one of the LLVM passes, and enters the
57*f4a2713aSLionel Sambuc`miscompilation debugger`_.  Otherwise, there is no problem ``bugpoint`` can
58*f4a2713aSLionel Sambucdebug.
59*f4a2713aSLionel Sambuc
60*f4a2713aSLionel Sambuc.. _crash debugger:
61*f4a2713aSLionel Sambuc
62*f4a2713aSLionel SambucCrash debugger
63*f4a2713aSLionel Sambuc--------------
64*f4a2713aSLionel Sambuc
65*f4a2713aSLionel SambucIf an optimizer or code generator crashes, ``bugpoint`` will try as hard as it
66*f4a2713aSLionel Sambuccan to reduce the list of passes (for optimizer crashes) and the size of the
67*f4a2713aSLionel Sambuctest program.  First, ``bugpoint`` figures out which combination of optimizer
68*f4a2713aSLionel Sambucpasses triggers the bug. This is useful when debugging a problem exposed by
69*f4a2713aSLionel Sambuc``opt``, for example, because it runs over 38 passes.
70*f4a2713aSLionel Sambuc
71*f4a2713aSLionel SambucNext, ``bugpoint`` tries removing functions from the test program, to reduce its
72*f4a2713aSLionel Sambucsize.  Usually it is able to reduce a test program to a single function, when
73*f4a2713aSLionel Sambucdebugging intraprocedural optimizations.  Once the number of functions has been
74*f4a2713aSLionel Sambucreduced, it attempts to delete various edges in the control flow graph, to
75*f4a2713aSLionel Sambucreduce the size of the function as much as possible.  Finally, ``bugpoint``
76*f4a2713aSLionel Sambucdeletes any individual LLVM instructions whose absence does not eliminate the
77*f4a2713aSLionel Sambucfailure.  At the end, ``bugpoint`` should tell you what passes crash, give you a
78*f4a2713aSLionel Sambucbitcode file, and give you instructions on how to reproduce the failure with
79*f4a2713aSLionel Sambuc``opt`` or ``llc``.
80*f4a2713aSLionel Sambuc
81*f4a2713aSLionel Sambuc.. _code generator debugger:
82*f4a2713aSLionel Sambuc
83*f4a2713aSLionel SambucCode generator debugger
84*f4a2713aSLionel Sambuc-----------------------
85*f4a2713aSLionel Sambuc
86*f4a2713aSLionel SambucThe code generator debugger attempts to narrow down the amount of code that is
87*f4a2713aSLionel Sambucbeing miscompiled by the selected code generator.  To do this, it takes the test
88*f4a2713aSLionel Sambucprogram and partitions it into two pieces: one piece which it compiles with the
89*f4a2713aSLionel Sambuc"safe" backend (into a shared object), and one piece which it runs with either
90*f4a2713aSLionel Sambucthe JIT or the static LLC compiler.  It uses several techniques to reduce the
91*f4a2713aSLionel Sambucamount of code pushed through the LLVM code generator, to reduce the potential
92*f4a2713aSLionel Sambucscope of the problem.  After it is finished, it emits two bitcode files (called
93*f4a2713aSLionel Sambuc"test" [to be compiled with the code generator] and "safe" [to be compiled with
94*f4a2713aSLionel Sambucthe "safe" backend], respectively), and instructions for reproducing the
95*f4a2713aSLionel Sambucproblem.  The code generator debugger assumes that the "safe" backend produces
96*f4a2713aSLionel Sambucgood code.
97*f4a2713aSLionel Sambuc
98*f4a2713aSLionel Sambuc.. _miscompilation debugger:
99*f4a2713aSLionel Sambuc
100*f4a2713aSLionel SambucMiscompilation debugger
101*f4a2713aSLionel Sambuc-----------------------
102*f4a2713aSLionel Sambuc
103*f4a2713aSLionel SambucThe miscompilation debugger works similarly to the code generator debugger.  It
104*f4a2713aSLionel Sambucworks by splitting the test program into two pieces, running the optimizations
105*f4a2713aSLionel Sambucspecified on one piece, linking the two pieces back together, and then executing
106*f4a2713aSLionel Sambucthe result.  It attempts to narrow down the list of passes to the one (or few)
107*f4a2713aSLionel Sambucwhich are causing the miscompilation, then reduce the portion of the test
108*f4a2713aSLionel Sambucprogram which is being miscompiled.  The miscompilation debugger assumes that
109*f4a2713aSLionel Sambucthe selected code generator is working properly.
110*f4a2713aSLionel Sambuc
111*f4a2713aSLionel SambucAdvice for using bugpoint
112*f4a2713aSLionel Sambuc=========================
113*f4a2713aSLionel Sambuc
114*f4a2713aSLionel Sambuc``bugpoint`` can be a remarkably useful tool, but it sometimes works in
115*f4a2713aSLionel Sambucnon-obvious ways.  Here are some hints and tips:
116*f4a2713aSLionel Sambuc
117*f4a2713aSLionel Sambuc* In the code generator and miscompilation debuggers, ``bugpoint`` only works
118*f4a2713aSLionel Sambuc  with programs that have deterministic output.  Thus, if the program outputs
119*f4a2713aSLionel Sambuc  ``argv[0]``, the date, time, or any other "random" data, ``bugpoint`` may
120*f4a2713aSLionel Sambuc  misinterpret differences in these data, when output, as the result of a
121*f4a2713aSLionel Sambuc  miscompilation.  Programs should be temporarily modified to disable outputs
122*f4a2713aSLionel Sambuc  that are likely to vary from run to run.
123*f4a2713aSLionel Sambuc
124*f4a2713aSLionel Sambuc* In the code generator and miscompilation debuggers, debugging will go faster
125*f4a2713aSLionel Sambuc  if you manually modify the program or its inputs to reduce the runtime, but
126*f4a2713aSLionel Sambuc  still exhibit the problem.
127*f4a2713aSLionel Sambuc
128*f4a2713aSLionel Sambuc* ``bugpoint`` is extremely useful when working on a new optimization: it helps
129*f4a2713aSLionel Sambuc  track down regressions quickly.  To avoid having to relink ``bugpoint`` every
130*f4a2713aSLionel Sambuc  time you change your optimization however, have ``bugpoint`` dynamically load
131*f4a2713aSLionel Sambuc  your optimization with the ``-load`` option.
132*f4a2713aSLionel Sambuc
133*f4a2713aSLionel Sambuc* ``bugpoint`` can generate a lot of output and run for a long period of time.
134*f4a2713aSLionel Sambuc  It is often useful to capture the output of the program to file.  For example,
135*f4a2713aSLionel Sambuc  in the C shell, you can run:
136*f4a2713aSLionel Sambuc
137*f4a2713aSLionel Sambuc  .. code-block:: console
138*f4a2713aSLionel Sambuc
139*f4a2713aSLionel Sambuc    $ bugpoint  ... |& tee bugpoint.log
140*f4a2713aSLionel Sambuc
141*f4a2713aSLionel Sambuc  to get a copy of ``bugpoint``'s output in the file ``bugpoint.log``, as well
142*f4a2713aSLionel Sambuc  as on your terminal.
143*f4a2713aSLionel Sambuc
144*f4a2713aSLionel Sambuc* ``bugpoint`` cannot debug problems with the LLVM linker. If ``bugpoint``
145*f4a2713aSLionel Sambuc  crashes before you see its "All input ok" message, you might try ``llvm-link
146*f4a2713aSLionel Sambuc  -v`` on the same set of input files. If that also crashes, you may be
147*f4a2713aSLionel Sambuc  experiencing a linker bug.
148*f4a2713aSLionel Sambuc
149*f4a2713aSLionel Sambuc* ``bugpoint`` is useful for proactively finding bugs in LLVM.  Invoking
150*f4a2713aSLionel Sambuc  ``bugpoint`` with the ``-find-bugs`` option will cause the list of specified
151*f4a2713aSLionel Sambuc  optimizations to be randomized and applied to the program. This process will
152*f4a2713aSLionel Sambuc  repeat until a bug is found or the user kills ``bugpoint``.
153*f4a2713aSLionel Sambuc
154*f4a2713aSLionel SambucWhat to do when bugpoint isn't enough
155*f4a2713aSLionel Sambuc=====================================
156*f4a2713aSLionel Sambuc
157*f4a2713aSLionel SambucSometimes, ``bugpoint`` is not enough. In particular, InstCombine and
158*f4a2713aSLionel SambucTargetLowering both have visitor structured code with lots of potential
159*f4a2713aSLionel Sambuctransformations.  If the process of using bugpoint has left you with still too
160*f4a2713aSLionel Sambucmuch code to figure out and the problem seems to be in instcombine, the
161*f4a2713aSLionel Sambucfollowing steps may help.  These same techniques are useful with TargetLowering
162*f4a2713aSLionel Sambucas well.
163*f4a2713aSLionel Sambuc
164*f4a2713aSLionel SambucTurn on ``-debug-only=instcombine`` and see which transformations within
165*f4a2713aSLionel Sambucinstcombine are firing by selecting out lines with "``IC``" in them.
166*f4a2713aSLionel Sambuc
167*f4a2713aSLionel SambucAt this point, you have a decision to make.  Is the number of transformations
168*f4a2713aSLionel Sambucsmall enough to step through them using a debugger?  If so, then try that.
169*f4a2713aSLionel Sambuc
170*f4a2713aSLionel SambucIf there are too many transformations, then a source modification approach may
171*f4a2713aSLionel Sambucbe helpful.  In this approach, you can modify the source code of instcombine to
172*f4a2713aSLionel Sambucdisable just those transformations that are being performed on your test input
173*f4a2713aSLionel Sambucand perform a binary search over the set of transformations.  One set of places
174*f4a2713aSLionel Sambucto modify are the "``visit*``" methods of ``InstCombiner`` (*e.g.*
175*f4a2713aSLionel Sambuc``visitICmpInst``) by adding a "``return false``" as the first line of the
176*f4a2713aSLionel Sambucmethod.
177*f4a2713aSLionel Sambuc
178*f4a2713aSLionel SambucIf that still doesn't remove enough, then change the caller of
179*f4a2713aSLionel Sambuc``InstCombiner::DoOneIteration``, ``InstCombiner::runOnFunction`` to limit the
180*f4a2713aSLionel Sambucnumber of iterations.
181*f4a2713aSLionel Sambuc
182*f4a2713aSLionel SambucYou may also find it useful to use "``-stats``" now to see what parts of
183*f4a2713aSLionel Sambucinstcombine are firing.  This can guide where to put additional reporting code.
184*f4a2713aSLionel Sambuc
185*f4a2713aSLionel SambucAt this point, if the amount of transformations is still too large, then
186*f4a2713aSLionel Sambucinserting code to limit whether or not to execute the body of the code in the
187*f4a2713aSLionel Sambucvisit function can be helpful.  Add a static counter which is incremented on
188*f4a2713aSLionel Sambucevery invocation of the function.  Then add code which simply returns false on
189*f4a2713aSLionel Sambucdesired ranges.  For example:
190*f4a2713aSLionel Sambuc
191*f4a2713aSLionel Sambuc.. code-block:: c++
192*f4a2713aSLionel Sambuc
193*f4a2713aSLionel Sambuc
194*f4a2713aSLionel Sambuc  static int calledCount = 0;
195*f4a2713aSLionel Sambuc  calledCount++;
196*f4a2713aSLionel Sambuc  DEBUG(if (calledCount < 212) return false);
197*f4a2713aSLionel Sambuc  DEBUG(if (calledCount > 217) return false);
198*f4a2713aSLionel Sambuc  DEBUG(if (calledCount == 213) return false);
199*f4a2713aSLionel Sambuc  DEBUG(if (calledCount == 214) return false);
200*f4a2713aSLionel Sambuc  DEBUG(if (calledCount == 215) return false);
201*f4a2713aSLionel Sambuc  DEBUG(if (calledCount == 216) return false);
202*f4a2713aSLionel Sambuc  DEBUG(dbgs() << "visitXOR calledCount: " << calledCount << "\n");
203*f4a2713aSLionel Sambuc  DEBUG(dbgs() << "I: "; I->dump());
204*f4a2713aSLionel Sambuc
205*f4a2713aSLionel Sambuccould be added to ``visitXOR`` to limit ``visitXor`` to being applied only to
206*f4a2713aSLionel Sambuccalls 212 and 217. This is from an actual test case and raises an important
207*f4a2713aSLionel Sambucpoint---a simple binary search may not be sufficient, as transformations that
208*f4a2713aSLionel Sambucinteract may require isolating more than one call.  In TargetLowering, use
209*f4a2713aSLionel Sambuc``return SDNode();`` instead of ``return false;``.
210*f4a2713aSLionel Sambuc
211*f4a2713aSLionel SambucNow that that the number of transformations is down to a manageable number, try
212*f4a2713aSLionel Sambucexamining the output to see if you can figure out which transformations are
213*f4a2713aSLionel Sambucbeing done.  If that can be figured out, then do the usual debugging.  If which
214*f4a2713aSLionel Sambuccode corresponds to the transformation being performed isn't obvious, set a
215*f4a2713aSLionel Sambucbreakpoint after the call count based disabling and step through the code.
216*f4a2713aSLionel SambucAlternatively, you can use "``printf``" style debugging to report waypoints.
217