1*f4a2713aSLionel Sambuc==================================== 2*f4a2713aSLionel SambucLLVM bugpoint tool: design and usage 3*f4a2713aSLionel Sambuc==================================== 4*f4a2713aSLionel Sambuc 5*f4a2713aSLionel Sambuc.. contents:: 6*f4a2713aSLionel Sambuc :local: 7*f4a2713aSLionel Sambuc 8*f4a2713aSLionel SambucDescription 9*f4a2713aSLionel Sambuc=========== 10*f4a2713aSLionel Sambuc 11*f4a2713aSLionel Sambuc``bugpoint`` narrows down the source of problems in LLVM tools and passes. It 12*f4a2713aSLionel Sambuccan be used to debug three types of failures: optimizer crashes, miscompilations 13*f4a2713aSLionel Sambucby optimizers, or bad native code generation (including problems in the static 14*f4a2713aSLionel Sambucand JIT compilers). It aims to reduce large test cases to small, useful ones. 15*f4a2713aSLionel SambucFor example, if ``opt`` crashes while optimizing a file, it will identify the 16*f4a2713aSLionel Sambucoptimization (or combination of optimizations) that causes the crash, and reduce 17*f4a2713aSLionel Sambucthe file down to a small example which triggers the crash. 18*f4a2713aSLionel Sambuc 19*f4a2713aSLionel SambucFor detailed case scenarios, such as debugging ``opt``, or one of the LLVM code 20*f4a2713aSLionel Sambucgenerators, see `How To Submit a Bug Report document <HowToSubmitABug.html>`_. 21*f4a2713aSLionel Sambuc 22*f4a2713aSLionel SambucDesign Philosophy 23*f4a2713aSLionel Sambuc================= 24*f4a2713aSLionel Sambuc 25*f4a2713aSLionel Sambuc``bugpoint`` is designed to be a useful tool without requiring any hooks into 26*f4a2713aSLionel Sambucthe LLVM infrastructure at all. It works with any and all LLVM passes and code 27*f4a2713aSLionel Sambucgenerators, and does not need to "know" how they work. Because of this, it may 28*f4a2713aSLionel Sambucappear to do stupid things or miss obvious simplifications. ``bugpoint`` is 29*f4a2713aSLionel Sambucalso designed to trade off programmer time for computer time in the 30*f4a2713aSLionel Sambuccompiler-debugging process; consequently, it may take a long period of 31*f4a2713aSLionel Sambuc(unattended) time to reduce a test case, but we feel it is still worth it. Note 32*f4a2713aSLionel Sambucthat ``bugpoint`` is generally very quick unless debugging a miscompilation 33*f4a2713aSLionel Sambucwhere each test of the program (which requires executing it) takes a long time. 34*f4a2713aSLionel Sambuc 35*f4a2713aSLionel SambucAutomatic Debugger Selection 36*f4a2713aSLionel Sambuc---------------------------- 37*f4a2713aSLionel Sambuc 38*f4a2713aSLionel Sambuc``bugpoint`` reads each ``.bc`` or ``.ll`` file specified on the command line 39*f4a2713aSLionel Sambucand links them together into a single module, called the test program. If any 40*f4a2713aSLionel SambucLLVM passes are specified on the command line, it runs these passes on the test 41*f4a2713aSLionel Sambucprogram. If any of the passes crash, or if they produce malformed output (which 42*f4a2713aSLionel Sambuccauses the verifier to abort), ``bugpoint`` starts the `crash debugger`_. 43*f4a2713aSLionel Sambuc 44*f4a2713aSLionel SambucOtherwise, if the ``-output`` option was not specified, ``bugpoint`` runs the 45*f4a2713aSLionel Sambuctest program with the "safe" backend (which is assumed to generate good code) to 46*f4a2713aSLionel Sambucgenerate a reference output. Once ``bugpoint`` has a reference output for the 47*f4a2713aSLionel Sambuctest program, it tries executing it with the selected code generator. If the 48*f4a2713aSLionel Sambucselected code generator crashes, ``bugpoint`` starts the `crash debugger`_ on 49*f4a2713aSLionel Sambucthe code generator. Otherwise, if the resulting output differs from the 50*f4a2713aSLionel Sambucreference output, it assumes the difference resulted from a code generator 51*f4a2713aSLionel Sambucfailure, and starts the `code generator debugger`_. 52*f4a2713aSLionel Sambuc 53*f4a2713aSLionel SambucFinally, if the output of the selected code generator matches the reference 54*f4a2713aSLionel Sambucoutput, ``bugpoint`` runs the test program after all of the LLVM passes have 55*f4a2713aSLionel Sambucbeen applied to it. If its output differs from the reference output, it assumes 56*f4a2713aSLionel Sambucthe difference resulted from a failure in one of the LLVM passes, and enters the 57*f4a2713aSLionel Sambuc`miscompilation debugger`_. Otherwise, there is no problem ``bugpoint`` can 58*f4a2713aSLionel Sambucdebug. 59*f4a2713aSLionel Sambuc 60*f4a2713aSLionel Sambuc.. _crash debugger: 61*f4a2713aSLionel Sambuc 62*f4a2713aSLionel SambucCrash debugger 63*f4a2713aSLionel Sambuc-------------- 64*f4a2713aSLionel Sambuc 65*f4a2713aSLionel SambucIf an optimizer or code generator crashes, ``bugpoint`` will try as hard as it 66*f4a2713aSLionel Sambuccan to reduce the list of passes (for optimizer crashes) and the size of the 67*f4a2713aSLionel Sambuctest program. First, ``bugpoint`` figures out which combination of optimizer 68*f4a2713aSLionel Sambucpasses triggers the bug. This is useful when debugging a problem exposed by 69*f4a2713aSLionel Sambuc``opt``, for example, because it runs over 38 passes. 70*f4a2713aSLionel Sambuc 71*f4a2713aSLionel SambucNext, ``bugpoint`` tries removing functions from the test program, to reduce its 72*f4a2713aSLionel Sambucsize. Usually it is able to reduce a test program to a single function, when 73*f4a2713aSLionel Sambucdebugging intraprocedural optimizations. Once the number of functions has been 74*f4a2713aSLionel Sambucreduced, it attempts to delete various edges in the control flow graph, to 75*f4a2713aSLionel Sambucreduce the size of the function as much as possible. Finally, ``bugpoint`` 76*f4a2713aSLionel Sambucdeletes any individual LLVM instructions whose absence does not eliminate the 77*f4a2713aSLionel Sambucfailure. At the end, ``bugpoint`` should tell you what passes crash, give you a 78*f4a2713aSLionel Sambucbitcode file, and give you instructions on how to reproduce the failure with 79*f4a2713aSLionel Sambuc``opt`` or ``llc``. 80*f4a2713aSLionel Sambuc 81*f4a2713aSLionel Sambuc.. _code generator debugger: 82*f4a2713aSLionel Sambuc 83*f4a2713aSLionel SambucCode generator debugger 84*f4a2713aSLionel Sambuc----------------------- 85*f4a2713aSLionel Sambuc 86*f4a2713aSLionel SambucThe code generator debugger attempts to narrow down the amount of code that is 87*f4a2713aSLionel Sambucbeing miscompiled by the selected code generator. To do this, it takes the test 88*f4a2713aSLionel Sambucprogram and partitions it into two pieces: one piece which it compiles with the 89*f4a2713aSLionel Sambuc"safe" backend (into a shared object), and one piece which it runs with either 90*f4a2713aSLionel Sambucthe JIT or the static LLC compiler. It uses several techniques to reduce the 91*f4a2713aSLionel Sambucamount of code pushed through the LLVM code generator, to reduce the potential 92*f4a2713aSLionel Sambucscope of the problem. After it is finished, it emits two bitcode files (called 93*f4a2713aSLionel Sambuc"test" [to be compiled with the code generator] and "safe" [to be compiled with 94*f4a2713aSLionel Sambucthe "safe" backend], respectively), and instructions for reproducing the 95*f4a2713aSLionel Sambucproblem. The code generator debugger assumes that the "safe" backend produces 96*f4a2713aSLionel Sambucgood code. 97*f4a2713aSLionel Sambuc 98*f4a2713aSLionel Sambuc.. _miscompilation debugger: 99*f4a2713aSLionel Sambuc 100*f4a2713aSLionel SambucMiscompilation debugger 101*f4a2713aSLionel Sambuc----------------------- 102*f4a2713aSLionel Sambuc 103*f4a2713aSLionel SambucThe miscompilation debugger works similarly to the code generator debugger. It 104*f4a2713aSLionel Sambucworks by splitting the test program into two pieces, running the optimizations 105*f4a2713aSLionel Sambucspecified on one piece, linking the two pieces back together, and then executing 106*f4a2713aSLionel Sambucthe result. It attempts to narrow down the list of passes to the one (or few) 107*f4a2713aSLionel Sambucwhich are causing the miscompilation, then reduce the portion of the test 108*f4a2713aSLionel Sambucprogram which is being miscompiled. The miscompilation debugger assumes that 109*f4a2713aSLionel Sambucthe selected code generator is working properly. 110*f4a2713aSLionel Sambuc 111*f4a2713aSLionel SambucAdvice for using bugpoint 112*f4a2713aSLionel Sambuc========================= 113*f4a2713aSLionel Sambuc 114*f4a2713aSLionel Sambuc``bugpoint`` can be a remarkably useful tool, but it sometimes works in 115*f4a2713aSLionel Sambucnon-obvious ways. Here are some hints and tips: 116*f4a2713aSLionel Sambuc 117*f4a2713aSLionel Sambuc* In the code generator and miscompilation debuggers, ``bugpoint`` only works 118*f4a2713aSLionel Sambuc with programs that have deterministic output. Thus, if the program outputs 119*f4a2713aSLionel Sambuc ``argv[0]``, the date, time, or any other "random" data, ``bugpoint`` may 120*f4a2713aSLionel Sambuc misinterpret differences in these data, when output, as the result of a 121*f4a2713aSLionel Sambuc miscompilation. Programs should be temporarily modified to disable outputs 122*f4a2713aSLionel Sambuc that are likely to vary from run to run. 123*f4a2713aSLionel Sambuc 124*f4a2713aSLionel Sambuc* In the code generator and miscompilation debuggers, debugging will go faster 125*f4a2713aSLionel Sambuc if you manually modify the program or its inputs to reduce the runtime, but 126*f4a2713aSLionel Sambuc still exhibit the problem. 127*f4a2713aSLionel Sambuc 128*f4a2713aSLionel Sambuc* ``bugpoint`` is extremely useful when working on a new optimization: it helps 129*f4a2713aSLionel Sambuc track down regressions quickly. To avoid having to relink ``bugpoint`` every 130*f4a2713aSLionel Sambuc time you change your optimization however, have ``bugpoint`` dynamically load 131*f4a2713aSLionel Sambuc your optimization with the ``-load`` option. 132*f4a2713aSLionel Sambuc 133*f4a2713aSLionel Sambuc* ``bugpoint`` can generate a lot of output and run for a long period of time. 134*f4a2713aSLionel Sambuc It is often useful to capture the output of the program to file. For example, 135*f4a2713aSLionel Sambuc in the C shell, you can run: 136*f4a2713aSLionel Sambuc 137*f4a2713aSLionel Sambuc .. code-block:: console 138*f4a2713aSLionel Sambuc 139*f4a2713aSLionel Sambuc $ bugpoint ... |& tee bugpoint.log 140*f4a2713aSLionel Sambuc 141*f4a2713aSLionel Sambuc to get a copy of ``bugpoint``'s output in the file ``bugpoint.log``, as well 142*f4a2713aSLionel Sambuc as on your terminal. 143*f4a2713aSLionel Sambuc 144*f4a2713aSLionel Sambuc* ``bugpoint`` cannot debug problems with the LLVM linker. If ``bugpoint`` 145*f4a2713aSLionel Sambuc crashes before you see its "All input ok" message, you might try ``llvm-link 146*f4a2713aSLionel Sambuc -v`` on the same set of input files. If that also crashes, you may be 147*f4a2713aSLionel Sambuc experiencing a linker bug. 148*f4a2713aSLionel Sambuc 149*f4a2713aSLionel Sambuc* ``bugpoint`` is useful for proactively finding bugs in LLVM. Invoking 150*f4a2713aSLionel Sambuc ``bugpoint`` with the ``-find-bugs`` option will cause the list of specified 151*f4a2713aSLionel Sambuc optimizations to be randomized and applied to the program. This process will 152*f4a2713aSLionel Sambuc repeat until a bug is found or the user kills ``bugpoint``. 153*f4a2713aSLionel Sambuc 154*f4a2713aSLionel SambucWhat to do when bugpoint isn't enough 155*f4a2713aSLionel Sambuc===================================== 156*f4a2713aSLionel Sambuc 157*f4a2713aSLionel SambucSometimes, ``bugpoint`` is not enough. In particular, InstCombine and 158*f4a2713aSLionel SambucTargetLowering both have visitor structured code with lots of potential 159*f4a2713aSLionel Sambuctransformations. If the process of using bugpoint has left you with still too 160*f4a2713aSLionel Sambucmuch code to figure out and the problem seems to be in instcombine, the 161*f4a2713aSLionel Sambucfollowing steps may help. These same techniques are useful with TargetLowering 162*f4a2713aSLionel Sambucas well. 163*f4a2713aSLionel Sambuc 164*f4a2713aSLionel SambucTurn on ``-debug-only=instcombine`` and see which transformations within 165*f4a2713aSLionel Sambucinstcombine are firing by selecting out lines with "``IC``" in them. 166*f4a2713aSLionel Sambuc 167*f4a2713aSLionel SambucAt this point, you have a decision to make. Is the number of transformations 168*f4a2713aSLionel Sambucsmall enough to step through them using a debugger? If so, then try that. 169*f4a2713aSLionel Sambuc 170*f4a2713aSLionel SambucIf there are too many transformations, then a source modification approach may 171*f4a2713aSLionel Sambucbe helpful. In this approach, you can modify the source code of instcombine to 172*f4a2713aSLionel Sambucdisable just those transformations that are being performed on your test input 173*f4a2713aSLionel Sambucand perform a binary search over the set of transformations. One set of places 174*f4a2713aSLionel Sambucto modify are the "``visit*``" methods of ``InstCombiner`` (*e.g.* 175*f4a2713aSLionel Sambuc``visitICmpInst``) by adding a "``return false``" as the first line of the 176*f4a2713aSLionel Sambucmethod. 177*f4a2713aSLionel Sambuc 178*f4a2713aSLionel SambucIf that still doesn't remove enough, then change the caller of 179*f4a2713aSLionel Sambuc``InstCombiner::DoOneIteration``, ``InstCombiner::runOnFunction`` to limit the 180*f4a2713aSLionel Sambucnumber of iterations. 181*f4a2713aSLionel Sambuc 182*f4a2713aSLionel SambucYou may also find it useful to use "``-stats``" now to see what parts of 183*f4a2713aSLionel Sambucinstcombine are firing. This can guide where to put additional reporting code. 184*f4a2713aSLionel Sambuc 185*f4a2713aSLionel SambucAt this point, if the amount of transformations is still too large, then 186*f4a2713aSLionel Sambucinserting code to limit whether or not to execute the body of the code in the 187*f4a2713aSLionel Sambucvisit function can be helpful. Add a static counter which is incremented on 188*f4a2713aSLionel Sambucevery invocation of the function. Then add code which simply returns false on 189*f4a2713aSLionel Sambucdesired ranges. For example: 190*f4a2713aSLionel Sambuc 191*f4a2713aSLionel Sambuc.. code-block:: c++ 192*f4a2713aSLionel Sambuc 193*f4a2713aSLionel Sambuc 194*f4a2713aSLionel Sambuc static int calledCount = 0; 195*f4a2713aSLionel Sambuc calledCount++; 196*f4a2713aSLionel Sambuc DEBUG(if (calledCount < 212) return false); 197*f4a2713aSLionel Sambuc DEBUG(if (calledCount > 217) return false); 198*f4a2713aSLionel Sambuc DEBUG(if (calledCount == 213) return false); 199*f4a2713aSLionel Sambuc DEBUG(if (calledCount == 214) return false); 200*f4a2713aSLionel Sambuc DEBUG(if (calledCount == 215) return false); 201*f4a2713aSLionel Sambuc DEBUG(if (calledCount == 216) return false); 202*f4a2713aSLionel Sambuc DEBUG(dbgs() << "visitXOR calledCount: " << calledCount << "\n"); 203*f4a2713aSLionel Sambuc DEBUG(dbgs() << "I: "; I->dump()); 204*f4a2713aSLionel Sambuc 205*f4a2713aSLionel Sambuccould be added to ``visitXOR`` to limit ``visitXor`` to being applied only to 206*f4a2713aSLionel Sambuccalls 212 and 217. This is from an actual test case and raises an important 207*f4a2713aSLionel Sambucpoint---a simple binary search may not be sufficient, as transformations that 208*f4a2713aSLionel Sambucinteract may require isolating more than one call. In TargetLowering, use 209*f4a2713aSLionel Sambuc``return SDNode();`` instead of ``return false;``. 210*f4a2713aSLionel Sambuc 211*f4a2713aSLionel SambucNow that that the number of transformations is down to a manageable number, try 212*f4a2713aSLionel Sambucexamining the output to see if you can figure out which transformations are 213*f4a2713aSLionel Sambucbeing done. If that can be figured out, then do the usual debugging. If which 214*f4a2713aSLionel Sambuccode corresponds to the transformation being performed isn't obvious, set a 215*f4a2713aSLionel Sambucbreakpoint after the call count based disabling and step through the code. 216*f4a2713aSLionel SambucAlternatively, you can use "``printf``" style debugging to report waypoints. 217