Home | History | Annotate | Download | only in CommandGuide
      1 FileCheck - Flexible pattern matching file verifier
      2 ===================================================
      3 
      4 SYNOPSIS
      5 --------
      6 
      7 :program:`FileCheck` *match-filename* [*--check-prefix=XXX*] [*--strict-whitespace*]
      8 
      9 DESCRIPTION
     10 -----------
     11 
     12 :program:`FileCheck` reads two files (one from standard input, and one
     13 specified on the command line) and uses one to verify the other.  This
     14 behavior is particularly useful for the testsuite, which wants to verify that
     15 the output of some tool (e.g. :program:`llc`) contains the expected information
     16 (for example, a movsd from esp or whatever is interesting).  This is similar to
     17 using :program:`grep`, but it is optimized for matching multiple different
     18 inputs in one file in a specific order.
     19 
     20 The ``match-filename`` file specifies the file that contains the patterns to
     21 match.  The file to verify is read from standard input unless the
     22 :option:`--input-file` option is used.
     23 
     24 OPTIONS
     25 -------
     26 
     27 .. option:: -help
     28 
     29  Print a summary of command line options.
     30 
     31 .. option:: --check-prefix prefix
     32 
     33  FileCheck searches the contents of ``match-filename`` for patterns to
     34  match.  By default, these patterns are prefixed with "``CHECK:``".
     35  If you'd like to use a different prefix (e.g. because the same input
     36  file is checking multiple different tool or options), the
     37  :option:`--check-prefix` argument allows you to specify one or more
     38  prefixes to match. Multiple prefixes are useful for tests which might
     39  change for different run options, but most lines remain the same.
     40 
     41 .. option:: --input-file filename
     42 
     43   File to check (defaults to stdin).
     44 
     45 .. option:: --strict-whitespace
     46 
     47  By default, FileCheck canonicalizes input horizontal whitespace (spaces and
     48  tabs) which causes it to ignore these differences (a space will match a tab).
     49  The :option:`--strict-whitespace` argument disables this behavior. End-of-line
     50  sequences are canonicalized to UNIX-style ``\n`` in all modes.
     51 
     52 .. option:: -version
     53 
     54  Show the version number of this program.
     55 
     56 EXIT STATUS
     57 -----------
     58 
     59 If :program:`FileCheck` verifies that the file matches the expected contents,
     60 it exits with 0.  Otherwise, if not, or if an error occurs, it will exit with a
     61 non-zero value.
     62 
     63 TUTORIAL
     64 --------
     65 
     66 FileCheck is typically used from LLVM regression tests, being invoked on the RUN
     67 line of the test.  A simple example of using FileCheck from a RUN line looks
     68 like this:
     69 
     70 .. code-block:: llvm
     71 
     72    ; RUN: llvm-as < %s | llc -march=x86-64 | FileCheck %s
     73 
     74 This syntax says to pipe the current file ("``%s``") into ``llvm-as``, pipe
     75 that into ``llc``, then pipe the output of ``llc`` into ``FileCheck``.  This
     76 means that FileCheck will be verifying its standard input (the llc output)
     77 against the filename argument specified (the original ``.ll`` file specified by
     78 "``%s``").  To see how this works, let's look at the rest of the ``.ll`` file
     79 (after the RUN line):
     80 
     81 .. code-block:: llvm
     82 
     83    define void @sub1(i32* %p, i32 %v) {
     84    entry:
     85    ; CHECK: sub1:
     86    ; CHECK: subl
     87            %0 = tail call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %p, i32 %v)
     88            ret void
     89    }
     90 
     91    define void @inc4(i64* %p) {
     92    entry:
     93    ; CHECK: inc4:
     94    ; CHECK: incq
     95            %0 = tail call i64 @llvm.atomic.load.add.i64.p0i64(i64* %p, i64 1)
     96            ret void
     97    }
     98 
     99 Here you can see some "``CHECK:``" lines specified in comments.  Now you can
    100 see how the file is piped into ``llvm-as``, then ``llc``, and the machine code
    101 output is what we are verifying.  FileCheck checks the machine code output to
    102 verify that it matches what the "``CHECK:``" lines specify.
    103 
    104 The syntax of the "``CHECK:``" lines is very simple: they are fixed strings that
    105 must occur in order.  FileCheck defaults to ignoring horizontal whitespace
    106 differences (e.g. a space is allowed to match a tab) but otherwise, the contents
    107 of the "``CHECK:``" line is required to match some thing in the test file exactly.
    108 
    109 One nice thing about FileCheck (compared to grep) is that it allows merging
    110 test cases together into logical groups.  For example, because the test above
    111 is checking for the "``sub1:``" and "``inc4:``" labels, it will not match
    112 unless there is a "``subl``" in between those labels.  If it existed somewhere
    113 else in the file, that would not count: "``grep subl``" matches if "``subl``"
    114 exists anywhere in the file.
    115 
    116 The FileCheck -check-prefix option
    117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    118 
    119 The FileCheck :option:`-check-prefix` option allows multiple test
    120 configurations to be driven from one `.ll` file.  This is useful in many
    121 circumstances, for example, testing different architectural variants with
    122 :program:`llc`.  Here's a simple example:
    123 
    124 .. code-block:: llvm
    125 
    126    ; RUN: llvm-as < %s | llc -mtriple=i686-apple-darwin9 -mattr=sse41 \
    127    ; RUN:              | FileCheck %s -check-prefix=X32
    128    ; RUN: llvm-as < %s | llc -mtriple=x86_64-apple-darwin9 -mattr=sse41 \
    129    ; RUN:              | FileCheck %s -check-prefix=X64
    130 
    131    define <4 x i32> @pinsrd_1(i32 %s, <4 x i32> %tmp) nounwind {
    132            %tmp1 = insertelement <4 x i32>; %tmp, i32 %s, i32 1
    133            ret <4 x i32> %tmp1
    134    ; X32: pinsrd_1:
    135    ; X32:    pinsrd $1, 4(%esp), %xmm0
    136 
    137    ; X64: pinsrd_1:
    138    ; X64:    pinsrd $1, %edi, %xmm0
    139    }
    140 
    141 In this case, we're testing that we get the expected code generation with
    142 both 32-bit and 64-bit code generation.
    143 
    144 The "CHECK-NEXT:" directive
    145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
    146 
    147 Sometimes you want to match lines and would like to verify that matches
    148 happen on exactly consecutive lines with no other lines in between them.  In
    149 this case, you can use "``CHECK:``" and "``CHECK-NEXT:``" directives to specify
    150 this.  If you specified a custom check prefix, just use "``<PREFIX>-NEXT:``".
    151 For example, something like this works as you'd expect:
    152 
    153 .. code-block:: llvm
    154 
    155    define void @t2(<2 x double>* %r, <2 x double>* %A, double %B) {
    156  	%tmp3 = load <2 x double>* %A, align 16
    157  	%tmp7 = insertelement <2 x double> undef, double %B, i32 0
    158  	%tmp9 = shufflevector <2 x double> %tmp3,
    159                                <2 x double> %tmp7,
    160                                <2 x i32> < i32 0, i32 2 >
    161  	store <2 x double> %tmp9, <2 x double>* %r, align 16
    162  	ret void
    163 
    164    ; CHECK:          t2:
    165    ; CHECK: 	        movl	8(%esp), %eax
    166    ; CHECK-NEXT: 	movapd	(%eax), %xmm0
    167    ; CHECK-NEXT: 	movhpd	12(%esp), %xmm0
    168    ; CHECK-NEXT: 	movl	4(%esp), %eax
    169    ; CHECK-NEXT: 	movapd	%xmm0, (%eax)
    170    ; CHECK-NEXT: 	ret
    171    }
    172 
    173 "``CHECK-NEXT:``" directives reject the input unless there is exactly one
    174 newline between it and the previous directive.  A "``CHECK-NEXT:``" cannot be
    175 the first directive in a file.
    176 
    177 The "CHECK-NOT:" directive
    178 ~~~~~~~~~~~~~~~~~~~~~~~~~~
    179 
    180 The "``CHECK-NOT:``" directive is used to verify that a string doesn't occur
    181 between two matches (or before the first match, or after the last match).  For
    182 example, to verify that a load is removed by a transformation, a test like this
    183 can be used:
    184 
    185 .. code-block:: llvm
    186 
    187    define i8 @coerce_offset0(i32 %V, i32* %P) {
    188      store i32 %V, i32* %P
    189 
    190      %P2 = bitcast i32* %P to i8*
    191      %P3 = getelementptr i8* %P2, i32 2
    192 
    193      %A = load i8* %P3
    194      ret i8 %A
    195    ; CHECK: @coerce_offset0
    196    ; CHECK-NOT: load
    197    ; CHECK: ret i8
    198    }
    199 
    200 The "CHECK-DAG:" directive
    201 ~~~~~~~~~~~~~~~~~~~~~~~~~~
    202 
    203 If it's necessary to match strings that don't occur in a strictly sequential
    204 order, "``CHECK-DAG:``" could be used to verify them between two matches (or
    205 before the first match, or after the last match). For example, clang emits
    206 vtable globals in reverse order. Using ``CHECK-DAG:``, we can keep the checks
    207 in the natural order:
    208 
    209 .. code-block:: c++
    210 
    211     // RUN: %clang_cc1 %s -emit-llvm -o - | FileCheck %s
    212 
    213     struct Foo { virtual void method(); };
    214     Foo f;  // emit vtable
    215     // CHECK-DAG: @_ZTV3Foo =
    216 
    217     struct Bar { virtual void method(); };
    218     Bar b;
    219     // CHECK-DAG: @_ZTV3Bar =
    220 
    221 ``CHECK-NOT:`` directives could be mixed with ``CHECK-DAG:`` directives to
    222 exclude strings between the surrounding ``CHECK-DAG:`` directives. As a result,
    223 the surrounding ``CHECK-DAG:`` directives cannot be reordered, i.e. all
    224 occurrences matching ``CHECK-DAG:`` before ``CHECK-NOT:`` must not fall behind
    225 occurrences matching ``CHECK-DAG:`` after ``CHECK-NOT:``. For example,
    226 
    227 .. code-block:: llvm
    228 
    229    ; CHECK-DAG: BEFORE
    230    ; CHECK-NOT: NOT
    231    ; CHECK-DAG: AFTER
    232 
    233 This case will reject input strings where ``BEFORE`` occurs after ``AFTER``.
    234 
    235 With captured variables, ``CHECK-DAG:`` is able to match valid topological
    236 orderings of a DAG with edges from the definition of a variable to its use.
    237 It's useful, e.g., when your test cases need to match different output
    238 sequences from the instruction scheduler. For example,
    239 
    240 .. code-block:: llvm
    241 
    242    ; CHECK-DAG: add [[REG1:r[0-9]+]], r1, r2
    243    ; CHECK-DAG: add [[REG2:r[0-9]+]], r3, r4
    244    ; CHECK:     mul r5, [[REG1]], [[REG2]]
    245 
    246 In this case, any order of that two ``add`` instructions will be allowed.
    247 
    248 If you are defining `and` using variables in the same ``CHECK-DAG:`` block,
    249 be aware that the definition rule can match `after` its use.
    250 
    251 So, for instance, the code below will pass:
    252 
    253 .. code-block:: llvm
    254 
    255   ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
    256   ; CHECK-DAG: vmov.32 [[REG2]][1]
    257   vmov.32 d0[1]
    258   vmov.32 d0[0]
    259 
    260 While this other code, will not:
    261 
    262 .. code-block:: llvm
    263 
    264   ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0]
    265   ; CHECK-DAG: vmov.32 [[REG2]][1]
    266   vmov.32 d1[1]
    267   vmov.32 d0[0]
    268 
    269 While this can be very useful, it's also dangerous, because in the case of
    270 register sequence, you must have a strong order (read before write, copy before
    271 use, etc). If the definition your test is looking for doesn't match (because
    272 of a bug in the compiler), it may match further away from the use, and mask
    273 real bugs away.
    274 
    275 In those cases, to enforce the order, use a non-DAG directive between DAG-blocks.
    276 
    277 The "CHECK-LABEL:" directive
    278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    279 
    280 Sometimes in a file containing multiple tests divided into logical blocks, one
    281 or more ``CHECK:`` directives may inadvertently succeed by matching lines in a
    282 later block. While an error will usually eventually be generated, the check
    283 flagged as causing the error may not actually bear any relationship to the
    284 actual source of the problem.
    285 
    286 In order to produce better error messages in these cases, the "``CHECK-LABEL:``"
    287 directive can be used. It is treated identically to a normal ``CHECK``
    288 directive except that FileCheck makes an additional assumption that a line
    289 matched by the directive cannot also be matched by any other check present in
    290 ``match-filename``; this is intended to be used for lines containing labels or
    291 other unique identifiers. Conceptually, the presence of ``CHECK-LABEL`` divides
    292 the input stream into separate blocks, each of which is processed independently,
    293 preventing a ``CHECK:`` directive in one block matching a line in another block.
    294 For example,
    295 
    296 .. code-block:: llvm
    297 
    298   define %struct.C* @C_ctor_base(%struct.C* %this, i32 %x) {
    299   entry:
    300   ; CHECK-LABEL: C_ctor_base:
    301   ; CHECK: mov [[SAVETHIS:r[0-9]+]], r0
    302   ; CHECK: bl A_ctor_base
    303   ; CHECK: mov r0, [[SAVETHIS]]
    304     %0 = bitcast %struct.C* %this to %struct.A*
    305     %call = tail call %struct.A* @A_ctor_base(%struct.A* %0)
    306     %1 = bitcast %struct.C* %this to %struct.B*
    307     %call2 = tail call %struct.B* @B_ctor_base(%struct.B* %1, i32 %x)
    308     ret %struct.C* %this
    309   }
    310 
    311   define %struct.D* @D_ctor_base(%struct.D* %this, i32 %x) {
    312   entry:
    313   ; CHECK-LABEL: D_ctor_base:
    314 
    315 The use of ``CHECK-LABEL:`` directives in this case ensures that the three
    316 ``CHECK:`` directives only accept lines corresponding to the body of the
    317 ``@C_ctor_base`` function, even if the patterns match lines found later in
    318 the file. Furthermore, if one of these three ``CHECK:`` directives fail,
    319 FileCheck will recover by continuing to the next block, allowing multiple test
    320 failures to be detected in a single invocation.
    321 
    322 There is no requirement that ``CHECK-LABEL:`` directives contain strings that
    323 correspond to actual syntactic labels in a source or output language: they must
    324 simply uniquely match a single line in the file being verified.
    325 
    326 ``CHECK-LABEL:`` directives cannot contain variable definitions or uses.
    327 
    328 FileCheck Pattern Matching Syntax
    329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    330 
    331 The "``CHECK:``" and "``CHECK-NOT:``" directives both take a pattern to match.
    332 For most uses of FileCheck, fixed string matching is perfectly sufficient.  For
    333 some things, a more flexible form of matching is desired.  To support this,
    334 FileCheck allows you to specify regular expressions in matching strings,
    335 surrounded by double braces: ``{{yourregex}}``.  Because we want to use fixed
    336 string matching for a majority of what we do, FileCheck has been designed to
    337 support mixing and matching fixed string matching with regular expressions.
    338 This allows you to write things like this:
    339 
    340 .. code-block:: llvm
    341 
    342    ; CHECK: movhpd	{{[0-9]+}}(%esp), {{%xmm[0-7]}}
    343 
    344 In this case, any offset from the ESP register will be allowed, and any xmm
    345 register will be allowed.
    346 
    347 Because regular expressions are enclosed with double braces, they are
    348 visually distinct, and you don't need to use escape characters within the double
    349 braces like you would in C.  In the rare case that you want to match double
    350 braces explicitly from the input, you can use something ugly like
    351 ``{{[{][{]}}`` as your pattern.
    352 
    353 FileCheck Variables
    354 ~~~~~~~~~~~~~~~~~~~
    355 
    356 It is often useful to match a pattern and then verify that it occurs again
    357 later in the file.  For codegen tests, this can be useful to allow any register,
    358 but verify that that register is used consistently later.  To do this,
    359 :program:`FileCheck` allows named variables to be defined and substituted into
    360 patterns.  Here is a simple example:
    361 
    362 .. code-block:: llvm
    363 
    364    ; CHECK: test5:
    365    ; CHECK:    notw	[[REGISTER:%[a-z]+]]
    366    ; CHECK:    andw	{{.*}}[[REGISTER]]
    367 
    368 The first check line matches a regex ``%[a-z]+`` and captures it into the
    369 variable ``REGISTER``.  The second line verifies that whatever is in
    370 ``REGISTER`` occurs later in the file after an "``andw``".  :program:`FileCheck`
    371 variable references are always contained in ``[[ ]]`` pairs, and their names can
    372 be formed with the regex ``[a-zA-Z][a-zA-Z0-9]*``.  If a colon follows the name,
    373 then it is a definition of the variable; otherwise, it is a use.
    374 
    375 :program:`FileCheck` variables can be defined multiple times, and uses always
    376 get the latest value.  Variables can also be used later on the same line they
    377 were defined on. For example:
    378 
    379 .. code-block:: llvm
    380 
    381     ; CHECK: op [[REG:r[0-9]+]], [[REG]]
    382 
    383 Can be useful if you want the operands of ``op`` to be the same register,
    384 and don't care exactly which register it is.
    385 
    386 FileCheck Expressions
    387 ~~~~~~~~~~~~~~~~~~~~~
    388 
    389 Sometimes there's a need to verify output which refers line numbers of the
    390 match file, e.g. when testing compiler diagnostics.  This introduces a certain
    391 fragility of the match file structure, as "``CHECK:``" lines contain absolute
    392 line numbers in the same file, which have to be updated whenever line numbers
    393 change due to text addition or deletion.
    394 
    395 To support this case, FileCheck allows using ``[[@LINE]]``,
    396 ``[[@LINE+<offset>]]``, ``[[@LINE-<offset>]]`` expressions in patterns. These
    397 expressions expand to a number of the line where a pattern is located (with an
    398 optional integer offset).
    399 
    400 This way match patterns can be put near the relevant test lines and include
    401 relative line number references, for example:
    402 
    403 .. code-block:: c++
    404 
    405    // CHECK: test.cpp:[[@LINE+4]]:6: error: expected ';' after top level declarator
    406    // CHECK-NEXT: {{^int a}}
    407    // CHECK-NEXT: {{^     \^}}
    408    // CHECK-NEXT: {{^     ;}}
    409    int a
    410 
    411