1 FileCheck - Flexible pattern matching file verifier 2 =================================================== 3 4 SYNOPSIS 5 -------- 6 7 :program:`FileCheck` *match-filename* [*--check-prefix=XXX*] [*--strict-whitespace*] 8 9 DESCRIPTION 10 ----------- 11 12 :program:`FileCheck` reads two files (one from standard input, and one 13 specified on the command line) and uses one to verify the other. This 14 behavior is particularly useful for the testsuite, which wants to verify that 15 the output of some tool (e.g. :program:`llc`) contains the expected information 16 (for example, a movsd from esp or whatever is interesting). This is similar to 17 using :program:`grep`, but it is optimized for matching multiple different 18 inputs in one file in a specific order. 19 20 The ``match-filename`` file specifies the file that contains the patterns to 21 match. The file to verify is read from standard input unless the 22 :option:`--input-file` option is used. 23 24 OPTIONS 25 ------- 26 27 .. option:: -help 28 29 Print a summary of command line options. 30 31 .. option:: --check-prefix prefix 32 33 FileCheck searches the contents of ``match-filename`` for patterns to 34 match. By default, these patterns are prefixed with "``CHECK:``". 35 If you'd like to use a different prefix (e.g. because the same input 36 file is checking multiple different tool or options), the 37 :option:`--check-prefix` argument allows you to specify one or more 38 prefixes to match. Multiple prefixes are useful for tests which might 39 change for different run options, but most lines remain the same. 40 41 .. option:: --input-file filename 42 43 File to check (defaults to stdin). 44 45 .. option:: --strict-whitespace 46 47 By default, FileCheck canonicalizes input horizontal whitespace (spaces and 48 tabs) which causes it to ignore these differences (a space will match a tab). 49 The :option:`--strict-whitespace` argument disables this behavior. End-of-line 50 sequences are canonicalized to UNIX-style ``\n`` in all modes. 51 52 .. option:: --implicit-check-not check-pattern 53 54 Adds implicit negative checks for the specified patterns between positive 55 checks. The option allows writing stricter tests without stuffing them with 56 ``CHECK-NOT``\ s. 57 58 For example, "``--implicit-check-not warning:``" can be useful when testing 59 diagnostic messages from tools that don't have an option similar to ``clang 60 -verify``. With this option FileCheck will verify that input does not contain 61 warnings not covered by any ``CHECK:`` patterns. 62 63 .. option:: -version 64 65 Show the version number of this program. 66 67 EXIT STATUS 68 ----------- 69 70 If :program:`FileCheck` verifies that the file matches the expected contents, 71 it exits with 0. Otherwise, if not, or if an error occurs, it will exit with a 72 non-zero value. 73 74 TUTORIAL 75 -------- 76 77 FileCheck is typically used from LLVM regression tests, being invoked on the RUN 78 line of the test. A simple example of using FileCheck from a RUN line looks 79 like this: 80 81 .. code-block:: llvm 82 83 ; RUN: llvm-as < %s | llc -march=x86-64 | FileCheck %s 84 85 This syntax says to pipe the current file ("``%s``") into ``llvm-as``, pipe 86 that into ``llc``, then pipe the output of ``llc`` into ``FileCheck``. This 87 means that FileCheck will be verifying its standard input (the llc output) 88 against the filename argument specified (the original ``.ll`` file specified by 89 "``%s``"). To see how this works, let's look at the rest of the ``.ll`` file 90 (after the RUN line): 91 92 .. code-block:: llvm 93 94 define void @sub1(i32* %p, i32 %v) { 95 entry: 96 ; CHECK: sub1: 97 ; CHECK: subl 98 %0 = tail call i32 @llvm.atomic.load.sub.i32.p0i32(i32* %p, i32 %v) 99 ret void 100 } 101 102 define void @inc4(i64* %p) { 103 entry: 104 ; CHECK: inc4: 105 ; CHECK: incq 106 %0 = tail call i64 @llvm.atomic.load.add.i64.p0i64(i64* %p, i64 1) 107 ret void 108 } 109 110 Here you can see some "``CHECK:``" lines specified in comments. Now you can 111 see how the file is piped into ``llvm-as``, then ``llc``, and the machine code 112 output is what we are verifying. FileCheck checks the machine code output to 113 verify that it matches what the "``CHECK:``" lines specify. 114 115 The syntax of the "``CHECK:``" lines is very simple: they are fixed strings that 116 must occur in order. FileCheck defaults to ignoring horizontal whitespace 117 differences (e.g. a space is allowed to match a tab) but otherwise, the contents 118 of the "``CHECK:``" line is required to match some thing in the test file exactly. 119 120 One nice thing about FileCheck (compared to grep) is that it allows merging 121 test cases together into logical groups. For example, because the test above 122 is checking for the "``sub1:``" and "``inc4:``" labels, it will not match 123 unless there is a "``subl``" in between those labels. If it existed somewhere 124 else in the file, that would not count: "``grep subl``" matches if "``subl``" 125 exists anywhere in the file. 126 127 The FileCheck -check-prefix option 128 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 129 130 The FileCheck :option:`-check-prefix` option allows multiple test 131 configurations to be driven from one `.ll` file. This is useful in many 132 circumstances, for example, testing different architectural variants with 133 :program:`llc`. Here's a simple example: 134 135 .. code-block:: llvm 136 137 ; RUN: llvm-as < %s | llc -mtriple=i686-apple-darwin9 -mattr=sse41 \ 138 ; RUN: | FileCheck %s -check-prefix=X32 139 ; RUN: llvm-as < %s | llc -mtriple=x86_64-apple-darwin9 -mattr=sse41 \ 140 ; RUN: | FileCheck %s -check-prefix=X64 141 142 define <4 x i32> @pinsrd_1(i32 %s, <4 x i32> %tmp) nounwind { 143 %tmp1 = insertelement <4 x i32>; %tmp, i32 %s, i32 1 144 ret <4 x i32> %tmp1 145 ; X32: pinsrd_1: 146 ; X32: pinsrd $1, 4(%esp), %xmm0 147 148 ; X64: pinsrd_1: 149 ; X64: pinsrd $1, %edi, %xmm0 150 } 151 152 In this case, we're testing that we get the expected code generation with 153 both 32-bit and 64-bit code generation. 154 155 The "CHECK-NEXT:" directive 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 158 Sometimes you want to match lines and would like to verify that matches 159 happen on exactly consecutive lines with no other lines in between them. In 160 this case, you can use "``CHECK:``" and "``CHECK-NEXT:``" directives to specify 161 this. If you specified a custom check prefix, just use "``<PREFIX>-NEXT:``". 162 For example, something like this works as you'd expect: 163 164 .. code-block:: llvm 165 166 define void @t2(<2 x double>* %r, <2 x double>* %A, double %B) { 167 %tmp3 = load <2 x double>* %A, align 16 168 %tmp7 = insertelement <2 x double> undef, double %B, i32 0 169 %tmp9 = shufflevector <2 x double> %tmp3, 170 <2 x double> %tmp7, 171 <2 x i32> < i32 0, i32 2 > 172 store <2 x double> %tmp9, <2 x double>* %r, align 16 173 ret void 174 175 ; CHECK: t2: 176 ; CHECK: movl 8(%esp), %eax 177 ; CHECK-NEXT: movapd (%eax), %xmm0 178 ; CHECK-NEXT: movhpd 12(%esp), %xmm0 179 ; CHECK-NEXT: movl 4(%esp), %eax 180 ; CHECK-NEXT: movapd %xmm0, (%eax) 181 ; CHECK-NEXT: ret 182 } 183 184 "``CHECK-NEXT:``" directives reject the input unless there is exactly one 185 newline between it and the previous directive. A "``CHECK-NEXT:``" cannot be 186 the first directive in a file. 187 188 The "CHECK-SAME:" directive 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 191 Sometimes you want to match lines and would like to verify that matches happen 192 on the same line as the previous match. In this case, you can use "``CHECK:``" 193 and "``CHECK-SAME:``" directives to specify this. If you specified a custom 194 check prefix, just use "``<PREFIX>-SAME:``". 195 196 "``CHECK-SAME:``" is particularly powerful in conjunction with "``CHECK-NOT:``" 197 (described below). 198 199 For example, the following works like you'd expect: 200 201 .. code-block:: llvm 202 203 !0 = !DILocation(line: 5, scope: !1, inlinedAt: !2) 204 205 ; CHECK: !DILocation(line: 5, 206 ; CHECK-NOT: column: 207 ; CHECK-SAME: scope: ![[SCOPE:[0-9]+]] 208 209 "``CHECK-SAME:``" directives reject the input if there are any newlines between 210 it and the previous directive. A "``CHECK-SAME:``" cannot be the first 211 directive in a file. 212 213 The "CHECK-NOT:" directive 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 216 The "``CHECK-NOT:``" directive is used to verify that a string doesn't occur 217 between two matches (or before the first match, or after the last match). For 218 example, to verify that a load is removed by a transformation, a test like this 219 can be used: 220 221 .. code-block:: llvm 222 223 define i8 @coerce_offset0(i32 %V, i32* %P) { 224 store i32 %V, i32* %P 225 226 %P2 = bitcast i32* %P to i8* 227 %P3 = getelementptr i8* %P2, i32 2 228 229 %A = load i8* %P3 230 ret i8 %A 231 ; CHECK: @coerce_offset0 232 ; CHECK-NOT: load 233 ; CHECK: ret i8 234 } 235 236 The "CHECK-DAG:" directive 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 239 If it's necessary to match strings that don't occur in a strictly sequential 240 order, "``CHECK-DAG:``" could be used to verify them between two matches (or 241 before the first match, or after the last match). For example, clang emits 242 vtable globals in reverse order. Using ``CHECK-DAG:``, we can keep the checks 243 in the natural order: 244 245 .. code-block:: c++ 246 247 // RUN: %clang_cc1 %s -emit-llvm -o - | FileCheck %s 248 249 struct Foo { virtual void method(); }; 250 Foo f; // emit vtable 251 // CHECK-DAG: @_ZTV3Foo = 252 253 struct Bar { virtual void method(); }; 254 Bar b; 255 // CHECK-DAG: @_ZTV3Bar = 256 257 ``CHECK-NOT:`` directives could be mixed with ``CHECK-DAG:`` directives to 258 exclude strings between the surrounding ``CHECK-DAG:`` directives. As a result, 259 the surrounding ``CHECK-DAG:`` directives cannot be reordered, i.e. all 260 occurrences matching ``CHECK-DAG:`` before ``CHECK-NOT:`` must not fall behind 261 occurrences matching ``CHECK-DAG:`` after ``CHECK-NOT:``. For example, 262 263 .. code-block:: llvm 264 265 ; CHECK-DAG: BEFORE 266 ; CHECK-NOT: NOT 267 ; CHECK-DAG: AFTER 268 269 This case will reject input strings where ``BEFORE`` occurs after ``AFTER``. 270 271 With captured variables, ``CHECK-DAG:`` is able to match valid topological 272 orderings of a DAG with edges from the definition of a variable to its use. 273 It's useful, e.g., when your test cases need to match different output 274 sequences from the instruction scheduler. For example, 275 276 .. code-block:: llvm 277 278 ; CHECK-DAG: add [[REG1:r[0-9]+]], r1, r2 279 ; CHECK-DAG: add [[REG2:r[0-9]+]], r3, r4 280 ; CHECK: mul r5, [[REG1]], [[REG2]] 281 282 In this case, any order of that two ``add`` instructions will be allowed. 283 284 If you are defining `and` using variables in the same ``CHECK-DAG:`` block, 285 be aware that the definition rule can match `after` its use. 286 287 So, for instance, the code below will pass: 288 289 .. code-block:: llvm 290 291 ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0] 292 ; CHECK-DAG: vmov.32 [[REG2]][1] 293 vmov.32 d0[1] 294 vmov.32 d0[0] 295 296 While this other code, will not: 297 298 .. code-block:: llvm 299 300 ; CHECK-DAG: vmov.32 [[REG2:d[0-9]+]][0] 301 ; CHECK-DAG: vmov.32 [[REG2]][1] 302 vmov.32 d1[1] 303 vmov.32 d0[0] 304 305 While this can be very useful, it's also dangerous, because in the case of 306 register sequence, you must have a strong order (read before write, copy before 307 use, etc). If the definition your test is looking for doesn't match (because 308 of a bug in the compiler), it may match further away from the use, and mask 309 real bugs away. 310 311 In those cases, to enforce the order, use a non-DAG directive between DAG-blocks. 312 313 The "CHECK-LABEL:" directive 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 316 Sometimes in a file containing multiple tests divided into logical blocks, one 317 or more ``CHECK:`` directives may inadvertently succeed by matching lines in a 318 later block. While an error will usually eventually be generated, the check 319 flagged as causing the error may not actually bear any relationship to the 320 actual source of the problem. 321 322 In order to produce better error messages in these cases, the "``CHECK-LABEL:``" 323 directive can be used. It is treated identically to a normal ``CHECK`` 324 directive except that FileCheck makes an additional assumption that a line 325 matched by the directive cannot also be matched by any other check present in 326 ``match-filename``; this is intended to be used for lines containing labels or 327 other unique identifiers. Conceptually, the presence of ``CHECK-LABEL`` divides 328 the input stream into separate blocks, each of which is processed independently, 329 preventing a ``CHECK:`` directive in one block matching a line in another block. 330 For example, 331 332 .. code-block:: llvm 333 334 define %struct.C* @C_ctor_base(%struct.C* %this, i32 %x) { 335 entry: 336 ; CHECK-LABEL: C_ctor_base: 337 ; CHECK: mov [[SAVETHIS:r[0-9]+]], r0 338 ; CHECK: bl A_ctor_base 339 ; CHECK: mov r0, [[SAVETHIS]] 340 %0 = bitcast %struct.C* %this to %struct.A* 341 %call = tail call %struct.A* @A_ctor_base(%struct.A* %0) 342 %1 = bitcast %struct.C* %this to %struct.B* 343 %call2 = tail call %struct.B* @B_ctor_base(%struct.B* %1, i32 %x) 344 ret %struct.C* %this 345 } 346 347 define %struct.D* @D_ctor_base(%struct.D* %this, i32 %x) { 348 entry: 349 ; CHECK-LABEL: D_ctor_base: 350 351 The use of ``CHECK-LABEL:`` directives in this case ensures that the three 352 ``CHECK:`` directives only accept lines corresponding to the body of the 353 ``@C_ctor_base`` function, even if the patterns match lines found later in 354 the file. Furthermore, if one of these three ``CHECK:`` directives fail, 355 FileCheck will recover by continuing to the next block, allowing multiple test 356 failures to be detected in a single invocation. 357 358 There is no requirement that ``CHECK-LABEL:`` directives contain strings that 359 correspond to actual syntactic labels in a source or output language: they must 360 simply uniquely match a single line in the file being verified. 361 362 ``CHECK-LABEL:`` directives cannot contain variable definitions or uses. 363 364 FileCheck Pattern Matching Syntax 365 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 367 All FileCheck directives take a pattern to match. 368 For most uses of FileCheck, fixed string matching is perfectly sufficient. For 369 some things, a more flexible form of matching is desired. To support this, 370 FileCheck allows you to specify regular expressions in matching strings, 371 surrounded by double braces: ``{{yourregex}}``. Because we want to use fixed 372 string matching for a majority of what we do, FileCheck has been designed to 373 support mixing and matching fixed string matching with regular expressions. 374 This allows you to write things like this: 375 376 .. code-block:: llvm 377 378 ; CHECK: movhpd {{[0-9]+}}(%esp), {{%xmm[0-7]}} 379 380 In this case, any offset from the ESP register will be allowed, and any xmm 381 register will be allowed. 382 383 Because regular expressions are enclosed with double braces, they are 384 visually distinct, and you don't need to use escape characters within the double 385 braces like you would in C. In the rare case that you want to match double 386 braces explicitly from the input, you can use something ugly like 387 ``{{[{][{]}}`` as your pattern. 388 389 FileCheck Variables 390 ~~~~~~~~~~~~~~~~~~~ 391 392 It is often useful to match a pattern and then verify that it occurs again 393 later in the file. For codegen tests, this can be useful to allow any register, 394 but verify that that register is used consistently later. To do this, 395 :program:`FileCheck` allows named variables to be defined and substituted into 396 patterns. Here is a simple example: 397 398 .. code-block:: llvm 399 400 ; CHECK: test5: 401 ; CHECK: notw [[REGISTER:%[a-z]+]] 402 ; CHECK: andw {{.*}}[[REGISTER]] 403 404 The first check line matches a regex ``%[a-z]+`` and captures it into the 405 variable ``REGISTER``. The second line verifies that whatever is in 406 ``REGISTER`` occurs later in the file after an "``andw``". :program:`FileCheck` 407 variable references are always contained in ``[[ ]]`` pairs, and their names can 408 be formed with the regex ``[a-zA-Z][a-zA-Z0-9]*``. If a colon follows the name, 409 then it is a definition of the variable; otherwise, it is a use. 410 411 :program:`FileCheck` variables can be defined multiple times, and uses always 412 get the latest value. Variables can also be used later on the same line they 413 were defined on. For example: 414 415 .. code-block:: llvm 416 417 ; CHECK: op [[REG:r[0-9]+]], [[REG]] 418 419 Can be useful if you want the operands of ``op`` to be the same register, 420 and don't care exactly which register it is. 421 422 FileCheck Expressions 423 ~~~~~~~~~~~~~~~~~~~~~ 424 425 Sometimes there's a need to verify output which refers line numbers of the 426 match file, e.g. when testing compiler diagnostics. This introduces a certain 427 fragility of the match file structure, as "``CHECK:``" lines contain absolute 428 line numbers in the same file, which have to be updated whenever line numbers 429 change due to text addition or deletion. 430 431 To support this case, FileCheck allows using ``[[@LINE]]``, 432 ``[[@LINE+<offset>]]``, ``[[@LINE-<offset>]]`` expressions in patterns. These 433 expressions expand to a number of the line where a pattern is located (with an 434 optional integer offset). 435 436 This way match patterns can be put near the relevant test lines and include 437 relative line number references, for example: 438 439 .. code-block:: c++ 440 441 // CHECK: test.cpp:[[@LINE+4]]:6: error: expected ';' after top level declarator 442 // CHECK-NEXT: {{^int a}} 443 // CHECK-NEXT: {{^ \^}} 444 // CHECK-NEXT: {{^ ;}} 445 int a 446 447