Home | History | Annotate | Download | only in dalvik
      1 <html devsite>
      2   <head>
      3     <title>Constraints</title>
      4     <meta name="project_path" value="/_project.yaml" />
      5     <meta name="book_path" value="/_book.yaml" />
      6   </head>
      7   <body>
      8   <!--
      9       Copyright 2017 The Android Open Source Project
     10 
     11       Licensed under the Apache License, Version 2.0 (the "License");
     12       you may not use this file except in compliance with the License.
     13       You may obtain a copy of the License at
     14 
     15           http://www.apache.org/licenses/LICENSE-2.0
     16 
     17       Unless required by applicable law or agreed to in writing, software
     18       distributed under the License is distributed on an "AS IS" BASIS,
     19       WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
     20       See the License for the specific language governing permissions and
     21       limitations under the License.
     22   -->
     23 
     24 
     25 
     26 <p>A <code>.dex</code> file is the transport format for Dalvik Bytecode. There are certain
     27 syntactical and semantical constraints for a file to be a valid <code>.dex</code> file, and
     28 a runtime is required to support only valid .dex files.</p>
     29 
     30     <h2 id="gen-constraints">General .dex integrity constraints</h2>
     31 
     32     <p>General integrity constraints are concerned with the larger structure of a
     33     <code>.dex</code> file, as described in detail in <a href="dex-format.html"><code>.dex</code>
     34     format</a>.</p>
     35 
     36     <table>
     37       <tr>
     38         <th>
     39           Identifier
     40         </th>
     41 
     42         <th>
     43           Description
     44         </th>
     45       </tr>
     46 
     47       <tr>
     48         <td>
     49           G1
     50         </td>
     51 
     52         <td>
     53           The <code>magic</code> number of the <code>.dex</code> file must be
     54           <code>dex\n035\0</code> or <code>dex\n037\0</code>.
     55         </td>
     56       </tr>
     57 
     58       <tr>
     59         <td>
     60           G2
     61         </td>
     62 
     63         <td>
     64           The checksum must be an Adler-32 checksum of the whole file contents
     65           except <code>magic</code> and <code>checksum</code> field.
     66         </td>
     67       </tr>
     68 
     69       <tr>
     70         <td>
     71           G3
     72         </td>
     73 
     74         <td>
     75           The signature must be a SHA-1 hash of the whole file contents except <code>magic</code>,
     76           <code>checksum</code>, and <code>signature</code>.
     77         </td>
     78       </tr>
     79 
     80       <tr>
     81         <td>
     82           G4
     83         </td>
     84 
     85         <td>
     86           The <code>file_size</code> must match the actual file size in bytes.
     87         </td>
     88       </tr>
     89 
     90 
     91       <tr>
     92         <td>
     93           G5
     94         </td>
     95 
     96         <td>
     97           The <code>header_size</code> must have the value: <code>0x70</code>
     98         </td>
     99       </tr>
    100 
    101       <tr>
    102         <td>
    103           G6
    104         </td>
    105 
    106         <td>
    107           The <code>endian_tag</code> must have either the value:
    108           <code>ENDIAN_CONSTANT</code> or <code>REVERSE_ENDIAN_CONSTANT</code>
    109         </td>
    110       </tr>
    111 
    112       <tr>
    113         <td>
    114           G7
    115         </td>
    116 
    117         <td>
    118           For each of the <code>link</code>, <code>string_ids</code>,
    119           <code>type_ids</code>, <code>proto_ids</code>,
    120           <code>field_ids</code>, <code>method_ids</code>, <code>class_defs</code>, and
    121           <code>data</code> sections, the <code>offset</code> and <code>size</code> fields must be
    122           either both zero or both non-zero. In the latter case, the offset must be
    123           four-byte-aligned.
    124         </td>
    125       </tr>
    126 
    127       <tr>
    128         <td>
    129           G8
    130         </td>
    131 
    132         <td>
    133           All offset fields in the header except <code>map_off</code> must be four-byte-aligned.
    134         </td>
    135       </tr>
    136 
    137       <tr>
    138         <td>
    139           G9
    140         </td>
    141 
    142         <td>
    143           The <code>map_off</code> field must be either zero or point into the
    144           data section. In the latter case, the <code>data</code> section must exist.
    145         </td>
    146       </tr>
    147 
    148       <tr>
    149         <td>
    150           G10
    151         </td>
    152 
    153         <td>
    154           None of the <code>link</code>, <code>string_ids</code>,
    155           <code>type_ids</code>, <code>proto_ids</code>, <code>field_ids</code>,
    156           <code>method_ids</code>, <code>class_defs</code> and <code>data</code> sections
    157           must overlap each other or the header.
    158         </td>
    159       </tr>
    160 
    161       <tr>
    162         <td>
    163           G11
    164         </td>
    165 
    166         <td>
    167           If a map exists, then each map entry must have a valid type. Each type may appear at most once.
    168         </td>
    169       </tr>
    170 
    171       <tr>
    172         <td>
    173           G12
    174         </td>
    175 
    176         <td>
    177           If a map exists, then each map entry must have a non-zero offset and
    178           size. The offset must point into the corresponding section of the file (i.e. a
    179           <code>string_id_item</code> must point into the <code>string_ids</code> section) and the explicit or
    180           implicit size of the item must match the actual contents and size of the
    181           section.
    182         </td>
    183       </tr>
    184 
    185       <tr>
    186         <td>
    187           G13
    188         </td>
    189 
    190         <td>
    191           If a map exists, then the offset of map entry <code>n+1</code> must be greater or
    192           equal to the offset of map entry <code>n plus than size of map entry n</code>. This implies
    193           non-overlapping entries and low-to-high ordering.
    194         </td>
    195       </tr>
    196 
    197       <tr>
    198         <td>
    199           G14
    200         </td>
    201 
    202         <td>
    203           The following types of entries must have an offset that is
    204           four-byte-aligned: <code>string_id_item</code>,
    205           <code>type_id_item</code>, <code>proto_id_item</code>,
    206           <code>field_id_item</code>,
    207           <code>method_id_item</code>, <code>class_def_item</code>,
    208           <code>type_list</code>, <code>code_item</code>,
    209           <code>annotations_directory_item</code>.
    210         </td>
    211       </tr>
    212 
    213       <tr>
    214         <td>
    215           G15
    216         </td>
    217 
    218         <td>
    219           For each <code>string_id_item</code>, the <code>string_data_off</code> field must contain a
    220           valid reference into the <code>data</code> section. For the referenced <code>string_data_item</code>, the
    221           <code>data</code> field must contain a valid MUTF-8 string, and the <code>utf16_size</code> must match
    222           the decoded length of the string.
    223         </td>
    224       </tr>
    225 
    226       <tr>
    227         <td>
    228           G16
    229         </td>
    230 
    231         <td>
    232           For each <code>type_id_item</code>, the <code>descriptor_idx</code> field must contain a valid
    233           reference into the <code>string_ids</code> list. The referenced string must be a valid type
    234           descriptor.
    235         </td>
    236       </tr>
    237 
    238       <tr>
    239         <td>
    240           G17
    241         </td>
    242 
    243         <td>
    244           For each <code>proto_id_item</code>, the <code>shorty_idx</code> field must contain a valid
    245           reference into the <code>string_ids</code> list. The referenced string must be a valid
    246           shorty descriptor. Also, the <code>return_type_idx</code> field must be a valid index into
    247           the <code>type_ids</code> section, and the <code>parameters_off</code> field must be either zero or a
    248           valid offset pointing into the <code>data</code> section. If non-zero, the parameter list
    249           must not contain any void entries.
    250         </td>
    251       </tr>
    252 
    253       <tr>
    254         <td>
    255           G18
    256         </td>
    257 
    258         <td>
    259           For each <code>field_id_item</code>, both the <code>class_idx</code> and <code>type_idx</code> fields must
    260           be valid indices into the <code>type_ids</code> list. The entry referenced by <code>class_idx</code>
    261           must be a non-array reference type. In addition, the <code>name_idx</code> field must be a
    262           valid reference into the <code>string_ids</code> section, and the contents of the referenced
    263           entry must conform to the <code>MemberName</code> specification.
    264         </td>
    265       </tr>
    266 
    267       <tr>
    268         <td>
    269           G19
    270         </td>
    271 
    272         <td>
    273           For each <code>method_id_item</code>, the <code>class_idx</code> field must be a valid index
    274           into the <code>type_ids</code> section, and the referenced entry must be a non-array
    275           reference type. The <code>proto_id</code> field must be a valid reference into the <code>proto_ids</code>
    276           list. The <code>name_idx</code> field must be a valid reference into the <code>string_ids</code> section,
    277           and the contents of the referenced entry must conform to the <code>MemberName</code>
    278           specification.
    279         </td>
    280       </tr>
    281 
    282       <tr>
    283         <td>
    284           G20
    285         </td>
    286 
    287         <td>
    288           For each <code>field_id_item</code>, the <code>class_idx</code> field must be a valid index
    289           into the <code>type_ids</code> list. The referenced entry must be a non-array reference
    290           type.
    291         </td>
    292       </tr>
    293 
    294 </table>
    295 
    296 <h2 id="static-constraints">
    297       Static bytecode constraints
    298     </h2>
    299 
    300     <p>
    301     Static constraints are constraints on individual elements of the bytecode.
    302     They usually can be checked without employing control or data-flow analysis
    303     techniques.
    304     </p>
    305 
    306     <table>
    307       <tr>
    308         <th>
    309           Identifier
    310         </th>
    311 
    312         <th>
    313           Description
    314         </th>
    315       </tr>
    316 
    317       <tr>
    318         <td>
    319           A1
    320         </td>
    321 
    322         <td>
    323           The <code>insns</code> array must not be empty.
    324         </td>
    325       </tr>
    326 
    327       <tr>
    328         <td>
    329           A2
    330         </td>
    331 
    332         <td>
    333           The first opcode in the <code>insns</code> array must have index zero.
    334         </td>
    335       </tr>
    336 
    337       <tr>
    338         <td>
    339           A3
    340         </td>
    341 
    342         <td>
    343           The <code>insns</code> array must contain only valid Dalvik opcodes.
    344         </td>
    345       </tr>
    346 
    347       <tr>
    348         <td>
    349           A4
    350         </td>
    351 
    352         <td>
    353           The index of instruction <code>n+1</code> must equal the index of
    354           instruction <code>n</code> plus the length of instruction
    355           <code>n</code>, taking into account possible operands.
    356         </td>
    357       </tr>
    358 
    359       <tr>
    360         <td>
    361           A5
    362         </td>
    363 
    364         <td>
    365           The last instruction in the <code>insns</code> array must end at index
    366           <code>insns_size-1</code>.
    367         </td>
    368       </tr>
    369 
    370       <tr>
    371         <td>
    372           A6
    373         </td>
    374 
    375         <td>
    376           All <code>goto</code> and <code>if-&lt;kind&gt;</code> targets must
    377           be opcodes within the same method.
    378         </td>
    379       </tr>
    380 
    381       <tr>
    382         <td>
    383           A7
    384         </td>
    385 
    386         <td>
    387           All targets of a <code>packed-switch</code> instruction must be
    388           opcodes within the same method. The size and the list of targets
    389           must be consistent.
    390         </td>
    391       </tr>
    392 
    393       <tr>
    394         <td>
    395           A8
    396         </td>
    397 
    398         <td>
    399           All targets of a <code>sparse-switch</code> instruction must be
    400           opcodes within the same method. The corresponding table must be
    401           consistent and sorted low-to-high.
    402         </td>
    403       </tr>
    404 
    405       <tr>
    406         <td>
    407           A9
    408         </td>
    409 
    410         <td>
    411           The <code>B</code> operand of the <code>const-string</code> and
    412           <code>const-string/jumbo</code> instructions must be a valid index
    413           into the string constant pool.
    414         </td>
    415       </tr>
    416 
    417       <tr>
    418         <td>
    419           A10
    420         </td>
    421 
    422         <td>
    423           The <code>C</code> operand of the <code>iget&lt;kind&gt;</code> and
    424           <code>iput&lt;kind&gt;</code> instructions must be a valid index into
    425           the field constant pool. The referenced entry must represent an
    426           instance field.
    427         </td>
    428       </tr>
    429 
    430       <tr>
    431         <td>
    432           A11
    433         </td>
    434 
    435         <td>
    436           The <code>C</code> operand of the <code>sget&lt;kind&gt;</code> and
    437           <code>sput&lt;kind&gt;</code> instructions must be a valid index into
    438           the field constant pool. The referenced entry must represent a static
    439           field.
    440         </td>
    441       </tr>
    442 
    443       <tr>
    444         <td>
    445           A12
    446         </td>
    447 
    448         <td>
    449           The <code>C</code> operand of the <code>invoke-virtual</code>,
    450           <code>invoke-super</code>, <code>invoke-direct</code> and
    451           <code>invoke-static</code> instructions must be a valid index into the
    452           method constant pool.
    453         </td>
    454       </tr>
    455 
    456       <tr>
    457         <td>
    458           A13
    459         </td>
    460 
    461         <td>
    462           The <code>B</code> operand of the <code>invoke-virtual/range</code>,
    463           <code>invoke-super/range</code>, <code>invoke-direct/range</code>, and
    464           <code>invoke-static/range</code> instructions must be a valid index
    465           into the method constant pool.
    466         </td>
    467       </tr>
    468 
    469       <tr>
    470         <td>
    471           A14
    472         </td>
    473 
    474         <td>
    475           A method the name of which starts with a '&lt;' must only be invoked
    476           implicitly by the VM, not by code originating from a <code>.dex</code> file. The
    477           only exception is the instance initializer, which may be invoked by
    478           <code>invoke-direct</code>.
    479         </td>
    480       </tr>
    481 
    482       <tr>
    483         <td>
    484           A15
    485         </td>
    486 
    487         <td>
    488           The <code>C</code> operand of the <code>invoke-interface</code>
    489           instruction must be a valid index into the method constant pool. The
    490           referenced <code>method_id</code> must belong to an interface (not a
    491           class).
    492         </td>
    493       </tr>
    494 
    495       <tr>
    496         <td>
    497           A16
    498         </td>
    499 
    500         <td>
    501           The <code>B</code> operand of the <code>invoke-interface/range</code>
    502           instruction must be a valid index into the method constant pool.
    503           The referenced <code>method_id</code> must belong to an interface (not
    504           a class).
    505         </td>
    506       </tr>
    507 
    508       <tr>
    509         <td>
    510           A17
    511         </td>
    512 
    513         <td>
    514           The <code>B</code> operand of the <code>const-class</code>,
    515           <code>check-cast</code>, <code>new-instance</code>, and
    516           <code>filled-new-array/range</code> instructions must be a valid index
    517           into the type constant pool.
    518         </td>
    519       </tr>
    520 
    521       <tr>
    522         <td>
    523           A18
    524         </td>
    525 
    526         <td>
    527           The <code>C</code> operand of the <code>instance-of</code>,
    528           <code>new-array</code>, and <code>filled-new-array</code>
    529           instructions must be a valid index into the type constant pool.
    530         </td>
    531       </tr>
    532 
    533      <tr>
    534         <td>
    535           A19
    536         </td>
    537 
    538         <td>
    539           The dimensions of an array created by a <code>new-array</code>
    540           instruction must be less than <code>256</code>.
    541         </td>
    542       </tr>
    543 
    544       <tr>
    545         <td>
    546           A20
    547         </td>
    548 
    549         <td>
    550           The <code>new</code> instruction must not refer to array classes,
    551           interfaces, or abstract classes.
    552         </td>
    553       </tr>
    554 
    555       <tr>
    556         <td>
    557           A21
    558         </td>
    559 
    560         <td>
    561           The type referred to by a <code>new-array</code> instruction must be
    562           a valid, non-reference type.
    563         </td>
    564       </tr>
    565 
    566       <tr>
    567         <td>
    568           A22
    569         </td>
    570 
    571         <td>
    572           All registers referred to by an instruction in a single-width
    573           (non-pair) fashion must be valid for the current method. That is,
    574           their indices must be non-negative and smaller than
    575           <code>registers_size</code>.
    576         </td>
    577       </tr>
    578 
    579       <tr>
    580         <td>
    581           A23
    582         </td>
    583 
    584         <td>
    585           All registers referred to by an instruction in a double-width (pair)
    586           fashion must be valid for the current method. That is, their indices
    587           must be non-negative and smaller than <code>registers_size-1</code>.
    588         </td>
    589       </tr>
    590       <tr>
    591         <td>
    592           A24
    593         </td>
    594 
    595         <td>
    596           The <code>method_id</code> operand of the <code>invoke-virtual</code>
    597           and <code>invoke-direct</code> instructions must belong to a class
    598           (not an interface). In Dex files prior to version <code>037</code>
    599           the same must be true of <code>invoke-super</code> and
    600           <code>invoke-static</code> instructions.
    601         </td>
    602       </tr>
    603 
    604       <tr>
    605         <td>
    606           A25
    607         </td>
    608 
    609         <td>
    610           The <code>method_id</code> operand of the
    611           <code>invoke-virtual/range</code> and
    612           <code>invoke-direct/range</code> instructions must belong to a class
    613           (not an interface). In Dex files prior to version <code>037</code>
    614           the same must be true of <code>invoke-super/range</code> and
    615           <code>invoke-static/range</code> instructions.
    616         </td>
    617       </tr>
    618     </table>
    619 
    620     <h2 id="struct-constraints">
    621       Structural bytecode constraints
    622     </h2>
    623 
    624     <p>
    625     Structural constraints are constraints on relationships between several
    626     elements of the bytecode. They usually can't be checked without employing
    627     control or data-flow analysis techniques.
    628     </p>
    629 
    630     <table>
    631       <tr>
    632         <th>
    633           Identifier
    634         </th>
    635 
    636         <th>
    637           Description
    638         </th>
    639       </tr>
    640 
    641       <tr>
    642         <td>
    643           B1
    644         </td>
    645 
    646         <td>
    647           The number and types of arguments (registers and immediate values)
    648           must always match the instruction.
    649         </td>
    650       </tr>
    651 
    652       <tr>
    653         <td>
    654           B2
    655         </td>
    656 
    657         <td>
    658           Register pairs must never be broken up.
    659         </td>
    660       </tr>
    661 
    662       <tr>
    663         <td>
    664           B3
    665         </td>
    666 
    667         <td>
    668           A register (or pair) has to be assigned first before it can be
    669           read.
    670         </td>
    671       </tr>
    672 
    673       <tr>
    674         <td>
    675           B4
    676         </td>
    677 
    678         <td>
    679           An <code>invoke-direct</code> instruction must invoke an instance
    680           initializer or a method only in the current class or one of its
    681           superclasses.
    682         </td>
    683       </tr>
    684 
    685       <tr>
    686         <td>
    687           B5
    688         </td>
    689 
    690         <td>
    691           An instance initializer must be invoked only on an uninitialized
    692           instance.
    693         </td>
    694       </tr>
    695 
    696       <tr>
    697         <td>
    698           B6
    699         </td>
    700 
    701         <td>
    702           Instance methods may be invoked only on and instance fields may only
    703           be accessed on already initialized instances.
    704         </td>
    705       </tr>
    706 
    707       <tr>
    708         <td>
    709           B7
    710         </td>
    711 
    712         <td>
    713           A register that holds the result of a <code>new-instance</code>
    714           instruction must not be used if the same
    715           <code>new-instance</code> instruction is again executed before
    716           the instance is initialized.
    717         </td>
    718       </tr>
    719 
    720       <tr>
    721         <td>
    722           B8
    723         </td>
    724 
    725         <td>
    726            An instance initializer must call another instance initializer (same
    727            class or superclass) before any instance members can be accessed.
    728            Exceptions are non-inherited instance fields, which can be assigned
    729            before calling another initializer, and the <code>Object</code> class
    730            in general.
    731         </td>
    732       </tr>
    733 
    734       <tr>
    735         <td>
    736           B9
    737         </td>
    738 
    739         <td>
    740            All actual method arguments must be assignment-compatible with their
    741            respective formal arguments.
    742         </td>
    743       </tr>
    744 
    745       <tr>
    746         <td>
    747           B10
    748         </td>
    749 
    750         <td>
    751            For each instance method invocation, the actual instance must be
    752            assignment-compatible with the class or interface specified in the
    753            instruction.
    754         </td>
    755       </tr>
    756 
    757       <tr>
    758         <td>
    759           B11
    760         </td>
    761 
    762         <td>
    763            A <code>return&lt;kind&gt;</code> instruction must match its
    764            method's return type.
    765         </td>
    766       </tr>
    767 
    768       <tr>
    769         <td>
    770           B12
    771         </td>
    772 
    773         <td>
    774            When accessing protected members of a superclass, the actual type of
    775            the instance being accessed must be either the current class or one
    776            of its subclasses.
    777         </td>
    778       </tr>
    779 
    780      <tr>
    781         <td>
    782           B13
    783         </td>
    784 
    785         <td>
    786            The type of a value stored into a static field must be
    787            assignment-compatible with or convertible to the field's type.
    788         </td>
    789       </tr>
    790 
    791       <tr>
    792         <td>
    793           B14
    794         </td>
    795 
    796         <td>
    797            The type of a value stored into a field must be assignment-compatible
    798            with or convertible to the field's type.
    799         </td>
    800       </tr>
    801 
    802       <tr>
    803         <td>
    804           B15
    805         </td>
    806 
    807         <td>
    808            The type of every value stored into an array must be
    809            assignment-compatible with the array's component type.
    810         </td>
    811       </tr>
    812 
    813       <tr>
    814         <td>
    815           B16
    816         </td>
    817 
    818         <td>
    819            The <code>A</code> operand of a <code>throw</code> instruction must
    820            be assignment-compatible with <code>java.lang.Throwable</code>.
    821         </td>
    822       </tr>
    823 
    824       <tr>
    825         <td>
    826           B17
    827         </td>
    828 
    829         <td>
    830            The last reachable instruction of a method must either be a backwards
    831            <code>goto</code> or branch, a <code>return</code>, or a
    832            <code>throw</code> instruction. It must not be possible to leave the
    833            <code>insns</code> array at the bottom.
    834         </td>
    835       </tr>
    836 
    837       <tr>
    838         <td>
    839           B18
    840         </td>
    841 
    842         <td>
    843           The unassigned half of a former register pair may not be read (is
    844           considered invalid) until it has been re-assigned by some other
    845           instruction.
    846         </td>
    847       </tr>
    848 
    849       <tr>
    850         <td>
    851           B19
    852         </td>
    853 
    854         <td>
    855           A <code>move-result&lt;kind&gt;</code> instruction must be immediately
    856           preceded (in the <code>insns</code> array) by an
    857           <code>invoke-&lt;kind&gt;</code> instruction. The only exception is
    858           the <code>move-result-object</code> instruction, which may also be
    859           preceded by a <code>filled-new-array</code> instruction.
    860         </td>
    861       </tr>
    862 
    863       <tr>
    864         <td>
    865           B20
    866         </td>
    867 
    868         <td>
    869           A <code>move-result&lt;kind&gt;</code> instruction must be immediately
    870           preceded (in actual control flow) by a matching
    871           <code>return-&lt;kind&gt;</code> instruction (it must not be jumped
    872           to). The only exception is the <code>move-result-object</code>
    873           instruction, which may also be preceded by a
    874           <code>filled-new-array</code> instruction.
    875         </td>
    876       </tr>
    877 
    878       <tr>
    879         <td>
    880           B21
    881         </td>
    882 
    883         <td>
    884           A <code>move-exception</code> instruction must appear only as the
    885           first instruction in an exception handler.
    886         </td>
    887       </tr>
    888 
    889       <tr>
    890         <td>
    891           B22
    892         </td>
    893 
    894         <td>
    895           The <code>packed-switch-data</code>, <code>sparse-switch-data</code>,
    896           and <code>fill-array-data</code> pseudo-instructions must not be
    897           reachable by control flow.
    898         </td>
    899       </tr>
    900     </table>
    901 
    902   </body>
    903 </html>
    904