Home | History | Annotate | Download | only in library
      1 :mod:`shutil` --- High-level file operations
      2 ============================================
      3 
      4 .. module:: shutil
      5    :synopsis: High-level file operations, including copying.
      6 .. sectionauthor:: Fred L. Drake, Jr. <fdrake (a] acm.org>
      7 .. partly based on the docstrings
      8 
      9 .. index::
     10    single: file; copying
     11    single: copying files
     12 
     13 **Source code:** :source:`Lib/shutil.py`
     14 
     15 --------------
     16 
     17 The :mod:`shutil` module offers a number of high-level operations on files and
     18 collections of files.  In particular, functions are provided  which support file
     19 copying and removal. For operations on individual files, see also the
     20 :mod:`os` module.
     21 
     22 .. warning::
     23 
     24    Even the higher-level file copying functions (:func:`shutil.copy`,
     25    :func:`shutil.copy2`) can't copy all file metadata.
     26 
     27    On POSIX platforms, this means that file owner and group are lost as well
     28    as ACLs.  On Mac OS, the resource fork and other metadata are not used.
     29    This means that resources will be lost and file type and creator codes will
     30    not be correct. On Windows, file owners, ACLs and alternate data streams
     31    are not copied.
     32 
     33 
     34 .. _file-operations:
     35 
     36 Directory and files operations
     37 ------------------------------
     38 
     39 .. function:: copyfileobj(fsrc, fdst[, length])
     40 
     41    Copy the contents of the file-like object *fsrc* to the file-like object *fdst*.
     42    The integer *length*, if given, is the buffer size. In particular, a negative
     43    *length* value means to copy the data without looping over the source data in
     44    chunks; by default the data is read in chunks to avoid uncontrolled memory
     45    consumption. Note that if the current file position of the *fsrc* object is not
     46    0, only the contents from the current file position to the end of the file will
     47    be copied.
     48 
     49 
     50 .. function:: copyfile(src, dst)
     51 
     52    Copy the contents (no metadata) of the file named *src* to a file named
     53    *dst*.  *dst* must be the complete target file name; look at
     54    :func:`shutil.copy` for a copy that accepts a target directory path.  If
     55    *src* and *dst* are the same files, :exc:`Error` is raised.
     56    The destination location must be writable; otherwise,  an :exc:`IOError` exception
     57    will be raised. If *dst* already exists, it will be replaced.   Special files
     58    such as character or block devices and pipes cannot be copied with this
     59    function.  *src* and *dst* are path names given as strings.
     60 
     61 
     62 .. function:: copymode(src, dst)
     63 
     64    Copy the permission bits from *src* to *dst*.  The file contents, owner, and
     65    group are unaffected.  *src* and *dst* are path names given as strings.
     66 
     67 
     68 .. function:: copystat(src, dst)
     69 
     70    Copy the permission bits, last access time, last modification time, and flags
     71    from *src* to *dst*.  The file contents, owner, and group are unaffected.  *src*
     72    and *dst* are path names given as strings.
     73 
     74 
     75 .. function:: copy(src, dst)
     76 
     77    Copy the file *src* to the file or directory *dst*.  If *dst* is a directory, a
     78    file with the same basename as *src*  is created (or overwritten) in the
     79    directory specified.  Permission bits are copied.  *src* and *dst* are path
     80    names given as strings.
     81 
     82 
     83 .. function:: copy2(src, dst)
     84 
     85    Similar to :func:`shutil.copy`, but metadata is copied as well -- in fact,
     86    this is just :func:`shutil.copy` followed by :func:`copystat`.  This is
     87    similar to the Unix command :program:`cp -p`.
     88 
     89 
     90 .. function:: ignore_patterns(\*patterns)
     91 
     92    This factory function creates a function that can be used as a callable for
     93    :func:`copytree`\'s *ignore* argument, ignoring files and directories that
     94    match one of the glob-style *patterns* provided.  See the example below.
     95 
     96    .. versionadded:: 2.6
     97 
     98 
     99 .. function:: copytree(src, dst, symlinks=False, ignore=None)
    100 
    101    Recursively copy an entire directory tree rooted at *src*.  The destination
    102    directory, named by *dst*, must not already exist; it will be created as
    103    well as missing parent directories.  Permissions and times of directories
    104    are copied with :func:`copystat`, individual files are copied using
    105    :func:`shutil.copy2`.
    106 
    107    If *symlinks* is true, symbolic links in the source tree are represented as
    108    symbolic links in the new tree, but the metadata of the original links is NOT
    109    copied; if false or omitted, the contents and metadata of the linked files
    110    are copied to the new tree.
    111 
    112    If *ignore* is given, it must be a callable that will receive as its
    113    arguments the directory being visited by :func:`copytree`, and a list of its
    114    contents, as returned by :func:`os.listdir`.  Since :func:`copytree` is
    115    called recursively, the *ignore* callable will be called once for each
    116    directory that is copied.  The callable must return a sequence of directory
    117    and file names relative to the current directory (i.e. a subset of the items
    118    in its second argument); these names will then be ignored in the copy
    119    process.  :func:`ignore_patterns` can be used to create such a callable that
    120    ignores names based on glob-style patterns.
    121 
    122    If exception(s) occur, an :exc:`Error` is raised with a list of reasons.
    123 
    124    The source code for this should be considered an example rather than the
    125    ultimate tool.
    126 
    127    .. versionchanged:: 2.3
    128       :exc:`Error` is raised if any exceptions occur during copying, rather than
    129       printing a message.
    130 
    131    .. versionchanged:: 2.5
    132       Create intermediate directories needed to create *dst*, rather than raising an
    133       error. Copy permissions and times of directories using :func:`copystat`.
    134 
    135    .. versionchanged:: 2.6
    136       Added the *ignore* argument to be able to influence what is being copied.
    137 
    138 
    139 .. function:: rmtree(path[, ignore_errors[, onerror]])
    140 
    141    .. index:: single: directory; deleting
    142 
    143    Delete an entire directory tree; *path* must point to a directory (but not a
    144    symbolic link to a directory).  If *ignore_errors* is true, errors resulting
    145    from failed removals will be ignored; if false or omitted, such errors are
    146    handled by calling a handler specified by *onerror* or, if that is omitted,
    147    they raise an exception.
    148 
    149    If *onerror* is provided, it must be a callable that accepts three
    150    parameters: *function*, *path*, and *excinfo*. The first parameter,
    151    *function*, is the function which raised the exception; it will be
    152    :func:`os.path.islink`, :func:`os.listdir`, :func:`os.remove` or
    153    :func:`os.rmdir`.  The second parameter, *path*, will be the path name passed
    154    to *function*.  The third parameter, *excinfo*, will be the exception
    155    information return by :func:`sys.exc_info`.  Exceptions raised by *onerror*
    156    will not be caught.
    157 
    158    .. versionchanged:: 2.6
    159       Explicitly check for *path* being a symbolic link and raise :exc:`OSError`
    160       in that case.
    161 
    162 
    163 .. function:: move(src, dst)
    164 
    165    Recursively move a file or directory (*src*) to another location (*dst*).
    166 
    167    If the destination is an existing directory, then *src* is moved inside that
    168    directory. If the destination already exists but is not a directory, it may
    169    be overwritten depending on :func:`os.rename` semantics.
    170 
    171    If the destination is on the current filesystem, then :func:`os.rename` is
    172    used.  Otherwise, *src* is copied (using :func:`shutil.copy2`) to *dst* and
    173    then removed.
    174 
    175    .. versionadded:: 2.3
    176 
    177 
    178 .. exception:: Error
    179 
    180    This exception collects exceptions that are raised during a multi-file
    181    operation. For :func:`copytree`, the exception argument is a list of 3-tuples
    182    (*srcname*, *dstname*, *exception*).
    183 
    184    .. versionadded:: 2.3
    185 
    186 
    187 .. _copytree-example:
    188 
    189 copytree example
    190 ::::::::::::::::
    191 
    192 This example is the implementation of the :func:`copytree` function, described
    193 above, with the docstring omitted.  It demonstrates many of the other functions
    194 provided by this module. ::
    195 
    196    def copytree(src, dst, symlinks=False, ignore=None):
    197        names = os.listdir(src)
    198        if ignore is not None:
    199            ignored_names = ignore(src, names)
    200        else:
    201            ignored_names = set()
    202 
    203        os.makedirs(dst)
    204        errors = []
    205        for name in names:
    206            if name in ignored_names:
    207                continue
    208            srcname = os.path.join(src, name)
    209            dstname = os.path.join(dst, name)
    210            try:
    211                if symlinks and os.path.islink(srcname):
    212                    linkto = os.readlink(srcname)
    213                    os.symlink(linkto, dstname)
    214                elif os.path.isdir(srcname):
    215                    copytree(srcname, dstname, symlinks, ignore)
    216                else:
    217                    copy2(srcname, dstname)
    218                # XXX What about devices, sockets etc.?
    219            except (IOError, os.error) as why:
    220                errors.append((srcname, dstname, str(why)))
    221            # catch the Error from the recursive copytree so that we can
    222            # continue with other files
    223            except Error as err:
    224                errors.extend(err.args[0])
    225        try:
    226            copystat(src, dst)
    227        except WindowsError:
    228            # can't copy file access times on Windows
    229            pass
    230        except OSError as why:
    231            errors.extend((src, dst, str(why)))
    232        if errors:
    233            raise Error(errors)
    234 
    235 Another example that uses the :func:`ignore_patterns` helper::
    236 
    237    from shutil import copytree, ignore_patterns
    238 
    239    copytree(source, destination, ignore=ignore_patterns('*.pyc', 'tmp*'))
    240 
    241 This will copy everything except ``.pyc`` files and files or directories whose
    242 name starts with ``tmp``.
    243 
    244 Another example that uses the *ignore* argument to add a logging call::
    245 
    246    from shutil import copytree
    247    import logging
    248 
    249    def _logpath(path, names):
    250        logging.info('Working in %s' % path)
    251        return []   # nothing will be ignored
    252 
    253    copytree(source, destination, ignore=_logpath)
    254 
    255 
    256 .. _archiving-operations:
    257 
    258 Archiving operations
    259 --------------------
    260 
    261 High-level utilities to create and read compressed and archived files are also
    262 provided.  They rely on the :mod:`zipfile` and :mod:`tarfile` modules.
    263 
    264 .. function:: make_archive(base_name, format, [root_dir, [base_dir, [verbose, [dry_run, [owner, [group, [logger]]]]]]])
    265 
    266    Create an archive file (eg. zip or tar) and returns its name.
    267 
    268    *base_name* is the name of the file to create, including the path, minus
    269    any format-specific extension. *format* is the archive format: one of
    270    "zip" (if the :mod:`zlib` module or external ``zip`` executable is
    271    available), "tar", "gztar" (if the :mod:`zlib` module is available), or
    272    "bztar" (if the :mod:`bz2` module is available).
    273 
    274    *root_dir* is a directory that will be the root directory of the
    275    archive; ie. we typically chdir into *root_dir* before creating the
    276    archive.
    277 
    278    *base_dir* is the directory where we start archiving from;
    279    ie. *base_dir* will be the common prefix of all files and
    280    directories in the archive.
    281 
    282    *root_dir* and *base_dir* both default to the current directory.
    283 
    284    *owner* and *group* are used when creating a tar archive. By default,
    285    uses the current owner and group.
    286 
    287    *logger* must be an object compatible with :pep:`282`, usually an instance of
    288    :class:`logging.Logger`.
    289 
    290    .. versionadded:: 2.7
    291 
    292 
    293 .. function:: get_archive_formats()
    294 
    295    Return a list of supported formats for archiving.
    296    Each element of the returned sequence is a tuple ``(name, description)``.
    297 
    298    By default :mod:`shutil` provides these formats:
    299 
    300    - *zip*: ZIP file (if the :mod:`zlib` module or external ``zip``
    301      executable is available).
    302    - *tar*: uncompressed tar file.
    303    - *gztar*: gzip'ed tar-file (if the :mod:`zlib` module is available).
    304    - *bztar*: bzip2'ed tar-file (if the :mod:`bz2` module is available).
    305 
    306    You can register new formats or provide your own archiver for any existing
    307    formats, by using :func:`register_archive_format`.
    308 
    309    .. versionadded:: 2.7
    310 
    311 
    312 .. function:: register_archive_format(name, function, [extra_args, [description]])
    313 
    314    Register an archiver for the format *name*. *function* is a callable that
    315    will be used to invoke the archiver.
    316 
    317    If given, *extra_args* is a sequence of ``(name, value)`` that will be
    318    used as extra keywords arguments when the archiver callable is used.
    319 
    320    *description* is used by :func:`get_archive_formats` which returns the
    321    list of archivers. Defaults to an empty list.
    322 
    323    .. versionadded:: 2.7
    324 
    325 
    326 .. function::  unregister_archive_format(name)
    327 
    328    Remove the archive format *name* from the list of supported formats.
    329 
    330    .. versionadded:: 2.7
    331 
    332 
    333 .. _archiving-example:
    334 
    335 Archiving example
    336 :::::::::::::::::
    337 
    338 In this example, we create a gzip'ed tar-file archive containing all files
    339 found in the :file:`.ssh` directory of the user::
    340 
    341     >>> from shutil import make_archive
    342     >>> import os
    343     >>> archive_name = os.path.expanduser(os.path.join('~', 'myarchive'))
    344     >>> root_dir = os.path.expanduser(os.path.join('~', '.ssh'))
    345     >>> make_archive(archive_name, 'gztar', root_dir)
    346     '/Users/tarek/myarchive.tar.gz'
    347 
    348 The resulting archive contains:
    349 
    350 .. code-block:: shell-session
    351 
    352     $ tar -tzvf /Users/tarek/myarchive.tar.gz
    353     drwx------ tarek/staff       0 2010-02-01 16:23:40 ./
    354     -rw-r--r-- tarek/staff     609 2008-06-09 13:26:54 ./authorized_keys
    355     -rwxr-xr-x tarek/staff      65 2008-06-09 13:26:54 ./config
    356     -rwx------ tarek/staff     668 2008-06-09 13:26:54 ./id_dsa
    357     -rwxr-xr-x tarek/staff     609 2008-06-09 13:26:54 ./id_dsa.pub
    358     -rw------- tarek/staff    1675 2008-06-09 13:26:54 ./id_rsa
    359     -rw-r--r-- tarek/staff     397 2008-06-09 13:26:54 ./id_rsa.pub
    360     -rw-r--r-- tarek/staff   37192 2010-02-06 18:23:10 ./known_hosts
    361