Home | History | Annotate | Download | only in faq
      1 :tocdepth: 2
      2 
      3 =========================
      4 Library and Extension FAQ
      5 =========================
      6 
      7 .. only:: html
      8 
      9    .. contents::
     10 
     11 General Library Questions
     12 =========================
     13 
     14 How do I find a module or application to perform task X?
     15 --------------------------------------------------------
     16 
     17 Check :ref:`the Library Reference <library-index>` to see if there's a relevant
     18 standard library module.  (Eventually you'll learn what's in the standard
     19 library and will be able to skip this step.)
     20 
     21 For third-party packages, search the `Python Package Index
     22 <https://pypi.python.org/pypi>`_ or try `Google <https://www.google.com>`_ or
     23 another Web search engine.  Searching for "Python" plus a keyword or two for
     24 your topic of interest will usually find something helpful.
     25 
     26 
     27 Where is the math.py (socket.py, regex.py, etc.) source file?
     28 -------------------------------------------------------------
     29 
     30 If you can't find a source file for a module it may be a built-in or
     31 dynamically loaded module implemented in C, C++ or other compiled language.
     32 In this case you may not have the source file or it may be something like
     33 :file:`mathmodule.c`, somewhere in a C source directory (not on the Python Path).
     34 
     35 There are (at least) three kinds of modules in Python:
     36 
     37 1) modules written in Python (.py);
     38 2) modules written in C and dynamically loaded (.dll, .pyd, .so, .sl, etc);
     39 3) modules written in C and linked with the interpreter; to get a list of these,
     40    type::
     41 
     42       import sys
     43       print sys.builtin_module_names
     44 
     45 
     46 How do I make a Python script executable on Unix?
     47 -------------------------------------------------
     48 
     49 You need to do two things: the script file's mode must be executable and the
     50 first line must begin with ``#!`` followed by the path of the Python
     51 interpreter.
     52 
     53 The first is done by executing ``chmod +x scriptfile`` or perhaps ``chmod 755
     54 scriptfile``.
     55 
     56 The second can be done in a number of ways.  The most straightforward way is to
     57 write ::
     58 
     59   #!/usr/local/bin/python
     60 
     61 as the very first line of your file, using the pathname for where the Python
     62 interpreter is installed on your platform.
     63 
     64 If you would like the script to be independent of where the Python interpreter
     65 lives, you can use the :program:`env` program.  Almost all Unix variants support
     66 the following, assuming the Python interpreter is in a directory on the user's
     67 :envvar:`PATH`::
     68 
     69   #!/usr/bin/env python
     70 
     71 *Don't* do this for CGI scripts.  The :envvar:`PATH` variable for CGI scripts is
     72 often very minimal, so you need to use the actual absolute pathname of the
     73 interpreter.
     74 
     75 Occasionally, a user's environment is so full that the :program:`/usr/bin/env`
     76 program fails; or there's no env program at all.  In that case, you can try the
     77 following hack (due to Alex Rezinsky)::
     78 
     79    #! /bin/sh
     80    """:"
     81    exec python $0 ${1+"$@"}
     82    """
     83 
     84 The minor disadvantage is that this defines the script's __doc__ string.
     85 However, you can fix that by adding ::
     86 
     87    __doc__ = """...Whatever..."""
     88 
     89 
     90 
     91 Is there a curses/termcap package for Python?
     92 ---------------------------------------------
     93 
     94 .. XXX curses *is* built by default, isn't it?
     95 
     96 For Unix variants the standard Python source distribution comes with a curses
     97 module in the :source:`Modules` subdirectory, though it's not compiled by default.
     98 (Note that this is not available in the Windows distribution -- there is no
     99 curses module for Windows.)
    100 
    101 The :mod:`curses` module supports basic curses features as well as many additional
    102 functions from ncurses and SYSV curses such as colour, alternative character set
    103 support, pads, and mouse support. This means the module isn't compatible with
    104 operating systems that only have BSD curses, but there don't seem to be any
    105 currently maintained OSes that fall into this category.
    106 
    107 For Windows: use `the consolelib module
    108 <http://effbot.org/zone/console-index.htm>`_.
    109 
    110 
    111 Is there an equivalent to C's onexit() in Python?
    112 -------------------------------------------------
    113 
    114 The :mod:`atexit` module provides a register function that is similar to C's
    115 :c:func:`onexit`.
    116 
    117 
    118 Why don't my signal handlers work?
    119 ----------------------------------
    120 
    121 The most common problem is that the signal handler is declared with the wrong
    122 argument list.  It is called as ::
    123 
    124    handler(signum, frame)
    125 
    126 so it should be declared with two arguments::
    127 
    128    def handler(signum, frame):
    129        ...
    130 
    131 
    132 Common tasks
    133 ============
    134 
    135 How do I test a Python program or component?
    136 --------------------------------------------
    137 
    138 Python comes with two testing frameworks.  The :mod:`doctest` module finds
    139 examples in the docstrings for a module and runs them, comparing the output with
    140 the expected output given in the docstring.
    141 
    142 The :mod:`unittest` module is a fancier testing framework modelled on Java and
    143 Smalltalk testing frameworks.
    144 
    145 To make testing easier, you should use good modular design in your program.
    146 Your program should have almost all functionality
    147 encapsulated in either functions or class methods -- and this sometimes has the
    148 surprising and delightful effect of making the program run faster (because local
    149 variable accesses are faster than global accesses).  Furthermore the program
    150 should avoid depending on mutating global variables, since this makes testing
    151 much more difficult to do.
    152 
    153 The "global main logic" of your program may be as simple as ::
    154 
    155    if __name__ == "__main__":
    156        main_logic()
    157 
    158 at the bottom of the main module of your program.
    159 
    160 Once your program is organized as a tractable collection of functions and class
    161 behaviours you should write test functions that exercise the behaviours.  A test
    162 suite that automates a sequence of tests can be associated with each module.
    163 This sounds like a lot of work, but since Python is so terse and flexible it's
    164 surprisingly easy.  You can make coding much more pleasant and fun by writing
    165 your test functions in parallel with the "production code", since this makes it
    166 easy to find bugs and even design flaws earlier.
    167 
    168 "Support modules" that are not intended to be the main module of a program may
    169 include a self-test of the module. ::
    170 
    171    if __name__ == "__main__":
    172        self_test()
    173 
    174 Even programs that interact with complex external interfaces may be tested when
    175 the external interfaces are unavailable by using "fake" interfaces implemented
    176 in Python.
    177 
    178 
    179 How do I create documentation from doc strings?
    180 -----------------------------------------------
    181 
    182 The :mod:`pydoc` module can create HTML from the doc strings in your Python
    183 source code.  An alternative for creating API documentation purely from
    184 docstrings is `epydoc <http://epydoc.sourceforge.net/>`_.  `Sphinx
    185 <http://sphinx-doc.org>`_ can also include docstring content.
    186 
    187 
    188 How do I get a single keypress at a time?
    189 -----------------------------------------
    190 
    191 For Unix variants there are several solutions.  It's straightforward to do this
    192 using curses, but curses is a fairly large module to learn.  Here's a solution
    193 without curses::
    194 
    195    import termios, fcntl, sys, os
    196    fd = sys.stdin.fileno()
    197 
    198    oldterm = termios.tcgetattr(fd)
    199    newattr = termios.tcgetattr(fd)
    200    newattr[3] = newattr[3] & ~termios.ICANON & ~termios.ECHO
    201    termios.tcsetattr(fd, termios.TCSANOW, newattr)
    202 
    203    oldflags = fcntl.fcntl(fd, fcntl.F_GETFL)
    204    fcntl.fcntl(fd, fcntl.F_SETFL, oldflags | os.O_NONBLOCK)
    205 
    206    try:
    207        while 1:
    208            try:
    209                c = sys.stdin.read(1)
    210                print "Got character", repr(c)
    211            except IOError: pass
    212    finally:
    213        termios.tcsetattr(fd, termios.TCSAFLUSH, oldterm)
    214        fcntl.fcntl(fd, fcntl.F_SETFL, oldflags)
    215 
    216 You need the :mod:`termios` and the :mod:`fcntl` module for any of this to work,
    217 and I've only tried it on Linux, though it should work elsewhere.  In this code,
    218 characters are read and printed one at a time.
    219 
    220 :func:`termios.tcsetattr` turns off stdin's echoing and disables canonical mode.
    221 :func:`fcntl.fnctl` is used to obtain stdin's file descriptor flags and modify
    222 them for non-blocking mode.  Since reading stdin when it is empty results in an
    223 :exc:`IOError`, this error is caught and ignored.
    224 
    225 
    226 Threads
    227 =======
    228 
    229 How do I program using threads?
    230 -------------------------------
    231 
    232 .. XXX it's _thread in py3k
    233 
    234 Be sure to use the :mod:`threading` module and not the :mod:`thread` module.
    235 The :mod:`threading` module builds convenient abstractions on top of the
    236 low-level primitives provided by the :mod:`thread` module.
    237 
    238 Aahz has a set of slides from his threading tutorial that are helpful; see
    239 http://www.pythoncraft.com/OSCON2001/.
    240 
    241 
    242 None of my threads seem to run: why?
    243 ------------------------------------
    244 
    245 As soon as the main thread exits, all threads are killed.  Your main thread is
    246 running too quickly, giving the threads no time to do any work.
    247 
    248 A simple fix is to add a sleep to the end of the program that's long enough for
    249 all the threads to finish::
    250 
    251    import threading, time
    252 
    253    def thread_task(name, n):
    254        for i in range(n): print name, i
    255 
    256    for i in range(10):
    257        T = threading.Thread(target=thread_task, args=(str(i), i))
    258        T.start()
    259 
    260    time.sleep(10) # <----------------------------!
    261 
    262 But now (on many platforms) the threads don't run in parallel, but appear to run
    263 sequentially, one at a time!  The reason is that the OS thread scheduler doesn't
    264 start a new thread until the previous thread is blocked.
    265 
    266 A simple fix is to add a tiny sleep to the start of the run function::
    267 
    268    def thread_task(name, n):
    269        time.sleep(0.001) # <---------------------!
    270        for i in range(n): print name, i
    271 
    272    for i in range(10):
    273        T = threading.Thread(target=thread_task, args=(str(i), i))
    274        T.start()
    275 
    276    time.sleep(10)
    277 
    278 Instead of trying to guess a good delay value for :func:`time.sleep`,
    279 it's better to use some kind of semaphore mechanism.  One idea is to use the
    280 :mod:`Queue` module to create a queue object, let each thread append a token to
    281 the queue when it finishes, and let the main thread read as many tokens from the
    282 queue as there are threads.
    283 
    284 
    285 How do I parcel out work among a bunch of worker threads?
    286 ---------------------------------------------------------
    287 
    288 Use the :mod:`Queue` module to create a queue containing a list of jobs.  The
    289 :class:`~Queue.Queue` class maintains a list of objects and has a ``.put(obj)``
    290 method that adds items to the queue and a ``.get()`` method to return them.
    291 The class will take care of the locking necessary to ensure that each job is
    292 handed out exactly once.
    293 
    294 Here's a trivial example::
    295 
    296    import threading, Queue, time
    297 
    298    # The worker thread gets jobs off the queue.  When the queue is empty, it
    299    # assumes there will be no more work and exits.
    300    # (Realistically workers will run until terminated.)
    301    def worker():
    302        print 'Running worker'
    303        time.sleep(0.1)
    304        while True:
    305            try:
    306                arg = q.get(block=False)
    307            except Queue.Empty:
    308                print 'Worker', threading.currentThread(),
    309                print 'queue empty'
    310                break
    311            else:
    312                print 'Worker', threading.currentThread(),
    313                print 'running with argument', arg
    314                time.sleep(0.5)
    315 
    316    # Create queue
    317    q = Queue.Queue()
    318 
    319    # Start a pool of 5 workers
    320    for i in range(5):
    321        t = threading.Thread(target=worker, name='worker %i' % (i+1))
    322        t.start()
    323 
    324    # Begin adding work to the queue
    325    for i in range(50):
    326        q.put(i)
    327 
    328    # Give threads time to run
    329    print 'Main thread sleeping'
    330    time.sleep(5)
    331 
    332 When run, this will produce the following output:
    333 
    334 .. code-block:: none
    335 
    336    Running worker
    337    Running worker
    338    Running worker
    339    Running worker
    340    Running worker
    341    Main thread sleeping
    342    Worker <Thread(worker 1, started)> running with argument 0
    343    Worker <Thread(worker 2, started)> running with argument 1
    344    Worker <Thread(worker 3, started)> running with argument 2
    345    Worker <Thread(worker 4, started)> running with argument 3
    346    Worker <Thread(worker 5, started)> running with argument 4
    347    Worker <Thread(worker 1, started)> running with argument 5
    348    ...
    349 
    350 Consult the module's documentation for more details; the :class:`~Queue.Queue`
    351 class provides a featureful interface.
    352 
    353 
    354 What kinds of global value mutation are thread-safe?
    355 ----------------------------------------------------
    356 
    357 A :term:`global interpreter lock` (GIL) is used internally to ensure that only
    358 one thread runs in the Python VM at a time.  In general, Python offers to switch
    359 among threads only between bytecode instructions; how frequently it switches can
    360 be set via :func:`sys.setcheckinterval`.  Each bytecode instruction and
    361 therefore all the C implementation code reached from each instruction is
    362 therefore atomic from the point of view of a Python program.
    363 
    364 In theory, this means an exact accounting requires an exact understanding of the
    365 PVM bytecode implementation.  In practice, it means that operations on shared
    366 variables of built-in data types (ints, lists, dicts, etc) that "look atomic"
    367 really are.
    368 
    369 For example, the following operations are all atomic (L, L1, L2 are lists, D,
    370 D1, D2 are dicts, x, y are objects, i, j are ints)::
    371 
    372    L.append(x)
    373    L1.extend(L2)
    374    x = L[i]
    375    x = L.pop()
    376    L1[i:j] = L2
    377    L.sort()
    378    x = y
    379    x.field = y
    380    D[x] = y
    381    D1.update(D2)
    382    D.keys()
    383 
    384 These aren't::
    385 
    386    i = i+1
    387    L.append(L[-1])
    388    L[i] = L[j]
    389    D[x] = D[x] + 1
    390 
    391 Operations that replace other objects may invoke those other objects'
    392 :meth:`__del__` method when their reference count reaches zero, and that can
    393 affect things.  This is especially true for the mass updates to dictionaries and
    394 lists.  When in doubt, use a mutex!
    395 
    396 
    397 Can't we get rid of the Global Interpreter Lock?
    398 ------------------------------------------------
    399 
    400 .. XXX mention multiprocessing
    401 .. XXX link to dbeazley's talk about GIL?
    402 
    403 The :term:`global interpreter lock` (GIL) is often seen as a hindrance to Python's
    404 deployment on high-end multiprocessor server machines, because a multi-threaded
    405 Python program effectively only uses one CPU, due to the insistence that
    406 (almost) all Python code can only run while the GIL is held.
    407 
    408 Back in the days of Python 1.5, Greg Stein actually implemented a comprehensive
    409 patch set (the "free threading" patches) that removed the GIL and replaced it
    410 with fine-grained locking.  Unfortunately, even on Windows (where locks are very
    411 efficient) this ran ordinary Python code about twice as slow as the interpreter
    412 using the GIL.  On Linux the performance loss was even worse because pthread
    413 locks aren't as efficient.
    414 
    415 Since then, the idea of getting rid of the GIL has occasionally come up but
    416 nobody has found a way to deal with the expected slowdown, and users who don't
    417 use threads would not be happy if their code ran at half the speed.  Greg's
    418 free threading patch set has not been kept up-to-date for later Python versions.
    419 
    420 This doesn't mean that you can't make good use of Python on multi-CPU machines!
    421 You just have to be creative with dividing the work up between multiple
    422 *processes* rather than multiple *threads*.  Judicious use of C extensions will
    423 also help; if you use a C extension to perform a time-consuming task, the
    424 extension can release the GIL while the thread of execution is in the C code and
    425 allow other threads to get some work done.
    426 
    427 It has been suggested that the GIL should be a per-interpreter-state lock rather
    428 than truly global; interpreters then wouldn't be able to share objects.
    429 Unfortunately, this isn't likely to happen either.  It would be a tremendous
    430 amount of work, because many object implementations currently have global state.
    431 For example, small integers and short strings are cached; these caches would
    432 have to be moved to the interpreter state.  Other object types have their own
    433 free list; these free lists would have to be moved to the interpreter state.
    434 And so on.
    435 
    436 And I doubt that it can even be done in finite time, because the same problem
    437 exists for 3rd party extensions.  It is likely that 3rd party extensions are
    438 being written at a faster rate than you can convert them to store all their
    439 global state in the interpreter state.
    440 
    441 And finally, once you have multiple interpreters not sharing any state, what
    442 have you gained over running each interpreter in a separate process?
    443 
    444 
    445 Input and Output
    446 ================
    447 
    448 How do I delete a file? (And other file questions...)
    449 -----------------------------------------------------
    450 
    451 Use ``os.remove(filename)`` or ``os.unlink(filename)``; for documentation, see
    452 the :mod:`os` module.  The two functions are identical; :func:`unlink` is simply
    453 the name of the Unix system call for this function.
    454 
    455 To remove a directory, use :func:`os.rmdir`; use :func:`os.mkdir` to create one.
    456 ``os.makedirs(path)`` will create any intermediate directories in ``path`` that
    457 don't exist. ``os.removedirs(path)`` will remove intermediate directories as
    458 long as they're empty; if you want to delete an entire directory tree and its
    459 contents, use :func:`shutil.rmtree`.
    460 
    461 To rename a file, use ``os.rename(old_path, new_path)``.
    462 
    463 To truncate a file, open it using ``f = open(filename, "r+")``, and use
    464 ``f.truncate(offset)``; offset defaults to the current seek position.  There's
    465 also ``os.ftruncate(fd, offset)`` for files opened with :func:`os.open`, where
    466 *fd* is the file descriptor (a small integer).
    467 
    468 The :mod:`shutil` module also contains a number of functions to work on files
    469 including :func:`~shutil.copyfile`, :func:`~shutil.copytree`, and
    470 :func:`~shutil.rmtree`.
    471 
    472 
    473 How do I copy a file?
    474 ---------------------
    475 
    476 The :mod:`shutil` module contains a :func:`~shutil.copyfile` function.  Note
    477 that on MacOS 9 it doesn't copy the resource fork and Finder info.
    478 
    479 
    480 How do I read (or write) binary data?
    481 -------------------------------------
    482 
    483 To read or write complex binary data formats, it's best to use the :mod:`struct`
    484 module.  It allows you to take a string containing binary data (usually numbers)
    485 and convert it to Python objects; and vice versa.
    486 
    487 For example, the following code reads two 2-byte integers and one 4-byte integer
    488 in big-endian format from a file::
    489 
    490    import struct
    491 
    492    f = open(filename, "rb")  # Open in binary mode for portability
    493    s = f.read(8)
    494    x, y, z = struct.unpack(">hhl", s)
    495 
    496 The '>' in the format string forces big-endian data; the letter 'h' reads one
    497 "short integer" (2 bytes), and 'l' reads one "long integer" (4 bytes) from the
    498 string.
    499 
    500 For data that is more regular (e.g. a homogeneous list of ints or floats),
    501 you can also use the :mod:`array` module.
    502 
    503 
    504 I can't seem to use os.read() on a pipe created with os.popen(); why?
    505 ---------------------------------------------------------------------
    506 
    507 :func:`os.read` is a low-level function which takes a file descriptor, a small
    508 integer representing the opened file.  :func:`os.popen` creates a high-level
    509 file object, the same type returned by the built-in :func:`open` function.
    510 Thus, to read *n* bytes from a pipe *p* created with :func:`os.popen`, you need to
    511 use ``p.read(n)``.
    512 
    513 
    514 How do I run a subprocess with pipes connected to both input and output?
    515 ------------------------------------------------------------------------
    516 
    517 .. XXX update to use subprocess
    518 
    519 Use the :mod:`popen2` module.  For example::
    520 
    521    import popen2
    522    fromchild, tochild = popen2.popen2("command")
    523    tochild.write("input\n")
    524    tochild.flush()
    525    output = fromchild.readline()
    526 
    527 Warning: in general it is unwise to do this because you can easily cause a
    528 deadlock where your process is blocked waiting for output from the child while
    529 the child is blocked waiting for input from you.  This can be caused by the
    530 parent expecting the child to output more text than it does or by data being
    531 stuck in stdio buffers due to lack of flushing.  The Python parent
    532 can of course explicitly flush the data it sends to the child before it reads
    533 any output, but if the child is a naive C program it may have been written to
    534 never explicitly flush its output, even if it is interactive, since flushing is
    535 normally automatic.
    536 
    537 Note that a deadlock is also possible if you use :func:`popen3` to read stdout
    538 and stderr. If one of the two is too large for the internal buffer (increasing
    539 the buffer size does not help) and you ``read()`` the other one first, there is
    540 a deadlock, too.
    541 
    542 Note on a bug in popen2: unless your program calls ``wait()`` or ``waitpid()``,
    543 finished child processes are never removed, and eventually calls to popen2 will
    544 fail because of a limit on the number of child processes.  Calling
    545 :func:`os.waitpid` with the :data:`os.WNOHANG` option can prevent this; a good
    546 place to insert such a call would be before calling ``popen2`` again.
    547 
    548 In many cases, all you really need is to run some data through a command and get
    549 the result back.  Unless the amount of data is very large, the easiest way to do
    550 this is to write it to a temporary file and run the command with that temporary
    551 file as input.  The standard module :mod:`tempfile` exports a
    552 :func:`~tempfile.mktemp` function to generate unique temporary file names. ::
    553 
    554    import tempfile
    555    import os
    556 
    557    class Popen3:
    558        """
    559        This is a deadlock-safe version of popen that returns
    560        an object with errorlevel, out (a string) and err (a string).
    561        (capturestderr may not work under windows.)
    562        Example: print Popen3('grep spam','\n\nhere spam\n\n').out
    563        """
    564        def __init__(self,command,input=None,capturestderr=None):
    565            outfile=tempfile.mktemp()
    566            command="( %s ) > %s" % (command,outfile)
    567            if input:
    568                infile=tempfile.mktemp()
    569                open(infile,"w").write(input)
    570                command=command+" <"+infile
    571            if capturestderr:
    572                errfile=tempfile.mktemp()
    573                command=command+" 2>"+errfile
    574            self.errorlevel=os.system(command) >> 8
    575            self.out=open(outfile,"r").read()
    576            os.remove(outfile)
    577            if input:
    578                os.remove(infile)
    579            if capturestderr:
    580                self.err=open(errfile,"r").read()
    581                os.remove(errfile)
    582 
    583 Note that many interactive programs (e.g. vi) don't work well with pipes
    584 substituted for standard input and output.  You will have to use pseudo ttys
    585 ("ptys") instead of pipes. Or you can use a Python interface to Don Libes'
    586 "expect" library.  A Python extension that interfaces to expect is called "expy"
    587 and available from http://expectpy.sourceforge.net.  A pure Python solution that
    588 works like expect is `pexpect <https://pypi.python.org/pypi/pexpect/>`_.
    589 
    590 
    591 How do I access the serial (RS232) port?
    592 ----------------------------------------
    593 
    594 For Win32, POSIX (Linux, BSD, etc.), Jython:
    595 
    596    http://pyserial.sourceforge.net
    597 
    598 For Unix, see a Usenet post by Mitch Chapman:
    599 
    600    https://groups.google.com/groups?selm=34A04430.CF9@ohioee.com
    601 
    602 
    603 Why doesn't closing sys.stdout (stdin, stderr) really close it?
    604 ---------------------------------------------------------------
    605 
    606 Python file objects are a high-level layer of abstraction on top of C streams,
    607 which in turn are a medium-level layer of abstraction on top of (among other
    608 things) low-level C file descriptors.
    609 
    610 For most file objects you create in Python via the built-in ``file``
    611 constructor, ``f.close()`` marks the Python file object as being closed from
    612 Python's point of view, and also arranges to close the underlying C stream.
    613 This also happens automatically in ``f``'s destructor, when ``f`` becomes
    614 garbage.
    615 
    616 But stdin, stdout and stderr are treated specially by Python, because of the
    617 special status also given to them by C.  Running ``sys.stdout.close()`` marks
    618 the Python-level file object as being closed, but does *not* close the
    619 associated C stream.
    620 
    621 To close the underlying C stream for one of these three, you should first be
    622 sure that's what you really want to do (e.g., you may confuse extension modules
    623 trying to do I/O).  If it is, use os.close::
    624 
    625     os.close(0)   # close C's stdin stream
    626     os.close(1)   # close C's stdout stream
    627     os.close(2)   # close C's stderr stream
    628 
    629 
    630 Network/Internet Programming
    631 ============================
    632 
    633 What WWW tools are there for Python?
    634 ------------------------------------
    635 
    636 See the chapters titled :ref:`internet` and :ref:`netdata` in the Library
    637 Reference Manual.  Python has many modules that will help you build server-side
    638 and client-side web systems.
    639 
    640 .. XXX check if wiki page is still up to date
    641 
    642 A summary of available frameworks is maintained by Paul Boddie at
    643 https://wiki.python.org/moin/WebProgramming\ .
    644 
    645 Cameron Laird maintains a useful set of pages about Python web technologies at
    646 http://phaseit.net/claird/comp.lang.python/web_python.
    647 
    648 
    649 How can I mimic CGI form submission (METHOD=POST)?
    650 --------------------------------------------------
    651 
    652 I would like to retrieve web pages that are the result of POSTing a form. Is
    653 there existing code that would let me do this easily?
    654 
    655 Yes. Here's a simple example that uses httplib::
    656 
    657    #!/usr/local/bin/python
    658 
    659    import httplib, sys, time
    660 
    661    # build the query string
    662    qs = "First=Josephine&MI=Q&Last=Public"
    663 
    664    # connect and send the server a path
    665    httpobj = httplib.HTTP('www.some-server.out-there', 80)
    666    httpobj.putrequest('POST', '/cgi-bin/some-cgi-script')
    667    # now generate the rest of the HTTP headers...
    668    httpobj.putheader('Accept', '*/*')
    669    httpobj.putheader('Connection', 'Keep-Alive')
    670    httpobj.putheader('Content-type', 'application/x-www-form-urlencoded')
    671    httpobj.putheader('Content-length', '%d' % len(qs))
    672    httpobj.endheaders()
    673    httpobj.send(qs)
    674    # find out what the server said in response...
    675    reply, msg, hdrs = httpobj.getreply()
    676    if reply != 200:
    677        sys.stdout.write(httpobj.getfile().read())
    678 
    679 Note that in general for percent-encoded POST operations, query strings must be
    680 quoted using :func:`urllib.urlencode`.  For example, to send
    681 ``name=Guy Steele, Jr.``::
    682 
    683    >>> import urllib
    684    >>> urllib.urlencode({'name': 'Guy Steele, Jr.'})
    685    'name=Guy+Steele%2C+Jr.'
    686 
    687 
    688 What module should I use to help with generating HTML?
    689 ------------------------------------------------------
    690 
    691 .. XXX add modern template languages
    692 
    693 You can find a collection of useful links on the `Web Programming wiki page
    694 <https://wiki.python.org/moin/WebProgramming>`_.
    695 
    696 
    697 How do I send mail from a Python script?
    698 ----------------------------------------
    699 
    700 Use the standard library module :mod:`smtplib`.
    701 
    702 Here's a very simple interactive mail sender that uses it.  This method will
    703 work on any host that supports an SMTP listener. ::
    704 
    705    import sys, smtplib
    706 
    707    fromaddr = raw_input("From: ")
    708    toaddrs  = raw_input("To: ").split(',')
    709    print "Enter message, end with ^D:"
    710    msg = ''
    711    while True:
    712        line = sys.stdin.readline()
    713        if not line:
    714            break
    715        msg += line
    716 
    717    # The actual mail send
    718    server = smtplib.SMTP('localhost')
    719    server.sendmail(fromaddr, toaddrs, msg)
    720    server.quit()
    721 
    722 A Unix-only alternative uses sendmail.  The location of the sendmail program
    723 varies between systems; sometimes it is ``/usr/lib/sendmail``, sometimes
    724 ``/usr/sbin/sendmail``.  The sendmail manual page will help you out.  Here's
    725 some sample code::
    726 
    727    import os
    728 
    729    SENDMAIL = "/usr/sbin/sendmail"  # sendmail location
    730    p = os.popen("%s -t -i" % SENDMAIL, "w")
    731    p.write("To: receiver (a] example.com\n")
    732    p.write("Subject: test\n")
    733    p.write("\n") # blank line separating headers from body
    734    p.write("Some text\n")
    735    p.write("some more text\n")
    736    sts = p.close()
    737    if sts != 0:
    738        print "Sendmail exit status", sts
    739 
    740 
    741 How do I avoid blocking in the connect() method of a socket?
    742 ------------------------------------------------------------
    743 
    744 The select module is commonly used to help with asynchronous I/O on sockets.
    745 
    746 To prevent the TCP connect from blocking, you can set the socket to non-blocking
    747 mode.  Then when you do the ``connect()``, you will either connect immediately
    748 (unlikely) or get an exception that contains the error number as ``.errno``.
    749 ``errno.EINPROGRESS`` indicates that the connection is in progress, but hasn't
    750 finished yet.  Different OSes will return different values, so you're going to
    751 have to check what's returned on your system.
    752 
    753 You can use the ``connect_ex()`` method to avoid creating an exception.  It will
    754 just return the errno value.  To poll, you can call ``connect_ex()`` again later
    755 -- 0 or ``errno.EISCONN`` indicate that you're connected -- or you can pass this
    756 socket to select to check if it's writable.
    757 
    758 
    759 Databases
    760 =========
    761 
    762 Are there any interfaces to database packages in Python?
    763 --------------------------------------------------------
    764 
    765 Yes.
    766 
    767 .. XXX remove bsddb in py3k, fix other module names
    768 
    769 Python 2.3 includes the :mod:`bsddb` package which provides an interface to the
    770 BerkeleyDB library.  Interfaces to disk-based hashes such as :mod:`DBM <dbm>`
    771 and :mod:`GDBM <gdbm>` are also included with standard Python.
    772 
    773 Support for most relational databases is available.  See the
    774 `DatabaseProgramming wiki page
    775 <https://wiki.python.org/moin/DatabaseProgramming>`_ for details.
    776 
    777 
    778 How do you implement persistent objects in Python?
    779 --------------------------------------------------
    780 
    781 The :mod:`pickle` library module solves this in a very general way (though you
    782 still can't store things like open files, sockets or windows), and the
    783 :mod:`shelve` library module uses pickle and (g)dbm to create persistent
    784 mappings containing arbitrary Python objects.  For better performance, you can
    785 use the :mod:`cPickle` module.
    786 
    787 A more awkward way of doing things is to use pickle's little sister, marshal.
    788 The :mod:`marshal` module provides very fast ways to store noncircular basic
    789 Python types to files and strings, and back again.  Although marshal does not do
    790 fancy things like store instances or handle shared references properly, it does
    791 run extremely fast.  For example, loading a half megabyte of data may take less
    792 than a third of a second.  This often beats doing something more complex and
    793 general such as using gdbm with pickle/shelve.
    794 
    795 
    796 Why is cPickle so slow?
    797 -----------------------
    798 
    799 .. XXX update this, default protocol is 2/3
    800 
    801 By default :mod:`pickle` uses a relatively old and slow format for backward
    802 compatibility.  You can however specify other protocol versions that are
    803 faster::
    804 
    805     largeString = 'z' * (100 * 1024)
    806     myPickle = cPickle.dumps(largeString, protocol=1)
    807 
    808 
    809 If my program crashes with a bsddb (or anydbm) database open, it gets corrupted. How come?
    810 ------------------------------------------------------------------------------------------
    811 
    812 Databases opened for write access with the bsddb module (and often by the anydbm
    813 module, since it will preferentially use bsddb) must explicitly be closed using
    814 the ``.close()`` method of the database.  The underlying library caches database
    815 contents which need to be converted to on-disk form and written.
    816 
    817 If you have initialized a new bsddb database but not written anything to it
    818 before the program crashes, you will often wind up with a zero-length file and
    819 encounter an exception the next time the file is opened.
    820 
    821 
    822 I tried to open Berkeley DB file, but bsddb produces bsddb.error: (22, 'Invalid argument'). Help! How can I restore my data?
    823 ----------------------------------------------------------------------------------------------------------------------------
    824 
    825 Don't panic! Your data is probably intact. The most frequent cause for the error
    826 is that you tried to open an earlier Berkeley DB file with a later version of
    827 the Berkeley DB library.
    828 
    829 Many Linux systems now have all three versions of Berkeley DB available.  If you
    830 are migrating from version 1 to a newer version use db_dump185 to dump a plain
    831 text version of the database.  If you are migrating from version 2 to version 3
    832 use db2_dump to create a plain text version of the database.  In either case,
    833 use db_load to create a new native database for the latest version installed on
    834 your computer.  If you have version 3 of Berkeley DB installed, you should be
    835 able to use db2_load to create a native version 2 database.
    836 
    837 You should move away from Berkeley DB version 1 files because the hash file code
    838 contains known bugs that can corrupt your data.
    839 
    840 
    841 Mathematics and Numerics
    842 ========================
    843 
    844 How do I generate random numbers in Python?
    845 -------------------------------------------
    846 
    847 The standard module :mod:`random` implements a random number generator.  Usage
    848 is simple::
    849 
    850    import random
    851    random.random()
    852 
    853 This returns a random floating point number in the range [0, 1).
    854 
    855 There are also many other specialized generators in this module, such as:
    856 
    857 * ``randrange(a, b)`` chooses an integer in the range [a, b).
    858 * ``uniform(a, b)`` chooses a floating point number in the range [a, b).
    859 * ``normalvariate(mean, sdev)`` samples the normal (Gaussian) distribution.
    860 
    861 Some higher-level functions operate on sequences directly, such as:
    862 
    863 * ``choice(S)`` chooses random element from a given sequence
    864 * ``shuffle(L)`` shuffles a list in-place, i.e. permutes it randomly
    865 
    866 There's also a ``Random`` class you can instantiate to create independent
    867 multiple random number generators.
    868