Home | History | Annotate | Download | only in async
      1 
      2 Normally, when SQLite writes to a database file, it waits until the write
      3 operation is finished before returning control to the calling application.
      4 Since writing to the file-system is usually very slow compared with CPU
      5 bound operations, this can be a performance bottleneck. This directory
      6 contains an extension that causes SQLite to perform all write requests
      7 using a separate thread running in the background. Although this does not
      8 reduce the overall system resources (CPU, disk bandwidth etc.) at all, it
      9 allows SQLite to return control to the caller quickly even when writing to
     10 the database, eliminating the bottleneck.
     11 
     12   1. Functionality
     13 
     14     1.1 How it Works
     15     1.2 Limitations
     16     1.3 Locking and Concurrency
     17 
     18   2. Compilation and Usage
     19 
     20   3. Porting
     21 
     22 
     23 
     24 1. FUNCTIONALITY
     25 
     26   With asynchronous I/O, write requests are handled by a separate thread
     27   running in the background.  This means that the thread that initiates
     28   a database write does not have to wait for (sometimes slow) disk I/O
     29   to occur.  The write seems to happen very quickly, though in reality
     30   it is happening at its usual slow pace in the background.
     31 
     32   Asynchronous I/O appears to give better responsiveness, but at a price.
     33   You lose the Durable property.  With the default I/O backend of SQLite,
     34   once a write completes, you know that the information you wrote is
     35   safely on disk.  With the asynchronous I/O, this is not the case.  If
     36   your program crashes or if a power loss occurs after the database
     37   write but before the asynchronous write thread has completed, then the
     38   database change might never make it to disk and the next user of the
     39   database might not see your change.
     40 
     41   You lose Durability with asynchronous I/O, but you still retain the
     42   other parts of ACID:  Atomic,  Consistent, and Isolated.  Many
     43   appliations get along fine without the Durablity.
     44 
     45   1.1 How it Works
     46 
     47     Asynchronous I/O works by creating a special SQLite "vfs" structure
     48     and registering it with sqlite3_vfs_register(). When files opened via 
     49     this vfs are written to (using the vfs xWrite() method), the data is not 
     50     written directly to disk, but is placed in the "write-queue" to be
     51     handled by the background thread.
     52 
     53     When files opened with the asynchronous vfs are read from 
     54     (using the vfs xRead() method), the data is read from the file on 
     55     disk and the write-queue, so that from the point of view of
     56     the vfs reader the xWrite() appears to have already completed.
     57 
     58     The special vfs is registered (and unregistered) by calls to the 
     59     API functions sqlite3async_initialize() and sqlite3async_shutdown().
     60     See section "Compilation and Usage" below for details.
     61 
     62   1.2 Limitations
     63 
     64     In order to gain experience with the main ideas surrounding asynchronous 
     65     IO, this implementation is deliberately kept simple. Additional 
     66     capabilities may be added in the future.
     67 
     68     For example, as currently implemented, if writes are happening at a 
     69     steady stream that exceeds the I/O capability of the background writer
     70     thread, the queue of pending write operations will grow without bound.
     71     If this goes on for long enough, the host system could run out of memory. 
     72     A more sophisticated module could to keep track of the quantity of 
     73     pending writes and stop accepting new write requests when the queue of 
     74     pending writes grows too large.
     75 
     76   1.3 Locking and Concurrency
     77 
     78     Multiple connections from within a single process that use this
     79     implementation of asynchronous IO may access a single database
     80     file concurrently. From the point of view of the user, if all
     81     connections are from within a single process, there is no difference
     82     between the concurrency offered by "normal" SQLite and SQLite
     83     using the asynchronous backend.
     84 
     85     If file-locking is enabled (it is enabled by default), then connections
     86     from multiple processes may also read and write the database file.
     87     However concurrency is reduced as follows:
     88 
     89       * When a connection using asynchronous IO begins a database
     90         transaction, the database is locked immediately. However the
     91         lock is not released until after all relevant operations
     92         in the write-queue have been flushed to disk. This means
     93         (for example) that the database may remain locked for some 
     94         time after a "COMMIT" or "ROLLBACK" is issued.
     95 
     96       * If an application using asynchronous IO executes transactions
     97         in quick succession, other database users may be effectively
     98         locked out of the database. This is because when a BEGIN
     99         is executed, a database lock is established immediately. But
    100         when the corresponding COMMIT or ROLLBACK occurs, the lock
    101         is not released until the relevant part of the write-queue 
    102         has been flushed through. As a result, if a COMMIT is followed
    103         by a BEGIN before the write-queue is flushed through, the database 
    104         is never unlocked,preventing other processes from accessing 
    105         the database.
    106 
    107     File-locking may be disabled at runtime using the sqlite3async_control()
    108     API (see below). This may improve performance when an NFS or other 
    109     network file-system, as the synchronous round-trips to the server be 
    110     required to establish file locks are avoided. However, if multiple 
    111     connections attempt to access the same database file when file-locking
    112     is disabled, application crashes and database corruption is a likely
    113     outcome.
    114 
    115 
    116 2. COMPILATION AND USAGE
    117 
    118   The asynchronous IO extension consists of a single file of C code
    119   (sqlite3async.c), and a header file (sqlite3async.h) that defines the 
    120   C API used by applications to activate and control the modules 
    121   functionality.
    122 
    123   To use the asynchronous IO extension, compile sqlite3async.c as
    124   part of the application that uses SQLite. Then use the API defined
    125   in sqlite3async.h to initialize and configure the module.
    126 
    127   The asynchronous IO VFS API is described in detail in comments in 
    128   sqlite3async.h. Using the API usually consists of the following steps:
    129 
    130     1. Register the asynchronous IO VFS with SQLite by calling the
    131        sqlite3async_initialize() function.
    132 
    133     2. Create a background thread to perform write operations and call
    134        sqlite3async_run().
    135 
    136     3. Use the normal SQLite API to read and write to databases via 
    137        the asynchronous IO VFS.
    138 
    139   Refer to sqlite3async.h for details.
    140 
    141 
    142 3. PORTING
    143 
    144   Currently the asynchronous IO extension is compatible with win32 systems
    145   and systems that support the pthreads interface, including Mac OSX, Linux, 
    146   and other varieties of Unix. 
    147 
    148   To port the asynchronous IO extension to another platform, the user must
    149   implement mutex and condition variable primitives for the new platform.
    150   Currently there is no externally available interface to allow this, but
    151   modifying the code within sqlite3async.c to include the new platforms
    152   concurrency primitives is relatively easy. Search within sqlite3async.c
    153   for the comment string "PORTING FUNCTIONS" for details. Then implement
    154   new versions of each of the following:
    155 
    156     static void async_mutex_enter(int eMutex);
    157     static void async_mutex_leave(int eMutex);
    158     static void async_cond_wait(int eCond, int eMutex);
    159     static void async_cond_signal(int eCond);
    160     static void async_sched_yield(void);
    161 
    162   The functionality required of each of the above functions is described
    163   in comments in sqlite3async.c.
    164 
    165