1 .. highlightlang:: c 2 3 4 .. _memory: 5 6 ***************** 7 Memory Management 8 ***************** 9 10 .. sectionauthor:: Vladimir Marangozov <Vladimir.Marangozov (a] inrialpes.fr> 11 12 13 14 .. _memoryoverview: 15 16 Overview 17 ======== 18 19 Memory management in Python involves a private heap containing all Python 20 objects and data structures. The management of this private heap is ensured 21 internally by the *Python memory manager*. The Python memory manager has 22 different components which deal with various dynamic storage management aspects, 23 like sharing, segmentation, preallocation or caching. 24 25 At the lowest level, a raw memory allocator ensures that there is enough room in 26 the private heap for storing all Python-related data by interacting with the 27 memory manager of the operating system. On top of the raw memory allocator, 28 several object-specific allocators operate on the same heap and implement 29 distinct memory management policies adapted to the peculiarities of every object 30 type. For example, integer objects are managed differently within the heap than 31 strings, tuples or dictionaries because integers imply different storage 32 requirements and speed/space tradeoffs. The Python memory manager thus delegates 33 some of the work to the object-specific allocators, but ensures that the latter 34 operate within the bounds of the private heap. 35 36 It is important to understand that the management of the Python heap is 37 performed by the interpreter itself and that the user has no control over it, 38 even if she regularly manipulates object pointers to memory blocks inside that 39 heap. The allocation of heap space for Python objects and other internal 40 buffers is performed on demand by the Python memory manager through the Python/C 41 API functions listed in this document. 42 43 .. index:: 44 single: malloc() 45 single: calloc() 46 single: realloc() 47 single: free() 48 49 To avoid memory corruption, extension writers should never try to operate on 50 Python objects with the functions exported by the C library: :c:func:`malloc`, 51 :c:func:`calloc`, :c:func:`realloc` and :c:func:`free`. This will result in mixed 52 calls between the C allocator and the Python memory manager with fatal 53 consequences, because they implement different algorithms and operate on 54 different heaps. However, one may safely allocate and release memory blocks 55 with the C library allocator for individual purposes, as shown in the following 56 example:: 57 58 PyObject *res; 59 char *buf = (char *) malloc(BUFSIZ); /* for I/O */ 60 61 if (buf == NULL) 62 return PyErr_NoMemory(); 63 ...Do some I/O operation involving buf... 64 res = PyString_FromString(buf); 65 free(buf); /* malloc'ed */ 66 return res; 67 68 In this example, the memory request for the I/O buffer is handled by the C 69 library allocator. The Python memory manager is involved only in the allocation 70 of the string object returned as a result. 71 72 In most situations, however, it is recommended to allocate memory from the 73 Python heap specifically because the latter is under control of the Python 74 memory manager. For example, this is required when the interpreter is extended 75 with new object types written in C. Another reason for using the Python heap is 76 the desire to *inform* the Python memory manager about the memory needs of the 77 extension module. Even when the requested memory is used exclusively for 78 internal, highly-specific purposes, delegating all memory requests to the Python 79 memory manager causes the interpreter to have a more accurate image of its 80 memory footprint as a whole. Consequently, under certain circumstances, the 81 Python memory manager may or may not trigger appropriate actions, like garbage 82 collection, memory compaction or other preventive procedures. Note that by using 83 the C library allocator as shown in the previous example, the allocated memory 84 for the I/O buffer escapes completely the Python memory manager. 85 86 87 .. _memoryinterface: 88 89 Memory Interface 90 ================ 91 92 The following function sets, modeled after the ANSI C standard, but specifying 93 behavior when requesting zero bytes, are available for allocating and releasing 94 memory from the Python heap: 95 96 97 .. c:function:: void* PyMem_Malloc(size_t n) 98 99 Allocates *n* bytes and returns a pointer of type :c:type:`void\*` to the 100 allocated memory, or *NULL* if the request fails. Requesting zero bytes returns 101 a distinct non-*NULL* pointer if possible, as if ``PyMem_Malloc(1)`` had 102 been called instead. The memory will not have been initialized in any way. 103 104 105 .. c:function:: void* PyMem_Realloc(void *p, size_t n) 106 107 Resizes the memory block pointed to by *p* to *n* bytes. The contents will be 108 unchanged to the minimum of the old and the new sizes. If *p* is *NULL*, the 109 call is equivalent to ``PyMem_Malloc(n)``; else if *n* is equal to zero, 110 the memory block is resized but is not freed, and the returned pointer is 111 non-*NULL*. Unless *p* is *NULL*, it must have been returned by a previous call 112 to :c:func:`PyMem_Malloc` or :c:func:`PyMem_Realloc`. If the request fails, 113 :c:func:`PyMem_Realloc` returns *NULL* and *p* remains a valid pointer to the 114 previous memory area. 115 116 117 .. c:function:: void PyMem_Free(void *p) 118 119 Frees the memory block pointed to by *p*, which must have been returned by a 120 previous call to :c:func:`PyMem_Malloc` or :c:func:`PyMem_Realloc`. Otherwise, or 121 if ``PyMem_Free(p)`` has been called before, undefined behavior occurs. If 122 *p* is *NULL*, no operation is performed. 123 124 The following type-oriented macros are provided for convenience. Note that 125 *TYPE* refers to any C type. 126 127 128 .. c:function:: TYPE* PyMem_New(TYPE, size_t n) 129 130 Same as :c:func:`PyMem_Malloc`, but allocates ``(n * sizeof(TYPE))`` bytes of 131 memory. Returns a pointer cast to :c:type:`TYPE\*`. The memory will not have 132 been initialized in any way. 133 134 135 .. c:function:: TYPE* PyMem_Resize(void *p, TYPE, size_t n) 136 137 Same as :c:func:`PyMem_Realloc`, but the memory block is resized to ``(n * 138 sizeof(TYPE))`` bytes. Returns a pointer cast to :c:type:`TYPE\*`. On return, 139 *p* will be a pointer to the new memory area, or *NULL* in the event of 140 failure. This is a C preprocessor macro; p is always reassigned. Save 141 the original value of p to avoid losing memory when handling errors. 142 143 144 .. c:function:: void PyMem_Del(void *p) 145 146 Same as :c:func:`PyMem_Free`. 147 148 In addition, the following macro sets are provided for calling the Python memory 149 allocator directly, without involving the C API functions listed above. However, 150 note that their use does not preserve binary compatibility across Python 151 versions and is therefore deprecated in extension modules. 152 153 :c:func:`PyMem_MALLOC`, :c:func:`PyMem_REALLOC`, :c:func:`PyMem_FREE`. 154 155 :c:func:`PyMem_NEW`, :c:func:`PyMem_RESIZE`, :c:func:`PyMem_DEL`. 156 157 158 .. _memoryexamples: 159 160 Examples 161 ======== 162 163 Here is the example from section :ref:`memoryoverview`, rewritten so that the 164 I/O buffer is allocated from the Python heap by using the first function set:: 165 166 PyObject *res; 167 char *buf = (char *) PyMem_Malloc(BUFSIZ); /* for I/O */ 168 169 if (buf == NULL) 170 return PyErr_NoMemory(); 171 /* ...Do some I/O operation involving buf... */ 172 res = PyString_FromString(buf); 173 PyMem_Free(buf); /* allocated with PyMem_Malloc */ 174 return res; 175 176 The same code using the type-oriented function set:: 177 178 PyObject *res; 179 char *buf = PyMem_New(char, BUFSIZ); /* for I/O */ 180 181 if (buf == NULL) 182 return PyErr_NoMemory(); 183 /* ...Do some I/O operation involving buf... */ 184 res = PyString_FromString(buf); 185 PyMem_Del(buf); /* allocated with PyMem_New */ 186 return res; 187 188 Note that in the two examples above, the buffer is always manipulated via 189 functions belonging to the same set. Indeed, it is required to use the same 190 memory API family for a given memory block, so that the risk of mixing different 191 allocators is reduced to a minimum. The following code sequence contains two 192 errors, one of which is labeled as *fatal* because it mixes two different 193 allocators operating on different heaps. :: 194 195 char *buf1 = PyMem_New(char, BUFSIZ); 196 char *buf2 = (char *) malloc(BUFSIZ); 197 char *buf3 = (char *) PyMem_Malloc(BUFSIZ); 198 ... 199 PyMem_Del(buf3); /* Wrong -- should be PyMem_Free() */ 200 free(buf2); /* Right -- allocated via malloc() */ 201 free(buf1); /* Fatal -- should be PyMem_Del() */ 202 203 In addition to the functions aimed at handling raw memory blocks from the Python 204 heap, objects in Python are allocated and released with :c:func:`PyObject_New`, 205 :c:func:`PyObject_NewVar` and :c:func:`PyObject_Del`. 206 207 These will be explained in the next chapter on defining and implementing new 208 object types in C. 209 210