1 .. _tut-classes: 2 3 ******* 4 Classes 5 ******* 6 7 Compared with other programming languages, Python's class mechanism adds classes 8 with a minimum of new syntax and semantics. It is a mixture of the class 9 mechanisms found in C++ and Modula-3. Python classes provide all the standard 10 features of Object Oriented Programming: the class inheritance mechanism allows 11 multiple base classes, a derived class can override any methods of its base 12 class or classes, and a method can call the method of a base class with the same 13 name. Objects can contain arbitrary amounts and kinds of data. As is true for 14 modules, classes partake of the dynamic nature of Python: they are created at 15 runtime, and can be modified further after creation. 16 17 In C++ terminology, normally class members (including the data members) are 18 *public* (except see below :ref:`tut-private`), and all member functions are 19 *virtual*. As in Modula-3, there are no shorthands for referencing the object's 20 members from its methods: the method function is declared with an explicit first 21 argument representing the object, which is provided implicitly by the call. As 22 in Smalltalk, classes themselves are objects. This provides semantics for 23 importing and renaming. Unlike C++ and Modula-3, built-in types can be used as 24 base classes for extension by the user. Also, like in C++, most built-in 25 operators with special syntax (arithmetic operators, subscripting etc.) can be 26 redefined for class instances. 27 28 (Lacking universally accepted terminology to talk about classes, I will make 29 occasional use of Smalltalk and C++ terms. I would use Modula-3 terms, since 30 its object-oriented semantics are closer to those of Python than C++, but I 31 expect that few readers have heard of it.) 32 33 34 .. _tut-object: 35 36 A Word About Names and Objects 37 ============================== 38 39 Objects have individuality, and multiple names (in multiple scopes) can be bound 40 to the same object. This is known as aliasing in other languages. This is 41 usually not appreciated on a first glance at Python, and can be safely ignored 42 when dealing with immutable basic types (numbers, strings, tuples). However, 43 aliasing has a possibly surprising effect on the semantics of Python code 44 involving mutable objects such as lists, dictionaries, and most other types. 45 This is usually used to the benefit of the program, since aliases behave like 46 pointers in some respects. For example, passing an object is cheap since only a 47 pointer is passed by the implementation; and if a function modifies an object 48 passed as an argument, the caller will see the change --- this eliminates the 49 need for two different argument passing mechanisms as in Pascal. 50 51 52 .. _tut-scopes: 53 54 Python Scopes and Namespaces 55 ============================ 56 57 Before introducing classes, I first have to tell you something about Python's 58 scope rules. Class definitions play some neat tricks with namespaces, and you 59 need to know how scopes and namespaces work to fully understand what's going on. 60 Incidentally, knowledge about this subject is useful for any advanced Python 61 programmer. 62 63 Let's begin with some definitions. 64 65 A *namespace* is a mapping from names to objects. Most namespaces are currently 66 implemented as Python dictionaries, but that's normally not noticeable in any 67 way (except for performance), and it may change in the future. Examples of 68 namespaces are: the set of built-in names (containing functions such as :func:`abs`, and 69 built-in exception names); the global names in a module; and the local names in 70 a function invocation. In a sense the set of attributes of an object also form 71 a namespace. The important thing to know about namespaces is that there is 72 absolutely no relation between names in different namespaces; for instance, two 73 different modules may both define a function ``maximize`` without confusion --- 74 users of the modules must prefix it with the module name. 75 76 By the way, I use the word *attribute* for any name following a dot --- for 77 example, in the expression ``z.real``, ``real`` is an attribute of the object 78 ``z``. Strictly speaking, references to names in modules are attribute 79 references: in the expression ``modname.funcname``, ``modname`` is a module 80 object and ``funcname`` is an attribute of it. In this case there happens to be 81 a straightforward mapping between the module's attributes and the global names 82 defined in the module: they share the same namespace! [#]_ 83 84 Attributes may be read-only or writable. In the latter case, assignment to 85 attributes is possible. Module attributes are writable: you can write 86 ``modname.the_answer = 42``. Writable attributes may also be deleted with the 87 :keyword:`del` statement. For example, ``del modname.the_answer`` will remove 88 the attribute :attr:`the_answer` from the object named by ``modname``. 89 90 Namespaces are created at different moments and have different lifetimes. The 91 namespace containing the built-in names is created when the Python interpreter 92 starts up, and is never deleted. The global namespace for a module is created 93 when the module definition is read in; normally, module namespaces also last 94 until the interpreter quits. The statements executed by the top-level 95 invocation of the interpreter, either read from a script file or interactively, 96 are considered part of a module called :mod:`__main__`, so they have their own 97 global namespace. (The built-in names actually also live in a module; this is 98 called :mod:`builtins`.) 99 100 The local namespace for a function is created when the function is called, and 101 deleted when the function returns or raises an exception that is not handled 102 within the function. (Actually, forgetting would be a better way to describe 103 what actually happens.) Of course, recursive invocations each have their own 104 local namespace. 105 106 A *scope* is a textual region of a Python program where a namespace is directly 107 accessible. "Directly accessible" here means that an unqualified reference to a 108 name attempts to find the name in the namespace. 109 110 Although scopes are determined statically, they are used dynamically. At any 111 time during execution, there are at least three nested scopes whose namespaces 112 are directly accessible: 113 114 * the innermost scope, which is searched first, contains the local names 115 * the scopes of any enclosing functions, which are searched starting with the 116 nearest enclosing scope, contains non-local, but also non-global names 117 * the next-to-last scope contains the current module's global names 118 * the outermost scope (searched last) is the namespace containing built-in names 119 120 If a name is declared global, then all references and assignments go directly to 121 the middle scope containing the module's global names. To rebind variables 122 found outside of the innermost scope, the :keyword:`nonlocal` statement can be 123 used; if not declared nonlocal, those variables are read-only (an attempt to 124 write to such a variable will simply create a *new* local variable in the 125 innermost scope, leaving the identically named outer variable unchanged). 126 127 Usually, the local scope references the local names of the (textually) current 128 function. Outside functions, the local scope references the same namespace as 129 the global scope: the module's namespace. Class definitions place yet another 130 namespace in the local scope. 131 132 It is important to realize that scopes are determined textually: the global 133 scope of a function defined in a module is that module's namespace, no matter 134 from where or by what alias the function is called. On the other hand, the 135 actual search for names is done dynamically, at run time --- however, the 136 language definition is evolving towards static name resolution, at "compile" 137 time, so don't rely on dynamic name resolution! (In fact, local variables are 138 already determined statically.) 139 140 A special quirk of Python is that -- if no :keyword:`global` statement is in 141 effect -- assignments to names always go into the innermost scope. Assignments 142 do not copy data --- they just bind names to objects. The same is true for 143 deletions: the statement ``del x`` removes the binding of ``x`` from the 144 namespace referenced by the local scope. In fact, all operations that introduce 145 new names use the local scope: in particular, :keyword:`import` statements and 146 function definitions bind the module or function name in the local scope. 147 148 The :keyword:`global` statement can be used to indicate that particular 149 variables live in the global scope and should be rebound there; the 150 :keyword:`nonlocal` statement indicates that particular variables live in 151 an enclosing scope and should be rebound there. 152 153 .. _tut-scopeexample: 154 155 Scopes and Namespaces Example 156 ----------------------------- 157 158 This is an example demonstrating how to reference the different scopes and 159 namespaces, and how :keyword:`global` and :keyword:`nonlocal` affect variable 160 binding:: 161 162 def scope_test(): 163 def do_local(): 164 spam = "local spam" 165 166 def do_nonlocal(): 167 nonlocal spam 168 spam = "nonlocal spam" 169 170 def do_global(): 171 global spam 172 spam = "global spam" 173 174 spam = "test spam" 175 do_local() 176 print("After local assignment:", spam) 177 do_nonlocal() 178 print("After nonlocal assignment:", spam) 179 do_global() 180 print("After global assignment:", spam) 181 182 scope_test() 183 print("In global scope:", spam) 184 185 The output of the example code is: 186 187 .. code-block:: none 188 189 After local assignment: test spam 190 After nonlocal assignment: nonlocal spam 191 After global assignment: nonlocal spam 192 In global scope: global spam 193 194 Note how the *local* assignment (which is default) didn't change *scope_test*\'s 195 binding of *spam*. The :keyword:`nonlocal` assignment changed *scope_test*\'s 196 binding of *spam*, and the :keyword:`global` assignment changed the module-level 197 binding. 198 199 You can also see that there was no previous binding for *spam* before the 200 :keyword:`global` assignment. 201 202 203 .. _tut-firstclasses: 204 205 A First Look at Classes 206 ======================= 207 208 Classes introduce a little bit of new syntax, three new object types, and some 209 new semantics. 210 211 212 .. _tut-classdefinition: 213 214 Class Definition Syntax 215 ----------------------- 216 217 The simplest form of class definition looks like this:: 218 219 class ClassName: 220 <statement-1> 221 . 222 . 223 . 224 <statement-N> 225 226 Class definitions, like function definitions (:keyword:`def` statements) must be 227 executed before they have any effect. (You could conceivably place a class 228 definition in a branch of an :keyword:`if` statement, or inside a function.) 229 230 In practice, the statements inside a class definition will usually be function 231 definitions, but other statements are allowed, and sometimes useful --- we'll 232 come back to this later. The function definitions inside a class normally have 233 a peculiar form of argument list, dictated by the calling conventions for 234 methods --- again, this is explained later. 235 236 When a class definition is entered, a new namespace is created, and used as the 237 local scope --- thus, all assignments to local variables go into this new 238 namespace. In particular, function definitions bind the name of the new 239 function here. 240 241 When a class definition is left normally (via the end), a *class object* is 242 created. This is basically a wrapper around the contents of the namespace 243 created by the class definition; we'll learn more about class objects in the 244 next section. The original local scope (the one in effect just before the class 245 definition was entered) is reinstated, and the class object is bound here to the 246 class name given in the class definition header (:class:`ClassName` in the 247 example). 248 249 250 .. _tut-classobjects: 251 252 Class Objects 253 ------------- 254 255 Class objects support two kinds of operations: attribute references and 256 instantiation. 257 258 *Attribute references* use the standard syntax used for all attribute references 259 in Python: ``obj.name``. Valid attribute names are all the names that were in 260 the class's namespace when the class object was created. So, if the class 261 definition looked like this:: 262 263 class MyClass: 264 """A simple example class""" 265 i = 12345 266 267 def f(self): 268 return 'hello world' 269 270 then ``MyClass.i`` and ``MyClass.f`` are valid attribute references, returning 271 an integer and a function object, respectively. Class attributes can also be 272 assigned to, so you can change the value of ``MyClass.i`` by assignment. 273 :attr:`__doc__` is also a valid attribute, returning the docstring belonging to 274 the class: ``"A simple example class"``. 275 276 Class *instantiation* uses function notation. Just pretend that the class 277 object is a parameterless function that returns a new instance of the class. 278 For example (assuming the above class):: 279 280 x = MyClass() 281 282 creates a new *instance* of the class and assigns this object to the local 283 variable ``x``. 284 285 The instantiation operation ("calling" a class object) creates an empty object. 286 Many classes like to create objects with instances customized to a specific 287 initial state. Therefore a class may define a special method named 288 :meth:`__init__`, like this:: 289 290 def __init__(self): 291 self.data = [] 292 293 When a class defines an :meth:`__init__` method, class instantiation 294 automatically invokes :meth:`__init__` for the newly-created class instance. So 295 in this example, a new, initialized instance can be obtained by:: 296 297 x = MyClass() 298 299 Of course, the :meth:`__init__` method may have arguments for greater 300 flexibility. In that case, arguments given to the class instantiation operator 301 are passed on to :meth:`__init__`. For example, :: 302 303 >>> class Complex: 304 ... def __init__(self, realpart, imagpart): 305 ... self.r = realpart 306 ... self.i = imagpart 307 ... 308 >>> x = Complex(3.0, -4.5) 309 >>> x.r, x.i 310 (3.0, -4.5) 311 312 313 .. _tut-instanceobjects: 314 315 Instance Objects 316 ---------------- 317 318 Now what can we do with instance objects? The only operations understood by 319 instance objects are attribute references. There are two kinds of valid 320 attribute names, data attributes and methods. 321 322 *data attributes* correspond to "instance variables" in Smalltalk, and to "data 323 members" in C++. Data attributes need not be declared; like local variables, 324 they spring into existence when they are first assigned to. For example, if 325 ``x`` is the instance of :class:`MyClass` created above, the following piece of 326 code will print the value ``16``, without leaving a trace:: 327 328 x.counter = 1 329 while x.counter < 10: 330 x.counter = x.counter * 2 331 print(x.counter) 332 del x.counter 333 334 The other kind of instance attribute reference is a *method*. A method is a 335 function that "belongs to" an object. (In Python, the term method is not unique 336 to class instances: other object types can have methods as well. For example, 337 list objects have methods called append, insert, remove, sort, and so on. 338 However, in the following discussion, we'll use the term method exclusively to 339 mean methods of class instance objects, unless explicitly stated otherwise.) 340 341 .. index:: object: method 342 343 Valid method names of an instance object depend on its class. By definition, 344 all attributes of a class that are function objects define corresponding 345 methods of its instances. So in our example, ``x.f`` is a valid method 346 reference, since ``MyClass.f`` is a function, but ``x.i`` is not, since 347 ``MyClass.i`` is not. But ``x.f`` is not the same thing as ``MyClass.f`` --- it 348 is a *method object*, not a function object. 349 350 351 .. _tut-methodobjects: 352 353 Method Objects 354 -------------- 355 356 Usually, a method is called right after it is bound:: 357 358 x.f() 359 360 In the :class:`MyClass` example, this will return the string ``'hello world'``. 361 However, it is not necessary to call a method right away: ``x.f`` is a method 362 object, and can be stored away and called at a later time. For example:: 363 364 xf = x.f 365 while True: 366 print(xf()) 367 368 will continue to print ``hello world`` until the end of time. 369 370 What exactly happens when a method is called? You may have noticed that 371 ``x.f()`` was called without an argument above, even though the function 372 definition for :meth:`f` specified an argument. What happened to the argument? 373 Surely Python raises an exception when a function that requires an argument is 374 called without any --- even if the argument isn't actually used... 375 376 Actually, you may have guessed the answer: the special thing about methods is 377 that the instance object is passed as the first argument of the function. In our 378 example, the call ``x.f()`` is exactly equivalent to ``MyClass.f(x)``. In 379 general, calling a method with a list of *n* arguments is equivalent to calling 380 the corresponding function with an argument list that is created by inserting 381 the method's instance object before the first argument. 382 383 If you still don't understand how methods work, a look at the implementation can 384 perhaps clarify matters. When an instance attribute is referenced that isn't a 385 data attribute, its class is searched. If the name denotes a valid class 386 attribute that is a function object, a method object is created by packing 387 (pointers to) the instance object and the function object just found together in 388 an abstract object: this is the method object. When the method object is called 389 with an argument list, a new argument list is constructed from the instance 390 object and the argument list, and the function object is called with this new 391 argument list. 392 393 394 .. _tut-class-and-instance-variables: 395 396 Class and Instance Variables 397 ---------------------------- 398 399 Generally speaking, instance variables are for data unique to each instance 400 and class variables are for attributes and methods shared by all instances 401 of the class:: 402 403 class Dog: 404 405 kind = 'canine' # class variable shared by all instances 406 407 def __init__(self, name): 408 self.name = name # instance variable unique to each instance 409 410 >>> d = Dog('Fido') 411 >>> e = Dog('Buddy') 412 >>> d.kind # shared by all dogs 413 'canine' 414 >>> e.kind # shared by all dogs 415 'canine' 416 >>> d.name # unique to d 417 'Fido' 418 >>> e.name # unique to e 419 'Buddy' 420 421 As discussed in :ref:`tut-object`, shared data can have possibly surprising 422 effects with involving :term:`mutable` objects such as lists and dictionaries. 423 For example, the *tricks* list in the following code should not be used as a 424 class variable because just a single list would be shared by all *Dog* 425 instances:: 426 427 class Dog: 428 429 tricks = [] # mistaken use of a class variable 430 431 def __init__(self, name): 432 self.name = name 433 434 def add_trick(self, trick): 435 self.tricks.append(trick) 436 437 >>> d = Dog('Fido') 438 >>> e = Dog('Buddy') 439 >>> d.add_trick('roll over') 440 >>> e.add_trick('play dead') 441 >>> d.tricks # unexpectedly shared by all dogs 442 ['roll over', 'play dead'] 443 444 Correct design of the class should use an instance variable instead:: 445 446 class Dog: 447 448 def __init__(self, name): 449 self.name = name 450 self.tricks = [] # creates a new empty list for each dog 451 452 def add_trick(self, trick): 453 self.tricks.append(trick) 454 455 >>> d = Dog('Fido') 456 >>> e = Dog('Buddy') 457 >>> d.add_trick('roll over') 458 >>> e.add_trick('play dead') 459 >>> d.tricks 460 ['roll over'] 461 >>> e.tricks 462 ['play dead'] 463 464 465 .. _tut-remarks: 466 467 Random Remarks 468 ============== 469 470 .. These should perhaps be placed more carefully... 471 472 Data attributes override method attributes with the same name; to avoid 473 accidental name conflicts, which may cause hard-to-find bugs in large programs, 474 it is wise to use some kind of convention that minimizes the chance of 475 conflicts. Possible conventions include capitalizing method names, prefixing 476 data attribute names with a small unique string (perhaps just an underscore), or 477 using verbs for methods and nouns for data attributes. 478 479 Data attributes may be referenced by methods as well as by ordinary users 480 ("clients") of an object. In other words, classes are not usable to implement 481 pure abstract data types. In fact, nothing in Python makes it possible to 482 enforce data hiding --- it is all based upon convention. (On the other hand, 483 the Python implementation, written in C, can completely hide implementation 484 details and control access to an object if necessary; this can be used by 485 extensions to Python written in C.) 486 487 Clients should use data attributes with care --- clients may mess up invariants 488 maintained by the methods by stamping on their data attributes. Note that 489 clients may add data attributes of their own to an instance object without 490 affecting the validity of the methods, as long as name conflicts are avoided --- 491 again, a naming convention can save a lot of headaches here. 492 493 There is no shorthand for referencing data attributes (or other methods!) from 494 within methods. I find that this actually increases the readability of methods: 495 there is no chance of confusing local variables and instance variables when 496 glancing through a method. 497 498 Often, the first argument of a method is called ``self``. This is nothing more 499 than a convention: the name ``self`` has absolutely no special meaning to 500 Python. Note, however, that by not following the convention your code may be 501 less readable to other Python programmers, and it is also conceivable that a 502 *class browser* program might be written that relies upon such a convention. 503 504 Any function object that is a class attribute defines a method for instances of 505 that class. It is not necessary that the function definition is textually 506 enclosed in the class definition: assigning a function object to a local 507 variable in the class is also ok. For example:: 508 509 # Function defined outside the class 510 def f1(self, x, y): 511 return min(x, x+y) 512 513 class C: 514 f = f1 515 516 def g(self): 517 return 'hello world' 518 519 h = g 520 521 Now ``f``, ``g`` and ``h`` are all attributes of class :class:`C` that refer to 522 function objects, and consequently they are all methods of instances of 523 :class:`C` --- ``h`` being exactly equivalent to ``g``. Note that this practice 524 usually only serves to confuse the reader of a program. 525 526 Methods may call other methods by using method attributes of the ``self`` 527 argument:: 528 529 class Bag: 530 def __init__(self): 531 self.data = [] 532 533 def add(self, x): 534 self.data.append(x) 535 536 def addtwice(self, x): 537 self.add(x) 538 self.add(x) 539 540 Methods may reference global names in the same way as ordinary functions. The 541 global scope associated with a method is the module containing its 542 definition. (A class is never used as a global scope.) While one 543 rarely encounters a good reason for using global data in a method, there are 544 many legitimate uses of the global scope: for one thing, functions and modules 545 imported into the global scope can be used by methods, as well as functions and 546 classes defined in it. Usually, the class containing the method is itself 547 defined in this global scope, and in the next section we'll find some good 548 reasons why a method would want to reference its own class. 549 550 Each value is an object, and therefore has a *class* (also called its *type*). 551 It is stored as ``object.__class__``. 552 553 554 .. _tut-inheritance: 555 556 Inheritance 557 =========== 558 559 Of course, a language feature would not be worthy of the name "class" without 560 supporting inheritance. The syntax for a derived class definition looks like 561 this:: 562 563 class DerivedClassName(BaseClassName): 564 <statement-1> 565 . 566 . 567 . 568 <statement-N> 569 570 The name :class:`BaseClassName` must be defined in a scope containing the 571 derived class definition. In place of a base class name, other arbitrary 572 expressions are also allowed. This can be useful, for example, when the base 573 class is defined in another module:: 574 575 class DerivedClassName(modname.BaseClassName): 576 577 Execution of a derived class definition proceeds the same as for a base class. 578 When the class object is constructed, the base class is remembered. This is 579 used for resolving attribute references: if a requested attribute is not found 580 in the class, the search proceeds to look in the base class. This rule is 581 applied recursively if the base class itself is derived from some other class. 582 583 There's nothing special about instantiation of derived classes: 584 ``DerivedClassName()`` creates a new instance of the class. Method references 585 are resolved as follows: the corresponding class attribute is searched, 586 descending down the chain of base classes if necessary, and the method reference 587 is valid if this yields a function object. 588 589 Derived classes may override methods of their base classes. Because methods 590 have no special privileges when calling other methods of the same object, a 591 method of a base class that calls another method defined in the same base class 592 may end up calling a method of a derived class that overrides it. (For C++ 593 programmers: all methods in Python are effectively ``virtual``.) 594 595 An overriding method in a derived class may in fact want to extend rather than 596 simply replace the base class method of the same name. There is a simple way to 597 call the base class method directly: just call ``BaseClassName.methodname(self, 598 arguments)``. This is occasionally useful to clients as well. (Note that this 599 only works if the base class is accessible as ``BaseClassName`` in the global 600 scope.) 601 602 Python has two built-in functions that work with inheritance: 603 604 * Use :func:`isinstance` to check an instance's type: ``isinstance(obj, int)`` 605 will be ``True`` only if ``obj.__class__`` is :class:`int` or some class 606 derived from :class:`int`. 607 608 * Use :func:`issubclass` to check class inheritance: ``issubclass(bool, int)`` 609 is ``True`` since :class:`bool` is a subclass of :class:`int`. However, 610 ``issubclass(float, int)`` is ``False`` since :class:`float` is not a 611 subclass of :class:`int`. 612 613 614 615 .. _tut-multiple: 616 617 Multiple Inheritance 618 -------------------- 619 620 Python supports a form of multiple inheritance as well. A class definition with 621 multiple base classes looks like this:: 622 623 class DerivedClassName(Base1, Base2, Base3): 624 <statement-1> 625 . 626 . 627 . 628 <statement-N> 629 630 For most purposes, in the simplest cases, you can think of the search for 631 attributes inherited from a parent class as depth-first, left-to-right, not 632 searching twice in the same class where there is an overlap in the hierarchy. 633 Thus, if an attribute is not found in :class:`DerivedClassName`, it is searched 634 for in :class:`Base1`, then (recursively) in the base classes of :class:`Base1`, 635 and if it was not found there, it was searched for in :class:`Base2`, and so on. 636 637 In fact, it is slightly more complex than that; the method resolution order 638 changes dynamically to support cooperative calls to :func:`super`. This 639 approach is known in some other multiple-inheritance languages as 640 call-next-method and is more powerful than the super call found in 641 single-inheritance languages. 642 643 Dynamic ordering is necessary because all cases of multiple inheritance exhibit 644 one or more diamond relationships (where at least one of the parent classes 645 can be accessed through multiple paths from the bottommost class). For example, 646 all classes inherit from :class:`object`, so any case of multiple inheritance 647 provides more than one path to reach :class:`object`. To keep the base classes 648 from being accessed more than once, the dynamic algorithm linearizes the search 649 order in a way that preserves the left-to-right ordering specified in each 650 class, that calls each parent only once, and that is monotonic (meaning that a 651 class can be subclassed without affecting the precedence order of its parents). 652 Taken together, these properties make it possible to design reliable and 653 extensible classes with multiple inheritance. For more detail, see 654 https://www.python.org/download/releases/2.3/mro/. 655 656 657 .. _tut-private: 658 659 Private Variables 660 ================= 661 662 "Private" instance variables that cannot be accessed except from inside an 663 object don't exist in Python. However, there is a convention that is followed 664 by most Python code: a name prefixed with an underscore (e.g. ``_spam``) should 665 be treated as a non-public part of the API (whether it is a function, a method 666 or a data member). It should be considered an implementation detail and subject 667 to change without notice. 668 669 Since there is a valid use-case for class-private members (namely to avoid name 670 clashes of names with names defined by subclasses), there is limited support for 671 such a mechanism, called :dfn:`name mangling`. Any identifier of the form 672 ``__spam`` (at least two leading underscores, at most one trailing underscore) 673 is textually replaced with ``_classname__spam``, where ``classname`` is the 674 current class name with leading underscore(s) stripped. This mangling is done 675 without regard to the syntactic position of the identifier, as long as it 676 occurs within the definition of a class. 677 678 Name mangling is helpful for letting subclasses override methods without 679 breaking intraclass method calls. For example:: 680 681 class Mapping: 682 def __init__(self, iterable): 683 self.items_list = [] 684 self.__update(iterable) 685 686 def update(self, iterable): 687 for item in iterable: 688 self.items_list.append(item) 689 690 __update = update # private copy of original update() method 691 692 class MappingSubclass(Mapping): 693 694 def update(self, keys, values): 695 # provides new signature for update() 696 # but does not break __init__() 697 for item in zip(keys, values): 698 self.items_list.append(item) 699 700 Note that the mangling rules are designed mostly to avoid accidents; it still is 701 possible to access or modify a variable that is considered private. This can 702 even be useful in special circumstances, such as in the debugger. 703 704 Notice that code passed to ``exec()`` or ``eval()`` does not consider the 705 classname of the invoking class to be the current class; this is similar to the 706 effect of the ``global`` statement, the effect of which is likewise restricted 707 to code that is byte-compiled together. The same restriction applies to 708 ``getattr()``, ``setattr()`` and ``delattr()``, as well as when referencing 709 ``__dict__`` directly. 710 711 712 .. _tut-odds: 713 714 Odds and Ends 715 ============= 716 717 Sometimes it is useful to have a data type similar to the Pascal "record" or C 718 "struct", bundling together a few named data items. An empty class definition 719 will do nicely:: 720 721 class Employee: 722 pass 723 724 john = Employee() # Create an empty employee record 725 726 # Fill the fields of the record 727 john.name = 'John Doe' 728 john.dept = 'computer lab' 729 john.salary = 1000 730 731 A piece of Python code that expects a particular abstract data type can often be 732 passed a class that emulates the methods of that data type instead. For 733 instance, if you have a function that formats some data from a file object, you 734 can define a class with methods :meth:`read` and :meth:`!readline` that get the 735 data from a string buffer instead, and pass it as an argument. 736 737 .. (Unfortunately, this technique has its limitations: a class can't define 738 operations that are accessed by special syntax such as sequence subscripting 739 or arithmetic operators, and assigning such a "pseudo-file" to sys.stdin will 740 not cause the interpreter to read further input from it.) 741 742 Instance method objects have attributes, too: ``m.__self__`` is the instance 743 object with the method :meth:`m`, and ``m.__func__`` is the function object 744 corresponding to the method. 745 746 747 .. _tut-iterators: 748 749 Iterators 750 ========= 751 752 By now you have probably noticed that most container objects can be looped over 753 using a :keyword:`for` statement:: 754 755 for element in [1, 2, 3]: 756 print(element) 757 for element in (1, 2, 3): 758 print(element) 759 for key in {'one':1, 'two':2}: 760 print(key) 761 for char in "123": 762 print(char) 763 for line in open("myfile.txt"): 764 print(line, end='') 765 766 This style of access is clear, concise, and convenient. The use of iterators 767 pervades and unifies Python. Behind the scenes, the :keyword:`for` statement 768 calls :func:`iter` on the container object. The function returns an iterator 769 object that defines the method :meth:`~iterator.__next__` which accesses 770 elements in the container one at a time. When there are no more elements, 771 :meth:`~iterator.__next__` raises a :exc:`StopIteration` exception which tells the 772 :keyword:`for` loop to terminate. You can call the :meth:`~iterator.__next__` method 773 using the :func:`next` built-in function; this example shows how it all works:: 774 775 >>> s = 'abc' 776 >>> it = iter(s) 777 >>> it 778 <iterator object at 0x00A1DB50> 779 >>> next(it) 780 'a' 781 >>> next(it) 782 'b' 783 >>> next(it) 784 'c' 785 >>> next(it) 786 Traceback (most recent call last): 787 File "<stdin>", line 1, in ? 788 next(it) 789 StopIteration 790 791 Having seen the mechanics behind the iterator protocol, it is easy to add 792 iterator behavior to your classes. Define an :meth:`__iter__` method which 793 returns an object with a :meth:`~iterator.__next__` method. If the class 794 defines :meth:`__next__`, then :meth:`__iter__` can just return ``self``:: 795 796 class Reverse: 797 """Iterator for looping over a sequence backwards.""" 798 def __init__(self, data): 799 self.data = data 800 self.index = len(data) 801 802 def __iter__(self): 803 return self 804 805 def __next__(self): 806 if self.index == 0: 807 raise StopIteration 808 self.index = self.index - 1 809 return self.data[self.index] 810 811 :: 812 813 >>> rev = Reverse('spam') 814 >>> iter(rev) 815 <__main__.Reverse object at 0x00A1DB50> 816 >>> for char in rev: 817 ... print(char) 818 ... 819 m 820 a 821 p 822 s 823 824 825 .. _tut-generators: 826 827 Generators 828 ========== 829 830 :term:`Generator`\s are a simple and powerful tool for creating iterators. They 831 are written like regular functions but use the :keyword:`yield` statement 832 whenever they want to return data. Each time :func:`next` is called on it, the 833 generator resumes where it left off (it remembers all the data values and which 834 statement was last executed). An example shows that generators can be trivially 835 easy to create:: 836 837 def reverse(data): 838 for index in range(len(data)-1, -1, -1): 839 yield data[index] 840 841 :: 842 843 >>> for char in reverse('golf'): 844 ... print(char) 845 ... 846 f 847 l 848 o 849 g 850 851 Anything that can be done with generators can also be done with class-based 852 iterators as described in the previous section. What makes generators so 853 compact is that the :meth:`__iter__` and :meth:`~generator.__next__` methods 854 are created automatically. 855 856 Another key feature is that the local variables and execution state are 857 automatically saved between calls. This made the function easier to write and 858 much more clear than an approach using instance variables like ``self.index`` 859 and ``self.data``. 860 861 In addition to automatic method creation and saving program state, when 862 generators terminate, they automatically raise :exc:`StopIteration`. In 863 combination, these features make it easy to create iterators with no more effort 864 than writing a regular function. 865 866 867 .. _tut-genexps: 868 869 Generator Expressions 870 ===================== 871 872 Some simple generators can be coded succinctly as expressions using a syntax 873 similar to list comprehensions but with parentheses instead of brackets. These 874 expressions are designed for situations where the generator is used right away 875 by an enclosing function. Generator expressions are more compact but less 876 versatile than full generator definitions and tend to be more memory friendly 877 than equivalent list comprehensions. 878 879 Examples:: 880 881 >>> sum(i*i for i in range(10)) # sum of squares 882 285 883 884 >>> xvec = [10, 20, 30] 885 >>> yvec = [7, 5, 3] 886 >>> sum(x*y for x,y in zip(xvec, yvec)) # dot product 887 260 888 889 >>> from math import pi, sin 890 >>> sine_table = {x: sin(x*pi/180) for x in range(0, 91)} 891 892 >>> unique_words = set(word for line in page for word in line.split()) 893 894 >>> valedictorian = max((student.gpa, student.name) for student in graduates) 895 896 >>> data = 'golf' 897 >>> list(data[i] for i in range(len(data)-1, -1, -1)) 898 ['f', 'l', 'o', 'g'] 899 900 901 902 .. rubric:: Footnotes 903 904 .. [#] Except for one thing. Module objects have a secret read-only attribute called 905 :attr:`~object.__dict__` which returns the dictionary used to implement the module's 906 namespace; the name :attr:`~object.__dict__` is an attribute but not a global name. 907 Obviously, using this violates the abstraction of namespace implementation, and 908 should be restricted to things like post-mortem debuggers. 909