1 2 :mod:`formatter` --- Generic output formatting 3 ============================================== 4 5 .. module:: formatter 6 :synopsis: Generic output formatter and device interface. 7 8 9 .. index:: single: HTMLParser (class in htmllib) 10 11 This module supports two interface definitions, each with multiple 12 implementations. The *formatter* interface is used by the :class:`~HTMLParser.HTMLParser` 13 class of the :mod:`htmllib` module, and the *writer* interface is required by 14 the formatter interface. 15 16 Formatter objects transform an abstract flow of formatting events into specific 17 output events on writer objects. Formatters manage several stack structures to 18 allow various properties of a writer object to be changed and restored; writers 19 need not be able to handle relative changes nor any sort of "change back" 20 operation. Specific writer properties which may be controlled via formatter 21 objects are horizontal alignment, font, and left margin indentations. A 22 mechanism is provided which supports providing arbitrary, non-exclusive style 23 settings to a writer as well. Additional interfaces facilitate formatting 24 events which are not reversible, such as paragraph separation. 25 26 Writer objects encapsulate device interfaces. Abstract devices, such as file 27 formats, are supported as well as physical devices. The provided 28 implementations all work with abstract devices. The interface makes available 29 mechanisms for setting the properties which formatter objects manage and 30 inserting data into the output. 31 32 33 .. _formatter-interface: 34 35 The Formatter Interface 36 ----------------------- 37 38 Interfaces to create formatters are dependent on the specific formatter class 39 being instantiated. The interfaces described below are the required interfaces 40 which all formatters must support once initialized. 41 42 One data element is defined at the module level: 43 44 45 .. data:: AS_IS 46 47 Value which can be used in the font specification passed to the ``push_font()`` 48 method described below, or as the new value to any other ``push_property()`` 49 method. Pushing the ``AS_IS`` value allows the corresponding ``pop_property()`` 50 method to be called without having to track whether the property was changed. 51 52 The following attributes are defined for formatter instance objects: 53 54 55 .. attribute:: formatter.writer 56 57 The writer instance with which the formatter interacts. 58 59 60 .. method:: formatter.end_paragraph(blanklines) 61 62 Close any open paragraphs and insert at least *blanklines* before the next 63 paragraph. 64 65 66 .. method:: formatter.add_line_break() 67 68 Add a hard line break if one does not already exist. This does not break the 69 logical paragraph. 70 71 72 .. method:: formatter.add_hor_rule(*args, **kw) 73 74 Insert a horizontal rule in the output. A hard break is inserted if there is 75 data in the current paragraph, but the logical paragraph is not broken. The 76 arguments and keywords are passed on to the writer's :meth:`send_line_break` 77 method. 78 79 80 .. method:: formatter.add_flowing_data(data) 81 82 Provide data which should be formatted with collapsed whitespace. Whitespace 83 from preceding and successive calls to :meth:`add_flowing_data` is considered as 84 well when the whitespace collapse is performed. The data which is passed to 85 this method is expected to be word-wrapped by the output device. Note that any 86 word-wrapping still must be performed by the writer object due to the need to 87 rely on device and font information. 88 89 90 .. method:: formatter.add_literal_data(data) 91 92 Provide data which should be passed to the writer unchanged. Whitespace, 93 including newline and tab characters, are considered legal in the value of 94 *data*. 95 96 97 .. method:: formatter.add_label_data(format, counter) 98 99 Insert a label which should be placed to the left of the current left margin. 100 This should be used for constructing bulleted or numbered lists. If the 101 *format* value is a string, it is interpreted as a format specification for 102 *counter*, which should be an integer. The result of this formatting becomes the 103 value of the label; if *format* is not a string it is used as the label value 104 directly. The label value is passed as the only argument to the writer's 105 :meth:`send_label_data` method. Interpretation of non-string label values is 106 dependent on the associated writer. 107 108 Format specifications are strings which, in combination with a counter value, 109 are used to compute label values. Each character in the format string is copied 110 to the label value, with some characters recognized to indicate a transform on 111 the counter value. Specifically, the character ``'1'`` represents the counter 112 value formatter as an Arabic number, the characters ``'A'`` and ``'a'`` 113 represent alphabetic representations of the counter value in upper and lower 114 case, respectively, and ``'I'`` and ``'i'`` represent the counter value in Roman 115 numerals, in upper and lower case. Note that the alphabetic and roman 116 transforms require that the counter value be greater than zero. 117 118 119 .. method:: formatter.flush_softspace() 120 121 Send any pending whitespace buffered from a previous call to 122 :meth:`add_flowing_data` to the associated writer object. This should be called 123 before any direct manipulation of the writer object. 124 125 126 .. method:: formatter.push_alignment(align) 127 128 Push a new alignment setting onto the alignment stack. This may be 129 :const:`AS_IS` if no change is desired. If the alignment value is changed from 130 the previous setting, the writer's :meth:`new_alignment` method is called with 131 the *align* value. 132 133 134 .. method:: formatter.pop_alignment() 135 136 Restore the previous alignment. 137 138 139 .. method:: formatter.push_font((size, italic, bold, teletype)) 140 141 Change some or all font properties of the writer object. Properties which are 142 not set to :const:`AS_IS` are set to the values passed in while others are 143 maintained at their current settings. The writer's :meth:`new_font` method is 144 called with the fully resolved font specification. 145 146 147 .. method:: formatter.pop_font() 148 149 Restore the previous font. 150 151 152 .. method:: formatter.push_margin(margin) 153 154 Increase the number of left margin indentations by one, associating the logical 155 tag *margin* with the new indentation. The initial margin level is ``0``. 156 Changed values of the logical tag must be true values; false values other than 157 :const:`AS_IS` are not sufficient to change the margin. 158 159 160 .. method:: formatter.pop_margin() 161 162 Restore the previous margin. 163 164 165 .. method:: formatter.push_style(*styles) 166 167 Push any number of arbitrary style specifications. All styles are pushed onto 168 the styles stack in order. A tuple representing the entire stack, including 169 :const:`AS_IS` values, is passed to the writer's :meth:`new_styles` method. 170 171 172 .. method:: formatter.pop_style([n=1]) 173 174 Pop the last *n* style specifications passed to :meth:`push_style`. A tuple 175 representing the revised stack, including :const:`AS_IS` values, is passed to 176 the writer's :meth:`new_styles` method. 177 178 179 .. method:: formatter.set_spacing(spacing) 180 181 Set the spacing style for the writer. 182 183 184 .. method:: formatter.assert_line_data([flag=1]) 185 186 Inform the formatter that data has been added to the current paragraph 187 out-of-band. This should be used when the writer has been manipulated 188 directly. The optional *flag* argument can be set to false if the writer 189 manipulations produced a hard line break at the end of the output. 190 191 192 .. _formatter-impls: 193 194 Formatter Implementations 195 ------------------------- 196 197 Two implementations of formatter objects are provided by this module. Most 198 applications may use one of these classes without modification or subclassing. 199 200 201 .. class:: NullFormatter([writer]) 202 203 A formatter which does nothing. If *writer* is omitted, a :class:`NullWriter` 204 instance is created. No methods of the writer are called by 205 :class:`NullFormatter` instances. Implementations should inherit from this 206 class if implementing a writer interface but don't need to inherit any 207 implementation. 208 209 210 .. class:: AbstractFormatter(writer) 211 212 The standard formatter. This implementation has demonstrated wide applicability 213 to many writers, and may be used directly in most circumstances. It has been 214 used to implement a full-featured World Wide Web browser. 215 216 217 .. _writer-interface: 218 219 The Writer Interface 220 -------------------- 221 222 Interfaces to create writers are dependent on the specific writer class being 223 instantiated. The interfaces described below are the required interfaces which 224 all writers must support once initialized. Note that while most applications can 225 use the :class:`AbstractFormatter` class as a formatter, the writer must 226 typically be provided by the application. 227 228 229 .. method:: writer.flush() 230 231 Flush any buffered output or device control events. 232 233 234 .. method:: writer.new_alignment(align) 235 236 Set the alignment style. The *align* value can be any object, but by convention 237 is a string or ``None``, where ``None`` indicates that the writer's "preferred" 238 alignment should be used. Conventional *align* values are ``'left'``, 239 ``'center'``, ``'right'``, and ``'justify'``. 240 241 242 .. method:: writer.new_font(font) 243 244 Set the font style. The value of *font* will be ``None``, indicating that the 245 device's default font should be used, or a tuple of the form ``(size, 246 italic, bold, teletype)``. Size will be a string indicating the size of 247 font that should be used; specific strings and their interpretation must be 248 defined by the application. The *italic*, *bold*, and *teletype* values are 249 Boolean values specifying which of those font attributes should be used. 250 251 252 .. method:: writer.new_margin(margin, level) 253 254 Set the margin level to the integer *level* and the logical tag to *margin*. 255 Interpretation of the logical tag is at the writer's discretion; the only 256 restriction on the value of the logical tag is that it not be a false value for 257 non-zero values of *level*. 258 259 260 .. method:: writer.new_spacing(spacing) 261 262 Set the spacing style to *spacing*. 263 264 265 .. method:: writer.new_styles(styles) 266 267 Set additional styles. The *styles* value is a tuple of arbitrary values; the 268 value :const:`AS_IS` should be ignored. The *styles* tuple may be interpreted 269 either as a set or as a stack depending on the requirements of the application 270 and writer implementation. 271 272 273 .. method:: writer.send_line_break() 274 275 Break the current line. 276 277 278 .. method:: writer.send_paragraph(blankline) 279 280 Produce a paragraph separation of at least *blankline* blank lines, or the 281 equivalent. The *blankline* value will be an integer. Note that the 282 implementation will receive a call to :meth:`send_line_break` before this call 283 if a line break is needed; this method should not include ending the last line 284 of the paragraph. It is only responsible for vertical spacing between 285 paragraphs. 286 287 288 .. method:: writer.send_hor_rule(*args, **kw) 289 290 Display a horizontal rule on the output device. The arguments to this method 291 are entirely application- and writer-specific, and should be interpreted with 292 care. The method implementation may assume that a line break has already been 293 issued via :meth:`send_line_break`. 294 295 296 .. method:: writer.send_flowing_data(data) 297 298 Output character data which may be word-wrapped and re-flowed as needed. Within 299 any sequence of calls to this method, the writer may assume that spans of 300 multiple whitespace characters have been collapsed to single space characters. 301 302 303 .. method:: writer.send_literal_data(data) 304 305 Output character data which has already been formatted for display. Generally, 306 this should be interpreted to mean that line breaks indicated by newline 307 characters should be preserved and no new line breaks should be introduced. The 308 data may contain embedded newline and tab characters, unlike data provided to 309 the :meth:`send_formatted_data` interface. 310 311 312 .. method:: writer.send_label_data(data) 313 314 Set *data* to the left of the current left margin, if possible. The value of 315 *data* is not restricted; treatment of non-string values is entirely 316 application- and writer-dependent. This method will only be called at the 317 beginning of a line. 318 319 320 .. _writer-impls: 321 322 Writer Implementations 323 ---------------------- 324 325 Three implementations of the writer object interface are provided as examples by 326 this module. Most applications will need to derive new writer classes from the 327 :class:`NullWriter` class. 328 329 330 .. class:: NullWriter() 331 332 A writer which only provides the interface definition; no actions are taken on 333 any methods. This should be the base class for all writers which do not need to 334 inherit any implementation methods. 335 336 337 .. class:: AbstractWriter() 338 339 A writer which can be used in debugging formatters, but not much else. Each 340 method simply announces itself by printing its name and arguments on standard 341 output. 342 343 344 .. class:: DumbWriter(file=None, maxcol=72) 345 346 Simple writer class which writes output on the file object passed in as *file* 347 or, if *file* is ``None``, on standard output. The output is simply word-wrapped 348 to the number of columns specified by *maxcol*. This class is suitable for 349 reflowing a sequence of paragraphs. 350 351