Home | History | Annotate | Download | only in hyphenation
      1 Compound word hyphenation
      2 
      3 Hyphen library supports better compound word hyphenation and special
      4 rules of compound word hyphenation of German languages and other
      5 languages with arbitrary number of compound words. The new options,
      6 COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN help to set the right
      7 style for the hyphenation of compound words.
      8 
      9 Algorithm
     10 
     11 The algorithm is an extension of the original pattern based hyphenation
     12 algorithm. It uses two hyphenation pattern sets, defined in the same
     13 pattern file and separated by the NEXTLEVEL keyword. First pattern
     14 set is for hyphenation only at compound word boundaries, the second one
     15 is for hyphenation within words or word parts.
     16 
     17 Recursive compound level hyphenation
     18 
     19 The algorithm is recursive: every word parts of a successful 
     20 first (compound) level hyphenation will be rehyphenated
     21 by the same (first) pattern set.
     22 
     23 Finally, when first level hyphenation is not possible, Hyphen uses
     24 the second level hyphenation for the word or the word parts.
     25 
     26 Word endings and word parts
     27 
     28 Patterns for word endings (patterns with ellipses) match the
     29 word parts, too.
     30 
     31 Options
     32 
     33 COMPOUNDLEFTHYPHENMIN: min. hyph. dist. from the left compound word boundary
     34 COMPOUNDRIGHTHYPHENMIN: min. hyph. dist. from the right comp. word boundary
     35 NEXTLEVEL: sign second level hyphenation patterns
     36 
     37 Default hyphenmin values
     38 
     39 Default values of COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN are 0,
     40 and 0 under the hyphenation, too. ("0" values of
     41 LEFTHYPHENMIN and RIGHTHYPHENMIN mean the default "2" under the hyphenation.)
     42 
     43 Examples
     44 
     45 See tests/compound* test files.
     46 
     47 Preparation of hyphenation patterns
     48 
     49 It hasn't been special pattern generator tool for compound hyphenation
     50 patterns, yet. It is possible to use PATGEN to generate both of
     51 pattern sets, concatenate it manually and set the requested HYPHENMIN values.
     52 (But don't forget the preprocessing steps by substrings.pl before
     53 concatenation.) One of the disadvantage of this method, that PATGEN
     54 doesn't know recursive compound hyphenation of Hyphen.
     55 
     56 Lszl Nmeth
     57 <nemeth (at) openoffice.org>
     58