1 Compound word hyphenation 2 3 Hyphen library supports better compound word hyphenation and special 4 rules of compound word hyphenation of German languages and other 5 languages with arbitrary number of compound words. The new options, 6 COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN help to set the right 7 style for the hyphenation of compound words. 8 9 Algorithm 10 11 The algorithm is an extension of the original pattern based hyphenation 12 algorithm. It uses two hyphenation pattern sets, defined in the same 13 pattern file and separated by the NEXTLEVEL keyword. First pattern 14 set is for hyphenation only at compound word boundaries, the second one 15 is for hyphenation within words or word parts. 16 17 Recursive compound level hyphenation 18 19 The algorithm is recursive: every word parts of a successful 20 first (compound) level hyphenation will be rehyphenated 21 by the same (first) pattern set. 22 23 Finally, when first level hyphenation is not possible, Hyphen uses 24 the second level hyphenation for the word or the word parts. 25 26 Word endings and word parts 27 28 Patterns for word endings (patterns with ellipses) match the 29 word parts, too. 30 31 Options 32 33 COMPOUNDLEFTHYPHENMIN: min. hyph. dist. from the left compound word boundary 34 COMPOUNDRIGHTHYPHENMIN: min. hyph. dist. from the right comp. word boundary 35 NEXTLEVEL: sign second level hyphenation patterns 36 37 Default hyphenmin values 38 39 Default values of COMPOUNDLEFTHYPHENMIN and COMPOUNDRIGHTHYPHENMIN are 0, 40 and 0 under the hyphenation, too. ("0" values of 41 LEFTHYPHENMIN and RIGHTHYPHENMIN mean the default "2" under the hyphenation.) 42 43 Examples 44 45 See tests/compound* test files. 46 47 Preparation of hyphenation patterns 48 49 It hasn't been special pattern generator tool for compound hyphenation 50 patterns, yet. It is possible to use PATGEN to generate both of 51 pattern sets, concatenate it manually and set the requested HYPHENMIN values. 52 (But don't forget the preprocessing steps by substrings.pl before 53 concatenation.) One of the disadvantage of this method, that PATGEN 54 doesn't know recursive compound hyphenation of Hyphen. 55 56 Lszl Nmeth 57 <nemeth (at) openoffice.org> 58