It is possible to define hyphenation exception dictionaries at two levels.
A general exception dictionary is applied on all languages. In addition, each language has own exception dictionary.
For better performance, dictionaries are stored in compiled form.
Exception dictionaries are stored in database.
The source file of the language independent hyphenation exception dictionary is stored in database. This dictionary has no binary pair, but it is incorporated in each language related exception dictionary. It contains the list of words that deserve special treatment. Typically, these are the proper names that are to be hyphenated differently than they would be hyphenated by the hyphenation rule for the current language. An example are Italian names in English language newspaper: they have to by hyphenated as in Italian language, not as in English. The words contained in the global exception dictionary are translated to lower case using the standard table defined in Windows. |
The sources of language related hyphenation exception dictionaries are stored in database A language dependent exception dictionary is in use only when the corresponding hyphenation is active. The words contained in language related hyphenation exception dictionaries are translated to lower case using the standard table defined in Windows. |
When a word needs to be hyphenated, it is first searched in the general exception dictionary, then in the one specific to the current hyphenation language, and finally the algorithm for that language is applied. |