[Trennmuster] Quality control on wortliste

Pander pander at users.sourceforge.net
Mi Jun 5 23:53:22 CEST 2013


Hi all,

In order to check the quality of the hyphenation definitions I have
created a simple script which generated histograms for all the
characters used in the words and in the hyphenation definitions.

Please run the attached script in a directory in which also
  git clone git://repo.or.cz/wortliste.git
has been done.

The resulting CSV files will show the use of some /exotic/ characters.
Perhaps these need to be investigated and fixed.

Keep me updated if you might have any improvements on the script or what
in general has been done with the findings.

Best regards,

Pander
-------------- nächster Teil --------------
Ein Dateianhang mit Binärdaten wurde abgetrennt...
Dateiname   : index.py
Dateityp    : text/x-python
Dateigröße  : 2113 bytes
Beschreibung: nicht verfügbar
URL         : <https://listi.jpberlin.de/pipermail/trennmuster/attachments/20130605/09d2b66f/attachment.py>


Mehr Informationen über die Mailingliste Trennmuster