[Trennmuster] pattern and character analysis

Pander pander at users.sourceforge.net
Mi Nov 13 15:20:21 CET 2013


On 11/05/2013 10:11 PM, Pander wrote:
> Hi all,
> 
> I would like to contribute the following analysis to this project. It is
> an analysis of the characters and reserved characters used in the German
> hyphenation pattern definitions.
> 
> With it I was already able to get a small typo in one of the patterns
> fixed. It is a convenient tool to spot errors and get statistics on the
> patterns. As a bonus, you can use histogram-wortzeichen.png to win with
> Galgenmännchen.
> 
> You could even use these results to improve
> https://de.wikipedia.org/wiki/Buchstabenh%C3%A4ufigkeit A similar
> analysis of character frequency in Dutch using the same GNUplot scripts
> was published https://nl.wikipedia.org/wiki/Letterfrequentie and
> http://opentaal.org/het-laatste-nieuws/171-karakterfrequentie
> 
> Werner has already reviewed my contribition and send me corrections
> which I have fixed. Could one of you review the attached file as well
> and when all is OK add it to the GIT repo in wortliste/skripte/python/
> please?

Hi all,

Attached is an updated version with also a case insensitive histogram
and small corrections contributed by Werner. Could one of you add this
to skripte/python in the git repo so you can create updates of the
histograms yourself?

Thanks,

Pander


> Best regards,
> 
> Pander
> 
> 
> 
> _______________________________________________
> Trennmuster mailing list
> Trennmuster at dante.de
> https://lists.dante.de/mailman/listinfo/trennmuster
> 

-------------- nächster Teil --------------
Ein Dateianhang mit Binärdaten wurde abgetrennt...
Dateiname   : histogramm.tar.bz2
Dateityp    : application/x-bzip
Dateigröße  : 123739 bytes
Beschreibung: nicht verfügbar
URL         : <https://listi.jpberlin.de/pipermail/trennmuster/attachments/20131113/9d5aba3c/attachment.bz2>


Mehr Informationen über die Mailingliste Trennmuster