> Below is first a histogram of all the characters found in the words > (before the semi column): > > 41671,€ This is plain wrong. There is *no* Euro sign in the whole file. Your script apparently doesn't handle the file encoding correctly (which is utf-8, BTW). Werner