Hi, one obvious similarity in all failing tests: break.aff:WORDCHARS -– checksharpsutf.aff:WORDCHARS ß. compoundrule5.aff:WORDCHARS 0123456789‰. All have a utf-8 character in WORDCHARS. I didn't spot anything obvious in WORDCHARS parsing which could break only on arm.