[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: suggestion for checking unicode characters against "trojan source attacks"





Le lun. 1 nov. 2021 à 22:51, Jérémy Lal <kapouer@melix.org> a écrit :


Le lun. 1 nov. 2021 à 22:29, Felix Lechner <felix.lechner@lease-up.com> a écrit :
Hi,

On Mon, Nov 1, 2021 at 2:21 PM Jérémy Lal <kapouer@melix.org> wrote:
>
> grep -r $'[\u061C\u200E\u200F\u202A\u202B\u202C\u202D\u202E\u2066\u2067\u2068\u2069]'

Does that cover both conditions?
 
It seems from the paper at
and the list given also at
that those nine characters are the ones that should be checked. 

There is a risk that it will be slow, by the way—but I generally favor
doing things right, so no problem here.

Maybe debian security team has already something in mind, or has a better understanding of this
CVE-2021-42574 and CVE-2021-42694 issue.

Update: the python script i linked at the start of the conversion is now available at
https://github.com/siddhesh/find-unicode-control
i'm not sure it's worth packaging it - using grep looks somewhat simpler.

Jérémy

Reply to: