Re: suggestion for checking unicode characters against "trojan source attacks"

To: Felix Lechner <felix.lechner@lease-up.com>
Cc: Jérémy Lal <kapouer@melix.org>, security@debian.org, lintian-maint <lintian-maint@debian.org>
Subject: Re: suggestion for checking unicode characters against "trojan source attacks"
From: Jérémy Lal <kapouer@melix.org>
Date: Tue, 9 Nov 2021 20:06:56 +0100
Message-id: <[🔎] CAJxTCxw+6J3+d1qQT2gCfDyK+=LEzcJcCVopH4T1gOQGfyq0AA@mail.gmail.com>
In-reply-to: <[🔎] CAFHYt56eJzFa8vmTjCAyjBuFRm=XyFebpnbxiziFjTUU8N4MMA@mail.gmail.com>
References: <[🔎] CAJxTCxyXM9+SFUpfUPVS9kkhRgHpOyOLhd+SHDsitHaRoG9EKA@mail.gmail.com> <[🔎] CAFHYt54OGu16gnRMSEiMqt8ysqAoe4roUYvhn0L_Dju_HG-QNw@mail.gmail.com> <[🔎] CAJxTCxwYCwUYWXYY6LBYeC3M1zHMWn=Hfp4PnQNubLTtFioKNA@mail.gmail.com> <[🔎] CAFHYt57eBYUHHDaVU=PQWc76V1gyfrpoXvLHomb+uqMGaPYyYw@mail.gmail.com> <[🔎] CAJxTCxx-20VfP6HMVbXJ3EpLv888qzLCHQTjHJC3f_grWcE=UQ@mail.gmail.com> <[🔎] CAFHYt55PBwWWu3q1C89ktZw9SoNZrJQkt+o6XA6+SwS=AexFXg@mail.gmail.com> <[🔎] CAJxTCxzLm9pOVN1ndKLdkJQ0sTe7BF-nPq36-DoPsJpQ2gHu6w@mail.gmail.com> <[🔎] CAJxTCxwzeACiyL0NqKJ-DwfjH8EeX7N4JvXuDuCVEPEOz=W-Vw@mail.gmail.com> <[🔎] CAFHYt56ny+tuB_JPSpybnri11Bx_=b7gOAqG4qVjJ7L8iLHi3Q@mail.gmail.com> <[🔎] CAFHYt57_MgE-7=iZqOX++wroSobMpwOKjrxQbGO70WmpXKUWug@mail.gmail.com> <[🔎] CAFHYt56eJzFa8vmTjCAyjBuFRm=XyFebpnbxiziFjTUU8N4MMA@mail.gmail.com>

Le ven. 5 nov. 2021 à 15:00, Felix Lechner <felix.lechner@lease-up.com> a écrit :

Dear Jérémy,

> > grep -r $'[\u061C\u200E\u200F\u202A\u202B\u202C\u202D\u202E\u2066\u2067\u2068\u2069]'

Here are the results from the archive. [1] It's about half-way done.

Lintian shows which character was encountered, but there are lots of
false positives (all on contents). So far there are no hits on file
names.

Please help to identify classes of false positives. Otherwise, I have
to turn the tag into a classification (or disable it) which means we
won't see the results on the website. Thanks!

Awesome ! This is really cool. I've started fishing for exploits.

Most files indeed are just declaring unicode chars among others,

so i suppose the test needs to account for that fact.

As an example of an odd case, i don't understand why in

https://salsa.debian.org/multimedia-team/intel-media-driver/-/blob/master/media_driver/agnostic/common/os/mos_utilities.cpp#4351

We have those two characters u202D u202C:

MOS_DECLARE_UF_KEY_DBGONLY(__MEDIA_USER_FEATURE_VALUE_MOCKADAPTOR_DEVICE_ID,
"MockAdaptor Device ID",
__MEDIA_USER_FEATURE_SUBKEY_INTERNAL,
__MEDIA_USER_FEATURE_SUBKEY_REPORT,
"MOS",
MOS_USER_FEATURE_TYPE_USER,
MOS_USER_FEATURE_VALUE_TYPE_INT32,
"\u202D‭39497\u202C‬",

"Device ID of mock device, default is 0x9A49"),

Any suggestion is welcome

Kind regards
Felix Lechner

[1] https://lintian.debian.org/tags/unicode-trojan

Reply to:

Follow-Ups:
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Felix Lechner <felix.lechner@lease-up.com>

References:
- suggestion for checking unicode characters against "trojan source attacks"
  - From: Jérémy Lal <kapouer@melix.org>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Felix Lechner <felix.lechner@lease-up.com>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Jérémy Lal <kapouer@melix.org>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Felix Lechner <felix.lechner@lease-up.com>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Jérémy Lal <kapouer@melix.org>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Felix Lechner <felix.lechner@lease-up.com>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Jérémy Lal <kapouer@melix.org>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Jérémy Lal <kapouer@melix.org>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Felix Lechner <felix.lechner@lease-up.com>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Felix Lechner <felix.lechner@lease-up.com>
- Re: suggestion for checking unicode characters against "trojan source attacks"
  - From: Felix Lechner <felix.lechner@lease-up.com>

Prev by Date: Bug#915384: lintian: check that debian/copyright has an entry for each component
Next by Date: Re: suggestion for checking unicode characters against "trojan source attacks"
Previous by thread: Re: suggestion for checking unicode characters against "trojan source attacks"
Next by thread: Re: suggestion for checking unicode characters against "trojan source attacks"
Index(es):
- Date
- Thread