Bug#1014908: ITP: gender-guesser -- Guess the gender from first name
Package: wnpp
Severity: wishlist
Owner: Edward Betts <edward@4angle.com>
X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org
* Package name : gender-guesser
Version : 0.4.0
Upstream Author : Israel Saeta Pérez <israel@lead-ratings.com>
* URL : https://github.com/lead-ratings/gender-guesser
* License : GPL-3 & GFDL-1.2+
Programming Lang: Python
Description : Guess the gender from first name
This package uses the underlying data from the program "gender" by Jorg
Michael.
.
Its use is pretty straightforward:
.
>>> import gender_guesser.detector as gender
>>> d = gender.Detector()
>>> print(d.get_gender(u"Bob"))
male
>>> print(d.get_gender(u"Sally"))
female
>>> print(d.get_gender(u"Pauley")) # should be androgynous
andy
.
The result will be one of 'unknown' (name not found), 'andy' (androgynous),
'male', 'female', 'mostly_male', or 'mostly_female'. The difference
between 'andy' and 'unknown' is that the former is found to have the same
probability to be male than to be female, while the later means that the name
wasn't found in the database.
.
I18N is fully supported:
.
>>> print(d.get_gender(u"\xc1lfr\xfan")) # u"Álfrún"
female
.
Additionally, you can give preference to specific countries:
.
>>> print(d.get_gender(u"Jamie"))
mostly_female
>>> print(d.get_gender(u"Jamie", u'great_britain'))
mostly_male
.
Additionally, you can create a detector that is not case sensitive (default
is to be case sensitive):
.
>>> d = gender.Detector(case_sensitive=False)
>>> print(d.get_gender(u"sally"))
female
>>> print(d.get_gender(u"Sally"))
female
I plan to maintain this package as part of the Python modules team.
Reply to: