Their method of identifying genders is to use a data source of Github user email...

johnhess · on Feb 10, 2016

If you make to the end of the article, they do account for exactly this issue by comparing cases where they could find an out-of-band gender indication (google+) but where the name was not identifiable via github name/profile.

From the article:

> For gender-neutral profiles, we included GitHub users that used an identicon, that Michael’s tool could not infer a gender for, and that a mixed-culture panel of judges could not guess the gender for.

astine · on Feb 10, 2016

Only 35% percent of the accounts have their gender listed in a linked Google+ account. Checking someone's social media profile is a relatively sure way of automatically determining the gender of a lot of people. The authors did use another automated tool to see if they could figure out the gender of users from their Github profile as well, which is something they needed for the second part of their analysis. They don't specify how accurate that procedure was, so it's possible that they are more accurate than you think.

Of course, there is still the issue that they have effectively limited their sample to people with Google+ accounts which may affect the results of the study. Given that men's acceptance rate also dropped when their gender was identifiable (but not by as much) gives credence to the idea that there might be a flaw in their Github profile analyzer.

marcammann · on Feb 10, 2016

Well, they used two steps. First they identified a sample set of contributors that have self identified, thus validated to some extent, their gender.

Further down they then distinguish between contributors where the gender can be inferred from looking at their name & profile picture. Splitting the group of those 35% which were identified via Google+ into two separate groups - identifiable vs. non-identifiable.

Mz · on Feb 11, 2016

You have talked me into not reading this.

Which leaves me frustrated. It seems I want things that mostly do not exist.

mucker · on Feb 10, 2016

Not to mention it doesn't account for biological sex.

ianremsen · on Feb 11, 2016

This is irrelevant, I think.

mucker · on Feb 17, 2016

You thinking it does not magically make it so.

ianremsen · on Feb 25, 2016

I'd like to hear your reasoning for why it's relevant enough to have added value to the results.