Gender Classifier
Estimates the likelihood of a name historically being associated with men or women.
Estimates the likelihood of a name historically being associated with men or women.
Input and output
To use this tool, you need a text column.
The tool adds two columns to your data: a binary gender estimate ("male" or "female") and a probability column. The probability columns shows the likelihood of the estimate being correct.
When to use this tool
You can use this tool when you need demographic information and have no other way to get that data. For example, let's say you want to use the Gender Pay app but don't have gender information in your data source. You can use this tool to get an approximate idea of gender distribution, which you can use to help you determine pay equity in your organization. Be careful when using this tool since it's only an estimate.
Tip
This training data used by this tool comes from these sources: Dbpedia Person Data (direct download); Popular baby names in the US; and Names data set curated by Milos Bejda. It was last updated with data from 2017. You could use the Text Classifier tool instead to train your own model to estimate this type of information using historic data that is more appropriate to your use case.
Usage example
In this example, we start with a data set that looks like this:
Name |
---|
Liam Gonzalez |
Alejandro Hernandez |
Earl Cooley |
Isis Roberts |
Imogen Eaton |
Herbert Rees |
Ava-Rose Floyd |
George Carty |
Geoffrey Robinson |
Travis Stephenson |
After running the tool, there are two new columns in the data set:
Name | Gender | Probability |
---|---|---|
Liam Gonzalez | Male | 0.984 215 795 993 804 9 |
Alejandro Hernandez | Male | 0.998 553 931 713 104 2 |
Earl Cooley | Male | 0.937 227 010 726 928 7 |
Isis Roberts | Female | 0.984 409 387 223 422 5 |
Imogen Eaton | Female | 0.808 390 051 126 480 1 |
Herbert Rees | Male | 0.999 955 415 725 708 |
Ava-Rose Floyd | Female | 0.999 999 992 828 745 3 |
George Carty | Male | 0.986 132 025 718 689 |
Geoffrey Robinson | Male | 0.997 940 957 546 234 1 |
Travis Stephenson | Male | 0.989 474 117 755 889 9 |