mirror of
https://github.com/bvanroll/unnamed_chatgpt_project.git
synced 2025-08-29 03:52:42 +00:00
10 lines
881 B
Markdown
10 lines
881 B
Markdown
# unnamed_chatgpt_project
|
|
|
|
|
|
## names
|
|
|
|
had to get a bunch of names for this to work since I didn't want to generate these, I wanted these to be the input for the generation of the other attributes.
|
|
For the names I wanted a diverse mix of countries of origin. Initial google results were mostly from US statistics but I soon found this [stackexchange comment](https://opendata.stackexchange.com/a/5003) and thus used this [dataset](ftp://ftp.heise.de/pub/ct/listings/0717-182.zip)
|
|
|
|
While looking through this dataset I found that apart from country and popularity statistics it also had information regarding the "possible" gender of the name which I could also use as part of the input when generating the attributes. Genders were defined from M (male), 1M (male if first part of then ame else mostly male), ?m (mostly male), F (female), 1F (see 1M), ?F (mostly female), ? (unisex)
|