Files
unnamed_chatgpt_project/README.md
2023-01-20 13:38:58 +01:00

10 lines
881 B
Markdown

# unnamed_chatgpt_project
## names
had to get a bunch of names for this to work since I didn't want to generate these, I wanted these to be the input for the generation of the other attributes.
For the names I wanted a diverse mix of countries of origin. Initial google results were mostly from US statistics but I soon found this [stackexchange comment](https://opendata.stackexchange.com/a/5003) and thus used this [dataset](ftp://ftp.heise.de/pub/ct/listings/0717-182.zip)
While looking through this dataset I found that apart from country and popularity statistics it also had information regarding the "possible" gender of the name which I could also use as part of the input when generating the attributes. Genders were defined from M (male), 1M (male if first part of then ame else mostly male), ?m (mostly male), F (female), 1F (see 1M), ?F (mostly female), ? (unisex)