OkCupid data for introductory statistics and data science courses

AY Kim, A Escobedo-Land - Journal of Statistics Education, 2015 - Taylor & Francis
We present a data set consisting of user profile data for 59,946 San Francisco OkCupid
users (a free online dating website) from a period in the 2010s. The data set includes typical
user information, lifestyle variables, and text responses to 10 essays questions. We present
four example analyses suitable for use in undergraduate introductory probability and
statistics and data science courses that use R. The statistical and data science concepts
covered include basic data visualization, exploratory data analysis, multivariate …

OkCupid Data for Introductory Statistics and Data

AY Kim, A Escobedo-Land - Journal of Statistics and Data Science … - search.proquest.com
–This line was added:“Note that random noise was added to the age variable for de-
identification purposes”.–The data file is renamed to “profiles_revised. csv”.–This line was
added:“However, the essay data has been randomized by rows to decouple them from the
profiles data. In other words, the user represented in the first row of profiles_revised does not
necessarily correspond to the user that wrote the responses in the first row of
essays_revised_and_shuffled. We load this randomized essays data as follows …
以上显示的是最相近的搜索结果。 查看全部搜索结果