Resources
This page is a work in progress and will be updated as the course proceeds!
Possible sources for data
Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals.
Data.gov is the United States government’s open data website. It provides access to datasets published by agencies across the federal government.
Pew The Pew Research Center is a nonpartisan American think tank based in Washington, D.C. It provides information on social issues, public opinion, and demographic trends shaping the United States and the world.
FiveThirtyEight uses statistical analysis - hard numbers - to tell compelling stories about elections, politics and American society.
IPUMS provides census and survey data from around the world integrated across time and space. Note that this one takes a little training to use - if we have time we will walk through the CPS data here!
UCI Machine Learning Data Repository Maintains 644 datasets as a service to the machine learning community. Here, you can donate and find datasets used by millions of people all around the world!
FRED Download, graph, and track 823000 economic time series from 114 sources.
Los Angeles Open Data A great source for data sets having to do with the city of Los Angeles
Data is Plural a fun list of interesting and public data sets - kind of random!
Tidy Tuesday a nice community of R users puts out different data sets every Tuesday for a community challenge