Finding datasets can be a bit tricky and it’s handy to have a few go-to places when you’re looking for something either for work, portfolio-building or just general interest.
Google dataset search
It’s the Google of datasets. Indexes lots of data from other repositories and research.
https://datasetsearch.research.google.com/
Data is Plural
This is a weekly newsletter highlighting interesting open datasets. You can sign up to the newsletter and browse old articles. The full datasets list is available this google sheet.
https://docs.google.com/spreadsheets/d/1wZhPLMCHKJvwOkP4juclhjFgqIY8fQFMemwKL2c64vk/edit#gid=0
TidyTuesday
There’s a weekly R community challenge called TidyTuesday, and you can find all the available datasets in the github repo.
https://github.com/rfordatascience/tidytuesday
Kaggle
A very popular collection of datasets used for machine learning exercises but useful for analysis too.
https://www.kaggle.com/datasets
Data.gov
The official open data repo of the USA.
Data.Europa.EU
The official open data repo of the European Union
Keep up to date with new Data posts and/or Big Book of R updates by signing up to my newsletter. Subscribers get a free copy of Project Management Fundamentals for Data Analysts worth $12.
Once you’ve subscribed, you’ll get a follow up email with a link to your free copy.