Finding datasets can be a bit tricky and it’s handy to have a few go-to places when you’re looking for something either for work, portfolio-building or just general interest.
Google dataset search
It’s the Google of datasets. Indexes lots of data from other repositories and research.
https://datasetsearch.research.google.com/
Data is Plural
This is a weekly newsletter highlighting interesting open datasets. You can sign up to the newsletter and browse old articles. The full datasets list is available this google sheet.
https://docs.google.com/spreadsheets/d/1wZhPLMCHKJvwOkP4juclhjFgqIY8fQFMemwKL2c64vk/edit#gid=0
TidyTuesday
There’s a weekly R community challenge called TidyTuesday, and you can find all the available datasets in the github repo.
https://github.com/rfordatascience/tidytuesday
Kaggle
A very popular collection of datasets used for machine learning exercises but useful for analysis too.
https://www.kaggle.com/datasets
Data.gov
The official open data repo of the USA.
Data.Europa.EU
The official open data repo of the European Union
Keep up to date with new data posts and Big Book of R updates by signing up to my newsletter. Subscribers get a free copy of Project Management Fundamentals for Data Analysts worth $12.
Once you’ve subscribed, you’ll get a follow up email with a link to your free copy.