Where to find datasets

Finding datasets can be a bit tricky and it’s handy to have a few go-to places when you’re looking for something either for work, portfolio-building or just general interest.

Google dataset search

It’s the Google of datasets. Indexes lots of data from other repositories and research.

https://datasetsearch.research.google.com/

Data is Plural

This is a weekly newsletter highlighting interesting open datasets. You can sign up to the newsletter and browse old articles. The full datasets list is available this google sheet.

https://docs.google.com/spreadsheets/d/1wZhPLMCHKJvwOkP4juclhjFgqIY8fQFMemwKL2c64vk/edit#gid=0

TidyTuesday

There’s a weekly R community challenge called TidyTuesday, and you can find all the available datasets in the github repo.

https://github.com/rfordatascience/tidytuesday

Kaggle

A very popular collection of datasets used for machine learning exercises but useful for analysis too.

https://www.kaggle.com/datasets

Data.gov

The official open data repo of the USA.

https://data.gov/

Data.Europa.EU

The official open data repo of the European Union

https://data.europa.eu/en


Subscribe for updates. I write about R, data and careers.

Subscribers get a free copy of Project Management Fundamentals for Data Analysts worth $12

* indicates required

Back to Top