As Data Science Practitioner, most of us find difficulty in finding the right dataset for our projects, and lately, I’ve been getting a lot of messages from my followers, asking my guidance in helping them to find the best dataset for their project, so I gathered a few websites that offer a good amount of datasets
1- KDDCUP Archive
KDDCup is an annual competition for data science practitioners, and I highly recommend that you pick your dataset from this site as it is in a raw format and most of the time it’s in a high volume, so they are perfect if you are doing Masters or PhD
You can also filter the dataset by their type ( text data for NLP projects — Images for Computer vision..etc)
![](https://images.sharemyimage.com/file/sharemi/2022/02/07/1-KddCup.png)
![](https://images.sharemyimage.com/file/sharemi/2022/02/07/2-KddCup-Type-data.png)
![](https://images.sharemyimage.com/file/sharemi/2022/02/07/3-KddCup-Type-data.png)
2- Google Dataset Search
A free datasets search engine from Google that helps you find datasets, it contains over 25 million datasets
![](https://images.sharemyimage.com/file/sharemi/2022/02/07/4-Google.png)
3- UCI Machine Learning Repository
All the datasets were uploaded by the users and you can filter them by attribute and data type and area of expertise
![](https://images.sharemyimage.com/file/sharemi/2022/02/07/5--UCI.png)
4-OpenML
An online machine learning platform for sharing and organizing data with more than 21.000 datasets
![](https://images.sharemyimage.com/file/sharemi/2022/02/07/6-OpenML.png)
5- Wikipedia
A very well organized repository for different types of datasets from Wikipedia
![](https://images.sharemyimage.com/file/sharemi/2022/02/07/7-Wiki.png)
Thanks for reading this article, hope you liked it, stay tuned for Part 2 of this article, Make sure to like it ( Clap 👏)and share it with your friends
You can check my social media accounts and courses on this link