一.Google数据集
链接:https://datasetsearch.research.google.com/
二.Huggingface数据集
链接1:GitHub - huggingface/datasets: 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
链接2:https://huggingface.co/datasets
三.Kaggle 数据集
链接:Find Open Datasets and Machine Learning Projects | Kaggle
四.Paper With Code 数据集
链接:Machine Learning Datasets | Papers With Code
五.Reddit 数据集
链接:https://www.reddit.com/r/datasets/
六.CLUE 数据集
链接:https://www.cluebenchmarks.com/dataSet_search.html
七.Machine learning datasets
链接:Dataset list - A list of the biggest machine learning datasets
十.ChineseNlpCorpus
链接:https://github.com/InsaneLife/ChineseNLPCorpus
十一.CV Datasets on the web
链接:http://www.cvpapers.com/datasets.html
十二.Yet Another Computer Vision Index To Datasets (YACVID)
链接:http://yacvid.hayko.at/
十三.天池数据集_阿里系唯一对外开放数据分享平台-阿里云天池
链接:https://tianchi.aliyun.com/dataset/?spm=5176.14154004.J_4642415020.21.d08056996oMHQY
十四.公开数据集下载,优质机器学习数据集,图像识别、NLP免费获取 | 格物钛,非结构化数据平台
链接:https://gas.graviti.cn/open-datasets