| awesomedata/awesome-public-datasets |
57,596 |
|
0 |
0 |
over 2 years ago |
0 |
|
126 |
mit |
|
| A topic-centric list of HQ open datasets. |
| github/CodeSearchNet |
2,054 |
|
0 |
0 |
about 4 years ago |
0 |
|
7 |
mit |
Jupyter Notebook |
| Datasets, tools, and benchmarks for representation learning of code. |
| FinMind/FinMind |
2,003 |
|
0 |
0 |
about 2 years ago |
130 |
December 06, 2023 |
45 |
apache-2.0 |
Jupyter Notebook |
| Open Data, more than 50 financial data. 提供超過 50 個金融資料(台股為主),每天更新 https://finmind.github.io/ |
| mdeff/fma |
1,773 |
|
0 |
0 |
over 3 years ago |
0 |
|
10 |
mit |
Jupyter Notebook |
| FMA: A Dataset For Music Analysis |
| awslabs/open-data-registry |
1,271 |
|
0 |
0 |
about 2 years ago |
0 |
|
26 |
apache-2.0 |
Python |
| A registry of publicly available datasets on AWS |
| qri-io/qri |
1,053 |
|
0 |
1 |
over 4 years ago |
271 |
December 13, 2021 |
220 |
gpl-3.0 |
Go |
| you're invited to a data party! |
| alibaba/data-juicer |
994 |
|
0 |
0 |
about 2 years ago |
3 |
September 28, 2023 |
16 |
apache-2.0 |
Python |
| A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据! |
| papyrussolution/UhttBarcodeReference |
758 |
|
0 |
0 |
over 2 years ago |
0 |
|
8 |
|
|
| Universe-HTT barcode reference |
| openml/OpenML |
624 |
|
0 |
0 |
over 2 years ago |
0 |
|
364 |
bsd-3-clause |
PHP |
| Open Machine Learning |
| github/covid-19-repo-data |
442 |
|
0 |
0 |
over 3 years ago |
0 |
|
15 |
cc0-1.0 |
|
| Data archive of identifiable COVID-19 related public projects on GitHub |