ML research datasets from ArXiv and Semantic Scholar (JSONL, quality-scored) (opens in new tab)
Export-ready, continuously-updated training datasets from arXiv, GitHub & more. Describe what you want to fine-tune on → get a dataset
Read the original article