Processing 99% of U.S. Caselaw for Under $1
daft.ai·5d·
Discuss: Hacker News
📋CSV Processing
Preview
Report Post

Introduction

One of the most impactful areas of open-source software and artificial intelligence is the total democratization of legal tech and data. Teraflop AI and Eventual collaborated to support the release of the Common Pile, an 8TB, 1 Trillion Token Dataset of Public Domain and Openly Licensed Text, along with Eleuther AI, Vector Institute, Allen AI, Hugging Face, and the Data Provenance Initiative.

In collaboration with Daft, Teraflop AI provisioned 99% of US precedential caselaw data from the Caselaw Access Project (CAP) and CourtListener (CL) using the highly-efficient [Daft dataframe library](https://www….

Similar Posts

Loading similar posts...