DOT Data Labs
Dataset

PACER-CourtRecords

US federal court docket and filing dataset extracted from PACER, normalized and full-text indexed.

24M+ federal cases
Cases
480M+ docket entries
Filings
320M+ pages
OCR'd PDFs
All US federal districts + circuits, 1990–present
Coverage

Comprehensive PACER (Public Access to Court Electronic Records) extraction covering all US federal district and appellate courts. Each docket entry includes parties, judges, filed motions, orders, and full-text PDFs OCR'd and indexed. Useful for legal research AI, case-outcome prediction, and citation-graph models.

Tags

LegalPACERCourt RecordsOCRNLPFederal Court

Delivery formats

  • JSONL
  • PDF (per-filing)
  • Parquet

License

Commercial AI training license, perpetual. Public-record data, source attribution preserved.

Data sample

What a record looks like

Sample docket entry

JSONIllustrative — full sample available under NDA
{
  "case_id": "1:24-cv-04812-AKH",
  "court": "S.D.N.Y.",
  "parties": [{"role":"plaintiff","name":"..."}],
  "filed": "2024-06-18",
  "docket_no": 142,
  "event": "MEMORANDUM in Opposition to motion to dismiss",
  "filing_pages": 38
}
← Back to all datasets