This repo will remain for posterity, but no new updates will be added here. Instead I will be updating this repo under LawSnipe going forward.
This project aims to make the United States legal code more accessible, transparent, and easier to analyze. By hosting the legal code on GitHub, changes can be tracked in a cleaner manner, enabling greater scrutiny and collaboration.
This repository, compressed stands at 161MB. Expanded into markdown it sits at 681MB.
The primary goals of this repository are:
- Public Accessibility: Provide the US legal code in an easily accessible and readable Markdown format.
- Change Tracking: Use Git's version control to track changes to the legal code, making it clear when and how laws evolve over time.
- Scrutiny and Collaboration: Facilitate public and academic scrutiny of the legal code, enabling meaningful discussions and improvements.
- Digestion by LLMs: Markdown format is easier for Large Language Models (LLMs) to parse, analyze, and learn from, thereby supporting the development of AI tools for legal research and education.
- Each section of the US Legal Code is organized into folders and files based on its title and chapter.
- The first document under each folder is called
introduction.md, which contains introductory information about that particular part, subpart, chapter, or subchapter. - The lowest level document is a "section", denoted by a "§" symbol.
- Readable Format: Markdown format ensures a clean and readable presentation of the law.
- Trackable Changes: Git enables users to view and analyze changes over time.
- Lightweight: Pruned non-crucial text minimizes storage requirements while preserving the core content of the law.
On Windows, you may run into Filename too long errors. This is actually not a Windows issue as far as I can tell, but the git implementation on Windows.
The solution is simpler than you might expect.
- Open "Terminal" in a PowerShell environment.
- Run the following command:
git config --system core.longpaths trueThis will update the config file located at C:\Program Files\Git\etc\gitconfig to add a core.longpaths = true.
You should be able to redownload the corpus without issue.
We appreciate your interest in improving this repository! However, I will not be accepting pull requests. Instead, we encourage users to open issues to identify areas for improvement. Below are some guidelines:
-
Encouraged Topics:
- Suggestions for improving the clarity or accuracy of the Markdown documents.
- Identification of contradictions or ambiguities within the legal code.
- Issues related to the specificity of language or missing content.
-
Civility:
- Please keep all discussions civil and constructive.
- Issues containing disrespectful or inflammatory language will be closed without further consideration.
-
Substance:
- All issues must include a clear description of the problem and a possible improvement or resolution.
- Issues lacking substance or actionable feedback will be closed.
Thank you for your understanding and cooperation in making this repository as accurate and useful as possible.
This repository is not an official source of the US legal code. While every effort has been made to ensure accuracy, users should consult official sources or legal professionals for authoritative information. This project is intended for educational and informational purposes only.