Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.3k 692

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.4k 163

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.6k 265

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 16.6k 1.3k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 941 88

Repositories

Showing 10 of 540 repositories
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 656 Apache-2.0 119 6 45 Updated Jan 5, 2026
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 63 Apache-2.0 11 1 31 Updated Jan 4, 2026
  • IFBench Public
    allenai/IFBench’s past year of commit activity
    Python 94 Apache-2.0 18 1 0 Updated Jan 4, 2026
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,505 Apache-2.0 478 11 (1 issue needs help) 38 Updated Jan 2, 2026
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 16,573 Apache-2.0 1,305 35 15 Updated Jan 3, 2026
  • olmoearth_projects Public

    OlmoEarth projects

    allenai/olmoearth_projects’s past year of commit activity
    Python 52 9 3 4 Updated Jan 1, 2026
  • datamap-rs Public

    Data mapping framework for rust stuff

    allenai/datamap-rs’s past year of commit activity
    Rust 42 Apache-2.0 4 0 3 Updated Dec 29, 2025
  • regmixer Public
    allenai/regmixer’s past year of commit activity
    Jupyter Notebook 6 1 0 2 Updated Dec 29, 2025
  • olmoearth_pretrain Public

    Earth system foundation model data, training, and eval

    allenai/olmoearth_pretrain’s past year of commit activity
    Python 120 20 2 14 Updated Dec 27, 2025
  • allenai/asta-extension’s past year of commit activity
    JavaScript 1 Apache-2.0 0 0 1 Updated Dec 25, 2025