Skip to content
Change the repository type filter

All

    Repositories list

    • Language-Model-SAEs

      Public
      Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.
      Python
      2821280Updated Apr 16, 2026Apr 16, 2026
    • MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenar…
      Python
      310810Updated Apr 16, 2026Apr 16, 2026
    • mlx-audio

      Public
      A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Sil…
      Python
      MIT License
      554500Updated Apr 16, 2026Apr 16, 2026
    • MOSS-TTS-Nano

      Public
      MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for real…
      Python
      Apache License 2.0
      1511.3k193Updated Apr 16, 2026Apr 16, 2026
    • JavaScript
      11900Updated Apr 14, 2026Apr 14, 2026
    • MOSS-VL

      Public
      MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
      Python
      Apache License 2.0
      422010Updated Apr 14, 2026Apr 14, 2026
    • sglang

      Public
      Python
      Apache License 2.0
      0300Updated Apr 14, 2026Apr 14, 2026
    • MOSS-Audio-Tokenizer

      Public
      MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming an…
      Python
      Apache License 2.0
      1219131Updated Apr 13, 2026Apr 13, 2026
    • MOSS-TTS-Nano-Demo

      Public
      CSS
      1100Updated Apr 13, 2026Apr 13, 2026
    • MOSS-TTS

      Public
      MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressive…
      Python
      Apache License 2.0
      1351.5k231Updated Apr 13, 2026Apr 13, 2026
    • A real-time video understanding foundation model built on Llama-3.2-Vision, featuring comprehensively extended video processing and multimodal reasoning capabil…
      Python
      Apache License 2.0
      413500Updated Apr 13, 2026Apr 13, 2026
    • Vue
      0500Updated Apr 9, 2026Apr 9, 2026
    • llama.cpp

      Public
      C++
      MIT License
      2302Updated Apr 8, 2026Apr 8, 2026
    • BandPO

      Public
      Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning. BandPO replaces canoni…
      Python
      GNU General Public License v3.0
      44800Updated Apr 8, 2026Apr 8, 2026
    • Python
      0700Updated Apr 3, 2026Apr 3, 2026
    • MOVA

      Public
      MOVA: Towards Scalable and Synchronized Video–Audio Generation
      Python
      Apache License 2.0
      77951271Updated Apr 1, 2026Apr 1, 2026
    • A library for mechanistic interpretability of GPT-style language models
      Python
      MIT License
      553200Updated Mar 31, 2026Mar 31, 2026
    • Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
      Python
      Apache License 2.0
      12704Updated Mar 30, 2026Mar 30, 2026
    • DiRL

      Public
      Python
      Apache License 2.0
      715801Updated Mar 30, 2026Mar 30, 2026
    • OurClaw

      Public
      Institutional OpenClaw Solution. Share One Claw with Others.
      TypeScript
      MIT License
      32400Updated Mar 30, 2026Mar 30, 2026
    • RoboOmni

      Public
      Official code of "RoboOmni: Proactive Robot Manipulation in Omni-modal Context"
      Python
      510560Updated Mar 28, 2026Mar 28, 2026
    • MOSS-TTSD

      Public
      MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, a…
      Python
      Apache License 2.0
      1251.3k520Updated Mar 23, 2026Mar 23, 2026
    • TTSD-eval

      Public
      Python
      0300Updated Mar 16, 2026Mar 16, 2026
    • JavaScript
      0200Updated Mar 3, 2026Mar 3, 2026
    • Website

      Public
      wangye
      JavaScript
      3001Updated Mar 2, 2026Mar 2, 2026
    • FRoM-W1

      Public
      [ArXiv 26] FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions
      Python
      Apache License 2.0
      715130Updated Feb 13, 2026Feb 13, 2026
    • MOSS-Speech is a true speech-to-speech large language model without text guidance.
      Python
      Apache License 2.0
      712920Updated Feb 13, 2026Feb 13, 2026
    • RoboJuDo

      Public
      [ArXiv 26] The Depolyment Framework for the FRoM-W1 Project
      Python
      Other
      63300Updated Jan 28, 2026Jan 28, 2026
    • Python
      12200Updated Jan 22, 2026Jan 22, 2026
    • ABC-Bench

      Public
      ABC-Bench is a benchmark for Agentic Backend Coding. It evaluates whether code agents can explore real repositories, edit code, configure environments, deploy c…
      22710Updated Jan 20, 2026Jan 20, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.