Skip to content
Change the repository type filter

All

    Repositories list

    • web-eval-agent

      Public
      An MCP server that autonomously evaluates web applications.
      Python
      1051.2k011Updated Jan 8, 2026Jan 8, 2026
    • claude-code-sandbox

      Public
      Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.
      Shell
      3.9k000Updated Jan 5, 2026Jan 5, 2026
    • OSS RL environment + evals toolkit
      Python
      56000Updated Nov 14, 2025Nov 14, 2025
    • Open source codebase for Scale Agentex
      Python
      26000Updated Nov 12, 2025Nov 12, 2025
    • TypeScript
      1300Updated Oct 21, 2025Oct 21, 2025
    • JavaScript
      0100Updated Aug 22, 2025Aug 22, 2025
    • demo

      Public template
      🤖 Fork me to try out Dependabot
      Ruby
      4.1k000Updated Jul 21, 2025Jul 21, 2025