npm installSet your Anthropic API key via environment variable:
export ANTHROPIC_API_KEY=sk-ant-your-api-key-hereFor server-side exam processing with Docker, set:
SIB_WORKER_IMAGE(Docker image built fromdocker/worker/Dockerfile)SIB_REPO_URL(clone URL for the base system-intelligence-benchmark repo)
Development:
npm run devProduction:
npm run build
node ./dist/server/entry.mjsThe server runs on http://localhost:3000 by default.
This project is intended to run the UI on Vercel and the API on a Docker-capable host.
UI (Vercel):
- Set the Vercel project framework to Astro.
- Build command:
ASTRO_OUTPUT=static npm run build. - Output directory:
dist. - Update
vercel.jsonto point/api/*to your API host.
API (Docker host):
- Build the worker image:
docker build -t sib-worker -f docker/worker/Dockerfile . - Set env vars:
ANTHROPIC_API_KEY,SIB_WORKER_IMAGE=sib-worker,SIB_REPO_URL. - Build and run the server:
npm run buildthennode ./dist/server/entry.mjs.
- Navigate to
/exams - Fill in exam metadata (ID, name, course, institution, year)
- Upload the exam PDF/TXT file
- Upload the solutions PDF/TXT file
- Optionally upload reference materials
- Click "Process and Add Exam"
GitHub username and token are required to create a draft pull request.
The AI will parse the exam and solutions, generating a structured exam.md file in the courseexam format.
- Navigate to
/labs - Enter the GitHub repository URL
- Fill in course metadata
- Click "Clone and Analyze Lab"
The AI agent will:
- Clone the repository
- Analyze the structure to identify tasks
- Generate config.json, task.md, compose.yaml, and evaluate.sh for each task
- Copy starter files
- Update courses.json
- Exams:
benchmarks/courseexam_bench/data/raw/{exam_id}/ - Labs:
benchmarks/courselab_bench/data/{course_id}/