autoresearch-continual-learning is a public fork and design pivot inspired by karpathy/autoresearch.
This repository is building a constrained autonomous research harness for:
- continual learning
- knowledge editing
- interference-sensitive updates
- multi-metric evaluation with regression risk
- single-GPU workflows where heavyweight runs must be serialized
The repository is focused on the research loop itself:
- bounded editable surfaces
- bounded runs
- structured artifacts
- anti-shim constraints
- promotion and discard rules
This is no longer just a design-doc repo, but it is also not yet a fully self-running autonomous experimentation system.
Current state:
- a real
protocol/constitution layer exists - preflight, schema, parsing, submission, and decision scripts exist
- the 3090 pilot benchmark is complete
- the active implementation family is now Qwen 3.5:
- surrogate lane:
Qwen/Qwen3.5-0.8B-Base - champion lane:
Qwen/Qwen3.5-4B-Base - editable surface:
qwen35_top8_hybrid_attention_mlp
- surrogate lane:
- baseline and method scaffolding exist
- Qwen-family surrogate, champion, and protected-confirmation run classes are frozen for the current launch envelope
- a 12-case CounterFact smoke lane is available only for plumbing checks
- the active v4 lane now uses a 96-case CounterFact standard substrate for in-loop development and protected confirmation
baseline-20260510T133850Zis the current accepted Qwen-family baseline champion on the corrected active v4 standard substrate
Not done yet:
- copying a launch-clean sealed workspace onto the 3090
- hardening monitor/restart behavior for longer unattended operation
- accumulating valid mainline method comparisons against the accepted Qwen baseline champion
So the honest description is:
this repo is a constitution-first continual-learning harness with an accepted Qwen-family baseline champion and a Qwen-wide HyperLoRA first method branch ready for sealed in-loop iteration.
autoresearch is powerful because it makes autonomous research tractable by constraining the loop:
- narrow editable surface
- fixed evaluation surface
- fixed compute budget
- machine-readable result
- keep-or-discard discipline
For continual learning and knowledge editing, those same ideas still matter, but the problem is harder:
- success is not one scalar metric
- regressions matter as much as improvements
- evaluation can be gamed more easily
- hidden capacity and shims are real failure modes
- heavyweight runs are expensive and must be serialized
This repo adapts the autoresearch mentality to that harder setting.
The first 3090 pilot benchmark compared:
Qwen/Qwen3.5-4B-Basemeta-llama/Llama-3.1-8Bgoogle/gemma-3-4b-pt
Original pilot outcome:
- all three fit on the observed 3090
- all three reached
1.0on the repaired bounded visible-dev smoke pack - Gemma initially won the implementation pilot because it matched the legitimacy checks and had the best throughput on the fixed probe
Current launch outcome:
- the harness pivoted to Qwen-family surrogate/champion lanes
Qwen/Qwen3.5-0.8B-Baseis the fast surrogate lane for method developmentQwen/Qwen3.5-4B-Baseis the champion lane for accepted comparisonsbaseline_seq_lora_ft_v11_qwen35_wide_fact_replaycleared protected confirmation as the accepted baseline champion
Active launch pair:
Qwen/Qwen3.5-4B-Baseqwen35_top8_hybrid_attention_mlp
The first target repo is:
conflict_aware_editing
That case study is useful because it already has:
- explicit quality gates
- artifact contracts
- competing method families
- evaluation slices with regression risk
- a strong anti-cheating stance
The design principle here is:
fit conflict_aware_editing to the best constrained autonomous research loop, not the other way around.
This repo is intentionally closer to autoresearch than to a general-purpose software-engineering orchestrator.
Its center of gravity is:
- hypothesis
- bounded change
- bounded run
- structured result
- promote, discard, or escalate
not:
- issue
- agent swarm
- PR theater
The main layers now present are:
protocol/- loop contract
- immutable/editable surfaces
- anti-shim policy
- promotion rules
- run-class policy
- pilot model/surface selection
scripts/- workspace preflight
- spec freezing
- 3090 submission
- artifact parsing
- decision logic
- pilot calibration and visible-dev profiling
eval/- schema validation
- metrics and aggregation
- protected confirmation normalization
- sentinel checks
method/- editable-surface definitions
- trainer shell
- method scaffolding
- baseline wrappers
experiments/- champion state
- append-only ledgers
This repository is a public fork of karpathy/autoresearch, and that inspiration should remain explicit.
What is preserved:
- tight loop constraints
- fixed-budget thinking
- structured result parsing
- human ownership of research organization
What is adapted:
- the target domain
- the evaluation complexity
- the decision logic
- the anti-shim and anti-cheat requirements
Because this is a public fork, some upstream prototype files still exist in the tree. Treat those as inherited reference material, not as the implementation of the current continual-learning harness.