CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Common Development Commands

The project uses Pixi for package management and task automation. Key commands:

Development Tasks

pixi run ci - Run complete CI pipeline (format, lint, test with coverage)
pixi run test - Run pytest test suite
pixi run format - Format code with ruff
pixi run lint - Run ruff linting and pylint
pixi run coverage - Run tests with coverage reporting
pixi run generate-docs - Generate documentation from examples
pixi run demo - Run demo example (example_image.py)

AI Agent Integration

pixi run agent-iterate - Full CI cycle for AI agents (includes docs, tests, commits, and fixes)

Testing Individual Examples

To test a specific example: pixi run python bencher/example/example_simple_float.py

Important

ALWAYS use the pixi environment for every command. Never run raw python, pytest, ruff, or any other tool directly — always prefix with pixi run (e.g. pixi run python ..., pixi run pytest ...). This ensures the correct dependencies and environment are used.

How to Use Bencher

See docs/how_to_use_bencher.md for the complete guide on using bencher — sweep types, result types, the benchmark() pattern, plot callbacks, and common mistakes. Read this before writing any benchmark or example.

Updating Examples & Docs

Add or modify the example implementation under bencher/example/.
Register the example in bencher/example/meta/generate_examples.py so the documentation generator emits a notebook (pick an appropriate gallery subdirectory).
Run pixi run generate-docs to regenerate gallery notebooks.
Update relevant user docs (for instance docs/intro.md or gallery sections) to mention the new example.
Make sure conf.py includes the docs that are added
Execute pixi run ci before committing to ensure formatting, linting, and tests all pass.

Code Architecture

Bencher is a benchmarking framework built around these core concepts:

Core Components

Bench: Main benchmarking class that orchestrates parameter sweeps and result collection
BenchRunner: Higher-level interface for managing multiple benchmark runs
BenchCfg/BenchRunCfg: Configuration classes for benchmark setup and execution
ParametrizedSweep: Base class for defining parameter sweep configurations

Parameter System

Uses the param library for parameter definitions with metadata
Sweep Classes: IntSweep, FloatSweep, StringSweep, EnumSweep, BoolSweep
Parameters define search spaces with bounds and sampling strategies
Results stored in N-dimensional xarray structures

Results System

BenchResult: Container for benchmark results and visualizations
HoloviewResult: Base class for interactive plots (scatter, line, heatmap, etc.)
ComposableContainer: Framework for combining multiple result types
Video/Image Results: Support for multimedia outputs
Results automatically cached using diskcache based on parameter hashes
Result type selection: Use ResultBool for binary outcomes (success/failure), ResultFloat for continuous metrics, ResultString for text, ResultImage/ResultVideo/ResultPath for files

Data Flow

Define parameter sweep configuration class inheriting from ParametrizedSweep
Implement benchmark function that takes config instance, returns metrics dict
Bench calculates the Cartesian product of all parameter combinations
Each combination executed (with caching), results stored in N-D tensor
Automatic plot type deduction based on parameter/result types
Results cached persistently for reuse

Key Directories

bencher/ - Main package source
bencher/example/ - Comprehensive examples organized by input dimensions
bencher/variables/ - Parameter sweep and result variable definitions
bencher/results/ - Result containers and visualization classes
test/ - Test suite

Configuration Files

pyproject.toml - Project dependencies and Pixi task definitions
ruff.toml - Code formatting/linting configuration (100 char line length)
Line length limit: 100 characters (configured in ruff.toml)

Testing Strategy

Uses pytest framework
Coverage reporting with coverage.py
Examples serve as integration tests
Meta-generated examples in bencher/example/meta/

Generated Example Naming Convention

All auto-generated examples live under bencher/example/generated/. Each filename must be globally unique across the entire generated tree — no two files may share a basename even if they are in different subdirectories. This is required because the documentation build uses filenames as RST page stems and thumbnail identifiers.

Pattern: {section_prefix}{descriptive_dimensions}.py

Every filename uses a section prefix from the table below (each already includes the example_ prefix), followed by the varying dimensions encoded in the name:

Section Prefix	Output Directory	Varying Dimensions
`example_sweep_`	`{N}_float/{variant}/`	float count, cat count, variant
`example_plot_`	`plot_types/`	plot type
`example_bool_plot_`	`bool_plot_types/`	plot type
`example_result_`	`result_types/result_{type}/`	result type, input dims
`example_composable_`	`composable_containers/`	backend, compose type
`example_sampling_`	`sampling/`	strategy
`example_stats_`	`statistics/`	variant
`example_const_vars_`	`const_vars/`	example
`example_optim_`	`optimization*/`	objectives, dims, over_time
`example_advanced_`	`advanced/`	example
`example_workflow_`	`workflows/`	example
`example_perf_`	`performance/`	variant
`example_regression_`	`regression/`	variant
`example_yaml_`	`yaml/`	format
`example_publish_`	`publishing/`	example
`example_rerun_`	`rerun/`	example
`example_agg_`	`aggregation/`	aggregation form, agg_fn
`example_cartesian_`	`cartesian_animation/`	(single example)
`example_container_tab_`	`container_tabs/`	layout mode
`example_levels_`	`levels/`	variant

Rules for adding new generators:

Every filename must start with example_ followed by a unique section prefix.
Encode every varying dimension in the filename — never rely on the folder path alone.
The Python function inside the generated file must also start with example_ (the test harness and doc builder use this prefix for discovery).
Register the generator in generate_examples.py:generate_python_files() and add corresponding entries to SECTION_GROUPS for gallery placement.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Common Development Commands

Development Tasks

AI Agent Integration

Testing Individual Examples

Important

How to Use Bencher

Updating Examples & Docs

Code Architecture

Core Components

Parameter System

Results System

Data Flow

Key Directories

Configuration Files

Testing Strategy

Generated Example Naming Convention

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

CLAUDE.md

Common Development Commands

Development Tasks

AI Agent Integration

Testing Individual Examples

Important

How to Use Bencher

Updating Examples & Docs

Code Architecture

Core Components

Parameter System

Results System

Data Flow

Key Directories

Configuration Files

Testing Strategy

Generated Example Naming Convention