feat(experimentation): experiment results model, task and endpoints by gagantrivedi · Pull Request #7796 · Flagsmith/flagsmith

gagantrivedi · 2026-06-16T10:44:00Z

Thanks for submitting a PR! Please check the boxes below:

I have read the Contributing Guide.
I have added information to docs/ if required so people know about the feature.
I have filled in the "Changes" section below.
I have filled in the "How did you test this code" section below.

Changes

Contributes to the experiment results stats layer (stacked on #7781; merge after it).

So the experiment detail page can show per-metric results — lift, chance-to-win, and a sample-ratio-mismatch check — this computes them from the warehouse on demand and stores one row per experiment, updated in place (mirroring the exposures panel).

ExperimentResults model (migration 0008), on a new abstract ExperimentComputation base now shared with ExperimentExposures: as_of / payload / last_error_at / refresh_requested_at, is_final, and record_refresh / record_failure / record_refresh_request.
compute_results_summary (services.py): derives the metric specs and the expected SRM split from the environment's multivariate allocations (control = unallocated remainder), then runs the feat(experimentation): results aggregation query and payload builder #7781 aggregation and feat(experimentation): Bayesian stats kernel #7769 kernel. SRM is skipped (and srm.unkeyed_variant logged) when an option has no variant key to attribute its share to.
compute_experiment_results task: recomputes the full window; on warehouse failure keeps the last good payload and logs results.compute_failed.
GET …/results/ and POST …/results/refresh/: read the row, or enqueue a refresh (202). _validate_refresh_request raises ValidationError before start / once final (400) and Throttled within the refresh interval (429 + Retry-After).

How did you test this code?

make test for the experimentation app — unit tests for the model, task, compute_results_summary / _experiment_metric_specs / _expected_variant_shares, and both endpoints, at 100% diff coverage. mypy and ruff clean.

vercel · 2026-06-16T10:44:07Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
docs	Ready	Preview, Comment	Jun 19, 2026 9:21am

2 Skipped Deployments

Project	Deployment	Actions	Updated (UTC)
flagsmith-frontend-preview	Ignored	Preview	Jun 19, 2026 9:21am
flagsmith-frontend-staging	Ignored	Preview	Jun 19, 2026 9:21am

codecov · 2026-06-16T10:50:00Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 98.59%. Comparing base (1bdb7ca) to head (b4b866e).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff            @@
##             main    #7796    +/-   ##
========================================
  Coverage   98.58%   98.59%            
========================================
  Files        1466     1467     +1     
  Lines       57010    57332   +322     
========================================
+ Hits        56203    56525   +322     
  Misses        807      807

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

github-actions · 2026-06-17T10:42:26Z

Docker builds report

Image	Build Status	Security report
`ghcr.io/flagsmith/flagsmith-e2e:pr-7796`	Finished ✅	Skipped
`ghcr.io/flagsmith/flagsmith-api-test:pr-7796`	Finished ✅	Skipped
`ghcr.io/flagsmith/flagsmith-frontend:pr-7796`	Finished ✅	Results ✅
`ghcr.io/flagsmith/flagsmith-api:pr-7796`	Finished ✅	Results ✅
`ghcr.io/flagsmith/flagsmith:pr-7796`	Finished ✅	Results ✅
`ghcr.io/flagsmith/flagsmith-private-cloud:pr-7796`	Finished ✅	Results ✅

github-actions · 2026-06-17T10:48:52Z

Playwright Test Results (oss - depot-ubuntu-latest-16)

1 passed

Details

1 test across 1 suite
41.5 seconds
17e9032
🔄 Run: #17579 (attempt 1)

Playwright Test Results (oss - depot-ubuntu-latest-arm-16)

1 passed

Details

1 test across 1 suite
37 seconds
17e9032
🔄 Run: #17579 (attempt 1)

Playwright Test Results (private-cloud - depot-ubuntu-latest-16)

1 failed