feat(nimbus): Add enrollment alert data GCS reader and Celery fetch task by yashikakhurana · Pull Request #15035 · mozilla/experimenter

yashikakhurana · 2026-03-25T16:48:22Z

Because

Now we have the enrollment count JSON available that we can use to set the monitoring data for each experiment, which can be used further to alert users for the thresholds and SRM.

This commit

Fetches the JSON data, parses and set it to the experiment monitoring_data field

mikewilli · 2026-03-26T18:08:45Z

experimenter/experimenter/settings.py

+    "fetch_monitoring_data": {
+        "task": "experimenter.jetstream.tasks.fetch_monitoring_data",
+        "schedule": crontab(minute=0, hour=8),
+        "options": {"expires": 3600},


What's this expiration option do?

timeout is 1 hour for this request

Is this useful here? We don't use it for any of the other tasks, and I can't really tell from the Celery docs what scenario this parameter is trying to handle.

If we do want this, I think a more appropriate timeout would be ~5min. Were you seeing really long execution times in testing this? Or is there another reason this should be 1hr?

mikewilli · 2026-03-26T18:11:14Z

experimenter/experimenter/settings.py

    },
+    "fetch_monitoring_data": {
+        "task": "experimenter.jetstream.tasks.fetch_monitoring_data",
+        "schedule": crontab(minute=0, hour=8),


This is fine, but be aware that there will be times when the data isn't available by 8 (esp if this is UTC, which I think it is?). It might be nice to have a way to trigger this manually in those cases, but I think it's ok to worry about it later if it happens enough to be worth fixing.

makes sense, let me note down it somewhere so that we don't miss it

mikewilli · 2026-03-26T18:12:39Z

experimenter/experimenter/slack/constants.py

It doesn't look like we're actually using these constants yet, can you wait to include them on the PR where they are actually needed?

okay I can remove these, just added as was prepping for the next task

mikewilli · 2026-03-26T18:14:35Z

experimenter/experimenter/jetstream/client.py

+    except Exception as e:
+        logger.error(f"Failed to load monitoring data from GCS: {e}")
+        return {}


I think we should just raise this exception and let the task handle it along with any logging. That fits with what we're doing in the other tasks, and it also doesn't seem useful to overwrite yesterday's data with blank data if there was an error getting the new data.

mikewilli · 2026-03-26T18:19:48Z

experimenter/experimenter/jetstream/tasks.py

+            logger.warning("No enrollment alert data found in GCS")
+            metrics.incr("fetch_monitoring_data.completed")


Should this be an error log and a .failed metric status?

mikewilli · 2026-03-26T18:53:38Z

experimenter/experimenter/jetstream/tests/test_tasks.py

+
+        self.assertEqual(result, {})
+
+    @patch("experimenter.jetstream.client.load_data_from_gcs")


I think you can patch these for everything without annotating every function by creating a fixture like this, and having it take a parameter to set the return value dynamically.

mikewilli · 2026-03-26T18:55:01Z

experimenter/experimenter/jetstream/tests/test_tasks.py

+        result = get_monitoring_data()
+
+        self.assertEqual(result, {})


Yea see my other comment on this, but I really think this should result in an exception from get_monitoring_data that is handled by the task.

mikewilli · 2026-03-26T18:58:53Z

experimenter/experimenter/jetstream/tests/test_tasks.py

+        ):
+            mock_get.return_value = experiment
+            # Should not raise, should log and continue
+            tasks.fetch_monitoring_data()


I'm not totally sure this test is necessary, but I guess it ensures that a random exception doesn't break the task?

Can we test for something here, like that the status or log occurs?

mikewilli · 2026-03-26T19:00:36Z

experimenter/experimenter/jetstream/tasks.py

+            try:
+                experiment = NimbusExperiment.objects.get(
+                    slug=exp_slug,
+                    status=NimbusConstants.Status.LIVE,


This should work for COMPLETE also, no?

mikewilli · 2026-03-26T19:01:00Z

experimenter/experimenter/jetstream/tests/test_tasks.py

+    @patch("experimenter.jetstream.tasks.get_monitoring_data")
+    def test_fetch_monitoring_data_updates_live_experiment(self, mock_get_data):
+        experiment = NimbusExperimentFactory.create(
+            status=NimbusExperiment.Status.LIVE,


Maybe parametrize this so it takes both LIVE and COMPLETE.

Co-authored-by: Mike Williams <102263964+mikewilli@users.noreply.github.com>

yashikakhurana added 2 commits March 25, 2026 09:45

feat(nimbus): Add enrollment alert data GCS reader and Celery fetch task

86bdbaf

feat(nimbus): Add enrollment alert data GCS reader and Celery fetch task

dd7ac4a

yashikakhurana marked this pull request as ready for review March 25, 2026 17:19

yashikakhurana requested review from freshstrangemusic, jaredlockhart, mikewilli and relud as code owners March 25, 2026 17:19

yashikakhurana linked an issue Mar 25, 2026 that may be closed by this pull request

Experimenter- Add enrollment alert thresholds and message templates #15036

Open

7 tasks

mikewilli reviewed Mar 26, 2026

View reviewed changes

Update experimenter/experimenter/jetstream/tasks.py

1b9b930

Co-authored-by: Mike Williams <102263964+mikewilli@users.noreply.github.com>

		logger.warning("No enrollment alert data found in GCS")
		metrics.incr("fetch_monitoring_data.completed")


		self.assertEqual(result, {})

		@patch("experimenter.jetstream.client.load_data_from_gcs")

Conversation

yashikakhurana commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yashikakhurana Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yashikakhurana commented Mar 25, 2026 •

edited

Loading

yashikakhurana Mar 26, 2026 •

edited

Loading