smite-ir: implement `OpenChannelGenerator` by morehouse · Pull Request #18 · morehouse/smite

morehouse · 2026-04-01T22:23:40Z

Adds the functionality needed to generate IR programs that exercise the open_channel -> accept_channel flow.

Key pieces:

ProgramBuilder is the toolkit for building type-correct programs.
Generator is the trait defining the generator interface.
OpenChannelGenerator generates IR programs that build and sends open_channel messages, then wait for the accept_channel reply.

Ref: #5 (Milestone 1)

Provides a list of all fields extractable from a compound variable, along with the Operations required to extract them. The immediate use case is for extracting fields from AcceptChannel compound variables when generating programs for the open_channel -> accept_channel flow. Eventually we will also use this for other compound variables.

ProgramBuilder is the toolkit for building Programs and is intended to be used by generators and mutators. It maintains: - The instruction list being generated (append-only, SSA). - A type-indexed variable registry. - The pick_variable method for selecting type-correct variables.

The generator creates IR sequences that do the following: 1. Build an open_channel message with mostly arbitrary fields, except that the correct chain_hash is pulled from the context. 2. Send the open_channel message to the target. 3. Receive any accept_channel response from the target.

We have access to expected input and output types, so we can ensure the types match up during Program construction. Currently we panic on the first invalid Instruction since generators are expected to *always* generate type-correct programs, and we want to detect generator bugs early on in development. Once we add support for mutators that rewrite existing programs, we will need to handle the situation where a corpus input deserializes to an invalid Program, causing us to panic on rewrite. The simplest solution is to implement and use Program::validate() to check deserialized corpus inputs, refusing to mutate inputs that do not validate.

devvaansh

took a deep look at the IR Builder and Operation logic, have a few suggestions regarding determinism for reproducibility and some potential hot-path optimizations for fuzzing throughput.

smite-ir/src/builder.rs

smite-ir/src/operation.rs

ekzyis

lgtm, just one question

ekzyis · 2026-04-04T17:31:11Z

smite-ir/src/generators.rs

+/// A generator that emits instructions into a `ProgramBuilder`.
+pub trait Generator {
+    /// Emits instructions for this generator's protocol interaction.
+    fn generate(&self, builder: &mut ProgramBuilder, rng: &mut impl Rng);


nit: rng is only used inside ProgramBuilder and #5 mentions this, which sounds to me like a generator shouldn't be in control of what a builder will select:

Each generator [...] delegates value selection and variable reuse to ProgramBuilder.

Should the generator then be in control of the RNG? Have you considered passing it in ProgramBuilder::new instead?

I think it makes sense to have rng in the generate interface. Currently generators don't use it directly, but in the future they probably will. For example, an interactive-tx generator may randomly choose what transaction inputs and outputs to construct, or randomly arrange the tx_add_* and tx_remove_* messages to send.

erickcestari

LGTM!

devvaansh · 2026-04-05T05:44:14Z

Also LGTM👍

harsh04044

The separation of generate_fresh() for keys vs pick_variable() for scalar parameters is a clean design. For the interactive-tx generator, serial_ids have a similar constraint -- parity is protocol-mandated (even for initiator, odd for non-initiator), so they'd need to be generated fresh with explicit parity enforcement rather than picked from the candidate pool.

Is the intent that generators are responsible for enforcing protocol constraints like this, or is there a planned mechanism in ProgramBuilder to support constrained variable generation?

morehouse · 2026-04-06T15:35:15Z

The separation of generate_fresh() for keys vs pick_variable() for scalar parameters is a clean design. For the interactive-tx generator, serial_ids have a similar constraint -- parity is protocol-mandated (even for initiator, odd for non-initiator), so they'd need to be generated fresh with explicit parity enforcement rather than picked from the candidate pool.

Is the intent that generators are responsible for enforcing protocol constraints like this, or is there a planned mechanism in ProgramBuilder to support constrained variable generation?

I would lean towards managing those kinds of constraints in the generators. If we have a recurring need for the same functionality we could consider moving it to ProgramBuilder.

Chand-ra · 2026-04-07T15:36:49Z

smite-ir/src/builder.rs

+    /// Selects or creates a variable of the given type using probabilistic
+    /// variable selection (75% most recent, 15% any existing, 10% fresh).
+    #[allow(clippy::missing_panics_doc)] // candidates is always non-empty
+    pub fn pick_variable(&mut self, var_type: VariableType, rng: &mut impl Rng) -> usize {


Nit: I think a TODO comment explaining that we'd like to replace these hardcoded probabilities with an adaptive mutation scheduler (like MOpt) in the future would make it easy to circle back here in the future.

Other than this, LGTM.

morehouse added 5 commits April 1, 2026 14:07

smite-ir: add generator and builder tests

4c0a34a

devvaansh reviewed Apr 3, 2026

View reviewed changes

smite-ir/src/builder.rs Show resolved Hide resolved

smite-ir/src/operation.rs Show resolved Hide resolved

ekzyis approved these changes Apr 4, 2026

View reviewed changes

erickcestari approved these changes Apr 5, 2026

View reviewed changes

harsh04044 approved these changes Apr 5, 2026

View reviewed changes

Chand-ra reviewed Apr 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

smite-ir: implement `OpenChannelGenerator`#18

smite-ir: implement `OpenChannelGenerator`#18
morehouse wants to merge 5 commits intomasterfrom
generators

morehouse commented Apr 1, 2026

Uh oh!

devvaansh left a comment

Uh oh!

Uh oh!

Uh oh!

ekzyis left a comment

Uh oh!

ekzyis Apr 4, 2026

Uh oh!

morehouse Apr 6, 2026

Uh oh!

erickcestari left a comment

Uh oh!

devvaansh commented Apr 5, 2026

Uh oh!

harsh04044 left a comment

Uh oh!

morehouse commented Apr 6, 2026

Uh oh!

Chand-ra Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

morehouse commented Apr 1, 2026

Uh oh!

devvaansh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ekzyis left a comment

Choose a reason for hiding this comment

Uh oh!

ekzyis Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

morehouse Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

erickcestari left a comment

Choose a reason for hiding this comment

Uh oh!

devvaansh commented Apr 5, 2026

Uh oh!

harsh04044 left a comment

Choose a reason for hiding this comment

Uh oh!

morehouse commented Apr 6, 2026

Uh oh!

Chand-ra Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants