Add cut-point rules for specific functions / intrinsics by dkcumming · Pull Request #931 · runtimeverification/mir-semantics

dkcumming · 2026-02-03T12:55:13Z

This PR enables the ability to provide strings to kmir prove-rs command via flag --break-on-function <STR> that can will then be broken on as a cut point rule if the a function call matches the string. In particular, the are added to a set in the K configuration and each function / intrinsic call is modified to follow this logic:

If a string in breakOnFunctions cell is a substring of the the function / instrinsic call, then break on call (processing the terminator in the same way);
else: continue as before

Multiple strings can be added to the set but each needs to be provided with a new --break-on-function flag since I couldn' think of a separator that was common and would not appear in the the fully qualified path of a function (if we want to provide the fully qualified path).

I tested this on the local mir-semantics tests and on SPL token and both worked as expected.

The `breakOnFunctions` cell will contain identifiers of functions that should be broken on by the `termCallFunctionFilter` rule. Which is identical to `termCallFunction` except it checks if the id is in the `breakOnFunctions` cell and has a different identifier for the cut point rules.

Fully qualified paths may have commas. Instead of using a separator use multiple occurances of the flag

jberthold · 2026-02-04T00:01:42Z

As discussed today in a meeting: This seems like a very useful feature for debugging proofs and semantic corner cases.
The one drawback I see is that because the function names are held in a config cell, a proof cannot (currently) be continued from a saved KCFG with a changed set of function names, they will be loaded together with the proof.
Nevertheless this is useful, with the caveat that the proof needs to be restarted when the set of "cut functions" changes.

Maybe there is an easy way to modify the pending nodes in a proof (updating this config cell)?

jberthold

Approving, we can merge this for now (will be easy to back out later).
In the long run we might want to use some K IO for this (\CC @ehildenb ) something like reading an environment variable with https://github.com/runtimeverification/k/blob/master/k-distribution/include/kframework/builtin/domains.md#shell-access

tothtamas28 · 2026-02-04T14:46:58Z

An alternative approach that @dkcumming and I have considered was instrumenting the program definition as follows:

For each function symbol foo in the SMIR JSON, generate a single rule labeled call-function-foo (escaping characters allowed in the symbol but not in the label) that fires exactly once, immediately before the function call is interpreted.

With this approach, there’s no need to extend the configuration. The open question is what the call-function-foo rules would actually look like in practice. In particular, how much the existing semantics would need to be modified to support this.

…cut-points

…on)) (#960) This PR builds upon #931 modifying the approach in response to the comments on that PR. For full context read #931 _first_. The `kmir prove-rs` flag `--break-on-function` is implemented in this PR as a compiled definition with hooked function to retrieve the function names to match on. This is similar to the already existing pattern that compiles the static data of a KMIR configuration into the definition. This allows for functions to be provided both when creating the initial proof, and when reading from disc (triggers a recompile of llvm if different flags are provided). I added a test to demonstrate this working on functions and intrinsics, only matching those provided. I do not have a test from reading a partial proof and adding different function names - I did test it but it seemed a bit overboard for a test just now. I did try the method with [K shell access impure function](https://github.com/runtimeverification/k/blob/master/k-distribution/include/kframework/builtin/domains.md#shell-access), however this created branching for every function call since the result was stored in a symbolic value. I couldn't figure out how to get that working concretely (I don't think it is possible but might be wrong).

dkcumming · 2026-03-04T11:20:45Z

#960 supersedes

dkcumming added 9 commits February 3, 2026 15:37

Added helper to get MonoItemKind name

f60a7b2

Plumbing break_on_function from cli to initial proof configuration

b860983

Changed explicit string match to identifier-contains-substring

2b0582b

Added support for intrinsics too

4b2e037

Support comma separated list of identifiers to match on

4a3c81e

Changed list from comma separated to append.

a709603

Fully qualified paths may have commas. Instead of using a separator use multiple occurances of the flag

Fix KAST in decode test

2030315

Updated test output

81b4e8d

dkcumming requested review from jberthold, mariaKt, palinatolmach and tothtamas28 February 3, 2026 12:55

tothtamas28 reviewed Feb 3, 2026

View reviewed changes

Comment thread kmir/src/tests/integration/data/decode-value/tmp/llvm/README.md Outdated

Removed accidentally commited definition

8515c55

jberthold approved these changes Feb 4, 2026

View reviewed changes

Merge remote-tracking branch 'origin/master' into dc/filter-function-…

e1bc16d

…cut-points

dkcumming mentioned this pull request Feb 28, 2026

Add cut-point rules for specific functions / intrinsics (via definition)) #960

Merged

dkcumming closed this Mar 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cut-point rules for specific functions / intrinsics#931

Add cut-point rules for specific functions / intrinsics#931
dkcumming wants to merge 11 commits intomasterfrom
dc/filter-function-cut-points

dkcumming commented Feb 3, 2026

Uh oh!

Uh oh!

jberthold commented Feb 4, 2026

Uh oh!

jberthold left a comment

Uh oh!

tothtamas28 commented Feb 4, 2026

Uh oh!

dkcumming commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dkcumming commented Feb 3, 2026

Uh oh!

Uh oh!

jberthold commented Feb 4, 2026

Uh oh!

jberthold left a comment

Choose a reason for hiding this comment

Uh oh!

tothtamas28 commented Feb 4, 2026

Uh oh!

dkcumming commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants