Structural Simplifier for Symbolic Simulation #

It is a specialized simplifier designed for symbolic simulation workloads. It addresses performance bottlenecks identified in the standard simp tactic when applied to large terms typical in symbolic execution.

Design Goals #

Efficient caching via pointer-based keys on maximally shared terms
Fast pattern matching using the Pattern infrastructure instead of isDefEq
Minimal proof term overhead by using shareCommon and efficient congruence lemma application

Key Performance Problems Addressed #

1. Cache Inefficiency (Hash Collisions) #

The standard simplifier uses structural equality for cache keys. With large terms, hash collisions cause O(n) comparisons that fail, leading to O(n²) behavior.

Solution: Require maximally shared input terms (shareCommon) and use pointer-based cache keys. Structurally equal terms have equal pointers, making cache lookup O(1).

2. `isDefEq` in Rewrite Matching #

Profiling shows that isDefEq dominates simplification time in many workloads. For each candidate rewrite rule, definitional equality checking with metavariable unification is performed, and sometimes substantial time is spent checking whether the scopes are compatible.

Solution: Use the Pattern infrastructure for syntactic matching:

No metavariable creation or assignment
No occurs check (CheckAssignment)
Direct de Bruijn index comparison

3. `inferType` in Proof Construction #

In the standard simplifier, building Eq.trans and congrArg proofs uses inferType on proof terms, which reconstructs large expressions and destroys sharing. It often causes O(n²) allocation.

Solution:

Never perform inferType on proof terms when constructing congruence and transitivity proofs.
Use a pointer-based cache for inferType on terms.

4. `funext` Proof Explosion #

Nested funext applications create O(n²) proof terms due to implicit arguments {f} {g} of size O(n) repeated n times.

Solution: Generate funext_k for k binders at once, stating the implicit arguments only once.

5. Binder Re-entry Cache Invalidation #

When a simplification theorem restructures a lambda, re-entering creates a fresh free variable, invalidating all cached results for subterms.

Solution: Reuse free variables across binder re-entry when safe:

Stack-based approach: reuse when types match on re-entry path
Instance-aware: track local type class instances separately from hypotheses
If simplifier doesn't discharge hypotheses from local context, only instances matter

6. Contextual `ite` Handling #

The standard ite_congr theorem adds hypotheses when entering branches, invalidating the cache and causing O(2^n) behavior on conditional trees.

Solution: Non-contextual ite handling for symbolic simulation:

Only simplify the condition
If condition reduces to True/False, take the appropriate branch
Otherwise, keep ite symbolic (let the simulator handle case-splitting)

Architecture #

Input Requirements #

Terms must be maximally shared via shareCommon like every other module in Sym.
Enables pointer-based cache keys throughout

Integration with SymM #

SimpM is designed to work within the SymM symbolic simulation framework:

Uses BackwardRule for control-flow (monadic bind, match, combinators)
SimpM handles term simplification between control-flow steps
Avoids entering control-flow binders

Structural Simplifier for Symbolic Simulation #

Design Goals #

Key Performance Problems Addressed #

1. Cache Inefficiency (Hash Collisions) #

2. isDefEq in Rewrite Matching #

3. inferType in Proof Construction #

4. funext Proof Explosion #

5. Binder Re-entry Cache Invalidation #

6. Contextual ite Handling #

Architecture #

Input Requirements #

Integration with SymM #

Use cases for done = true #

In pre simprocs #

In post simprocs #

Behavior #

2. `isDefEq` in Rewrite Matching #

3. `inferType` in Proof Construction #

4. `funext` Proof Explosion #

6. Contextual `ite` Handling #

Use cases for `done = true` #

In `pre` simprocs #

In `post` simprocs #