Miden VM Program decoder

Miden VM program decoder is responsible for ensuring that a program with a given MAST root is executed by the VM. As the VM executes a program, the decoder does the following:

Decodes a sequence of field elements supplied by the prover into individual operation codes (or opcodes for short).
Organizes the sequence of field elements into code blocks, and computes the hash of the program according to the methodology described here.

At the end of program execution, the decoder outputs the computed program hash. This hash binds the sequence of opcodes executed by the VM to a program the prover claims to have executed. The verifier uses this hash during the STARK proof verification process to verify that the proof attests to a correct execution of a specific program (i.e., the prover didn't claim to execute program $A$ while in fact executing a different program $B$ ).

The sections below describe how Miden VM decoder works. Throughout these sections we make the following assumptions:

An opcode requires $7$ bits to represent.
An immediate value requires one full field element to represent.
A NOOP operation has a numeric value of $0$ , and thus, can be encoded as seven zeros. Executing a NOOP operation does not change the state of the VM, but it does advance operation counter, and may affect program hash.

Program execution

Miden VM programs consist of a set of code blocks organized into a binary tree. The leaves of the tree contain linear sequences of instructions, and control flow is defined by the internal nodes of the tree.

Managing control flow in the VM is accomplished by executing control flow operations listed in the table below. Each of these operations requires exactly one VM cycle to execute.

Operation	Description
`JOIN`	Initiates processing of a new Join block.
`SPLIT`	Initiates processing of a new Split block.
`LOOP`	Initiates processing of a new Loop block.
`REPEAT`	Initiates a new iteration of an executing loop.
`SPAN`	Initiates processing of a new Span block.
`RESPAN`	Initiates processing of a new operation batch within a span block.
`DYN`	Initiates processing of a new Dyn block.
`CALL`	Initiates processing of a new Call block.
`SYSCALL`	Initiates processing ofa new Syscall block.
`END`	Marks the end of a program block.
`HALT`	Marks the end of the entire program.

Let's consider a simple program below:


begin
    <operations1>
    if.true
        <operations2>
    else
        <operations3>
    end
end

Block structure of this program is shown below.


JOIN
    SPAN
        <operations1>
    END
    SPLIT
        SPAN
            <operations2>
        END
        SPAN
            <operations3>
        END
    END
END

Executing this program on the VM can result in one of two possible instruction sequences. First, if after operations in <operations1> are executed the top of the stack is $1$ , the VM will execute the following:


JOIN
SPAN
<operations1>
END
SPLIT
SPAN
<operations2>
END
END
END
HALT

However, if after <operations1> are executed, the top of the stack is $0$ , the VM will execute the following:


JOIN
SPAN
<operations1>
END
SPLIT
SPAN
<operations3>
END
END
END
HALT

The main task of the decoder is to output exactly the same program hash, regardless of which one of the two possible execution paths was taken. However, before we can describe how this is achieved, we need to give an overview of the overall decoder structure.

Decoder structure

The decoder is one of the more complex parts of the VM. It consists of the following components:

Main execution trace consisting of $24$ trace columns which contain the state of the decoder at a given cycle of a computation.
Connection to the hash chiplet, which is used to offload hash computations from the decoder.
$3$ virtual tables (implemented via multi-set checks), which keep track of code blocks and operations executing on the VM.

Decoder trace

Decoder trace columns can be grouped into several logical sets of registers as illustrated below.

These registers have the following meanings:

Block address register $a$ . This register contains address of the hasher for the current block (row index from the auxiliary hashing table). It also serves the role of unique block identifiers. This is convenient, because hasher addresses are guaranteed to be unique.
Registers $b_{0}, ..., b_{6}$ , which encode opcodes for operation to be executed by the VM. Each of these registers can contain a single binary value (either $1$ or $0$ ). And together these values describe a single opcode.
Hasher registers $h_{0}, ..., h_{7}$ . When control flow operations are executed, these registers are used to provide inputs for the current block's hash computation (e.g., for JOIN, SPLIT, LOOP, SPAN, CALL, SYSCALL operations) or to record the result of the hash computation (i.e., for END operation). However, when regular operations are executed, $2$ of these registers are used to help with op group decoding, and the remaining $6$ can be used to hold operation-specific helper variables.
Register $s p$ which contains a binary flag indicating whether the VM is currently executing instructions inside a span block. The flag is set to $1$ when the VM executes non-control flow instructions, and is set to $0$ otherwise.
Register $g c$ which keeps track of the number of unprocessed operation groups in a given span block.
Register $o x$ which keeps track of a currently executing operation's index within its operation group.
Operation batch flags $c_{0}, c_{1}, c_{2}$ which indicate how many operation groups a given operation batch contains. These flags are set only for SPAN and RESPAN operations, and are set to $0$ 's otherwise.
Two additional registers (not shown) are used primarily for constraint degree reduction.

Program block hashing

To compute hashes of program blocks, the decoder relies on the hash chiplet. Specifically, the decoder needs to perform two types of hashing operations:

A simple 2-to-1 hash, where we provide a sequence of $8$ field elements, and get back $4$ field elements representing the result. Computing such a hash requires $8$ rows in the hash chiplet.
A sequential hash of $n$ elements. Computing such a hash requires multiple absorption steps, and at each step $8$ field elements are absorbed into the hasher. Thus, computing a sequential hash of $n$ elements requires $⌈ n /8 ⌉$ rows in the hash chiplet. At the end, we also get $4$ field elements representing the result.

To make hashing requests to the hash chiplet and to read the results from it, we will need to divide out relevant values from the chiplets bus column $b_{c hi p}$ as described below.

Simple 2-to-1 hash

To initiate a 2-to-1 hash of $8$ elements ( $v_{0}, ..., v_{7}$ ) we need to divide $b_{c hi p}$ by the following value:

$α_{0} + α_{1} \cdot m_{b p} + α_{2} \cdot r + i = 0 \sum 7 (α_{i + 8} \cdot v_{i})$

where:

$m_{b p}$ is a label indicating beginning of a new permutation. Value of this label is computed based on hash chiplet selector flags according to the methodology described here.
$r$ is the address of the row at which the hashing begins.
Some $α$ values are skipped in the above (e.g., $α_{3}$ ) because of the specifics of how auxiliary hasher table rows are reduced to field elements (described here). For example, $α_{3}$ is used as a coefficient for node index values during Merkle path computations in the hasher, and thus, is not relevant in this case. The $α_{4}$ term is omitted when the number of items being hashed is a multiple of the rate width ( $8$ ) because it is multiplied by 0 - the value of the first capacity register as determined by the hasher chiplet logic.

To read the $4$ -element result ( $u_{0}, ..., u_{3}$ ), we need to divide $b_{c hi p}$ by the following value:

$α_{0} + α_{1} \cdot m_{h o u t} + α_{2} \cdot (r + 7) + i = 0 \sum 3 (α_{i + 8} \cdot u_{i})$

where:

$m_{h o u t}$ is a label indicating return of the hash value. Value of this label is computed based on hash chiplet selector flags according to the methodology described here.
$r$ is the address of the row at which the hashing began.

Sequential hash

To initiate a sequential hash of $n$ elements ( $v_{0}, ..., v_{n - 1}$ ), we need to divide $b_{c hi p}$ by the following value:

$α_{0} + α_{1} \cdot m_{b p} + α_{2} \cdot r + α_{4} \cdot n + i = 0 \sum 7 (α_{i + 8} \cdot v_{i})$

This also absorbs the first $8$ elements of the sequence into the hasher state. Then, to absorb the next sequence of $8$ elements (e.g., $v_{8}, ..., v_{15}$ ), we need to divide $b_{c hi p}$ by the following value:

$α_{0} + α_{1} \cdot m_{ab p} + α_{2} \cdot (r + 7) + i = 0 \sum 7 (α_{i + 8} \cdot v_{i + 8})$

Where $m_{ab p}$ is a label indicating absorption of more elements into the hasher state. Value of this label is computed based on hash chiplet selector flags according to the methodology described here.

We can keep absorbing elements into the hasher in the similar manner until all elements have been absorbed. Then, to read the result (e.g., $u_{0}, ..., u_{3}$ ), we need to divide $b_{c hi p}$ by the following value:

$α_{0} + α_{1} \cdot m_{h o u t} + α_{2} \cdot (r + ⌈ n /8 ⌉ \cdot 8 - 1) + i = 0 \sum 3 (α_{i + 8} \cdot u_{i})$

Thus, for example, if $n = 14$ , the result will of the hash will be available at hasher row $r + 15$ .

Control flow tables

In addition to the hash chiplet, control flow operations rely on $3$ virtual tables: block stack table, block hash table, and op group table. These tables are virtual in that they don't require separate trace columns. Their state is described solely by running product columns: $p_{1}$ , $p_{2}$ , and $p_{3}$ . The tables are described in the following sections.

Block stack table

When the VM starts executing a new program block, it adds its block ID together with the ID of its parent block (and some additional info) to the block stack table. When a program block is fully executed, it is removed from the table. In this way, the table represents a stack of blocks which are currently executing on the VM. By the time program execution completes, block stack table must be empty.

The block stack table is also used to ensure that execution contexts are managed properly across the CALL and SYSCALL operations.

The table can be thought of as consisting of $11$ columns as shown below:

decoder_block_stack_table

where:

The first column ( $t_{0}$ ) contains the ID of the block.
The second column ( $t_{1}$ ) contains the ID of the parent block. If the block has no parent (i.e., it is a root block of the program), parent ID is 0.
The third column ( $t_{2}$ ) contains a binary value which is set to $1$ is the block is a loop block, and to $0$ otherwise.
The following 8 columns are only set to non-zero values for CALL and SYSCALL operations. They save all the necessary information to be able to restore the parent context properly upon the corresponding END operation
- the prnt_b0 and prnt_b1 columns refer to the stack helper columns B0 and B1 (current stack depth and last overflow address, respectively)

In the above diagram, the first 2 rows correspond to 2 different CALL operations. The first CALL operation is called from the root context, and hence its parent fn hash is the zero hash. Additionally, the second CALL operation has a parent fn hash of [h0, h1, h2, h3], indicating that the first CALL was to a procedure with that hash.

Running product column $p_{1}$ is used to keep track of the state of the table. At any step of the computation, the current value of $p_{1}$ defines which rows are present in the table.

To reduce a row in the block stack table to a single value, we compute the following.

$ro w = α_{0} + i = 0 \sum 10 (α_{i + 1} \cdot t_{i}),$

where $α_{0}, ..., α_{11}$ are the random values provided by the verifier.

Block hash table

When the VM starts executing a new program block, it adds hashes of the block's children to the block hash table. And when the VM finishes executing a block, it removes its hash from the block hash table. Thus, by the time program execution completes, block hash table must be empty.

The table can be thought of as consisting of $7$ columns as shown below:

block_hash_table

where:

The first column ( $t_{0}$ ) contains the ID of the block's parent. For program root, parent ID is $0$ .
The next $4$ columns ( $t_{1}, ..., t_{4}$ ) contain the hash of the block.
The next column ( $t_{5}$ ) contains a binary value which is set to $1$ if the block is the first child of a join block, and to $0$ otherwise.
The last column ( $t_{6}$ ) contains a binary value which is set to $1$ if the block is a body of a loop, and to $0$ otherwise.

Running product column $p_{2}$ is used to keep track of the state of the table. At any step of the computation, the current value of $p_{2}$ defines which rows are present in the table.

To reduce a row in the block hash table to a single value, we compute the following.

$ro w = α_{0} + i = 0 \sum 6 (α_{i + 1} \cdot t_{i})$

Where $α_{0}, ..., α_{7}$ are the random values provided by the verifier.

Unlike other virtual tables, block hash table does not start out in an empty state. Specifically, it is initialized with a single row containing the hash of the program's root block. This needs to be done because the root block does not have a parent and, thus, otherwise it would never be added to the block hash table.

Initialization of the block hash table is done by setting the initial value of $p_{2}$ to the value of the row containing the hash of a program's root block.

Op group table

Op group table is used in decoding of span blocks, which are leaves in a program's MAST. As described here, a span block can contain one or more operation batches, each batch containing up to $8$ operation groups.

When the VM starts executing a new batch of operations, it adds all operation groups within a batch, except for the first one, to the op group table. Then, as the VM starts executing an operation group, it removes the group from the table. Thus, by the time all operation groups in a batch have been executed, the op group table must be empty.

The table can be thought of as consisting of $3$ columns as shown below:

decoder_op_group_table

The meaning of the columns is as follows:

The first column ( $t_{0}$ ) contains operation batch ID. During the execution of the program, each operation batch is assigned a unique ID.
The second column ( $t_{1}$ ) contains the position of the group in the span block (not just in the current batch). The position is $1$ -based and is counted from the end. Thus, for example, if a span block consists of a single batch with $4$ groups, the position of the first group would be $4$ , the position of the second group would be $3$ etc. (the reason for this is explained in this section). Note that the group with position $4$ is not added to the table, because it is the first group in the batch, so the first row of the table will be for the group with position $3$ .
The third column ( $t_{2}$ ) contains the actual values of operation groups (this could include up to $9$ opcodes or a single immediate value).

Permutation column $p_{3}$ is used to keep track of the state of the table. At any step of the computation, the current value of $p_{3}$ defines which rows are present in the table.

To reduce a row in the op group table to a single value, we compute the following.

$ro w = α_{0} + i = 0 \sum 2 (α_{i + 1} \cdot t_{i})$

Where $α_{0}, ..., α_{3}$ are the random values provided by the verifier.

Control flow operation semantics

In this section we describe high-level semantics of executing all control flow operations. The descriptions are not meant to be complete and omit some low-level details. However, they provide good intuition on how these operations work.

JOIN operation

Before a JOIN operation is executed by the VM, the prover populates $h_{0}, ..., h_{7}$ registers with hashes of left and right children of the join program block as shown in the diagram below.

decoder_join_operation

In the above diagram, blk is the ID of the join block which is about to be executed. blk is also the address of the hasher row in the auxiliary hasher table. prnt is the ID of the block's parent.

When the VM executes a JOIN operation, it does the following:

Adds a tuple (blk, prnt, 0, 0...) to the block stack table.
Adds tuples (blk, left_child_hash, 1, 0) and (blk, right_child_hash, 0, 0) to the block hash table.
Initiates a 2-to-1 hash computation in the hash chiplet (as described here) using blk as row address in the auxiliary hashing table and $h_{0}, ..., h_{7}$ as input values.

SPLIT operation

Before a SPLIT operation is executed by the VM, the prover populates $h_{0}, ..., h_{7}$ registers with hashes of true and false branches of the split program block as shown in the diagram below.

decoder_split_operation

In the above diagram, blk is the ID of the split block which is about to be executed. blk is also the address of the hasher row in the auxiliary hasher table. prnt is the ID of the block's parent.

When the VM executes a SPLIT operation, it does the following:

Adds a tuple (blk, prnt, 0, 0...) to the block stack table.
Pops the stack and:
a. If the popped value is $1$ , adds a tuple (blk, true_branch_hash, 0, 0) to the block hash table.
b. If the popped value is $0$ , adds a tuple (blk, false_branch_hash, 0, 0) to the block hash table.
c. If the popped value is neither $1$ nor $0$ , the execution fails.
Initiates a 2-to-1 hash computation in the hash chiplet (as described here) using blk as row address in the auxiliary hashing table and $h_{0}, ..., h_{7}$ as input values.

LOOP operation

Before a LOOP operation is executed by the VM, the prover populates $h_{0}, ..., h_{3}$ registers with hash of the loop's body as shown in the diagram below.

decoder_loop_operation

In the above diagram, blk is the ID of the loop block which is about to be executed. blk is also the address of the hasher row in the auxiliary hasher table. prnt is the ID of the block's parent.

When the VM executes a LOOP operation, it does the following:

Pops the stack and:
a. If the popped value is $1$ adds a tuple (blk, prnt, 1, 0...) to the block stack table (the 1 indicates that the loop's body is expected to be executed). Then, adds a tuple (blk, loop_body_hash, 0, 1) to the block hash table.
b. If the popped value is $0$ , adds (blk, prnt, 0, 0...) to the block stack table. In this case, nothing is added to the block hash table.
c. If the popped value is neither $1$ nor $0$ , the execution fails.
Initiates a 2-to-1 hash computation in the hash chiplet (as described here) using blk as row address in the auxiliary hashing table and $h_{0}, ..., h_{3}$ as input values.

SPAN operation

Before a SPAN operation is executed by the VM, the prover populates $h_{0}, ..., h_{7}$ registers with contents of the first operation batch of the span block as shown in the diagram below. The prover also sets the group count register $g c$ to the total number of operation groups in the span block.

decoder_span_block

In the above diagram, blk is the ID of the span block which is about to be executed. blk is also the address of the hasher row in the auxiliary hasher table. prnt is the ID of the block's parent. g0_op0 is the first operation of the batch, and g_0' is the first operation group of the batch with the first operation removed.

When the VM executes a SPAN operation, it does the following:

Adds a tuple (blk, prnt, 0, 0...) to the block stack table.
Adds groups of the operation batch, as specified by op batch flags (see here) to the op group table.
Initiates a sequential hash computation in the hash chiplet (as described here) using blk as row address in the auxiliary hashing table and $h_{0}, ..., h_{7}$ as input values.
Sets the in_span register to $1$ .
Decrements group_count register by $1$ .
Sets the op_index register to $0$ .

DYN operation

decoder_dyn_operation

In the above diagram, blk is the ID of the dyn block which is about to be executed. blk is also the address of the hasher row in the auxiliary hasher table. p_addr is the ID of the block's parent.

When the VM executes a DYN operation, it does the following:

Adds a tuple (blk, p_addr, 0, 0...) to the block stack table.
Sends a memory read request to the memory chiplet, using s0 as the memory address. The result hash of callee is placed in the decoder hasher trace at $h_{0}, h_{1}, h_{2}, h_{3}$ .
Adds the tuple (blk, hash of callee, 0, 0) to the block hash table.
Initiates a 2-to-1 hash computation in the hash chiplet (as described here) using blk as row address in the auxiliary hashing table and [ZERO; 8] as input values.
Performs a stack left shift
- Above s16 was pulled from the stack overflow table if present; otherwise set to 0.

Note that unlike DYNCALL, the fmp, ctx, in_syscall and fn_hash registers are unchanged.

DYNCALL operation

decoder_dyncall_operation

When the VM executes a DYNCALL operation, it does the following:

Adds a tuple (blk, p_addr, 0, ctx, fmp, b_0, b_1, fn_hash[0..3]) to the block stack table.
Sends a memory read request to the memory chiplet, using s0 as the memory address. The result hash of callee is placed in the decoder hasher trace at $h_{0}, h_{1}, h_{2}, h_{3}$ .
Adds the tuple (blk, hash of callee, 0, 0) to the block hash table.
Initiates a 2-to-1 hash computation in the hash chiplet (as described here) using blk as row address in the auxiliary hashing table and [ZERO; 8] as input values.
Performs a stack left shift
- Above s16 was pulled from the stack overflow table if present; otherwise set to 0.

Similar to CALL, DYNCALL resets the fmp, sets up a new ctx, and sets the fn_hash registers to the callee hash. in_syscall needs to be 0, since calls are not allowed during a syscall.

END operation

Before an END operation is executed by the VM, the prover populates $h_{0}, ..., h_{3}$ registers with the hash of the block which is about to end. The prover also sets values in $h_{4}$ and $h_{5}$ registers as follows:

$h_{4}$ is set to $1$ if the block is a body of a loop block. We denote this value as f0.
$h_{5}$ is set to $1$ if the block is a loop block. We denote this value as f1.
$h_{6}$ is set to $1$ if the block is a call block. We denote this value as f2.
$h_{7}$ is set to $1$ if the block is a syscall block. We denote this value as f3.

decoder_end_operation

In the above diagram, blk is the ID of the block which is about to finish executing. prnt is the ID of the block's parent.

When the VM executes an END operation, it does the following:

Removes a tuple from the block stack table.
- if f2 or f3 is set, we remove a row (blk, prnt, 0, ctx_next, fmp_next, b0_next, b1_next, fn_hash_next[0..4])
  - in the above, the x_next variables denote the column x in the next row
- else, we remove a row (blk, prnt, f1, 0, 0, 0, 0, 0)
Removes a tuple (prnt, current_block_hash, nxt, f0) from the block hash table, where $n x t = 0$ if the next operation is either END or REPEAT, and $1$ otherwise.
Reads the hash result from the hash chiplet (as described here) using blk + 7 as row address in the auxiliary hashing table.
If $h_{5} = 1$ (i.e., we are exiting a loop block), pops the value off the top of the stack and verifies that the value is $0$ .
Verifies that group_count register is set to $0$ .

HALT operation

Before a HALT operation is executed by the VM, the VM copies values in $h_{0}, ..., h_{3}$ registers to the next row as illustrated in the diagram below:

decoder_halt_operation

In the above diagram, blk is the ID of the block which is about to finish executing.

When the VM executes a HALT operation, it does the following:

Verifies that block address register is set to $0$ .
If we are not at the last row of the trace, verifies that the next operation is HALT.
Copies values of $h_{0}, ..., h_{3}$ registers to the next row.
Populates all other decoder registers with $0$ 's in the next row.

REPEAT operation

Before a REPEAT operation is executed by the VM, the VM copies values in registers $h_{0}, ..., h_{4}$ to the next row as shown in the diagram below.

decoder_repeat_operation

In the above diagram, blk is the ID of the loop's body and prnt is the ID of the loop.

When the VM executes a REPEAT operation, it does the following:

Checks whether register $h_{4}$ is set to $1$ . If it isn't (i.e., we are not in a loop), the execution fails.
Pops the stack and if the popped value is $1$ , adds a tuple (prnt, loop_body_loop 0, 1) to the block hash table. If the popped value is not $1$ , the execution fails.

The effect of the above is that the VM needs to execute the loop's body again to clear the block hash table.

RESPAN operation

Before a RESPAN operation is executed by the VM, the VM copies the ID of the current block blk and the number of remaining operation groups in the span to the next row, and sets the value of in_span column to $0$ . The prover also sets the value of $h_{1}$ register for the next row to the ID of the current block's parent prnt as shown in the diagram below:

decoder_respan_operation

In the above diagram, g0_op0 is the first operation of the new operation batch, and g0' is the first operation group of the batch with g0_op0 operation removed.

When the VM executes a RESPAN operation, it does the following:

Increments block address by $8$ .
Removes the tuple (blk, prnt, 0, 0...) from the block stack table.
Adds the tuple (blk+8, prnt, 0, 0...) to the block stack table.
Absorbs values in registers $h_{0}, ..., h_{7}$ into the hasher state of the hash chiplet (as described here).
Sets the in_span register to $1$ .
Adds groups of the operation batch, as specified by op batch flags (see here) to the op group table using blk+8 as batch ID.

The net result of the above is that we incremented the ID of the current block by $8$ and added the next set of operation groups to the op group table.

CALL operation

Recall that the purpose of a CALL operation is to execute a procedure in a new execution context. Specifically, this means that the entire memory is zero'd in the new execution context, and the stack is truncated to a depth of 16 (i.e. any element in the stack overflow table is not available in the new context). On the corresponding END instruction, the prover will restore the previous execution context (verified by the block stack table).

Before a CALL operation, the prover populates $h_{0}, ..., h_{3}$ registers with the hash of the procedure being called. In the next row, the prover

resets the FMP register (free memory pointer),
sets the context ID to the next row's CLK value
sets the fn hash registers to the hash of the callee
- This register is what the caller instruction uses to return the hash of the caller in a syscall
resets the stack B0 register to 16 (which tracks the current stack depth)
resets the overflow address to 0 (which tracks the "address" of the last element added to the overflow table)
- it is set to 0 to indicate that the overflow table is empty

decoder_call_operation

In the above diagram, blk is the ID of the call block which is about to be executed. blk is also the address of the hasher row in the auxiliary hasher table. prnt is the ID of the block's parent.

When the VM executes a CALL operation, it does the following:

Adds a tuple (blk, prnt, 0, p_ctx, p_fmp, p_b0, p_b1, prnt_fn_hash[0..4]) to the block stack table.
Initiates a 2-to-1 hash computation in the hash chiplet (as described here) using blk as row address in the auxiliary hashing table and $h_{0}, ..., h_{3}$ as input values.

SYSCALL operation

Similarly to the CALL operation, a SYSCALL changes the execution context. However, it always jumps back to the root context, and executes kernel procedures only.

Before a SYSCALL operation, the prover populates $h_{0}, ..., h_{3}$ registers with the hash of the procedure being called. In the next row, the prover

resets the FMP register (free memory pointer),
sets the context ID to 0,
does NOT modify the fn hash register
- Hence, the fn hash register contains the procedure hash of the caller, to be accessed by the caller instruction,
resets the stack B0 register to 16 (which tracks the current stack depth)
resets the overflow address to 0 (which tracks the "address" of the last element added to the overflow table)
- it is set to 0 to indicate that the overflow table is empty

decoder_syscall_operation

In the above diagram, blk is the ID of the syscall block which is about to be executed. blk is also the address of the hasher row in the auxiliary hasher table. prnt is the ID of the block's parent.

When the VM executes a SYSCALL operation, it does the following:

Adds a tuple (blk, prnt, 0, p_ctx, p_fmp, p_b0, p_b1, prnt_fn_hash[0..4]) to the block stack table.
Sends a request to the kernel ROM chiplet indicating that hash of callee is being accessed.
- this results in a fault if hash of callee does not correspond to the hash of a kernel procedure
Initiates a 2-to-1 hash computation in the hash chiplet (as described here) using blk as row address in the auxiliary hashing table and $h_{0}, ..., h_{3}$ as input values.

Program decoding

When decoding a program, we start at the root block of the program. We can compute the hash of the root block directly from hashes of its children. The prover provides hashes of the child blocks non-deterministically, and we use them to compute the program's hash (here we rely on the hash chiplet). We then verify the program hash via boundary constraints. Thus, if the prover provided valid hashes for the child blocks, we will get the expected program hash.

Now, we need to verify that the VM executed the child blocks correctly. We do this recursively similar to what is described above: for each of the blocks, the prover provides hashes of its children non-deterministically and we verify that the hash has been computed correctly. We do this until we get to the leaf nodes (i.e., span blocks). Hashes of span blocks are computed sequentially from the instructions executed by the VM.

The sections below illustrate how different types of code blocks are decoded by the VM.

JOIN block decoding

When decoding a join bock, the VM first executes a JOIN operation, then executes the first child block, followed by the second child block. Once the children of the join block are executed, the VM executes an END operation. This is illustrated in the diagram below.

decoder_join_block_decoding

As described previously, when the VM executes a JOIN operation, hashes of both children are added to the block hash table. These hashes are removed only when the END operations for the child blocks are executed. Thus, until both child blocks are executed, the block hash table is not cleared.

SPLIT block decoding

When decoding a split block, the decoder pops an element off the top of the stack, and if the popped element is $1$ , executes the block corresponding to the true branch. If the popped element is $0$ , the decoder executes the block corresponding to the false branch. This is illustrated on the diagram below.

decoder_split_block_decoding

As described previously, when the VM executes a SPLIT operation, only the hash of the branch to be executed is added to the block hash table. Thus, until the child block corresponding to the required branch is executed, the block hash table is not cleared.

LOOP block decoding

When decoding a loop bock, we need to consider two possible scenarios:

When the top of the stack is $1$ , we need to enter the loop and execute loop body at least once.
When the top of the stack is, $0$ we need to skip the loop.

In both cases, we need to pop an element off the top of the stack.

Executing the loop

If the top of the stack is $1$ , the VM executes a LOOP operation. This removes the top element from the stack and adds the hash of the loop's body to the block hash table. It also adds a row to the block stack table setting the is_loop value to $1$ .

To clear the block hash table, the VM needs to execute the loop body (executing the END operation for the loop body block will remove the corresponding row from the block hash table). After loop body is executed, if the top of the stack is $1$ , the VM executes a REPEAT operation (executing REPEAT operation when the top of the stack is $0$ will result in an error). This operation again adds the hash of the loop's body to the block hash table. Thus, the VM needs to execute the loop body again to clear the block hash table.

This process is illustrated on the diagram below.

decoder_loop_execution

The above steps are repeated until the top of the stack becomes $0$ , at which point the VM executes the END operation. Since in the beginning we set is_loop column in the block stack table to $1$ , $h_{6}$ column will be set to $1$ when the END operation is executed. Thus, executing the END operation will also remove the top value from the stack. If the removed value is not $0$ , the operation will fail. Thus, the VM can exit the loop block only when the top of the stack is $0$ .

Skipping the loop

If the top of the stack is $0$ , the VM still executes the LOOP operation. But unlike in the case when we need to enter the loop, the VM sets is_loop flag to $0$ in the block stack table, and does not add any rows to the block hash table. The last point means that the only possible operation to be executed after the LOOP operation is the END operation. This is illustrated in the diagram below.

decoder_loop_skipping

Moreover, since we've set the is_loop flag to $0$ , executing the END operation does not remove any items from the stack.

DYN block decoding

When decoding a dyn bock, the VM first executes a DYN operation, then executes the child block dynamically specified by the top of the stack. Once the child of the dyn block has been executed, the VM executes an END operation. This is illustrated in the diagram below.

decoder_dyn_block_decoding

As described previously, when the VM executes a DYN operation, the hash of the child is added to the block hash table. This hash is removed only when the END operation for the child block is executed. Thus, until the child block corresponding to the dynamically specified target is executed, the block hash table is not cleared.

SPAN block decoding

As described here, a span block can contain one or more operation batches, each batch containing up to $8$ operation groups. At the high level, decoding of a span block is done as follows:

At the beginning of the block, we make a request to the hash chiplet which initiates the hasher, absorbs the first operation batch ( $8$ field elements) into the hasher, and returns the row address of the hasher, which we use as the unique ID for the span block (see here).
We then add groups of the operation batch, as specified by op batch flags (but always skipping the first one) to the op group table.
We then remove operation groups from the op group table in the FIFO order one by one, and decode them in the manner similar to the one described here.
Once all operation groups in a batch have been decoded, we absorb the next batch into the hasher and repeat the process described above.
Once all batches have been decoded, we return the hash of the span block from the hasher.

Overall, three control flow operations are used when decoding a span block:

SPAN operation is used to initialize a hasher and absorbs the first operation batch into it.
RESPAN operation is used to absorb any additional batches in the span block.
END operation is used to end the decoding of a span block and retrieve its hash from the hash chiplet.

Operation group decoding

As described here, an operation group is a sequence of operations which can be encoded into a single field element. For a field element of $64$ bits, we can fit up to $9$ operations into a group. We do this by concatenating binary representations of opcodes together with the first operation located in the least significant position.

We can read opcodes from the group by simply subtracting them from the op group value and then dividing the result by $2^{7}$ . Once the value of the op group reaches $0$ , we know that all opcodes have been read. Graphically, this can be illustrated like so:

decoder_operation_group_decoding

Notice that despite their appearance, op bits is actually $7$ separate registers, while op group is just a single register.

We also need to make sure that at most $9$ operations are executed as a part of a single group. For this purpose we use the op_index column. Values in this column start out at $0$ for each operation group, and are incremented by $1$ for each executed operation. To make sure that at most $9$ operations can be executed in a group, the value of the op_index column is not allowed to exceed $8$ .

Operation batch flags

Operation batch flags are used to specify how many operation groups comprise a given operation batch. For most batches, the number of groups will be equal to $8$ . However, for the last batch in a block (or for the first batch, if the block consists of only a single batch), the number of groups may be less than $8$ . Since processing of new batches starts only on SPAN and RESPAN operations, only for these operations the flags can be set to non-zero values.

To simplify the constraint system, the number of groups in a batch can be only one of the following values: $1$ , $2$ , $4$ , and $8$ . If the number of groups in a batch does not match one of these values, the batch is simply padded with NOOP's (one NOOP per added group). Consider the diagram below.

decoder_OPERATION_batch_flags

In the above, the batch contains $3$ operation groups. To bring the count up to $4$ , we consider the $4$ -th group (i.e., $0$ ) to be a part of the batch. Since a numeric value for NOOP operation is $0$ , op group value of $0$ can be interpreted as a single NOOP.

Operation batch flags (denoted as $c_{0}, c_{1}, c_{2}$ ), encode the number of groups and define how many groups are added to the op group table as follows:

(1, -, -) - $8$ groups. Groups in $h_{1}, ... h_{7}$ are added to the op group table.
(0, 1, 0) - $4$ groups. Groups in $h_{1}, ... h_{3}$ are added to the op group table
(0, 0, 1) - $2$ groups. Groups in $h_{1}$ is added to the op group table.
(0, 1, 1) - $1$ group. Nothing is added to the op group table
(0, 0, 0) - not a SPAN or RESPAN operation.

Single-batch span

The simplest example of a span block is a block with a single batch. This batch may contain up to $8$ operation groups (e.g., $g_{0}, ..., g_{7}$ ). Decoding of such a block is illustrated in the diagram below.

decoder_single_batch_span

Before the VM starts processing this span block, the prover populates registers $h_{0}, ..., h_{7}$ with operation groups $g_{0}, ..., g_{7}$ . The prover also puts the total number of groups into the group_count register $g c$ . In this case, the total number of groups is $8$ .

When the VM executes a SPAN operation, it does the following:

Initiates hashing of elements $g_{0}, ..., g_{7}$ using hash chiplet. The hasher address is used as the block ID blk, and it is inserted into addr register in the next row.
Adds a tuple (blk, prnt, 0) to the block stack table.
Sets the is_span register to $1$ in the next row.
Sets the op_index register to $0$ in the next row.
Decrements group_count register by $1$ .
Sets op bits registers at the next step to the first operation of $g_{0}$ , and also copies $g_{0}$ with the first operation removed (denoted as $g_{0}^{'}$ ) to the next row.
Adds groups $g_{1}, ..., g_{7}$ to the op group table. Thus, after the SPAN operation is executed, op group table looks as shown below.

decoder_op_group_table_after_span_op

Then, with every step the next operation is removed from $g_{0}$ , and by step $9$ , the value of $g_{0}$ is $0$ . Once this happens, the VM does the following:

Decrements group_count register by $1$ .
Sets op bits registers at the next step to the first operation of $g_{1}$ .
Sets hasher register $h_{0}$ to the value of $g_{1}$ with the first operation removed (denoted as $g_{1}^{'}$ ).
Removes row (blk, 7, g1) from the op group table. This row can be obtained by taking values from registers: addr, group_count, and $h_{0}^{'} + i = 0 \sum 6 (2^{i} \cdot b_{i}^{'})$ for $i \in [0, 7)$ , where $h_{0}^{'}$ and $b_{i}^{'}$ refer to values in the next row for the first hasher column and op_bits columns respectively.

Note that we rely on the group_count column to construct the row to be removed from the op group table. Since group count is decremented from the total number of groups to $0$ , to remove groups from the op group table in correct order, we need to assign group position to groups in the op group table in the reverse order. For example, the first group to be removed should have position $7$ , the second group to be removed should have position $6$ etc.

Decoding of $g_{1}$ is performed in the same manner as decoding of $g_{0}$ : with every subsequent step the next operation is removed from $g_{1}$ until its value reaches $0$ , at which point, decoding of group $g_{2}$ begins.

The above steps are executed until value of group_count reaches $0$ . Once group_count reaches $0$ and the last operation group $g_{7}$ is executed, the VM executes the END operation. Semantics of the END operation are described here.

Notice that by the time we get to the END operation, all rows are removed from the op group table.

Multi-batch span

A span block may contain an unlimited number of operation batches. As mentioned previously, to absorb a new batch into the hasher, the VM executes a RESPAN operation. The diagram below illustrates decoding of a span block consisting of two operation batches.

decoder_multi_batch_span

Decoding of such a block will look very similar to decoding of the single-span block described previously, but there also will be some differences.

First, after the SPAN operation is executed, the op group table will look as follows:

decoder_op_group_table_multi_span

Notice that while the same groups ( $g_{1}, ..., g_{7}$ ) are added to the table, their positions now reflect the total number of groups in the span block.

Second, executing a RESPAN operation increments hasher address by $8$ . This is done because absorbing additional $8$ elements into the hasher state requires $8$ more rows in the auxiliary hasher table.

Incrementing value of addr register actually changes the ID of the span block (though, for a span block, it may be more appropriate to view values in this column as IDs of individual operation batches). This means that we also need to update the block stack table. Specifically, we need to remove row (blk, prnt, 0) from it, and replace it with row (blk + 8, prnt, 0). To perform this operation, the prover sets the value of $h_{1}$ in the next row to prnt.

Executing a RESPAN operation also adds groups $g_{9}, g_{10}, g_{11}$ to the op group table, which now would look as follows:

decoder_op_group_table_post_respan

Then, the execution of the second batch proceeds in a manner similar to the first batch: we remove operations from the current op group, execute them, and when the value of the op group reaches $0$ , we start executing the next group in the batch. Thus, by the time we get to the END operation, the op group table should be empty.

When executing the END operation, the hash of the span block will be read from hasher row at address addr + 7, which, in our example, will be equal to blk + 15.

Handling immediate values

Miden VM operations can carry immediate values. Currently, the only such operation is a PUSH operation. Since immediate values can be thought of as constants embedded into program code, we need to make sure that changing immediate values affects program hash.

To achieve this, we treat immediate values in a manner similar to how we treat operation groups. Specifically, when computing hash of a span block, immediate values are absorbed into the hasher state in the same way as operation groups are. As mentioned previously, an immediate value is represented by a single field element, and thus, an immediate value takes place of a single operation group.

The diagram below illustrates decoding of a span block with $9$ operations one of which is a PUSH operation.

decoder_decoding_span_block_with_push

In the above, when the SPAN operation is executed, immediate value imm0 will be added to the op group table, which will look as follows:

decoder_imm_vale_op_group_table

Then, when the PUSH operation is executed, the VM will do the following:

Decrement group_count by $1$ .
Remove a row from the op group table equal to (addr, group_count, s0'), where $s_{0}^{'}$ is the value of the top of the stack at the next row (i.e., it is the value that is pushed onto the stack).

Thus, after the PUSH operation is executed, the op group table is cleared, and group count decreases to $0$ (which means that there are no more op groups to execute). Decoding of the rest of the op group proceeds as described in the previous sections.

Program decoding example

Let's run through an example of decoding a simple program shown previously:


begin
    <operations1>
    if.true
        <operations2>
    else
        <operations3>
    end
end

Translating this into code blocks with IDs assigned, we get the following:


b0: JOIN
    b1: SPAN
        <operations1>
    b1: END
    b2: SPLIT
        b3: SPAN
            <operations2>
        b3: END
        b4: SPAN
            <operations3>
        b4: END
    b2: END
b0: END

The root of the program is a join block $b_{0}$ . This block contains two children: a span bock $b_{1}$ and a split block $b_{2}$ . In turn, the split block $b_{2}$ contains two children: a span block $b_{3}$ and a span block $b_{4}$ .

When this program is executed on the VM, the following happens:

Before the program starts executing, block hash table is initialized with a single row containing the hash of $b_{0}$ .
Then, JOIN operation for $b_{0}$ is executed. It adds hashes of $b_{1}$ and $b_{2}$ to the block hash table. It also adds an entry for $b_{0}$ to the block stack table. States of both tables after this step are illustrated below.
Then, span $b_{1}$ is executed and a sequential hash of its operations is computed. Also, when SPAN operation for $b_{1}$ is executed, an entry for $b_{1}$ is added to the block stack table. At the end of $b_{1}$ (when END is executed), entries for $b_{1}$ are removed from both the block hash and block stack tables.
Then, SPLIT operation for $b_{2}$ is executed. It adds an entry for $b_{2}$ to the block stack table. Also, depending on whether the top of the stack is $1$ or $0$ , either hash of $b_{3}$ or hash of $b_{4}$ is added to the block hash table. Let's say the top of the stack is $1$ . Then, at this point, the block hash and block stack tables will look like in the second picture below.
Then, span $b_{3}$ is executed and a sequential hash of its instructions is computed. Also, when SPAN operation for $b_{3}$ is executed, an entry for $b_{3}$ is added to the block stack table. At the end of $b_{3}$ (when END is executed), entries for $b_{3}$ are removed from both the block hash and block stack tables.
Then, END operation for $b_{2}$ is executed. It removes the hash of $b_{2}$ from the block hash table, and also removes the entry for $b_{2}$ from the block stack table. The third picture below illustrates the states of block stack and block hash tables after this step.
Then, END for $b_{0}$ is executed, which removes entries for $b_{0}$ from the block stack and block hash tables. At this point both tables are empty.
Finally, a sequence of HALT operations is executed until the length of the trace reaches a power of two.

States of block hash and block stack tables after step 2: decoder_state_block_hash_2

States of block hash and block stack tables after step 4: decoder_state_block_hash_4

States of block hash and block stack tables after step 6: decoder_state_block_hash_6

The Miden virtual machine