16 Control Formalism II

8.15

16 Control Formalism II🔗

Last time we saw how to use evaluation contexts to define the small-step semantics of control flow operators like return. Today we will revisit this topic to explore how to define the small step semantics of exceptions. Along the way, we will revisit the evaluation contexts formalism to clean it up a bit and make it easier to reason about.

16.1 Unique Decompositions🔗

Recall from last time that we defined a notion of an evaluation context E that represents the fragment of the program that an expression is being evaluated in. We defined E as a "program with a hole in it", where the hole is where computation occurrs. In order to implement small step semantics of control operators it’s extremely useful to know exactly which evaluation context an expression is to be evaluated in.

Last time we saw that there was more than 1 evaluation context for a given term. This is quite annoying and undesirable, and it was actually not intentional; let’s fix this!

Recall our tiny language CalcRet from last time and its evaluation contexts:

‹e›
::=
( + ‹e› ‹e› )

|
‹num›

|
( return ‹num› )
Figure 21: Surface syntax for CalcRet.

‹E›
::=
[.]

|
( + ‹num› ‹E› )

|
( + ‹E› ‹e› )
Figure 22: Evaluation context grammar for CalcRet.

Further recall that we defined the syntax E[e] to mean "substitute in the expression e for the hole in E". Now let’s define our small-step semantics, which will be different from last time:

Notice that these rules no longer have an (E-Ctx) rule for stepping an expression under a context. We’ll see why this matters.

\dfrac{~}{E[\texttt{(+ n1 n2)}] \rightarrow E[\texttt{n1} + \texttt{n2}]}~\text{(E-Add)}\dfrac{~}{E[\texttt{(return n)}] \rightarrow \texttt{n}}~\text{(E-Ret)}
Figure 23: Small step reduction semantics for CalcRet using evaluation contexts for enforcing unique decompositions.

The small-step semantics in the above figure identify our reducible expressions (sometimes abbreviated as "redexes"): expressions that are ready to take a step of computation. The reducible expressions are in the holes: they are addition of numbers (+ n1 n2) and return expression (return n). We can write down a grammar for the reducible expressions that we label as e_{\rightarrow}:

e_{\rightarrow} ::= \texttt{(+ n1 n2)} \mid \texttt{(return n)}

Now we can state our desired property:

Theorem: (Uniqueness of Decomposition) Let e be a CalcLang program and suppose e \rightarrow e{}’. Then, there exists a unique evaluation context E and reducible expression e{}’ where e = E[e{}’].

To prove this fact, we’ll write down another grammar called E[e_{\rightarrow}] that is the set of evaluation contexts with reducible expressions in the holes. Let’s define this grammar:

E[e_\rightarrow] ::= \underbrace{e_{\rightarrow}}_{(1)} \mid \underbrace{(+~E[e_\rightarrow]~e)}_{(2)} \mid \underbrace{(+~\texttt{n}~E[e_\rightarrow])}_{(3)}

The uniqueness of decompositions follows from two well-formedness conditions about this grammar:

There is exactly one application of the redex production rule (1). We say that the redex produced by rule (1) is in the focus.
The grammar is unambiguous (i.e., there is at most 1 valid parse tree). This means there can’t be multiple trees that place the redex in different positions.

Before we establish these facts let’s see some examples of parsing programs using the E[e_{\rightarrow}] grammar:

From the above parse tree we can conclude that the unique evaluation context is E1 = (+ [.] (+ 3 4))and e1 = (+ 1 2) is the redex.

From the above parse tree we can conclude that the unique evaluation context is E2 = (+ (+ [.] 3) 4) and e1 = (+ 1 2) is the redex.

Notice what’s happening when we parse programs using this grammar: by parsing the program, we identify the evaluation context E and the hole [\cdot]. From this tree we can extract an evaluation context E by replacing the focus with the hole [\cdot].

This block of text can be skipped for those that are willing to accept that the two properties are true by inspection.

Now let’s establish that these two facts are true about this grammar. They follow by fairly simple inspection of the structure of the grammar. First, we know there is a single application of rule (1) for any parse tree because every other rule contains exactly one recursive application of the E[e_\rightarrow] production and there are no other terminal production rules. The second property follows by induction on the structure of the syntax of expressions: we will show that at most 1 production rule applies to any given program, which establishes unambiguity. Assume e=n. Then, no rules apply, so this term has no parse tree. Assume e = (+~e_1~e_2). Then, do case analysis on e_1. If e_1 = n then do case analysis on e_2; if e_2 = n then rule (1) applies, otherwise rule (3) applies. The other case is that e_1 = (+~e_1{}’~e_2{}’); this means rule (2) applies.

Using these properties we can now prove the uniqueness of decomposition:

Proof of uniqueness of decomposition. By assumption we know e \rightarrow e{}’. This means either the (E-Add) rule or the (E-Ret) rule was applied. In either case, this implies e can be parsed using the E[e_\rightarrow] grammar. By the fact that the grammar is unambiguous and there is at most 1 redex position, this means that there is a unique parse tree that identifies a unique context E where the hole is at the hole position in the parsre tree of E[e_{\rightarrow}].

16.2 Derivation Trees🔗

Phew! That was a lot of work, but there’s a real payoff: now we can always immediately identify the reducible expression and its (unique!) context by parsing a program using the E[e_\rightarrow] grammar. This is very powerful, since it lets us immediately "focus in on" the reducible expression and write down semantics that manipulate the outer context. This is a powerful tool for describing control-flow operators.

Once you get good at this, you don’t need to draw the parse tree anymore; you will know that the redex that should be stepped is the inner-most left-most redex according to the grammar of E.

Let’s see it in action by let’s running a program like (+ 1 (return 2)). First, let’s parse the program to see which redex we step next:

So, from this parsetree we can conclude that E = (+ 1 [.]) and e = (return 2). Now we can apply our E-Add rule:

\dfrac{~}{\texttt{(+ 1 [return 2])}\rightarrow 2}~\text{(E-Add)}

And we see this program runs to 2.

16.3 Semantics of Exceptions🔗

Now let’s see the power of the formalism by extending the untyped lambda calculus with exceptions.

Here is the semantics, where we use the same evaluation context as the untyped lambda calculus above:

\dfrac{~}{E[(\lambda x . e)~v] \rightarrow E[e[x \mapsto v]]}~\text{(E-Beta)}\dfrac{~}{E[\texttt{raise}] \rightarrow \texttt{raise}}~\text{(E-Raise)}\dfrac{\texttt{e} \rightarrow \texttt{e{}’}} {\texttt{E[(try e h)]} \rightarrow \texttt{E[(try e{}’ h)]}}~\text{(E-Try)}\dfrac{~}{E[\texttt{(try v h)]} \rightarrow \texttt{v}}~\text{(E-TryV)}\dfrac{~}{E[\texttt{(try raise h)]} \rightarrow \texttt{h}}~\text{(E-Catch)}
Figure 24: Call-by-value semantics for the untyped lambda calculus with exceptions.

We will define our evaluation context grammar as:

E ::= [\cdot] \mid v~E \mid E~e

Let’s see how these rules handle various programs. Here are some examples. First, a simple case with just a single nesting of try/handle:

--------------------------- (E-Raise)

(+ [(raise)] 3) --> [raise]

----------------------------------------- (E-Try)

[(try (+ 1 (raise) 3))] --> (try raise 10)

Let’s check that nested handlers work as expected by properly evaluating to the inner-most handler:

------------------------- (E-Catch)

[(try (raise) 10)] --> 10

------------------------------------------- (E-Try)

[(try (try (raise) 10) 20)] --> (try 10 20)

Let’s see how handlers interact with lambdas. This time we’ll make use of the multi-step operator \rightarrow^* to fully evaluate a term:

----------------------- E-Catch ---------- E-Refl

[(try raise 25)] --> 25 25 -->* 25

---------------------------------------------------- (E-Beta) ---------------------------------------------------------- (E-Trans)

[(try ((λ x . (raise)) 10) 25)] --> [(try raise 25)] [(try raise 25)] -->* 25

--------------------------------------------------------------------------- E-Beta ------------------------------------------------------------------------------------------------------------------------------------------(E-Trans)

[(λ f . (try (f 10) 25)) (λ x . (raise))] --> (try ((λ x . (raise)) 10) 25) [(try ((λ x . (raise)) 10) 25)] -->* 25

---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------(E-Trans)

[(λ f . (try (f 10) 25)) (λ x . (raise))] -->* 25

contents ← prev up next →

1	Introduction
2	Interpreting Programs
3	Scope and Substitution
4	The Lambda Calculus
5	Compilation
6	Formalism
7	Types
8	Types II: Typing Formalism
9	Types III: Type Soundness
10	Parametric Polymorphism
11	Compiling to System-F
12	Subtyping
13	Control
14	Implementing Exceptions
15	Control Formalism
16	Control Formalism II
17	Abstract Machines
18	Abstract Machines II
19	Memory Safety
20	Memory Management
21	Probabilistic Programming
22	Managing Effects

16.1	Unique Decompositions
16.2	Derivation Trees
16.3	Semantics of Exceptions

‹e›	::=	( + ‹e› ‹e› )
	\|	‹num›
	\|	( return ‹num› )