Post 2

This will be the Second Post in the PLT section. It will introduce the Lambda Calculus for Types-Dependent-on-Types[ $λ_{ω}$ ] (or atleast, a version of it) : The Motivation behind it; Its Semantics and Properties.

I expect the reader to be familiar with the basics of Lambda Calculus, Type Theory, and the System $λ 2$ , which is the Second-Order Typed Lambda Calculus.

Once again, I will use $\to_{β}$ to refer to singular and chained $β$ -reductions.

MOTIVATION : Let’s start by motivating the need for ( $λ_{ω}$ ) and seeing why ( $λ 2$ ) isn’t enough for sophisticated types. In order to see this, let’s briefly remind ourselves of the generating Grammars for the set of $𝕋$ 2-types and ( $λ 2$ ) :

$𝕋$ 2 = $𝕍$ $|$ ( $𝕋$ 2 $\to$ $𝕋$ 2) $|$ $\prod 𝕍$ : $*$ . $𝕋$ 2

$Λ_{𝕋 2}$ = V $|$ $λ$ V: $𝕋$ 2. $Λ_{𝕋 2}$ $|$ $Λ_{𝕋 2}$ $Λ_{𝕋 2}$ $|$ $λ 𝕍$ : $*$ . $Λ_{𝕋 2}$ $|$ $Λ_{𝕋 2} 𝕋 2$

The $λ 2$ , as we can see, is great at dealing with Terms-Dependent-on-Types, by introducing the $\prod$ -type. This allows a term to "bind" to many different types. However, the bindings are relatively straightforward, and therefore, so are the $\prod$ -types.

An example is the definition of the ’Nat’-type :

$ℕ$ $\equiv$ $\prod α$ : $*$ . $(α \to α) \to α \to α$

All elements of ‘Nat‘ will have this type, and can be constructed generally as : $λ α$ : $*$ . $λ$ f : $α \to α$ . $λ$ x : $α$ . $f (f (f (. . . (f x))))$

It is easy to see that - $f (f (f (. . . (f x))))$ : $α$

So, $λ 2$ gives us the ability to concisely write functions with terms for generic types. For instance, one could construct a generic identity function, and then simply substitute in the type and term (in that order), to make it act as an identity function for that specific Type. However, it doesn’t allow us to construct Types-Dependent-on-Types. The reason for this is in the 2nd-order application/abstraction rules in $λ 2$ :

$Γ ⊢ M : \prod α : *$ . $A$ , $Γ ⊢ B : *$

$Γ ⊢ M B$ : $A [α : = B]$

$Γ, α : * ⊢ M : A$

$Γ ⊢ λ α$ : $*$ . $M : \prod α : *$ . $A$

As on can see, the $Γ ⊢ B : *$ proposition from $(a p p l_{2})$ does not allow us to "bind" more complicated types than $*$ . In addition, the cycle for this simplicity is completed because $(a b s t_{2})$ only allows us to introduce and make polymorphic some $α$ with $*$ .

This limits the expressiveness of $λ 2$ . Here, we can have variables $x, y, . . .$ that are dependent on some $α, β, . .$ , but we can’t have some arbitrary type be dependent on another type.

Specifically, expressions of the sort :

i) $λ α$ : $*$ . $α \to α$

ii) $λ α$ : $*$ . $λ β$ : $*$ . $α \to β$

are not possible in $λ 2$ .

Ideally, we would like these expressions to become placeholders for more generic types via $β$ -reduction :

i) $(λ α$ : $*$ . $α \to α) γ$ $\to_{β}$ $γ \to γ$

ii) $(λ α$ : $*$ . $α \to α) δ$ $\to_{β}$ $δ \to δ$

iii) $(λ α$ : $*$ . $α \to α) (τ_{1} \to τ_{2})$ $\to_{β}$ $(τ_{1} \to τ_{2}) \to (τ_{1} \to τ_{2})$

We can see that these allow us to substitute types into types. This is a very useful abstraction upon which many modern functional programming constructs rely. Monads, for example, often rely on implicitly Dependent-Types.

However, none of this machinery is possible in $λ 2$ , and we shall need new Semantics to introduce them : Enter $λ_{ω}$ .

SEMANTICS : Now that we know what shortcomings we had before, and what new expressive abilities we would like in our system (vaguely), let us begin to discuss the enhancements via new semantics.

First and foremost, let us analyze one of the previously mentioned lambda-terms that we would like in our new framework :

$λ α$ : $*$ . $α \to α$

We would like this term can be a placeholder for any Type that is a binary operation of the arrow on itself. Therefore, we would like to type the term as :

$λ α$ : $*$ . $α \to α$ : $* \to *$ In order to be able to generically type these "higher-order" Types, we need a new Super-Type to represent $*$ and all $*$ expressions over the binary operator $\to$ . Terms from the said set are generally typed as : $□$

So, all types like : $*, * \to *, (* \to *) \to *, . . .$ have type : $□$

Furthermore, to formalize this notion of abstract structure over these "higher-types", we introduce some simple terminology and grammar :

$𝕂$ = $*$ $|$ $𝕂 \to 𝕂$

It is easy to see (by definition, in fact), that, $\forall κ \in 𝕂$ , $κ : □$ .

These abstract "higher-types" are termed Kinds. They represent the Set of the new, more powerful Types that we want to type terms with and substitute into (in $λ_{ω}$ ).

The types : $*, □$ have a special name - sorts. They are represented as : sorts = { $*, □$ }. The symbol $s$ refers to $*$ or $□$ .

Terms such as the ones defined above, that take Types and construct new terms (of Types) from them, are called Type-Constructors.

Formally :

A Type-Constructor is a term $M \in Λ_{ω}$ , $∋$ , $M : κ$ .

If $κ \neq *$ , then it is referred to as a Proper Type-Constructor.

Now that we have an idea of what we want our Lambda-Terms in $λ_{ω}$ to look like, and what exactly our "Higher-Types" must look like, we can begin to introduce the Rules and Formal Semantics of our system.

The first thing we need to do is introduce the "Super-Type" $□$ in our Derivation rules so as to construct complex types. This is done via the $(s o r t)$ rule :

$(s o r t)$ $ϕ$ $⊢ * : □$

This rule is necessary for 2 reasons :

a) It allows us to introduce $*$ and generically typed type-variables

b) It lays the foundations for the usage of the $(f o r m)$ rule (introduced ahead) on empty contexts ( $Γ \equiv ϕ$ )

Now that we can introduce the super-type ( $□$ ) into our Lambda-Calculus, the next thing we want is the ability to introduce variables at 2 levels :

a) Those with type $*$

b) And variables that have a type of a type-variable

This is achieved via the $(v a r)$ rule :

$Γ ⊢ C$ : $s$

$Γ, x$ : $C ⊢ x$ : $C$ , if $x \notin Γ$

As $s$ represents a term from the set : ${*, □}$ we can see that 2 levels of terms can be introduced here. As an example, consider the following :

$ϕ ⊢ *$ : $□$ ( $(s o r t)$ )

$α$ : $*$ $⊢ α$ : $*$ (( $v a r$ ) on $(s o r t)$ )

$α$ : $*$ $⊢ α$ : $*$ (From $(1 .)$ )

$α$ : $*, x$ : $α ⊢ x$ : $α$ ) $(v a r)$ on $(1 .)$ )

However, with just $(s o r t)$ and $(v a r)$ we can’t generate all things that are present in the context ( $Γ$ ). As an example, the following generations are not possible :

$α$ : $*, β$ : $* ⊢ β$ : $*$

$α$ : $*, β$ : $* ⊢ α$ : $*$

We need to solve this issue by effectively allowing for the weakening of $Γ$ by allowing us to "re-derive" $*$ : $□$ no matter what exactly $Γ$ is. The rule that introduces this into the semantics of $λ_{ω}$ is $(w e a k)$ :

$Γ ⊢ A$ : $B$ , $Γ ⊢ C$ : $s$

$Γ, x$ : $C ⊢ A$ : $B$ if $x \notin Γ$

Again, it’s easy to see that the Rule allows for "weakening" at the level of terms of types, and types of higher-kinds. This can be seen pretty easily in the following example :

$ϕ ⊢ *$ : $□$ , $ϕ ⊢ *$ : $□$ ( $(s o r t) \times 2$ )

$α$ : $* ⊢ *$ : $□$ ( $(w e a k)$ )

$ϕ ⊢ *$ : $□$ ( $(s o r t)$ )

$α$ : $* ⊢ α$ : $*$ ( $(v a r)$ )

$α$ : $* ⊢ *$ : $□$ (From $(1 .)$ )

$α$ : $*, β$ : $* ⊢ β$ : $*$ ( $(v a r)$ on $(1 .)$ )

$α$ : $* ⊢ α$ : $*$ , $α$ : $* ⊢ *$ : $□$ (From $(1 .)$ and $(2 .)$ )

$α$ : $*, β$ : $* ⊢ α$ : $*$ ( $(w e a k)$ )

So, we’ve gained the ability to fully express our Context ( $Γ$ ) and introduce the super-type into our derivations. The only semantic we need before introducing $(a p p l)$ and $(a b s t)$ into our rules is that of the ability to construct any $κ \in 𝕂$ . This can be introduced by the simple $(f o r m)$ rule :

$Γ ⊢ A$ : $s$ , $Γ ⊢ B$ : $s$

$Γ ⊢ A \to B$ : $s$

Once again, it is easy to see that this not only introduces the elements of $𝕂$ , but also allows us to type term-variables with type-variables.

We are now in a place to introduce the much awaited $(a p p l)$ and $(a b s t)$ rules for $λ_{ω}$ :

$Γ ⊢ M$ : $A \to B$ , $Γ ⊢ N$ : $A$

$Γ ⊢ M N$ : $B$

$Γ, x$ : $A ⊢ M$ : $B$ , $Γ ⊢ A \to B$ : $s$

$Γ ⊢ λ x$ : $A$ . $M$ : $A \to B$

These rules are actually fairly intuitive, and once again, easy to verify as to being applicable for introducing and applying to term-variables and type-variables.

The last rule that we need to construct our Types-Depdent-On-Types $λ_{ω}$ is the $(c o n v)$ rule :

$Γ ⊢ A$ : $B$ , $Γ ⊢ B^{^{'}}$ : $s$

$Γ ⊢ A$ : $B^{^{'}}$ , if $B =_{β} B^{^{'}}$

The reason we need the $(c o n v)$ rule is to formalize our intuition for circumstances where a variable can be typed with 2 $β$ -equivalent types. Consider the following type-term :

$(λ α$ : $*$ . $α$ . $α \to α) β$ $\to_{β}$ $β \to β$

Supposing we have some $Γ$ , where, $x$ : $(λ α$ : $*$ . $α$ . $α \to α) β$ and $x \in Γ$

Clearly, $x$ : $β \to β$ as a type is equivalent to the one that $x$ is typed with in the context $Γ$ . This is the exact intuition that the $(c o n v)$ rule formalizes. The $(c o n v)$ rule allows this for all types up to $β$ -equivalence.

One should be careful so as to not confuse the Subject-Reduction Lemma with the $(c o n v)$ rule. The former states that two $β$ -equivalent terms have the same type, while the latter states that a term can be typed with all $β$ -equivalent types. If there is some confusion regarding the fact that $β$ -reduction is applying over here to terms and types, then one can simply glance at the rules for $λ_{ω}$ to understand that, as types are more complicated in $λ_{ω}$ and can be applied to and abstracted on to create more types, type-variables can be manipulated in ways that they can’t in $λ 2$ . Hence, the concept of Dependent-Types.

Having derived all the Formal Semantics of $λ_{ω}$ we can now rewrite them collectively, and move on to the important Properties :

$ϕ$ $⊢ * : □$

$Γ ⊢ C$ : $s$

$Γ, x$ : $C ⊢ x$ : $C$ , if $x \notin Γ$

$Γ ⊢ A$ : $B$ , $Γ ⊢ C$ : $s$

$Γ, x$ : $C ⊢ A$ : $B$ , if $x \notin Γ$

$Γ ⊢ A$ : $s$ , $Γ ⊢ B$ : $s$

$Γ ⊢ A \to B$ : $s$

$Γ ⊢ M$ : $A \to B$ , $Γ ⊢ N$ : $A$

$Γ ⊢ M N$ : $B$

$Γ, x$ : $A ⊢ M$ : $B$ , $Γ ⊢ A \to B$ : $s$

$Γ ⊢ λ x$ : $A$ . $M$ : $A \to B$

$Γ ⊢ A$ : $B$ , $Γ ⊢ B^{^{'}}$ : $s$

$Γ ⊢ A$ : $B^{^{'}}$ , if $B =_{β} B^{^{'}}$

This is it. This is the $λ_{ω}$ .

As a last note - The $λ_{ω}$ does not contain $\prod$ -types. However, Girard’s extension of $λ_{ω}$ into what is called "System $F_{ω}$ " does.

PROPERTIES : This section is going to be succinct and I will mention the 3 most important properties about $λ_{ω}$ :

Theorem 1 : Uniqueness of Types - $\forall A \in Λ_{ω}$ , if, $(Γ ⊢ A$ : $B_{1}) \land (Γ ⊢ A$ : $B_{2})$ , then, $B_{1} =_{β} B_{2}$ .

This Lemma is basically stating a formalism of the fact that a term can only be typed by two types if they are equivalent. The proof for this follows from Induction on the ways a term $M \in Λ_{ω}$ can be built, and using the $(c o n v)$ rule for the Inductive Hypothesis.

Theorem 2 : Church-Rosser Theorem - $\forall M, N_{1}, N_{2} \in Λ_{ω}$ , $∋$ $(M \to_{β} N_{1}) \land (M \to_{β} N_{2})$ , $\exists$ $N \in Λ_{ω}$ , $∋$ $M \to_{β} N$ .

For a generic Proof of this Theorem, one may refer to the work by Henk Barendregt or that of Takahashi, 1995. The important part is that the theorem holds for the System $λ_{ω}$ .

Theorem 3 : Strong Normalization Theorem - $\forall M \in Λ_{ω}$ , $\exists$ $π = M \to_{β} M_{1} \to_{β} M_{2} . . . \to_{β} M_{n}$ , where, $M_{n}$ is in $β$ -nf.

This Theorem basically states that every term in $λ_{ω}$ is strongly-normalizing, or that evaluating leads to termination (and no form of undecidability or infinite looping). The Proof for this Theorem is highly non-trivial and uses the concept of Logical Relations to find a property that is an inductive invariant for the System.

At this point, we have covered a lot of ground and laid the foundations for $λ_{ω}$ by stating the motivating behind its construction, its semantics, and its three most crucial properties.

So, until the next post (where we will cover the Calculus-of-Constructions), the reader is encouraged to prove the 3 Properties!