# User:Michiexile/MATH198/Lecture 10

(Difference between revisions)

IMPORTANT NOTE: THESE NOTES ARE STILL UNDER DEVELOPMENT. PLEASE WAIT UNTIL AFTER THE LECTURE WITH HANDING ANYTHING IN, OR TREATING THE NOTES AS READY TO READ.

This lecture will be shallow, and leave many things undefined, hinted at, and is mostly meant as an appetizer, enticing the audience to go forth and seek out the literature on topos theory for further studies.

## Contents

### 1 Subobject classifier

One very useful property of the category Set is that the powerset of a given set is still a set; we have an internal concept of object of all subobjects. Certainly, for any category (small enough) C, we have a contravariant functor $Sub(-): C\to Set$ taking an object to the set of all equivalence classes of monomorphisms into that object; with the image Sub(f) given by the pullback diagram

If the functor Sub( − ) is representable - meaning that there is some object $X\in C_0$ such that Sub( − ) = hom( − ,X) - then the theory surrounding representable functors, connected to the Yoneda lemma - give us a number of good properties.

One of them is that every representable functor has a universal element; a generalization of the kind of universal mapping properties we've seen in definitions over and over again during this course; all the definitions that posit the unique existence of some arrow in some diagram given all other arrows.

Thus, in a category with a representable subobject functor, we can pick a representing object $\Omega\in C_0$, such that Sub(X) = hom(X,Ω). Furthermore, picking a universal element corresponds to picking a subobject $\Omega_0\hookrightarrow\Omega$ such that for any object A and subobject $A_0\hookrightarrow A$, there is a unique arrow $\chi: A\to\Omega$ such that there is a pullback diagram

One can prove that Ω0 is terminal in C, and we shall call Ω the subobject classifier, and this arrow $\Omega_0=1\to\Omega$ true. The arrow χ is called the characteristic arrow of the subobject.

In Set, all this takes on a familiar tone: the subobject classifier is a 2-element set, with a true element distinguished; and a characteristic function of a subset takes on the true value for every element in the subset, and the other (false) value for every element not in the subset.

### 2 Defining topoi

Definition A topos is a cartesian closed category with all finite limits and with a subobject classifier.

It is worth noting that this is a far stronger condition than anything we can even hope to fulfill for the category of Haskell types and functions. The functional programming relevance will take a back seat in this lecture, in favour of usefulness in logic and set theory replacements.

### 3 Properties of topoi

The meat is in the properties we can prove about topoi, and in the things that turn out to be topoi.

Theorem Let E be a topos.

• E has finite colimits.

#### 3.1 Power object

Since a topos is closed, we can take exponentials. Specifically, we can consider $[A\to\Omega]$. This is an object such that $hom(B,[A\to\Omega]) = hom(A\times B, \Omega) = Sub(A\times B)$. Hence, we get an internal version of the subobject functor. (pick B to be the terminal object to get a sense for how global elements of $[A\to\Omega]$ correspond to subobjects of A)

#### 3.2 Internal logic

We can use the properties of a topos to develop a logic theory - mimicking the development of logic by considering operations on subsets in a given universe:

Classically, in Set, and predicate logic, we would say that a predicate is some function from a universe to a set of truth values. So a predicate takes some sort of objects, and returns either True or False.

Furthermore, we allow the definition of sets using predicates:

$\{x\in U: P(x)\}$

Looking back, though, there is no essential difference between this, and defining the predicate as the subset of the universe directly; the predicate-as-function appears, then, as the characteristic function of the subset. And types are added as easily - we specify each variable, each object, to have a set it belongs to.

This way, predicates really are subsets. Type annotations decide which set the predicate lives in. And we have everything set up in a way that opens up for the topos language above.

We'd define, for predicates P,Q acting on the same type:

$\{x\in A : \top\} = A$
$\{x\in A : \bot\} = \emptyset$
$\{x : (P \wedge Q)(x)\} = \{x : P(x)\} \cap \{x : Q(x)\}$
$\{x : (P \vee Q)(x)\} = \{x : P(x)\} \cup \{x : Q(x)\}$
$\{x\in A : (\neg P)(x) \} = A \setminus \{x\in A : P(x)\}$

We could then start to define primitive logic connectives as set operations; the intersection of two sets is the set on which both the corresponding predicates hold true, so $\wedge = \cap$. Similarily, the union of two sets is the set on which either of the corresponding predicates holds true, so $\vee = \cup$. The complement of a set, in the universe, is the negation of the predicate, and all other propositional connectives (implication, equivalence, ...) can be built with conjunction (and), disjunction (or) and negation (not).

So we can mimic all these in a given topos:

We say that a universe U is just an object in a given topos. (Note that by admitting several universes, we arrive at a typed predicate logic, with basically no extra work.)

A predicate is a subobject of the universe.

We can now proceed to define all the familiar logic connectives one by one, using the topos setting. While doing this, we shall introduce the notation $t_A: A\to\Omega$ for the morphism $t_A = A\to 1\to^{true}\Omega$ that takes on the value true on all of A. We note that with this convention, $\chi_{A_0}$, the characteristic morphism of a subobject, is the arrow such that $\chi_{A_0}\circ i = t_{A_0}$.

Conjunction: Given predicates P,Q, we need to define the conjunction $P\wedge Q$ as some $P\wedge Q: U\to\Omega$ that corresponds to both P and Q simultaneously.

We may define $true\times true: 1\to\Omega\times\Omega$, a subobject of $\Omega\times\Omega$. Being a subobject, this has a characteristic arrow $\wedge:\Omega\times\Omega\to\Omega$, that we call the conjunction arrow.

Now, we may define $\chi_P\times\chi_Q:U\to\Omega\times\Omega$ for subobjects $P,Q\subseteq U$ - and we take their composition $\wedge\circ\chi_P\times\chi_Q$ to be the characteristic arrow of the subobject $P\wedge Q$.

And, indeed, this results in a topoidal version of intersection of subobjects.

Implication: Next we define $\leq_1: \Omega_1\to\Omega\times\Omega$ to be the equalizer of $\wedge$ and proj1. Given $v, w: U\to\Omega$, we write $v\leq_1 w$ if $v\times w$ factors through $\leq_1$.

Using the definition of an equalizer we arrive at $v\leq_1 w$ iff $v = v\wedge w$. From this, we can deduce

$u\leq_1 true$
$u\leq_1 u$
If $u\leq_1 v$ and $v\leq_1 w$ then $u\leq_1 w$.
If $u\leq_1 v$ and $v\leq_1 u$ then u = v

and thus, $\leq_1$ is a partial order on $[U\to\Omega]$. Intuitively, $u\leq_1 v$ if v is at least as true as u.

This relation corresponds to inclusion on subobjects. Note that $\leq_1:\Omega_1\to\Omega\times\Omega$, given from the equalizer, gives us Ω1 as a relation on Ω - a subobject of $\Omega\times\Omega$. Specifically, it has a classifying arrow $\Rightarrow:\Omega\times\Omega\to\Omega$. We write $h\Rightarrow k = (\Rightarrow)\circ h\times k$. And for subobjects $P,Q\subseteq A$, we write $P\Rightarrow Q$ for the subobject classified by $\chi_P\Rightarrow\chi_Q$.

It turns out, without much work, that this $P\Rightarrow Q$ behaves just like classical implication in its relationship to $\wedge$.

Membership: We can internalize the notion of membership as a subobject $\in^U\subseteq U\times\Omega^U$, and thus get the membership relation from a pullback:

For elements $x\times h: 1\to U\times\Omega^U$, we write $x\in^U h$ for $x\times h\in\in^U$. Yielding a subset of the product $U\times\Omega^U$, this is readily interpretable as a relation relating things in U with subsets of U, so that for any x,h we can answer whether $x\in^Uh$. Both notations indicate $ev_A\circ h\times x = true$.

Universal quantification: For any object U, the maximal subobject of U is U itself, embedded with 1U into itself. There is an arrow $\tau_U:1\to\Omega^U$ represents this subobject. Being a map from 1, it is specifically monic, so it has a classifying arrow $\forall_U:\Omega^U\to\Omega$ that takes a given subobject of U to true precisely if it is in fact the maximal subobject.

Now, with a relation $r:R\to B\times A$, we define $\forall a. R$ by the following pullback:

where λχR comes from the universal property of the exponential.

Theorem For any $s:S\to B$ monic, $S\subseteq \forall a.R$ iff $S\times A\subseteq R$.

This theorem tells us that the subobject given by $\forall a.R$ is the largest subobject of B that is related by R to all of A.

Falsum: We can define the false truth value using these tools as $\forall w\in\Omega.w$. This might be familiar to the more advanced Haskell type hackers - as the type

x :: forall a. a
which has to be able to give us an element of any type, regardless of the type itself. And in Haskell, the only element that inhabits all types is
undefined
.

From a logical perspective, we use a few basic inference rules:

and connect them up to derive

for any φ not involving w - and we can always adjust any φ to avoid w.

Thus, the formula $\forall w.w$ has the property that it implies everything - and thus is a good candidate for the false truth value; since the inference

is the defining introduction rule for false.

Negation: We define negation the same way as in classical logic: $\neg \phi = \phi \Rightarrow false$.

Disjunction: We can define

$P\vee Q = \forall w. ((\phi\Rightarrow w)\wedge(\psi\Rightarrow w))\Rightarrow w$

Note that this definition uses one of our primary inference rules:

as the defining property for the disjunction, and we may derive any properties we like from these.

Existential quantifier: Finally, the existential quantifier is derived similarly to the disjunction - by figuring out a rule we want it to obey, and using that as a definition for it:

$\exists x.\phi = \forall w. (\forall x. \phi \Rightarrow w)\Rightarrow w$

Here, the rule we use as defining property is

Before we leave this exploration of logic, some properties worth knowing about: While we can prove $\neg(\phi\wedge\neg\phi)$ and $\phi\Rightarrow\neg\neg\phi$, we cannot, in just topos logic, prove things like

$\neg(\phi\wedge\psi)\Rightarrow(\neg\phi\vee\neg\psi)$
$\neg\neg\phi\Rightarrow\phi$

nor any statements like

$\neg(\forall x.\neg\phi)\Rightarrow(\exists x.\phi)$
$\neg(\forall x.\phi)\Rightarrow(\exists x.\neg\phi)$
$\neg(\exists x.\neg\phi)\Rightarrow(\forall x.\phi)$

We can, though, prove

$\neg(\exists x.\phi)\Rightarrow(\forall x.\neg\phi)$

If we include, extra, an additional inference rule (called the Boolean negation rule) given by

then suddenly we're back in classical logic, and can prove $\neg\neg\phi\Rightarrow\phi$ and $\phi\or\neg\phi$.

#### 3.3 Examples: Sheaves, topology and time sheaves

The first interesting example of a topos is the category of (small enough) sets; in some sense clear already since we've been modelling our axioms and workflows pretty closely on the theory of sets.

Generating logic and set theory in the topos of sets, we get a theory that captures several properties of intuitionistic logic; such as the lack of Boolean negation, of exclusion of the third, and of double negation rules.

For the more interesting examples, however, we shall introduce the concepts of topology and of sheaf:

Definition A (set-valued) presheaf on a category C is a contravariant functor $E: C^{op}\to Set$.

Presheaves occur all over the place in geometry and topology - and occasionally in computer science too: There is a construction in which a functor $A\to Set$ for a discrete small category A identified with its underlying set of objects as a set, corresponds to the data type of bags of elements from A - for $a\in A$, the image F(a) denotes the multiplicity of a in the bag.

Theorem The category of all presheaves (with natural transformations as the morphisms) on a category C form a topos.

Example Pick a category on the shape

A contravariant functor on this category is given by a pair of sets G0,G1 and a pair of function $source, target: G_1\to G_0$. Identities are sent to identities.

The category of presheaves on this category, thus, is the category of graphs. Thus graphs form a topos.

The subobject classifier in the category of graphs is a graph with two nodes: in and out, and five arrows:

$in \to^{all} in$
$in \to^{both} in$
$in \to^{source} out$
$out \to^{target} in$
$out \to^{neither} out$

Now, given a subgraph $H \leq G$, we define a function $\chi_H:G\to\Omega$ by sending nodes to in or out dependent on their membership. For an arrow a, we send it to all if the arrow is in H, and otherwise we send it to both/source/target/neither according to where its source and target reside.

To really get into sheaves, though, we introduce more structure - specifically, we define what we mean by a topology:

Definition Suppose P is a partially ordered set. We call P a complete Heyting algebra if

• There is a top element 1 such that $x\leq 1 \forall x\in P$.
• Any two elements x,y have an infimum (greatest lower bound) $x\wedge y$.
• Every subset $Q\subseteq P$ has a supremum (least upper bound) $\bigvee_{p\in P} p$.
• $x\wedge(\bigvee y_i) = \bigvee x\wedge y_i$

Note that for the partial order by inclusion of a family of subsets of a given set, being a complete Heyting algebra is the same as being a topology in the classical sense - you can take finite unions and any intersections of open sets and still get an open set.

If {xi} is a subset with supremum x, and E is a presheaf, we get functions $e_i:E(x)\to E(x_i)$ from functoriality. We can summarize all these ei into $e = \prod_i e_i: E(x)\to\prod_i E(x_i)$.

Furthermore, functoriality gives us families of functions $c_{ij}: E(x_i)\to E(x_i\wedge x_j)$ and $d_{ij}: E(x_j)\to E(x_i\wedge x_j)$. These can be collected into $c: \prod_i E(x_i)\to\prod_{ij}E(x_i\wedge x_j)$ and $d:\prod_j E(x_j)\to\prod_{ij}E(x_i\wedge x_j)$.

Definition A presheaf E on a Heyting algebra is called a sheaf if it satisfies:

$x = \bigvee x_i$

implies that

is an equalizer. If you have seen sheaves before, you may recognize this as the covering axiom.

In other words, E is a sheaf if whenever $x=\bigvee x_i$ and c(α) = d(α), then there is some $\bar\alpha$ such that $\alpha = e(\bar\alpha)$.

Theorem The category of sheaves on a Heyting algebra is a topos.

For context, we can think of sheaves over Heyting algebras as sets in a logic with an expanded notion of truth. Our Heyting algebra is the collection of truth values, and the sheaves are the fuzzy sets with fuzziness introduced by the Heyting algebra.

Recalling that subsets and predicates are viewed as the same thing, we can view the set E(p) as the part of the fuzzy set E that is at least p true.

As it turns out, to really make sense of this approach, we realize that equality is a predicate as well - and thus can hold or not depending on the truth value we use.

Definition Let P be a complete Heyting algebra. A P-valued set is a pair (S,σ) of a set S and a function $\sigma: S\to P$. A category of fuzzy sets is a category of P-valued sets. A morphism $f:(S,\sigma)\to(T,\tau)$ of P-valued sets is a function $f:S\to T$ such that $\tau\circ f = \sigma$.

From these definitions emerges a fuzzy set theory where all components of it being a kind of set theory emerges from the topoidal approach above. Thus, say, subsets in a fuzzy sense are just monics, thus are injective on the set part, and such that the valuation, on the image of the injection, increases from the previous valuation: $(T,\tau)\subseteq(S,\sigma)$ if $T\subseteq S$ and σ | T = τ.

To get to topoi, though, there are a few matters we need to consider. First, we may well have several versions of the empty set - either a bona fide empty set, or just a set where every element is never actually there. This issue is minor. Much more significant though, is that while we can easily make (S,σ) give rise to a presheaf, by defining

$E(x) = \{s\in S: \sigma(s)\geq x\}$

this definition will not yield a sheaf. The reason for this boils down to $E(0) = S \neq 1$. We can fix this, though, by adjoining another element - $\bot$ - to P giving P + . The new element $\bot$ is imbued with two properties: it is smaller, in P + , than any other element, and it is mapped, by E to 1.

Theorem The construction above gives a fuzzy set (S,σ) the structure of a sheaf on the augmented Heyting algebra.

Corollary The category of fuzzy sets for a Heyting algebra P forms a topos.

Final note While this construction allows us to make membership a fuzzy concept, we're not really done fuzzy-izing sets. There are two fundamental predicates on sets: equality and membership. While fuzzy set theory, classically, only allows us to make one of these fuzzy, topos theory allows us - rather easily - to make both these predicates fuzzy. Not only that, but membership reduces - with the power object construction - to equality testing, by which the fuzzy set theory ends up somewhat inconsistent in its treatment of the predicates.

### 4 Literature

At this point, I would warmly recommend the interested reader to pick up one, or more, of:

• Steve Awodey: Category Theory
• Michael Barr & Charles Wells: Categories for Computing Science
• Colin McLarty: Elementary Categories, Elementary Toposes

or for more chewy books

• Peter T. Johnstone: Sketches of an Elephant: a Topos Theory compendium
• Michael Barr & Charles Wells: Toposes, Triples and Theories

### 5 Exercises

No homework at this point. However, if you want something to think about, a few questions and exercises: