Difference between revisions of "User:Michiexile/MATH198/Lecture 1"

Revision as of 14:19, 8 September 2009

Welcome, administrativia

I'm Mikael Vejdemo-Johansson. I can be reached in my office 383-BB, especially during my office hours; or by email to mik@math.stanford.edu.

I encourage, strongly, student interactions.

Introduction

Why this course?

An introduction to Haskell will usually come with pointers toward Category Theory as a useful tool, though not with much more than the mention of the subject. This course is intended to fill that gap, and provide an introduction to Category Theory that ties into Haskell and functional programming as a source of examples and applications.

What will we cover?

The definition of categories, special objects and morphisms, functors, natural transformation, (co-)limits and special cases of these, adjunctions, freeness and presentations as categorical constructs, monads and Kleisli arrows, recursion with categorical constructs.

Maybe, just maybe, if we have enough time, we'll finish with looking at the definition of a topos, and how this encodes logic internal to a category. Applications to fuzzy sets.

What do we require?

Our examples will be drawn from discrete mathematics, logic, Haskell programming and linear algebra. I expect the following concepts to be at least vaguely familiar to anyone taking this course:

Sets
Functions
Permutations
Groups
Partially ordered sets
Vector spaces
Linear maps
Matrices
Homomorphisms

Category

Graphs

We recall the definition of a (directed) graph. A graph $G$ is a collection of edges (arrows) and vertices (nodes). Each edge is assigned a source node and a target node.

$source\to target$

Given a graph $G$ , we denote the collection of nodes by $G_{0}$ and the collection of arrows by $G_{1}$ . These two collections are connected, and the graph given its structure, by two functions: the source function $s:G_{1}\to G_{0}$ and the target function $t:G_{1}\to G_{0}$ .

We shall not, in general, require either of the collections to be a set, but will happily accept larger collections; dealing with set-theoretical paradoxes as and when we have to. A graph where both nodes and arrows are sets shall be called small. A graph where either is a class shall be called large.

If both $G_{0}$ and $G_{1}$ are finite, the graph is called finite too.

The empty graph has $G_{0}=G_{1}=\emptyset$ .

A discrete graph has $G_{1}=\emptyset$ .

A complete graph has $G_{1}=\{(v,w)|v,w\in G_{0}\}$ .

A simple graph has at most one arrow between each pair of nodes. Any relation on a set can be interpreted as a simple graph.

Show some examples.

A homomorphism $f:G\to H$ of graphs is a pair of functions $f_{0}:G_{0}\to H_{0}$ and $f_{1}:G_{1}\to H_{1}$ such that sources map to sources and targets map to targets, or in other words:

$s(f_{1}(e))=f_{0}(s(e))$
$t(f_{1}(e))=f_{0}(t(e))$

By a path in a graph $G$ from the node $x$ to the node $y$ of length $k$ , we mean a sequence of edges $(f_{1},f_{2},\dots ,f_{k})$ such that:

$s(f_{k})=x$
$t(f_{1})=y$
$t(f_{i})=s(f_{i-1})$ for all other $i$ .

Paths with start and end point identical are called closed. For any node $x$ , there is a unique closed path $()$ starting and ending in $x$ of length 0.

For any edge $f$ , there is a unique path from $s(f)$ to $t(f)$ of length 1: $(f)$ .

We denote by $G_{k}$ the set of paths in $G$ of length $k$ .

Morphisms and objects

Some morphisms and some objects are special enough to garner special names that we will use regularly.

Isomorphisms and existence of inverses.
Epi- and mono-morphisms and cancellability.
- Examples in concrete categories.
- Monomorphisms and subobjects:
  - Factoring through. Equivalence relation by mutual factoring.
  - Subobjects as equivalence classes of monomorphisms.
- Splitting and the existence of inverses.
Terminal and initial objects.
- Constants. Pointless sets.

Examples

The empty category.
The one object/one arrow category $1$
The categories $2$ and $1+1$
The category Set of sets.
The catgeory FSet of finite sets.
The category PFn of sets and partial functions.
- $PFn(A,B)$ is a partially ordered set.
Every partial order is a category. Each hom-set has at most one element.
Every monoid is a category. Only one object.
- Kleene closure. Free monoids.
The category of Sets and injective functions.
The category of Sets and surjective functions.
The category of $k$ -vector spaces and linear maps.
The category with objects the natural numbers and $Hom(m,n)$ the set of $m\times n$ -matrices.
The category of Data Types with Computable Functions.
- Our ideal programming language has:
  - Primitive data types.
  - Constants of each primitive type.
  - Operations, given as functions between types.
  - Constructors, producing elements from data types, and producing derived data types and operations.
- We will assume that the language is equipped with
  - A do-nothing operation for each data type. Haskell has id.
  - An empty type $1$ , with the property that each type has exactly one function to this type. Haskell has (). We will use this to define the constants of type $t$ as functions $1\to t$ . Thus, constants end up being 0-ary functions.
  - A composition constructor, taking an operator $f:A\to B$ and another operator $g:B\to C$ and producing an operator $g\circ f:A\to C$ . Haskell has (.).
- This allows us to model a functional programming language with a category.
The category with objects logical propositions and arrows proofs.

Homework

For a passing mark, a written, acceptable solution to at least 2 of the 4 questions should be given no later than midnight before the next lecture.

Prove the general associative law: that for any path, and any bracketing of that path, the same composition may be found.
Suppose $u:A\to A$ in some category $C$ .
1. If $g\circ u=g$ for all $g:A\to B$ in the category, then $u=1_{A}$ .
2. If $u\circ h=h$ for all $h:B\to A$ in the category, then $u=1_{A}$ .
3. These two results characterize the objects in a category by the properties of their corresponding identity arrows completely.
For as many of the examples given as you can, prove that they really do form a category. Passing mark is at least 60% of the given examples.
For this question, all parts are required:
1. For which sets is the free monoid on that set commutative.
2. Prove that for any category $C$ , the set $Hom(A,A)$ is a monoid under composition for every object $A$ .

Graphs and paths

A graph is a collection $G_{0}$ of vertices and a collection $G_{1}$ of arrows. The structure of the graph is captured in the existence of two functions, that we shall call source and target, both going from $G_{1}$ to $G_{1}$ . In other words, each arrow has a source and a target.

We denote by $[v,w]$ the collection of arrows with source $v$ and target $w$ .

We extend the notation, and denote by $G_{i}$ the collection of all paths of length $i$ . Such a path is a sequence $f_{1},\dots ,f_{i}$ of arrows, such that for each $j$ , $target(f_{j-1})=source(f_{j})$ .

Definition of a category

A category is a graph with some special structure:

Each $[v,w]$ is a set and equipped with a composition operation $[u,v]\times [v,w]\to [u,w]$ . In other words, any two arrows, such that the target of one is the source of the other, can be composed to give a new arrow with target and source from the ones left out.

We write $f:u\to v$ if $f\in [u,v]$ .

$u\to v\to w$ => $u\to w$

The composition of arrows is associative.
Each vertex $v$ has a dedicated arrow $1_{v}$ with source and target $v$ , called the identity arrow.
Each identity arrow is a left- and right-identity for the composition operation.

The composition of $f:u\to v$ with $g:v\to w$ is denoted by $gf:u\to v\to w$ . A mnemonic here is that you write things so associativity looks right. Hence, $(gf)(x)=g(f(x))$ . This will make more sense once we get around to generalized elements later on.

Examples

The empty category with no vertices and no arrows.
The category 1 with a single vertex and only its identity arrow.
The category 2 with two objects, their identity arrows and the arrow $a\to b$ .
For vertices take vector spaces. For arrows, take linear maps. This is a category, the identity arrow is just the identity map $f(x)=x$ and composition is just function composition.
For vertices take finite sets. For arrows, take functions.
For vertices take logical propositions. For arrows take proofs in propositional logic. The identity arrow is the empty proof: P proves P without an actual proof. And if you can prove P using Q and then R using P, then this composes to a proof of R using Q.
For vertices, take data types. For arrows take (computable) functions. This forms a category, in which we can discuss an abstraction that mirrors most of Haskell. There are issues making Haskell not quite a category on its own, but we get close enough to draw helpful conclusions and analogies.
Suppose P is a set equipped with a partial ordering relation <. Then we can form a category out of this set with elements for vertices and with a single element in [v,w] if and only if v<w. Then the transitivity and reflexivity of partial orderings show that this forms a category.

Some language we want settled:

A category is concrete if it is like the vector spaces and the sets among the examples - the collection of all sets-with-specific-additional-structure equipped with all functions-respecting-that-structure. We require already that [v,w] is always a set.

A category is small if the collection of all vertices, too, is a set.

@@ Line 1: / Line 1: @@
 ==Welcome, administrativia==
+I'm Mikael Vejdemo-Johansson. I can be reached in my office 383-BB, especially during my office hours; or by email to mik@math.stanford.edu.
+I encourage, strongly, student interactions.
 ==Introduction==
@@ Line 29: / Line 33: @@
 ===Graphs===
-* Graph is vertices and edges. Each edge has source and target.
+We recall the definition of a ''(directed) graph''. A graph <math>G</math> is a collection of ''edges (arrows)'' and ''vertices (nodes)''. Each edge is assigned a ''source'' node and a ''target'' node.
-* Notation: <math>G</math> a graph, <math>G_0</math> the vertices, <math>G_1</math> the edges, or arrows.
+<math>source \to target</math>
-* Some examples.
-* Complete structure given by <math>G_0</math>, <math>G_1</math> and the two functions <math>s,t:G_1\to G_0</math>.
+Given a graph <math>G</math>, we denote the collection of nodes by <math>G_0</math> and the collection of arrows by <math>G_1</math>. These two collections are connected, and the graph given its structure, by two functions: the source function <math>s:G_1\to G_0</math> and the target function <math>t:G_1\to G_0</math>.
+We shall not, in general, require either of the collections to be a set, but will happily accept larger collections; dealing with set-theoretical paradoxes as and when we have to. A graph where both nodes and arrows are sets shall be called ''small''. A graph where either is a class shall be called ''large''.
+If both <math>G_0</math> and <math>G_1</math> are finite, the graph is called ''finite'' too.
+The ''empty graph'' has <math>G_0 = G_1 = \emptyset</math>.
+A ''discrete graph'' has <math>G_1=\emptyset</math>.
+A ''complete graph'' has <math>G_1 = \{ (v,w) | v,w\in G_0\}</math>.
+A ''simple graph'' has at most one arrow between each pair of nodes. Any relation on a set can be interpreted as a simple graph.
+* Show some examples.
+A ''homomorphism'' <math>f:G\to H</math> of graphs is a pair of functions <math>f_0:G_0\to H_0</math> and <math>f_1:G_1\to H_1</math> such that sources map to sources and targets map to targets, or in other words:
-* Endoarrow, empty graph, discrete graph, complete graph, small graph, large graph, set theoretical traps, finite graph, simple graphs.
+* <math>s(f_1(e)) = f_0(s(e))</math>
+* <math>t(f_1(e)) = f_0(t(e))</math>
+By a ''path'' in a graph <math>G</math> from the node <math>x</math> to the node <math>y</math> of length <math>k</math>, we mean a sequence of edges <math>(f_1,f_2,\dots,f_k)</math> such that:
-* Homomorphism: Map edges to edges, vertices to vertices, respects source and target maps.
+* <math>s(f_k)=x</math>
+* <math>t(f_1)=y</math>
+* <math>t(f_i) = s(f_{i-1})</math> for all other <math>i</math>.
-* Paths: A path from the node <math>x</math> to the node <math>y</math> is some sequence of edges <math>(f_1,\dots,f_n)</math> with:
+Paths with start and end point identical are called ''closed''. For any node <math>x</math>, there is a unique closed path <math>()</math> starting and ending in <math>x</math> of length 0.
-** <math>s(f_n)=x</math>
-** <math>t(f_1)=y</math>
-** <math>t(f_i) = s(f_{i-1})</math> for all other <math>i</math>.
-* For any node <math>x</math>, there is a unique path from <math>x</math> to <math>x</math> of length 0: <math>()</math>.
+For any edge <math>f</math>, there is a unique path from <math>s(f)</math> to <math>t(f)</math> of length 1: <math>(f)</math>.
-* For any edge <math>f</math>, there is a unique path from <math>s(f)</math> to <math>t(f)</math> of length 1.
-* We denote by <math>G_k</math> the set of all paths in <math>G</math> of length <math>k</math>.
+We denote by <math>G_k</math> the set of paths in <math>G</math> of length <math>k</math>.
-* This is compatible with out previous definition of <math>G_0,G_1</math>.
 ===Categories===
@@ Line 119: / Line 138: @@
 ===Homework===
-For a passing mark, a written, acceptable solution to at least 2 of the 3 questions should be given no later than midnight before the next lecture.
+For a passing mark, a written, acceptable solution to at least 2 of the 4 questions should be given no later than midnight before the next lecture.
 # Prove the general associative law: that for any path, and any bracketing of that path, the same composition may be found.
@@ Line 125: / Line 144: @@
 ## If <math>g\circ u=g</math> for all <math>g:A\to B</math> in the category, then <math>u=1_A</math>.
 ## If <math>u\circ h=h</math> for all <math>h:B\to A</math> in the category, then <math>u=1_A</math>.
+## These two results characterize the objects in a category by the properties of their corresponding identity arrows completely.
 # For as many of the examples given as you can, prove that they really do form a category. Passing mark is at least 60% of the given examples.
 # For this question, all parts are required:

Difference between revisions of "User:Michiexile/MATH198/Lecture 1"

Revision as of 14:19, 8 September 2009

Contents

Welcome, administrativia

Introduction

Why this course?

What will we cover?

What do we require?

Category

Graphs

Categories

Morphisms and objects

Examples

Homework

Graphs and paths

Definition of a category

Examples

Navigation menu

Search