Chapter 3 - Finite Automata

Finite Automata (FAs) are the simplest model of computation we will study. They are essentially programs that solve certain decision problems without using any variables or complex memory. They process input in a single pass, from left to right, and the result is known immediately after reading the last symbol.

3.1 Aims

Our goal here is not to provide a comprehensive course on Automata Theory. Instead, we use FAs as a didactic tool, a laboratory for introducing and understanding fundamental concepts of computation in a simple, visual way. We will explore:

Modeling Computation: How to formally describe a computation.
Core Concepts: We will define and build intuition for key ideas like Configuration, Computation Step, Simulation, Determinism, and Nondeterminism.
Proofs of Limitation: We will learn how to prove that a specific task is unsolvable by a given class of algorithms.

Grasping these concepts with FAs will make it much easier to understand them later in the more general and powerful context of Turing machines.

3.2 Representations of Finite Automata

To define a model of computation, we must answer a few basic questions:

What elementary operations are available?
How much memory is available, and how is it used?
How is input provided and output determined?

For FAs, the answer is stark: there is no memory besides the program itself and a pointer to the current instruction. This means no variables. The only changing piece of information is the program counter, which we will come to know as the automaton’s state.

A Program-like Representation

Let’s imagine an FA as a simple program. If the input alphabet is $Σ = {a_{1}, \dots, a_{k}}$ , the only allowed operation is a multi-way branch based on the next input symbol. Each line of the program corresponds to a state.

Consider a program A over $Σ_{b oo l} = {0, 1}$ with four lines (states 0, 1, 2, 3). The program starts at line 0. A set of lines $F$ are designated as “accepting” lines. If the program finishes on a line in $F$ , the input is accepted.

Example Program A:

States: ${0, 1, 2, 3}$
Accepting States: $F = {0, 3}$

Program Logic:

0: if input = 1 then goto 1 else goto 2
1: if input = 1 then goto 0 else goto 3
2: if input = 0 then goto 0 else goto 3
3: if input = 0 then goto 1 else goto 2

On input 1011, the execution trace is: Start at 0 → read 1 → goto 1 → read 0 → goto 3 → read 1 → goto 2 → read 1 → goto 3. The program ends at line 3. Since $3 \in F$ , the word 1011 is accepted.

Graphical Representation

This program-like structure can be visualized as a state transition diagram. Each state (line number) is a node. A directed edge from state i to j labeled with symbol a means if input = a then goto j when in state i.

The start state is marked with an incoming arrow.
Accepting states are marked with a double circle.

The diagram for Program A is:

Formal Definition

While intuitive, diagrams and programs are not ideal for formal proofs. The standard, rigorous way to define an FA is as a mathematical object.

Definition 3.1: Deterministic Finite Automaton (DFA)

A Deterministic Finite Automaton (DFA) is a 5-tuple $M = (Q, Σ, δ, q_{0}, F)$ , where:

$Q$ is a finite set of states.

$Σ$ is a finite set called the input alphabet.

$δ : Q \times Σ \to Q$ is the transition function.

$q_{0} \in Q$ is the start state.

$F \subseteq Q$ is the set of accepting states (or final states).

To describe the dynamics of a DFA, we define:

Configuration: A pair $(q, w) \in Q \times Σ^{*}$ , representing that the DFA is in state $q$ with the remaining input $w$ to be read.
Step: We write $(q, a w) ⊢_{M} (p, w)$ if the DFA takes a single step from configuration $(q, a w)$ to $(p, w)$ . This happens if and only if $δ (q, a) = p$ .
Computation: A sequence of configurations $C_{0}, C_{1}, \dots, C_{n}$ where $C_{i} ⊢_{M} C_{i + 1}$ for all $i$ . The computation on input $x$ is the sequence starting from the start configuration $(q_{0}, x)$ and ending in an end configuration $(q_{n}, λ)$ .
Acceptance: A DFA $M$ accepts a word $x$ if the computation on $x$ ends in a configuration $(q, λ)$ where $q \in F$ .
Language of a DFA: The language accepted by $M$ , denoted $L (M)$ , is the set of all words that $M$ accepts.
Regular Languages: The class of all languages accepted by some DFA is called the class of regular languages, denoted $L_{E A}$ .

For our example automaton $M$ , the computation on 1011 is: $(q_{0}, 1011) ⊢_{M} (q_{1}, 011) ⊢_{M} (q_{3}, 11) ⊢_{M} (q_{2}, 1) ⊢_{M} (q_{3}, λ)$ . Since $q_{3} \in F$ , the word is accepted.

The Meaning of States

The key to designing and understanding DFAs is to realize that each state represents a property of the prefix of the input read so far. The automaton has finite memory, embodied by its states. It partitions the infinite set of all possible strings $Σ^{*}$ into a finite number of equivalence classes, where all strings in a class drive the automaton to the same state.

For our example automaton, the states represent the parity of 0s and 1s seen so far:

$q_{0}$ : even0s, even1s

$q_{1}$ : even0s, odd1s

$q_{2}$ : odd0s, even1s

$q_{3}$ : odd0s, odd1s

The accepted language is $L (M) = {w \in {0, 1}^{*} ∣ ∣ w ∣_{0} is even and ∣ w ∣_{1} is even, OR ∣ w ∣_{0} is odd and ∣ w ∣_{1} is odd}$ . This is equivalent to ${w \in {0, 1}^{*} ∣ ∣ w ∣_{0} + ∣ w ∣_{1} is even}$ , i.e., all words of even length.

3.3 Simulations

A powerful technique in automata theory is to simulate multiple automata at once. This allows us to prove that the class of regular languages is closed under certain operations (union, intersection, complement).

Lemma 3.2: Closure Properties

If $L_{1}$ and $L_{2}$ are regular languages, then so are $L_{1} \cup L_{2}$ , $L_{1} \cap L_{2}$ , and $Σ^{*} ∖ L_{1}$ .

Proof Idea (Product Construction)

Let $M_{1} = (Q_{1}, Σ, δ_{1}, q_{01}, F_{1})$ and $M_{2} = (Q_{2}, Σ, δ_{2}, q_{02}, F_{2})$ be DFAs for $L_{1}$ and $L_{2}$ . We can construct a new DFA $M$ that runs $M_{1}$ and $M_{2}$ in parallel.

The states of $M$ are pairs $(q, p)$ where $q \in Q_{1}$ and $p \in Q_{2}$ .
The start state of $M$ is $(q_{01}, q_{02})$ .
The transition function is $δ ((q, p), a) = (δ_{1} (q, a), δ_{2} (p, a))$ .

The only difference for the various operations is the choice of accepting states $F$ :

For Intersection ( $L_{1} \cap L_{2}$ ): $F = F_{1} \times F_{2}$ . We accept if both $M_{1}$ and $M_{2}$ are in an accepting state.
For Union ( $L_{1} \cup L_{2}$ ): $F = (F_{1} \times Q_{2}) \cup (Q_{1} \times F_{2})$ . We accept if at least one of $M_{1}$ or $M_{2}$ is in an accepting state.
For Complement ( $Σ^{*} ∖ L_{1}$ ): We just flip the accepting and non-accepting states of $M_{1}$ . $F = Q_{1} ∖ F_{1}$ .

3.4 Proofs of Non-existence

How can we prove that a language is not regular? We must show that no DFA can accept it. The key is to exploit the DFA’s main weakness: its finite memory.

A DFA can only remember a finite amount of information about the input it has seen, because this information must be encoded in its current state, and there are only a finite number of states.

This leads to a crucial insight (the Pigeonhole Principle): if we feed a DFA an infinite number of different prefixes, it must eventually revisit a state.

Lemma 3.4: The Pumping Lemma for Regular Languages

For every regular language $L$ , there exists a constant $n_{L}$ (the pumping length) such that for any word $z \in L$ with $∣ z ∣ \geq n_{L}$ , $z$ can be decomposed into three parts, $z = uv w$ , satisfying:

$∣ uv ∣ \leq n_{L}$

$∣ v ∣ \geq 1$

For all $i \geq 0$ , the word $u v^{i} w$ is also in $L$ .

Intuition: Any sufficiently long word $z$ must cause the DFA to visit some state twice while reading the first $n_{L}$ symbols. The part of the string between these two visits is $v$ . Since we can traverse this loop ( $v$ ) any number of times (including zero) and still end up in the same state, the resulting string must also be accepted.

How to use it: To prove a language $L$ is not regular, we use a proof by contradiction.

Assume $L$ is regular. The Pumping Lemma gives us a pumping length $n_{L}$ .
Choose a clever word $z \in L$ such that $∣ z ∣ \geq n_{L}$ .
Show that for any possible decomposition $z = uv w$ that satisfies conditions (1) and (2), there is some $i$ for which $u v^{i} w \in / L$ .
This contradicts condition (3) of the lemma, so our initial assumption must be false. $L$ is not regular.

Example: Prove $L = {a^{n} b^{n} c^{n} ∣ n \in N}$ is not regular.

Assume $L$ is regular with pumping length $n_{L}$ .
Choose $z = a^{n_{L}} b^{n_{L}} c^{n_{L}}$ . Clearly, $z \in L$ and $∣ z ∣ \geq n_{L}$ .
By the lemma, $z = uv w$ with $∣ uv ∣ \leq n_{L}$ and $∣ v ∣ \geq 1$ . Because of the length constraint, $v$ must consist entirely of $a$ ‘s.
Let’s “pump” with $i = 2$ . The new word is $u v^{2} w = a^{n_{L} + ∣ v ∣} b^{n_{L}} c^{n_{L}}$ .
Since $∣ v ∣ \geq 1$ , this new word has more $a$ ‘s than $b$ ‘s or $c$ ‘s, so it is not in $L$ . This is a contradiction.
Therefore, $L$ is not regular.

3.5 Nondeterminism

What if we relax the rules of our automata? A Nondeterministic Finite Automaton (NFA) can have multiple possible moves from a given state on a given input symbol.

Definition 3.3: Nondeterministic Finite Automaton (NFA)

An NFA is a 5-tuple $M = (Q, Σ, δ, q_{0}, F)$ , where everything is the same as a DFA except the transition function:

$δ : Q \times (Σ \cup {λ}) \to P (Q)$

Here, $P (Q)$ is the power set of $Q$ . The function maps a state and an input symbol (or $λ$ ) to a set of possible next states.

How an NFA computes

An NFA accepts an input word $w$ if there exists at least one path of transitions from the start state to an accepting state that consumes the word $w$ . The computation can be seen as a tree of possibilities. If any branch of the tree leads to an accepting state, the word is accepted.

The Power of Nondeterminism

It might seem that NFAs are more powerful than DFAs. Surprisingly, they are not.

Satz 3.2: Equivalence of NFAs and DFAs

For every NFA, there exists an equivalent DFA (that accepts the same language).

Proof Idea (Subset Construction)

We can construct a DFA that simulates the NFA. The key idea is that the state of the DFA will correspond to the set of all possible states the NFA could be in at that moment.

States of DFA: The power set of the NFA’s states, $P (Q)$ .
Start State of DFA: The set containing the NFA’s start state, ${q_{0}}$ (and all states reachable from it via $λ$ -transitions).
Transition of DFA: If the NFA is in a set of states $P$ and reads symbol $a$ , the DFA transitions to the state corresponding to the set of all states reachable from any state in $P$ by reading $a$ .
Accepting States of DFA: Any state (which is a set of NFA states) that contains at least one of the NFA’s original accepting states.

The Cost of Determinism

While NFAs and DFAs are equivalent in power, converting an NFA with $n$ states to a DFA can, in the worst case, result in a DFA with $2^{n}$ states. This exponential blow-up is a fundamental theme in complexity theory. Nondeterminism can sometimes offer a much more concise way to describe a language, even if it doesn’t increase the ultimate computational power.

3.6 Summary

Finite Automata are a simple model of computation with finite memory, represented by their states.
They are used to recognize regular languages.
We can prove a language is not regular using the Pumping Lemma, which exploits the finite memory limitation.
Nondeterministic Finite Automata (NFAs) allow for multiple computation paths.
NFAs are computationally equivalent to DFAs, but converting an NFA to a DFA can lead to an exponential increase in the number of states. This highlights a fundamental trade-off between conciseness and determinism.

CS Notes

Explorer

Chapter 3 - Finite Automata

3.1 Aims

3.2 Representations of Finite Automata

A Program-like Representation

Graphical Representation

Formal Definition

3.3 Simulations

Proof Idea (Product Construction)

3.4 Proofs of Non-existence

3.5 Nondeterminism

How an NFA computes

The Power of Nondeterminism

Proof Idea (Subset Construction)

3.6 Summary

Table of Contents

Graph View