15 Randomized Algorithms, Monte Carlo vs Las Vegas, Reducing Error Probability

Lecture from: 08.04.2025 | Video: Homelab

Let’s continue our discussion on randomized algorithms, focusing on how we define and analyze them, especially the common types: Las Vegas and Monte Carlo algorithms. We’ll also see how to improve their reliability by repetition.

Classical vs. Randomized Algorithms

Classical (Deterministic) Algorithms

Think of a standard algorithm you’ve learned, like MergeSort or Dijkstra’s.

Input: It takes an input $I$ .
Process: It performs a fixed sequence of operations based on $I$ .
Output: It produces an output $A (I)$ .

Key characteristics of a classical algorithm:

Correctness: The output $A (I)$ is always precise and correct for a given input $I$ .
Consistency: For the same input $I$ , the output $A (I)$ is always the same. (The algorithm $A$ behaves like a mathematical function.)
Deterministic Runtime: For the same input $I$ , the runtime of algorithm $A$ is always the same.
Reproducibility: The behavior of the algorithm is entirely reproducible. Give it the same input, and you get the same steps, same output, same runtime.

Randomized Algorithms

Randomized algorithms introduce an element of chance into the process.

Input: They take an input $I$ .
Random Source: They also have access to a source of randomness, let’s call it $R$ . This could be a sequence of random bits, random numbers drawn from a distribution, etc.
Process: The algorithm’s operations can depend on both $I$ and the random values from $R$ .
Output: The output is $A (I, R)$ .

Consequences of using randomness:

The output now depends on both the input $I$ and the specific random choices $R$ made during execution.
This means the output might be:
- Sometimes correct.
- Sometimes almost correct (e.g., a close approximation).
- Sometimes fast (the runtime itself can be a random variable).
Non-Reproducibility: Generally, if you run the algorithm twice with the same input $I$ but with different random choices from $R$ (which is typical unless you fix the random seed), you might get different outputs or different runtimes. However, the algorithm $A$ is still a deterministic function if you consider the pair $(I, R)$ as its combined input.

The Random Source

Where does this randomness $R$ come from?

The random source provides random, independent bits or numbers, drawn according to some specified (often uniform) distribution.

Possible sources:

Physical Random Number Generators: These leverage inherently random physical processes.
- Examples: Lotto numbers, Geiger counters (radioactive decay), thermal noise in circuits, quantum phenomena. These are good sources of “true” randomness but can be slow or impractical for direct use in algorithms.
Deterministic (Pseudo-)Random Number Generators (PRNGs): These are algorithms that produce a sequence of numbers that “look” random and pass many statistical tests for randomness, but are actually generated deterministically from an initial seed value.
- Given a seed $s$ , a PRNG produces a sequence $r_{0} (s), r_{1} (s), r_{2} (s), \dots$ .
- Advantage: If you know the seed, you can reproduce the exact same sequence of “random” numbers. This is invaluable for debugging randomized algorithms.
- Caveat: PRNGs are not truly random. A sophisticated adversary who knows the PRNG algorithm and can observe enough output might predict future “random” numbers. The quality of PRNGs varies widely.

Our Assumption in Analysis

For theoretical analysis, we typically assume access to an ideal random source that provides perfectly random, independent bits/numbers as needed.

In practice, we use high-quality PRNGs. The potential discrepancy between theory (ideal randomness) and practice (pseudo-randomness) is something to be aware of, though often PRNGs are good enough for most applications.

Monte Carlo vs. Las Vegas Algorithms

Randomized algorithms are broadly categorized based on how randomness affects their correctness and runtime. The two main types are named after famous gambling locations: Monte Carlo and Las Vegas.

Monte Carlo Algorithms

Correctness/Quality: The correctness or quality of the output is a random variable. The algorithm might produce an incorrect answer or an answer of varying quality with some probability.
Runtime: Typically, the runtime of a Monte Carlo algorithm is not dependent on the random choices (or is bounded deterministically). It usually runs for a predetermined number of steps.
Goal:
- Always (or predictably) fast.
- Mostly correct or provides a good quality answer with high probability.

Las Vegas Algorithms

Runtime: The runtime is a random variable. It might finish quickly on some random choices, and slowly on others.
Correctness/Quality: The correctness or quality of the output is not dependent on the random choices. If the algorithm produces an answer, that answer is guaranteed to be correct.
Goal:
- Always correct/good (when an answer is given).
- Mostly fast (i.e., the expected runtime is good, or it finishes quickly with high probability).

Alternative View of Las Vegas Algorithms

Sometimes, a Las Vegas algorithm is designed such that it might explicitly output ”???” (or “I don’t know”) instead of a correct answer if it gets “unlucky” with its random choices within a certain time bound. The guarantee is that if it does give an answer other than ”???”, that answer is correct.

This leads to two common operational modes for Las Vegas algorithms:

Repeat until an answer: If the algorithm can output ”???”, you simply run it repeatedly until it gives a definite (and thus correct) answer. The analysis then focuses on the expected number of repetitions.
Abort after fixed time: Run the algorithm for a predetermined maximum time. If it produces an answer within this time, great. If not, it aborts and outputs ”???“.

A Las Vegas algorithm whose runtime is a random variable can be converted into one that sometimes outputs ’???’ by imposing a runtime cutoff (e.g., based on Markov’s inequality applied to its runtime).

Reducing Error Probability

One of the powerful features of randomized algorithms is that we can often decrease the probability of an undesirable outcome (like getting no answer, or getting a wrong answer) by simply running the algorithm multiple times.

Reducing “Failure to Answer” for Las Vegas Algorithms

Recall that a Las Vegas algorithm is always correct if it provides an answer, but it might sometimes fail to do so, perhaps by outputting a special symbol like ”???” or by exceeding a time limit.

Goal

Suppose we have a Las Vegas algorithm $A$ . For any input $I$ , it gives a correct answer with a probability of at least $ε$ . This means the probability of it outputting ”???” is at most $1 - ε$ . Our goal is to construct a new algorithm, $A_{δ}$ , that gives a correct answer with an even higher probability, specifically at least $1 - δ$ , where $δ$ is some small target failure probability.

Idea: Repetition

The natural approach is to run the original algorithm $A$ multiple times. If any of these runs gives a definite answer, we can use it. But how many repetitions are enough?

Theorem: Amplifying Success for Las Vegas Algorithms

Let $A$ be a randomized algorithm that never gives a false answer. However, it might sometimes output ”???” (indicating no answer found). Let $P r [A (I) provides a correct answer] \geq ε$ for any input $I$ .

For any desired overall failure probability $δ > 0$ :

Construct a new algorithm $A_{δ}$ as follows:

Repeatedly call $A$ .
If any call to $A$ returns a value different from ”???”, then $A_{δ}$ immediately stops and returns that value.
If $A$ has been called $N = ⌈ ε^{- 1} ln δ^{- 1} ⌉$ times and all calls have resulted in ”???”, then $A_{δ}$ stops and outputs ”???“.

Then, the probability that $A_{δ}$ provides a correct answer for input $I$ is at least $1 - δ$ . That is, $P r [A_{δ} (I) correct] \geq 1 - δ$ .

Proof

The algorithm $A_{δ}$ fails to provide a correct answer if and only if every one of its $N$ independent calls to $A$ results in ”???“.

The probability that a single call to $A$ results in ”???” is $P r [single call fails] \leq 1 - ε$ .

Since the $N$ calls are independent, the probability that all $N$ calls fail is: $P r [A_{δ} fails] \leq (1 - ε)^{N}$ .

We want this failure probability to be at most $δ$ . So we need to find $N$ such that $(1 - ε)^{N} \leq δ$ .

We know that for any real number $x$ , $1 - x \leq e^{- x}$ . Applying this with $x = ε$ : $(1 - ε)^{N} \leq (e^{- ε})^{N} = e^{- εN}$ .

So, we need $e^{- εN} \leq δ$ .

Taking the natural logarithm of both sides (which is a monotonically increasing function, so it preserves the inequality): $ln (e^{- εN}) \leq ln δ$ $- εN \leq ln δ$

Multiplying by $- 1$ (and reversing the inequality sign):

$εN \geq - ln δ$ $εN \geq ln (1/ δ)$ $N \geq \frac{1}{ε} ln (\frac{1}{δ})$ .

By choosing $N = ⌈ ε^{- 1} ln δ^{- 1} ⌉$ , we satisfy this condition.

Therefore, $P r [A_{δ} fails] \leq δ$ .

The probability that $A_{δ}$ provides a correct answer is $1 - P r [A_{δ} fails] \geq 1 - δ$ . This completes the proof.

Example: Number of Iterations

Let’s see how many iterations $N$ are needed for some practical values. Suppose our base Las Vegas algorithm $A$ has a success probability $ε = 0.25$ (it finds an answer 1 out of 4 times on average). Then $N = ⌈(1/0.25) ln (1/ δ)⌉ = ⌈ 4 ln (1/ δ)⌉$ .

Target Failure Prob. ( $δ$ )	Min. Iterations ( $N$ )	$P r [A_{δ} correct] \geq$
0.1	10	0.9
0.01	19	0.99
0.001	28	0.999
0.0001	37	0.9999
0.00001	47	0.99999
0.000001	56	0.999999

Key Observation: To decrease the failure probability by a constant factor (e.g., from $δ_{1}$ to $δ_{1} /10$ ), the number of required iterations $N$ increases only by an additive constant amount ( $ε^{- 1} ln 10$ ). This makes boosting the success probability of Las Vegas algorithms very efficient.

Reducing Error for Monte Carlo Algorithms

For Monte Carlo algorithms, which may produce incorrect answers, reducing the error probability by simple repetition isn’t always straightforward.

General Case Limitation:

If a Monte Carlo algorithm is no better than random guessing (e.g., it flips a coin to decide between “JA” and “NEIN”, and its probability of being correct is exactly $1/2$ ), then simply repeating it and, say, taking a majority vote won’t improve things. The error probability remains $1/2$ .

Error reduction for Monte Carlo algorithms is generally possible under two main conditions:

The algorithm exhibits one-sided error.
The algorithm’s probability of being correct is strictly greater than $1/2$ (i.e., it has some “edge” over random guessing).

Monte Carlo with One-Sided Error

This is common in decision problems where an error can only occur for one type of instance (e.g., it might falsely identify a “NO” instance as “YES”, but never a “YES” instance as “NO”).

Definition (One-Sided Error Example)

An algorithm $A$ has one-sided error if, for a decision problem (JA/NEIN):

If the input $I$ is a JA-instance: $P r [A (I) outputs JA] = 1$ . (It’s always correct).
If the input $I$ is a NEIN-instance: $P r [A (I) outputs NEIN] \geq ε$ . (It’s correct with probability at least $ε$ ). This implies it erroneously outputs JA with probability at most $1 - ε$ .

(The roles of JA and NEIN can be swapped depending on the specific algorithm).

Theorem: Error Reduction for One-Sided Monte Carlo

Let $A$ be a Monte Carlo algorithm with one-sided error as defined above: $P r [A (I) = JA ∣ I is JA-instance] = 1$ . $P r [A (I) = NEIN ∣ I is NEIN-instance] \geq ε$ .

For any desired overall error probability $δ > 0$ :

Construct $A_{δ}$ as follows:

Repeatedly call $A$ .
If any call to $A$ returns NEIN, then $A_{δ}$ immediately stops and returns NEIN.
If $A$ has been called $N = ⌈ ε^{- 1} ln δ^{- 1} ⌉$ times and all calls have resulted in JA, then $A_{δ}$ stops and outputs JA.

Then, the probability that $A_{δ}$ gives the correct answer for input $I$ is at least $1 - δ$ . That is, $P r [A_{δ} (I) correct] \geq 1 - δ$ .

Proof

We analyze the two cases for the true nature of input $I$ :

Case 1: $I$ is a JA-instance. According to the properties of $A$ , every call to $A$ on a JA-instance will output JA. Therefore, $A_{δ}$ will either see $N$ consecutive JAs and output JA (rule 3), or it would have stopped earlier if NEIN was possible (but it’s not for JA-instances). So, $A_{δ}$ correctly outputs JA. $P r [A_{δ} (I) correct ∣ I is JA] = 1$ .
Case 2: $I$ is a NEIN-instance. The algorithm $A_{δ}$ makes an error if it outputs JA when $I$ is a NEIN-instance. This happens if and only if all $N$ independent calls to $A$ outputted JA. For a NEIN-instance, the probability that a single call to $A$ outputs JA (an error) is $P r [A (I) = JA ∣ I is NEIN] \leq 1 - ε$ . So, $P r [A_{δ} errs ∣ I is NEIN] \leq (1 - ε)^{N}$ . Using the same reasoning as in the Las Vegas proof, if we choose $N = ⌈ ε^{- 1} ln δ^{- 1} ⌉$ , then $(1 - ε)^{N} \leq δ$ . Thus, $P r [A_{δ} (I) correct ∣ I is NEIN] \geq 1 - δ$ .

Since $A_{δ}$ is correct with probability 1 for JA-instances and with probability at least $1 - δ$ for NEIN-instances, its overall probability of being correct is at least $1 - δ$ .

Monte Carlo with Two-Sided Error (Correctness Probability $> 1/2$ )

Now, consider a Monte Carlo algorithm $A$ that always gives a JA or NEIN answer, but it can be wrong in either direction. However, it’s better than a random guess: $P r [A (I) is correct] \geq \frac{1}{2} + ε$ , for some $ε > 0$ .

Theorem: Majority Vote for Two-Sided Error Amplification

Let $A$ be a Monte Carlo algorithm such that $P r [A (I) correct] \geq \frac{1}{2} + ε$ for some $ε > 0$ . For any desired overall error probability $δ > 0$ :

Construct $A_{δ}$ as follows:

Call $A$ a total of $N = ⌈ 4 ε^{- 2} ln δ^{- 1} ⌉$ times independently.
$A_{δ}$ outputs the answer (JA or NEIN) that occurred most frequently among the $N$ trials (the majority vote). If there’s a tie, it can break it arbitrarily, e.g., output JA.

Then, $P r [A_{δ} (I) correct] \geq 1 - δ$ .

Proof

We analyze a Monte Carlo algorithm $A$ that returns the correct answer with probability at least $\frac{1}{2} + ε$ for some known $ε > 0$ . We define a new algorithm $A_{δ}$ which runs $A$ independently $N$ times and returns the majority answer.

We show that:

N = ⌈ \frac{4}{ε ^{2}} ln (\frac{1}{δ}) ⌉ \Rightarrow Pr [A_{δ} (I) is correct] \geq 1 - δ .

Setup

Let:

$0 < ε \leq \frac{1}{2}$
$p := Pr [A (I) correct] \geq \frac{1}{2} + ε$
Let $X$ be the number of correct answers in $N$ independent runs of $A (I)$ .
So $X \sim Binomial (N, p)$
We want to bound:
$Pr [A_{δ} (I) is incorrect] = Pr [X \leq N /2]$

Step 1: Expected Value of X

The expected number of correct runs is:

E [X] = Np \geq N (\frac{1}{2} + ε) = \frac{N}{2} + εN

We now show that:

\frac{N}{2} \leq (1 - ε) E [X] (★)

Step 2: Verifying the Chernoff Condition

We expand the right-hand side of (★):

(1 - ε) E [X] = (1 - ε) (\frac{N}{2} + εN) = \frac{N}{2} + εN - ε \cdot \frac{N}{2} - ε^{2} N = \frac{N}{2} + \frac{εN}{2} - ε^{2} N

So:

(1 - ε) E [X] \geq \frac{N}{2} ⟺ \frac{εN}{2} - ε^{2} N \geq 0 ⟺ ε (\frac{1}{2} - ε) \geq 0

This holds for all $ε \leq \frac{1}{2}$ , so (★) is valid.

Step 3: Applying Chernoff Bound

Using the Chernoff bound for the lower tail:

Pr [X \leq (1 - ε) E [X]] \leq exp (- \frac{ε ^{2} E [ X ]}{2})

By (★), we have:

Pr [X \leq N /2] \leq exp (- \frac{ε ^{2} E [ X ]}{2})

Step 4: Bounding the Exponent

We use:

E [X] \geq \frac{N}{2}

So:

Pr [X \leq N /2] \leq exp (- \frac{ε ^{2} N}{4})

We want this to be at most $δ$ :

exp (- \frac{ε ^{2} N}{4}) - \frac{ε ^{2} N}{4} N \leq δ \leq ln δ \geq \frac{4}{ε ^{2}} ln (\frac{1}{δ})

Thus, choosing:

N = ⌈ \frac{4}{ε ^{2}} ln (\frac{1}{δ}) ⌉

ensures:

Pr [X \leq N /2] \leq δ

Final Conclusion

With this value of $N$ , the amplified algorithm $A_{δ}$ satisfies:

Pr [A_{δ} (I) is correct] = Pr [X > N /2] \geq 1 - δ

This shows that any Monte Carlo algorithm with correctness probability $\geq \frac{1}{2} + ε$ can be boosted to failure probability at most $δ$ using:

N = O (\frac{1}{ε ^{2}} lo g \frac{1}{δ})

independent repetitions and majority vote.

Randomized Algorithms for Optimization Problems

Randomized algorithms can also be applied to optimization problems (e.g., finding a maximum clique, minimum spanning tree, etc.).

Typical Scenario:

The algorithm always produces a feasible (valid) solution.
The quality of this solution is a random variable; it’s not necessarily the optimal one.

Suppose for a maximization problem, we want a solution with value at least $q (I)$ (our target quality for input $I$ ). Assume our base randomized algorithm $A$ achieves this target quality with at least probability $ε$ : $P r [Value from A (I) \geq q (I)] \geq ε$ .

Goal: Design an algorithm $A_{δ}$ that achieves quality $q (I)$ with a higher probability, at least $1 - δ$ .

Theorem: Amplification for Optimization by Repetition

Let $A$ be a randomized algorithm for an optimization problem (assume maximization without loss of generality). Suppose $P r [Value from A (I) \geq q (I)] \geq ε$ .

For any desired success probability $1 - δ$ (where $δ$ is the failure probability):

Construct $A_{δ}$ as follows:

Call the original algorithm $A$ a total of $N = ⌈ ε^{- 1} ln δ^{- 1} ⌉$ times independently.
$A_{δ}$ outputs the solution that has the best value among all $N$ solutions obtained.

Then, $P r [Value from A_{δ} (I) \geq q (I)] \geq 1 - δ$ .

Proof

The algorithm $A_{δ}$ fails to achieve a solution of quality at least $q (I)$ if and only if every one of the $N$ independent trials of $A$ produces a solution with value less than $q (I)$ .

The probability that a single trial of $A$ fails to meet the target quality is $P r [Value from A (I) < q (I)] \leq 1 - ε$ .

Since the $N$ trials are independent, the probability that all $N$ trials fail is: $P r [A_{δ} fails] \leq (1 - ε)^{N}$ .

This is the same mathematical situation as in the Las Vegas amplification. By choosing $N = ⌈ ε^{- 1} ln δ^{- 1} ⌉$ , we ensure that $(1 - ε)^{N} \leq δ$ .

Therefore, the probability that $A_{δ}$ succeeds (i.e., at least one of the $N$ trials achieves the target quality, and thus the best of them does) is $P r [Value from A_{δ} (I) \geq q (I)] \geq 1 - δ$ .

This “repeat and take best” strategy is a common and effective way to boost the performance of randomized optimization algorithms.

CS Notes

Explorer

15 Randomized Algorithms, Monte Carlo vs Las Vegas, Reducing Error Probability

Classical vs. Randomized Algorithms

Classical (Deterministic) Algorithms

Randomized Algorithms

The Random Source

Our Assumption in Analysis

Monte Carlo vs. Las Vegas Algorithms

Monte Carlo Algorithms

Las Vegas Algorithms

Alternative View of Las Vegas Algorithms

Reducing Error Probability

Reducing “Failure to Answer” for Las Vegas Algorithms

Goal

Idea: Repetition

Theorem: Amplifying Success for Las Vegas Algorithms

Proof

Example: Number of Iterations

Reducing Error for Monte Carlo Algorithms

Monte Carlo with One-Sided Error

Definition (One-Sided Error Example)

Theorem: Error Reduction for One-Sided Monte Carlo

Proof

Monte Carlo with Two-Sided Error (Correctness Probability $> 1/2$ )

Theorem: Majority Vote for Two-Sided Error Amplification

Proof

Setup

Step 1: Expected Value of X

Step 2: Verifying the Chernoff Condition

Step 3: Applying Chernoff Bound

Step 4: Bounding the Exponent

Final Conclusion

Randomized Algorithms for Optimization Problems

Theorem: Amplification for Optimization by Repetition

Proof

Table of Contents

Graph View

CS Notes

Explorer

15 Randomized Algorithms, Monte Carlo vs Las Vegas, Reducing Error Probability

Classical vs. Randomized Algorithms

Classical (Deterministic) Algorithms

Randomized Algorithms

The Random Source

Our Assumption in Analysis

Monte Carlo vs. Las Vegas Algorithms

Monte Carlo Algorithms

Las Vegas Algorithms

Alternative View of Las Vegas Algorithms

Reducing Error Probability

Reducing “Failure to Answer” for Las Vegas Algorithms

Goal

Idea: Repetition

Theorem: Amplifying Success for Las Vegas Algorithms

Proof

Example: Number of Iterations

Reducing Error for Monte Carlo Algorithms

Monte Carlo with One-Sided Error

Definition (One-Sided Error Example)

Theorem: Error Reduction for One-Sided Monte Carlo

Proof

Monte Carlo with Two-Sided Error (Correctness Probability >1/2)

Theorem: Majority Vote for Two-Sided Error Amplification

Proof

Setup

Step 1: Expected Value of X

Step 2: Verifying the Chernoff Condition

Step 3: Applying Chernoff Bound

Step 4: Bounding the Exponent

Final Conclusion

Randomized Algorithms for Optimization Problems

Theorem: Amplification for Optimization by Repetition

Proof

Table of Contents

Graph View

Monte Carlo with Two-Sided Error (Correctness Probability $> 1/2$ )