27 Soundness and Completeness of Resolution Calculus, NP and SAT problem

Lecture from: 18.12.2024 | Video: Videos ETHZ

Let us briefly recap where we are. We were given a set of formulas $M$ and wanted to see if a formula $F$ is logically entailed from the set of formulas, meaning if $M ⊨ F$ . In the extreme case, we can also ask if $F$ is a tautology, meaning it is logically entailed from no assumptions, i.e., $\emptyset ⊨ F$ (or simply $⊨ F$ ). Note that the aim is to prove things with as few assumptions as possible.

We also introduced a second formalism where we syntactically derive things, denoted as $M ⊢_{K} F$ , in a calculus $K$ . The idea is that syntactic derivation steps should indeed preserve logical entailment (if the premises are true, the derived formula should also be true). This property is called soundness. Conversely, we also want that if something can be logically entailed, there exists a finite sequence of derivation steps to show that we can derive it syntactically. This property is called completeness.

These concepts (soundness and completeness) are generally true for all logical calculi. Last time, we focused on propositional logic and said that instead of dealing with logical connectives like $\land$ , $\lor$ , $\neg$ , $\to$ , etc., we can convert our formula to Conjunctive Normal Form (CNF) and look at it as a clause set in the context of the resolution calculus.

We also observed that checking the satisfiability of a formula in CNF corresponds to the SAT problem. We are particularly interested in unsatisfiability, as it’s generally a harder problem (for satisfiability, we just need to provide a single interpretation that makes the clause set true).

The current belief is that SAT $\in$ NP, but UNSAT $\in /$ NP (or more precisely, UNSAT is co-NP-complete), meaning there is likely no efficient proof system that can provide short, easily verifiable proofs of unsatisfiability for arbitrary CNF formulas (unless P = NP, which is considered highly unlikely). This means given a clause set, while a “yes” answer to the SAT problem can be verified efficiently, a “no” answer is, in general, believed not to be verifiable in polynomial time.

Resolution Calculus

Let $K$ be a clause set. A clause $C$ is entailed by a clause set $K$ , written $K ⊨ C$ , if for any interpretation $A$ that makes all clauses in $K$ true (i.e., $A (K) = 1$ ), $C$ must also be true (i.e., $A (C) = 1$ ). In other words, if every clause in $K$ evaluates to 1 under $A$ , then $C$ must also evaluate to 1 under $A$ .

We also have the property that if $C \subseteq C^{'}$ , then ${C} ⊨ C^{'}$ . This holds because if $A (C) = 1$ , then at least one literal in $C$ must be true under the interpretation $A$ . Since $C^{'}$ contains all the literals of $C$ (and possibly more), that same literal will also be present in $C^{'}$ , and therefore $A (C^{'}) = 1$ . This shows that adding arbitrary literals to a clause results in a weaker (or equally strong) clause, as it becomes easier to satisfy. This is referred to as weakening.

If we have $K \subseteq K^{'}$ , then $K^{'} ⊨ K$ . More generally, if for every clause $C \in K$ , we have $K^{'} ⊨ C$ , then we write $K^{'} ⊨ K$ . This means that any interpretation that satisfies all clauses in $K^{'}$ will also satisfy all clauses in $K$ . In terms of satisfiability, this also implies that if $K^{'}$ is satisfiable, then so is $K$ . The other way around is not true in general.

Resolution Rule

Recall the resolution rule. If we are given two clauses $C_{1}$ and $C_{2}$ such that there exists a literal $L$ where $L \in C_{1}$ and $\neg L \in C_{2}$ , then we can apply the resolution rule. The resolvent $C$ of $C_{1}$ and $C_{2}$ with respect to $L$ is defined as:

C = (C_{1} \cup C_{2}) ∖ {L, \neg L}

The resolvent can maximally have a size of $∣ C_{1} ∣ + ∣ C_{2} ∣ - 2$ . We write the application of the resolution rule as:

{C_{1}, C_{2}} ⊢_{res} C

The resolution calculus has only this single rule, i.e., $Res = {res}$ .

Lemma 6.5: Soundness

If a clause $C$ can be derived from a clause set $K$ using the resolution rule, then $C$ is logically entailed by $K$ . Formally:

K ⊢_{Res} C ⟹ K ⊨ C

This is a lemma, and we need to prove it.

It suffices to show that a single application of the resolution rule preserves entailment. Let ${C_{1}, C_{2}} ⊢_{res} C$ . We want to show that ${C_{1}, C_{2}} ⊨ C$ . If this is true, we can extend to any sequence of resolution steps, demonstrating $K ⊨ C$ .

Let us now justify the first part, i.e., show that ${C_{1}, C_{2}} ⊨ C$ .

Let $A$ be an arbitrary interpretation such that $A (C_{1}) = 1$ and $A (C_{2}) = 1$ . Since ${C_{1}, C_{2}} ⊢_{res} C$ , there exists a literal $L$ such that $L \in C_{1}$ and $\neg L \in C_{2}$ , and $C = (C_{1} \cup C_{2}) ∖ {L, \neg L}$ . We can perform a case distinction on the truth value of $L$ under $A$ :

Case 1: $A (L) = 1$ : Since $A (C_{2}) = 1$ and $A (\neg L) = 0$ , there must exist another literal $L^{'} \in C_{2}$ such that $L^{'} \neq = \neg L$ and $A (L^{'}) = 1$ . Since $L^{'} \in C_{2}$ and $L^{'} \neq = \neg L$ , it follows that $L^{'} \in C$ . Therefore, $A (C) = 1$ .
Case 2: $A (L) = 0$ : Since $A (C_{1}) = 1$ and $A (L) = 0$ , there must exist another literal $L^{''} \in C_{1}$ such that $L^{''} \neq = L$ and $A (L^{''}) = 1$ . Since $L^{''} \in C_{1}$ and $L^{''} \neq = L$ , it follows that $L^{''} \in C$ . Therefore, $A (C) = 1$ .

In both cases, $A (C) = 1$ . Thus, under any interpretation $A$ where $A (C_{1}) = 1$ and $A (C_{2}) = 1$ , we also have $A (C) = 1$ . Therefore, ${C_{1}, C_{2}} ⊨ C$ . Because ${C_{1}, C_{2}} \subseteq K ⟹ K ⊨ {C_{1}, C_{2}}$ , by transitivity $K ⊨ C$ .

Theorem 6.6: Completeness of Resolution

We want to show that if a clause set is unsatisfiable, then we can derive the empty clause using resolution.

Let $K$ be a clause set. Then $K$ is unsatisfiable if and only if $K ⊢_{Res} □$ . (Recall that $□$ denotes the empty clause).

Proof:

" $\Leftarrow$ ": (Soundness, already proven in Lemma 6.5, but we recap the argument for clarity.) Assume $K ⊢_{Res} □$ . From Lemma 6.5 (soundness of the resolution rule), we know that $K ⊢_{Res} □$ implies $K ⊨ □$ . The empty clause $□$ is unsatisfiable (vacuously false), since there exists no interpretation that can make it true. Since $K$ entails an unsatisfiable clause, $K$ must also be unsatisfiable.

" $\Rightarrow$ ": (Completeness, this is what we show by induction) Assume $K$ is unsatisfiable. We want to prove that $K ⊢_{Res} □$ using induction on the number of distinct propositional variables (atoms) appearing in $K$ .

Base Case (Induction Base): Let $n$ be the number of distinct propositional variables in $K$ .
- If $n = 0$ , then $K$ is either ${}$ (the empty set of clauses, which is satisfiable) or ${□}$ (contains the empty clause). Since we assumed $K$ is unsatisfiable, $K$ must be ${□}$ , and trivially $K ⊢_{Res} □$ .
- If $n = 1$ , $K$ contains only one propositional variable, say $A_{1}$ . The possible clauses are ${A_{1}}$ , ${\neg A_{1}}$ , ${A_{1}, \neg A_{1}}$ and $□$ . The only unsatisfiable sets of clauses from those are ${{A_{1}}, {\neg A_{1}}}$ and any set containing $□$ . If we are in a set containing $□$ , then we are already done. Otherwise, $K$ must contain both ${A_{1}}$ and ${\neg A_{1}}$ (or clauses that can be weakened to these). Applying resolution to ${A_{1}}$ and ${\neg A_{1}}$ yields $□$ . Thus, $K ⊢_{Res} □$ .
Induction Hypothesis (I.H.): Assume that for any unsatisfiable clause set $K^{'}$ containing at most $n$ distinct propositional variables, $K^{'} ⊢_{Res} □$ .
Inductive Step: Let $K$ be an unsatisfiable clause set containing $n + 1$ distinct propositional variables. Let $A$ be one of these variables. We construct two new clause sets, $K_{A}$ and $K_{\neg A}$ , as follows:
- $K_{A}$ : From $K$ , remove all clauses containing $A$ , and remove $\neg A$ from the remaining clauses.
- $K_{\neg A}$ : From $K$ , remove all clauses containing $\neg A$ , and remove $A$ from the remaining clauses.
Formally, $K_{A} = {C ∖ {\neg A} ∣ C \in K and A \in / C}$ and $K_{\neg A} = {C ∖ {A} ∣ C \in K and \neg A \in / C}$ .

Since $K$ is unsatisfiable, both $K_{A}$ and $K_{\neg A}$ are also unsatisfiable. (If either $K_{A}$ or $K_{\neg A}$ were satisfiable, we could extend that satisfying interpretation to satisfy $K$ by assigning the appropriate truth value to $A$ ). Furthermore, $K_{A}$ and $K_{\neg A}$ each contain at most $n$ distinct propositional variables (since we’ve eliminated $A$ ).

By the induction hypothesis, $K_{A} ⊢_{Res} □$ and $K_{\neg A} ⊢_{Res} □$ .

Now, we want to show that $K ⊢_{Res} □$ . Since $K_{A} ⊢_{Res} □$ , there exists a sequence of resolution steps starting from clauses in $K_{A}$ that derives $□$ . We can “lift” this derivation to a derivation from $K$ : Replace each clause $C$ in the derivation from $K_{A}$ with $C \cup {\neg A}$ if $C \cup {\neg A}$ was not a step of the derivation. The same sequence of resolution steps will now derive ${\neg A}$ (instead of $□$ ) from clauses in $K$ .

Similarly, since $K_{\neg A} ⊢_{Res} □$ , we can lift a derivation of $□$ from $K_{\neg A}$ to a derivation of ${A}$ from clauses in $K$ .

Finally, we can resolve ${A}$ and ${\neg A}$ (which we derived from $K$ ) to obtain $□$ . Therefore, $K ⊢_{Res} □$ .

This completes the proof by induction.

3-SAT

The 3-SAT problem is a special case of the SAT problem where each clause contains exactly three literals. A formula in 3-CNF is a conjunction of clauses, where each clause is a disjunction of three literals.

Surprisingly, 3-SAT is also NP-complete. This means that although it appears more restricted than the general SAT problem (where clauses can have any number of literals), it is equally difficult in terms of computational complexity. In other words, there is no known polynomial-time algorithm for 3-SAT (unless P = NP), and any algorithm that can solve 3-SAT in polynomial time could be adapted to solve any other problem in NP in polynomial time. This equivalence in difficulty is established by showing that any SAT instance (with arbitrary clause lengths) can be converted into a 3-SAT instance in polynomial time, such that the 3-SAT instance is satisfiable if and only if the original SAT instance is satisfiable.

NP-Completeness

A problem is NP-complete if it satisfies two conditions:

It is in NP: A problem is in the class NP (Nondeterministic Polynomial time) if a proposed solution (“yes” instance) can be verified in polynomial time. This means that given a potential solution, we can check whether it’s correct in a time that grows at most polynomially with the size of the input. For example, for SAT, given an assignment of truth values to variables, we can easily check in polynomial time if that assignment satisfies all clauses.
It is NP-hard: A problem is NP-hard if every problem in NP can be reduced to it in polynomial time. This means that for any problem in NP, we can find a polynomial-time algorithm that transforms an instance of that problem into an instance of the NP-hard problem, such that the answer (“yes” or “no”) is preserved. In other words, the NP-hard problem is at least as difficult as any other problem in NP.

A problem that satisfies both of these conditions is called NP-complete. NP-complete problems are, in a sense, the “hardest” problems in NP. If we could find a polynomial-time algorithm for any NP-complete problem, we would have proven that P = NP, which would imply that all problems in NP have polynomial-time solutions.

This was the last lecture…

CS Notes

Explorer