In
Boolean algebra
In mathematics and mathematical logic, Boolean algebra is a branch of algebra. It differs from elementary algebra in two ways. First, the values of the variable (mathematics), variables are the truth values ''true'' and ''false'', usually denot ...
, a
formula
In science, a formula is a concise way of expressing information symbolically, as in a mathematical formula or a ''chemical formula''. The informal use of the term ''formula'' in science refers to the general construct of a relationship betwe ...
is in conjunctive normal form (CNF) or clausal normal form if it is a
conjunction of one or more
clauses
In language, a clause is a Constituent (linguistics), constituent or Phrase (grammar), phrase that comprises a semantic predicand (expressed or not) and a semantic Predicate (grammar), predicate. A typical clause consists of a subject (grammar), ...
, where a clause is a
disjunction
In logic, disjunction (also known as logical disjunction, logical or, logical addition, or inclusive disjunction) is a logical connective typically notated as \lor and read aloud as "or". For instance, the English language sentence "it is ...
of
literals; otherwise put, it is a product of sums or an AND of ORs.
In automated theorem proving, the notion "''clausal normal form''" is often used in a narrower sense, meaning a particular representation of a CNF formula as a set of sets of literals.
Definition
A logical formula is considered to be in CNF if it is a
conjunction of one or more
disjunctions of one or more
literals. As in
disjunctive normal form
In boolean logic, a disjunctive normal form (DNF) is a canonical normal form of a logical formula consisting of a disjunction of conjunctions; it can also be described as an OR of ANDs, a sum of products, or in philosophical logic a ''cluster c ...
(DNF), the only propositional operators in CNF are
or (
),
and (
), and
not (
). The ''not'' operator can only be used as part of a literal, which means that it can only precede a
propositional variable
In mathematical logic, a propositional variable (also called a sentence letter, sentential variable, or sentential letter) is an input variable (that can either be true or false) of a truth function. Propositional variables are the basic building ...
.
The following is a
context-free grammar
In formal language theory, a context-free grammar (CFG) is a formal grammar whose production rules
can be applied to a nonterminal symbol regardless of its context.
In particular, in a context-free grammar, each production rule is of the fo ...
for CNF:
: ''CNF''
''Disjunct''
''Disjunct''
''CNF''
: ''Disjunct''
''Literal''
''Literal''
''Disjunct''
: ''Literal''
''Variable''
''Variable''
Where ''Variable'' is any variable.
All of the following formulas in the variables
and
are in conjunctive normal form:
*
*
*
*
The following formulas are not in conjunctive normal form:
*
, since an AND is nested within a NOT
*
, since an OR is nested within a NOT
*
, since an AND is nested within an OR
Conversion to CNF
In
classical logic
Classical logic (or standard logic) or Frege–Russell logic is the intensively studied and most widely used class of deductive logic. Classical logic has had much influence on analytic philosophy.
Characteristics
Each logical system in this c ...
each
propositional formula can be converted to an
equivalent formula that is in CNF. This transformation is based on rules about
logical equivalence
In logic and mathematics, statements p and q are said to be logically equivalent if they have the same truth value in every model. The logical equivalence of p and q is sometimes expressed as p \equiv q, p :: q, \textsfpq, or p \iff q, depending ...
s:
double negation elimination
In propositional logic, the double negation of a statement states that "it is not the case that the statement is not true". In classical logic, every statement is logically equivalent to its double negation, but this is not true in intuitionis ...
,
De Morgan's laws
In propositional calculus, propositional logic and Boolean algebra, De Morgan's laws, also known as De Morgan's theorem, are a pair of transformation rules that are both Validity (logic), valid rule of inference, rules of inference. They are nam ...
, and the
distributive law
In mathematics, the distributive property of binary operations is a generalization of the distributive law, which asserts that the equality
x \cdot (y + z) = x \cdot y + x \cdot z
is always true in elementary algebra.
For example, in elementary ...
.
Basic algorithm
The algorithm to compute a CNF-equivalent of a given propositional formula
builds upon
in
disjunctive normal form (DNF): step 1.
Then
is converted to
by swapping ANDs with ORs and vice versa while negating all the literals. Remove all
.
Conversion by syntactic means
Convert to CNF the propositional formula
.
Step 1: Convert its negation to disjunctive normal form.
[see ]
,
where each
is a conjunction of literals
.
Step 2: Negate
.
Then shift
inwards by applying the
(generalized) De Morgan's equivalences until no longer possible.
where
Step 3: Remove all double negations.
Example
Convert to CNF the propositional formula
.
The (full) DNF equivalent of its negation is
Conversion by semantic means
A CNF equivalent of a formula can be derived from its
truth table
A truth table is a mathematical table used in logic—specifically in connection with Boolean algebra, Boolean functions, and propositional calculus—which sets out the functional values of logical expressions on each of their functional arg ...
. Again, consider the formula
.
The corresponding
truth table
A truth table is a mathematical table used in logic—specifically in connection with Boolean algebra, Boolean functions, and propositional calculus—which sets out the functional values of logical expressions on each of their functional arg ...
is
A CNF equivalent of
is
Each disjunction reflects an assignment of variables for which
evaluates to F(alse).
If in such an assignment a variable
* is T(rue), then the literal is set to
in the disjunction,
* is F(alse), then the literal is set to
in the disjunction.
Other approaches
Since all propositional formulas can be converted into an equivalent formula in conjunctive normal form, proofs are often based on the assumption that all formulae are CNF. However, in some cases this conversion to CNF can lead to an exponential explosion of the formula. For example, translating the non-CNF formula
into CNF produces a formula with
clauses:
Each clause contains either
or
for each
.
There exist transformations into CNF that avoid an exponential increase in size by preserving
satisfiability
In mathematical logic, a formula is ''satisfiable'' if it is true under some assignment of values to its variables. For example, the formula x+3=y is satisfiable because it is true when x=3 and y=6, while the formula x+1=x is not satisfiable over ...
rather than
equivalence. These transformations are guaranteed to only linearly increase the size of the formula, but introduce new variables. For example, the above formula can be transformed into CNF by adding variables
as follows:
An
interpretation satisfies this formula only if at least one of the new variables is true. If this variable is
, then both
and
are true as well. This means that every
model
A model is an informative representation of an object, person, or system. The term originally denoted the plans of a building in late 16th-century English, and derived via French and Italian ultimately from Latin , .
Models can be divided in ...
that satisfies this formula also satisfies the original one. On the other hand, only some of the models of the original formula satisfy this one: since the
are not mentioned in the original formula, their values are irrelevant to satisfaction of it, which is not the case in the last formula. This means that the original formula and the result of the translation are
equisatisfiable but not
equivalent.
An alternative translation, the
Tseitin transformation, includes also the clauses
. With these clauses, the formula implies
; this formula is often regarded to "define"
to be a name for
.
Maximum number of disjunctions
Consider a propositional formula with
variables,
.
There are
possible literals:
.
has
non-empty subsets.
This is the maximum number of disjunctions a CNF can have.
All truth-functional combinations can be expressed with
disjunctions, one for each row of the truth table.
In the example below they are underlined.
Example
Consider a formula with two variables
and
.
The longest possible CNF has
disjunctions:
This formula is a
contradiction
In traditional logic, a contradiction involves a proposition conflicting either with itself or established fact. It is often used as a tool to detect disingenuous beliefs and bias. Illustrating a general tendency in applied logic, Aristotle's ...
. It can be simplified to
or to
, which are also contradictions, as well as valid CNFs.
Computational complexity
An important set of problems in
computational complexity
In computer science, the computational complexity or simply complexity of an algorithm is the amount of resources required to run it. Particular focus is given to computation time (generally measured by the number of needed elementary operations ...
involves finding assignments to the variables of a Boolean formula expressed in conjunctive normal form, such that the formula is true. The ''k''-SAT problem is the problem of finding a satisfying assignment to a Boolean formula expressed in CNF in which each disjunction contains at most ''k'' variables.
3-SAT is
NP-complete
In computational complexity theory, NP-complete problems are the hardest of the problems to which ''solutions'' can be verified ''quickly''.
Somewhat more precisely, a problem is NP-complete when:
# It is a decision problem, meaning that for any ...
(like any other ''k''-SAT problem with ''k''>2) while
2-SAT is known to have solutions in
polynomial time
In theoretical computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations p ...
. As a consequence, the task of converting a formula into a
DNF, preserving satisfiability, is
NP-hard
In computational complexity theory, a computational problem ''H'' is called NP-hard if, for every problem ''L'' which can be solved in non-deterministic polynomial-time, there is a polynomial-time reduction from ''L'' to ''H''. That is, assumi ...
;
dually, converting into CNF, preserving
validity, is also NP-hard; hence equivalence-preserving conversion into DNF or CNF is again NP-hard.
Typical problems in this case involve formulas in "3CNF": conjunctive normal form with no more than three variables per conjunct. Examples of such formulas encountered in practice can be very large, for example with 100,000 variables and 1,000,000 conjuncts.
A formula in CNF can be converted into an equisatisfiable formula in "''k''CNF" (for ''k''≥3) by replacing each conjunct with more than ''k'' variables
by two conjuncts
and
with a new variable, and repeating as often as necessary.
First-order logic
In first order logic, conjunctive normal form can be taken further to yield the clausal normal form of a logical formula, which can be then used to perform
first-order resolution
In mathematical logic and automated theorem proving, resolution is a rule of inference leading to a refutation-complete theorem-proving technique for sentences in propositional logic and first-order logic. For propositional logic, systematically ...
.
In resolution-based automated theorem-proving, a CNF formula
See below for an example.
Converting from first-order logic
To convert
first-order logic
First-order logic, also called predicate logic, predicate calculus, or quantificational logic, is a collection of formal systems used in mathematics, philosophy, linguistics, and computer science. First-order logic uses quantified variables over ...
to CNF:
#Convert to
negation normal form.
## Eliminate implications and equivalences: repeatedly replace
with
; replace
with
. Eventually, this will eliminate all occurrences of
and
.
##Move NOTs inwards by repeatedly applying
De Morgan's law
In propositional logic and Boolean algebra, De Morgan's laws, also known as De Morgan's theorem, are a pair of transformation rules that are both valid rules of inference. They are named after Augustus De Morgan, a 19th-century British mathemat ...
. Specifically, replace
with
; replace
with
; and replace
with
; replace
with
;
with
. After that, a
may occur only immediately before a predicate symbol.
#Standardize variables
##For sentences like
which use the same variable name twice, change the name of one of the variables. This avoids confusion later when dropping quantifiers. For example,