computability theory Computability theory, also known as recursion theory, is a branch of mathematical logic, computer science, and the theory of computation that originated in the 1930s with the study of computable functions and Turing degrees. The field has since ex ...

, Rice's theorem states that all non-trivial semantic properties of programs are undecidable. A ''semantic'' property is one about the program's behavior (for instance, "does the program terminate for all inputs?"), unlike a syntactic property (for instance, "does the program contain an

if-then-else In computer science, conditionals (that is, conditional statements, conditional expressions and conditional constructs) are programming language constructs that perform different computations or actions or return different values depending on t ...

statement?"). A ''non-trivial'' property is one which is neither true for every program, nor false for every program. The theorem generalizes the undecidability of the

halting problem In computability theory (computer science), computability theory, the halting problem is the problem of determining, from a description of an arbitrary computer program and an input, whether the program will finish running, or continue to run for ...

. It has far-reaching implications on the feasibility of

static analysis Static analysis, static projection, or static scoring is a simplified analysis wherein the effect of an immediate change to a system is calculated without regard to the longer-term response of the system to that change. If the short-term effect i ...

of programs. It implies that it is impossible, for example, to implement a tool that checks whether any given program is correct, or even executes without error (it is possible to implement a tool that always overestimates or always underestimates, so in practice one has to decide what is less of a problem). The theorem is named after Henry Gordon Rice, who proved it in his doctoral dissertation of 1951 at

Syracuse University Syracuse University (informally 'Cuse or SU) is a Private university, private research university in Syracuse, New York, United States. It was established in 1870 with roots in the Methodist Episcopal Church but has been nonsectarian since 1920 ...

Introduction

Rice's theorem puts a theoretical bound on which types of

can be performed automatically. One can distinguish between the

syntax In linguistics, syntax ( ) is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituenc ...

of a program, and its

semantics Semantics is the study of linguistic Meaning (philosophy), meaning. It examines what meaning is, how words get their meaning, and how the meaning of a complex expression depends on its parts. Part of this process involves the distinction betwee ...

. The syntax is the detail of how the program is written, or its "intension", and the semantics is how the program behaves when run, or its "extension". Rice's theorem asserts that it is impossible to decide a property of programs which depends only on the semantics and not on the syntax, unless the property is trivial (true of all programs, or false of all programs). By Rice's theorem, it is impossible to write a program that automatically verifies for the absence of bugs in other programs, taking a program and a

specification A specification often refers to a set of documented requirements to be satisfied by a material, design, product, or service. A specification is often a type of technical standard. There are different types of technical or engineering specificati ...

as input, and checking whether the program satisfies the specification. This does not imply an impossibility to prevent ''certain types'' of bugs. For example, Rice's theorem implies that in dynamically typed programming languages which are

Turing-complete In computability theory, a system of data-manipulation rules (such as a model of computation, a computer's instruction set, a programming language, or a cellular automaton) is said to be Turing-complete or computationally universal if it can be ...

, it is impossible to verify the absence of type errors. On the other hand, statically typed programming languages feature a type system which statically prevents type errors. In essence, this should be understood as a feature of the ''syntax'' (taken in a broad sense) of those languages. In order to type check a program, its source code must be inspected; the operation does not depend merely on the hypothetical semantics of the program. In terms of general software verification, this means that although one cannot algorithmically check whether any given program satisfies a given specification, one can require programs to be annotated with extra information that proves the program is correct, or to be written in a particular restricted form that makes the verification possible, and only accept programs which are verified in this way. In the case of type safety, the former corresponds to type annotations, and the latter corresponds to

type inference Type inference, sometimes called type reconstruction, refers to the automatic detection of the type of an expression in a formal language. These include programming languages and mathematical type systems, but also natural languages in some bran ...

. Taken beyond type safety, this idea leads to correctness proofs of programs through proof annotations such as in

Hoare logic Hoare logic (also known as Floyd–Hoare logic or Hoare rules) is a formal system with a set of logical rules for reasoning rigorously about the correctness of computer programs. It was proposed in 1969 by the British computer scientist and l ...

. Another way of working around Rice's theorem is to search for methods which catch ''many'' bugs, without being complete. This is the theory of

abstract interpretation In computer science, abstract interpretation is a theory of sound approximation of the semantics of computer programs, based on monotonic functions over ordered sets, especially lattices. It can be viewed as a partial execution of a computer pro ...

. Yet another direction for verification is

model checking In computer science, model checking or property checking is a method for checking whether a finite-state model of a system meets a given specification (also known as correctness). This is typically associated with hardware or software syst ...

, which can only apply to finite-state programs, not to Turing-complete languages.

Formal statement

Let ''φ'' be an admissible numbering of partial computable functions. Let ''P'' be a subset of

\mathbb

. Suppose that: # ''P'' is ''non-trivial'': ''P'' is neither empty nor

\mathbb

itself. # ''P'' is ''extensional'': for all integers ''m'' and ''n'', if ''φ''_''m'' = ''φ''_''n'', then ''m'' ∈ ''P'' ⟺ ''n'' ∈ ''P''. Then ''P'' is undecidable. A more concise statement can be made in terms of index sets: The only decidable index sets are ∅ and

\mathbb

Examples

Given a program ''P'' which takes a natural number ''n'' and returns a natural number ''P''(''n''), the following questions are undecidable: * Does ''P'' terminate on a given ''n''? (This is the

.) * Does ''P'' terminate on 0? * Does ''P'' terminate on ''all'' ''n'' (i.e., is ''P'' total)? * Does ''P'' terminate and return 0 on every input? * Does ''P'' terminate and return 0 on some input? * Does ''P'' terminate and return the same value for all inputs? * Is ''P'' equivalent to a given program ''Q''?

Proof by Kleene's recursion theorem

Assume for contradiction that

P

is a non-trivial, extensional and computable set of natural numbers. There is a natural number

a \in P

and a natural number

b \notin P

. Define a function

Q

Q_e(x) = \varphi_b(x)

when

e \in P

and

Q_e(x) = \varphi_a(x)

when

e \notin P

. By

Kleene's recursion theorem In computability theory, Kleene's recursion theorems are a pair of fundamental results about the application of computable functions to their own descriptions. The theorems were first proved by Stephen Kleene in 1938 and appear in his 1952 ...

, there exists

e

such that

\varphi_e = Q_e

. Then, if

e \in P

, we have

\varphi_e = \varphi_b

, contradicting the extensionality of

P

since

b \notin P

, and conversely, if

e \notin P

, we have

\varphi_e = \varphi_a

, which again contradicts extensionality since

a \in P

Proof by reduction from the halting problem

Proof sketch

Suppose, for concreteness, that we have an algorithm for examining a program ''p'' and determining infallibly whether ''p'' is an implementation of the squaring function, which takes an integer ''d'' and returns ''d''². The proof works just as well if we have an algorithm for deciding any other non-trivial property of program behavior (i.e. a semantic and non-trivial property), and is given in general below. The claim is that we can convert our algorithm for identifying squaring programs into one that identifies functions that halt. We will describe an algorithm that takes inputs ''a'' and ''i'' and determines whether program ''a'' halts when given input ''i''. The algorithm for deciding this is conceptually simple: it constructs (the description of) a new program ''t'' taking an argument ''n'', which (1) first executes program ''a'' on input ''i'' (both ''a'' and ''i'' being hard-coded into the definition of ''t''), and (2) then returns the square of ''n''. If ''a''(''i'') runs forever, then ''t'' never gets to step (2), regardless of ''n''. Then clearly, ''t'' is a function for computing squares if and only if step (1) terminates. Since we have assumed that we can infallibly identify programs for computing squares, we can determine whether ''t'', which depends on ''a'' and ''i'', is such a program; thus we have obtained a program that decides whether program ''a'' halts on input ''i''. Note that our halting-decision algorithm never executes ''t'', but only passes its description to the squaring-identification program, which by assumption always terminates; since the construction of the description of ''t'' can also be done in a way that always terminates, the halting-decision cannot fail to halt either. halts (a,i) This method does not depend specifically on being able to recognize functions that compute squares; as long as ''some'' program can do what we are trying to recognize, we can add a call to ''a'' to obtain our ''t''. We could have had a method for recognizing programs for computing square roots, or programs for computing the monthly payroll, or programs that halt when given the input "Abraxas"; in each case, we would be able to solve the halting problem similarly.

Formal proof

For the formal proof, algorithms are presumed to define partial functions over strings and are themselves represented by strings. The partial function computed by the algorithm represented by a string ''a'' is denoted F_''a''. This proof proceeds by

reductio ad absurdum In logic, (Latin for "reduction to absurdity"), also known as (Latin for "argument to absurdity") or ''apagogical argument'', is the form of argument that attempts to establish a claim by showing that the opposite scenario would lead to absur ...

: we assume that there is a non-trivial property that is decided by an algorithm, and then show that it follows that we can decide the

, which is not possible, and therefore a contradiction. Let us now assume that ''P''(''a'') is an algorithm that decides some non-trivial property of F_''a''. Without loss of generality we may assume that ''P''(''no-halt'') = "no", with ''no-halt'' being the representation of an algorithm that never halts. If this is not true, then this holds for the algorithm ' that computes the negation of the property ''P''. Now, since ''P'' decides a non-trivial property, it follows that there is a string ''b'' that represents an algorithm F_''b'' and ''P''(''b'') = "yes". We can then define an algorithm ''H''(''a'', ''i'') as follows: :1. construct a string ''t'' that represents an algorithm ''T''(''j'') such that :* ''T'' first simulates the computation of F_''a''(''i''), :* then ''T'' simulates the computation of F_''b''(''j'') and returns its result. :2. return ''P''(''t''). We can now show that ''H'' decides the halting problem: * Assume that the algorithm represented by ''a'' halts on input ''i''. In this case F_''t'' = F_''b'' and, because ''P''(''b'') = "yes" and the output of ''P''(''x'') depends only on F_''x'', it follows that ''P''(''t'') = "yes" and, therefore ''H''(''a'', ''i'') = "yes". * Assume that the algorithm represented by ''a'' does not halt on input ''i''. In this case F_''t'' = F_''no-halt'', i.e., the partial function that is never defined. Since ''P''(''no-halt'') = "no" and the output of ''P''(''x'') depends only on F_''x'', it follows that ''P''(''t'') = "no" and, therefore ''H''(''a'', ''i'') = "no". Since the halting problem is known to be undecidable, this is a contradiction and the assumption that there is an algorithm ''P''(''a'') that decides a non-trivial property for the function represented by ''a'' must be false.

References

* * * {{Citation , last1=Rogers , first1=Hartley Jr. , author-link= Hartley Rogers Jr. , title=Theory of Recursive Functions and Effective Computability , publisher=

McGraw-Hill McGraw Hill is an American education science company that provides educational content, software, and services for students and educators across various levels—from K-12 to higher education and professional settings. They produce textbooks, ...

, year=1987 , edition= 2nd, at= §14.8 Articles containing proofs Theorems in theory of computation Theorems in the foundations of mathematics Undecidable problems