The Isabelle
automated theorem prover is a
higher-order logic (HOL) theorem prover, written in
Standard ML
Standard ML (SML) is a General-purpose programming language, general-purpose, High-level programming language, high-level, Modular programming, modular, Functional programming, functional programming language with compile-time type checking and t ...
and
Scala. As a
Logic for Computable Functions (LCF) style theorem prover, it is based on a small logical core (kernel) to increase the trustworthiness of proofs without requiring, yet supporting, explicit proof objects.
Isabelle is available inside a flexible system framework allowing for logically safe extensions, which comprise both theories and implementations for code-generating, documenting, and specific support for a variety of
formal methods
In computer science, formal methods are mathematics, mathematically rigorous techniques for the formal specification, specification, development, Program analysis, analysis, and formal verification, verification of software and computer hardware, ...
. It can be seen as an
integrated development environment
An integrated development environment (IDE) is a Application software, software application that provides comprehensive facilities for software development. An IDE normally consists of at least a source-code editor, build automation tools, an ...
(IDE) for formal methods. In recent years, a substantial number of theories and system extensions have been collected in the Isabelle ''Archive of Formal Proofs'' (Isabelle AFP).
Isabelle was named by
Lawrence Paulson after
Gérard Huet's daughter.
The Isabelle theorem prover is
free software
Free software, libre software, libreware sometimes known as freedom-respecting software is computer software distributed open-source license, under terms that allow users to run the software for any purpose as well as to study, change, distribut ...
, released under the revised
BSD license.
Features
Isabelle is generic: it provides a
meta-logic (a weak
type theory
In mathematics and theoretical computer science, a type theory is the formal presentation of a specific type system. Type theory is the academic study of type systems.
Some type theories serve as alternatives to set theory as a foundation of ...
), which is used to encode object logics like
first-order logic (FOL),
higher-order logic
In mathematics and logic, a higher-order logic (abbreviated HOL) is a form of logic that is distinguished from first-order logic by additional quantifiers and, sometimes, stronger semantics. Higher-order logics with their standard semantics are m ...
(HOL) or
Zermelo–Fraenkel set theory
In set theory, Zermelo–Fraenkel set theory, named after mathematicians Ernst Zermelo and Abraham Fraenkel, is an axiomatic system that was proposed in the early twentieth century in order to formulate a theory of sets free of paradoxes suc ...
(ZFC). The most widely used object logic is Isabelle/HOL, although significant set theory developments were completed in Isabelle/ZF. Isabelle's main proof method is a higher-order version of
resolution, based on higher-order
unification.
Though interactive, Isabelle features efficient automatic reasoning tools, such as a
term rewriting engine and a
tableaux prover, various decision procedures, and, through the Sledgehammer proof-automation interface, external
satisfiability modulo theories (SMT) solvers (including
CVC4) and
resolution-based
automated theorem provers (ATPs), including
E,
SPASS, and
Vampire
A vampire is a mythical creature that subsists by feeding on the Vitalism, vital essence (generally in the form of blood) of the living. In European folklore, vampires are undead, undead humanoid creatures that often visited loved ones and c ...
(the Metis proof method reconstructs resolution proofs generated by these ATPs). It also features two
model
A model is an informative representation of an object, person, or system. The term originally denoted the plans of a building in late 16th-century English, and derived via French and Italian ultimately from Latin , .
Models can be divided in ...
finders (
counterexample generators): Nitpick
[Jasmin Christian Blanchette, Mathias Fleury, Peter Lammich & Christoph Weidenbach]
"A Verified SAT Solver Framework with Learn, Forget, Restart, and Incrementality"
''Journal of Automated Reasoning'' 61:333–365 (2018). and Nunchaku.
Isabelle features locales which are modules that structure large proofs. A locale fixes types, constants, and assumptions within a specified scope
[ so that they do not have to be repeated for every lemma.
Isar ("intelligible semi-automated reasoning") is Isabelle's formal proof language. It is inspired by the Mizar system.][
]
Example proof
Isabelle allows proofs to be written in two different styles, the procedural and the declarative. Procedural proofs specify a series of tactics (theorem proving functions/procedures) to apply. While reflecting the procedure that a human mathematician might apply to proving a result, they are typically hard to read as they do not describe the outcome of these steps. Declarative proofs (supported by Isabelle's proof language, Isar), on the other hand, specify the actual mathematical operations to be performed, and are therefore more easily read and checked by humans.
The procedural style has been deprecated in recent versions of Isabelle.
For example, a declarative proof by contradiction
In logic, proof by contradiction is a form of proof that establishes the truth or the validity of a proposition by showing that assuming the proposition to be false leads to a contradiction.
Although it is quite freely used in mathematical pr ...
in Isar that the square root of two is not rational can be written as follows.
sqrt2_not_rational:
?x =
m n :: nat
sqrt_rat: lowest_terms:
(rule Rats_abs_nat_div_natE)
(auto simp add: power2_eq_square)
eq: of_nat_eq_iff power2_eq_square fastforce
simp
simp
-
k ..
eq simp
simp
simp
(rule gcd_greatest)
lowest_terms simp
False odd_one blast
Applications
Isabelle has been used to aid formal methods
In computer science, formal methods are mathematics, mathematically rigorous techniques for the formal specification, specification, development, Program analysis, analysis, and formal verification, verification of software and computer hardware, ...
for the specification, development and verification of software and hardware systems.
Isabelle has been used to formalize numerous theorems from mathematics
Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
and computer science
Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
, like Gödel's completeness theorem
Gödel's completeness theorem is a fundamental theorem in mathematical logic that establishes a correspondence between semantics, semantic truth and syntactic Provability logic, provability in first-order logic.
The completeness theorem applies ...
, Gödel's theorem about the consistency of the axiom of choice
In mathematics, the axiom of choice, abbreviated AC or AoC, is an axiom of set theory. Informally put, the axiom of choice says that given any collection of non-empty sets, it is possible to construct a new set by choosing one element from e ...
, the prime number theorem
In mathematics, the prime number theorem (PNT) describes the asymptotic analysis, asymptotic distribution of the prime numbers among the positive integers. It formalizes the intuitive idea that primes become less common as they become larger by p ...
, correctness of security protocols, and properties of programming language semantics. Many of the formal proofs are, as mentioned, maintained in the Archive of Formal Proofs, which contains (as of 2019) at least 500 articles with over 2 million lines of proof in total.
* In 2009, the L4.verified project at NICTA
NICTA (formerly named National ICT Australia Ltd) was Australia's Information and Communications Technology (ICT) Research Centre of Excellence and is now known as CSIRO's Data61. The term "Centre of Excellence" is common marketing terminology ...
produced the first formal proof of functional correctness of a general-purpose operating system kernel:[
] the seL4 (secure embedded L4) microkernel
In computer science, a microkernel (often abbreviated as μ-kernel) is the near-minimum amount of software that can provide the mechanisms needed to implement an operating system (OS). These mechanisms include low-level address space management, ...
. The proof is constructed and checked in Isabelle/HOL and comprises over 200,000 lines of proof script to verify 7,500 lines of C. The verification covers code, design, and implementation, and the main theorem states that the C code correctly implements the formal specification of the kernel. The proof uncovered 144 bugs in an early version of the C code of the seL4 kernel, and about 150 issues in each of design and specification.
* The definition of the programming language Lightweight Java was proven type-sound in Isabelle.
Alternatives
Several languages and systems provide similar functions:
* Agda, written in Haskell
Haskell () is a general-purpose, statically typed, purely functional programming language with type inference and lazy evaluation. Designed for teaching, research, and industrial applications, Haskell pioneered several programming language ...
* Rocq (previously known as ''Coq''), written in OCaml
OCaml ( , formerly Objective Caml) is a General-purpose programming language, general-purpose, High-level programming language, high-level, Comparison of multi-paradigm programming languages, multi-paradigm programming language which extends the ...
* Lean, written in Lean itself and C++
* LEGO
Lego (, ; ; stylised as LEGO) is a line of plastic construction toys manufactured by the Lego Group, a privately held company based in Billund, Denmark. Lego consists of variously coloured interlocking plastic bricks made of acrylonitri ...
, written in Standard ML of New Jersey
* Mizar system, written in Free Pascal
* Metamath, written in ANSI C
* Prover9 Prover9 is an automated theorem proving, automated theorem prover for first-order logic, first-order and equational logic developed by William McCune.
Description
Prover9 is the successor of the Otter (theorem prover), Otter theorem prover also dev ...
, written in C, with a GUI written in Python
* Twelf, written in Standard ML
Standard ML (SML) is a General-purpose programming language, general-purpose, High-level programming language, high-level, Modular programming, modular, Functional programming, functional programming language with compile-time type checking and t ...
Notes
References
Further reading
* Lawrence C. Paulson
"The Foundation of a Generic Theorem Prover"
''Journal of Automated Reasoning'', Volume 5, Issue 3 (September 1989), pages: 363–397, .
* Lawrence C. Paulson and Tobias Nipkow
"Isabelle Tutorial and User's Manual"
1990.
* M. A. Ozols, K. A. Eastaughffe, and A. Cant
"DOVE: A Tool for Design Oriented Verification and Evaluation"
''Proceedings of AMAST 97'', M. Johnson, editor, Sydney, Australia. Lecture Notes in Computer Science (LNCS) Vol. 1349, Springer Verlag, 1997.
* Tobias Nipkow, Lawrence C. Paulson, Markus Wenzel
"Isabelle/HOL – A Proof Assistant for Higher-Order Logic"
2020.
External links
*
Isabelle on Stack Overflow
The Archive of Formal Proofs
IsarMathLib
{{ML programming
Proof assistants
Free theorem provers
Software using the BSD license