In
computer science
Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
, formal methods are
mathematically rigorous techniques for the
specification
A specification often refers to a set of documented requirements to be satisfied by a material, design, product, or service. A specification is often a type of technical standard.
There are different types of technical or engineering specificati ...
, development,
analysis
Analysis (: analyses) is the process of breaking a complex topic or substance into smaller parts in order to gain a better understanding of it. The technique has been applied in the study of mathematics and logic since before Aristotle (38 ...
, and
verification of
software
Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications.
The history of software is closely tied to the development of digital comput ...
and
hardware systems.
The use of formal methods for software and hardware design is motivated by the expectation that, as in other engineering disciplines, performing appropriate mathematical analysis can contribute to the reliability and robustness of a design.
Formal methods employ a variety of
theoretical computer science
Theoretical computer science is a subfield of computer science and mathematics that focuses on the Abstraction, abstract and mathematical foundations of computation.
It is difficult to circumscribe the theoretical areas precisely. The Associati ...
fundamentals, including
logic
Logic is the study of correct reasoning. It includes both formal and informal logic. Formal logic is the study of deductively valid inferences or logical truths. It examines how conclusions follow from premises based on the structure o ...
calculi,
formal language
In logic, mathematics, computer science, and linguistics, a formal language is a set of strings whose symbols are taken from a set called "alphabet".
The alphabet of a formal language consists of symbols that concatenate into strings (also c ...
s,
automata theory,
control theory
Control theory is a field of control engineering and applied mathematics that deals with the control system, control of dynamical systems in engineered processes and machines. The objective is to develop a model or algorithm governing the applic ...
,
program semantics
In programming language theory, semantics is the rigorous mathematical study of the meaning of programming languages. Semantics assigns computational meaning to valid strings in a programming language syntax. It is closely related to, and oft ...
,
type systems, and
type theory
In mathematics and theoretical computer science, a type theory is the formal presentation of a specific type system. Type theory is the academic study of type systems.
Some type theories serve as alternatives to set theory as a foundation of ...
.
Uses
Formal methods can be applied at various points through the
development process.
Specification
Formal methods may be used to give a formal description of the system to be developed, at whatever level of detail desired. Further formal methods may depend on this specification to synthesize a program or to verify the correctness of a system.
Alternatively, specification may be the only stage in which formal methods is used. By writing a specification, ambiguities in the informal requirements can be discovered and resolved. Additionally, engineers can use a formal specification as a reference to guide their development processes.
The need for formal specification systems has been noted for years. In the
ALGOL 58 report,
John Backus presented a formal notation for describing
programming language syntax, later named
Backus normal form then renamed
Backus–Naur form
In computer science, Backus–Naur form (BNF, pronounced ), also known as Backus normal form, is a notation system for defining the Syntax (programming languages), syntax of Programming language, programming languages and other Formal language, for ...
(BNF). Backus also wrote that a formal description of the meaning of syntactically valid ALGOL programs was not completed in time for inclusion in the report, stating that it "will be included in a subsequent paper." However, no paper describing the formal semantics was ever released.
Synthesis
Program synthesis is the process of automatically creating a program that conforms to a specification. Deductive synthesis approaches rely on a complete formal specification of the program, whereas inductive approaches infer the specification from examples. Synthesizers perform a search over the space of possible programs to find a program consistent with the specification. Because of the size of this search space, developing efficient search algorithms is one of the major challenges in program synthesis.
Verification
Formal verification is the use of software tools to prove properties of a formal specification, or to prove that a formal model of a system
implementation
Implementation is the realization of an application, execution of a plan, idea, scientific modelling, model, design, specification, Standardization, standard, algorithm, policy, or the Management, administration or management of a process or Goal ...
satisfies its specification.
Once a formal specification has been developed, the specification may be used as the basis for
proving properties of the specification, and by inference, properties of the system implementation.
Sign-off verification
Sign-off verification is the use of a formal verification tool that is highly trusted. Such a tool can replace traditional verification methods (the tool may even be certified).
Human-directed proof
Sometimes, the motivation for proving the
correctness of a system is not the obvious need for reassurance of the correctness of the system, but a desire to understand the system better. Consequently, some proofs of correctness are produced in the style of
mathematical proof
A mathematical proof is a deductive reasoning, deductive Argument-deduction-proof distinctions, argument for a Proposition, mathematical statement, showing that the stated assumptions logically guarantee the conclusion. The argument may use othe ...
: handwritten (or typeset) using
natural language
A natural language or ordinary language is a language that occurs naturally in a human community by a process of use, repetition, and change. It can take different forms, typically either a spoken language or a sign language. Natural languages ...
, using a level of informality common to such proofs. A "good" proof is one that is readable and understandable by other human readers.
Critics of such approaches point out that the
ambiguity
Ambiguity is the type of meaning (linguistics), meaning in which a phrase, statement, or resolution is not explicitly defined, making for several interpretations; others describe it as a concept or statement that has no real reference. A com ...
inherent in natural language allows errors to be undetected in such proofs; often, subtle errors can be present in the low-level details typically overlooked by such proofs. Additionally, the work involved in producing such a good proof requires a high level of mathematical sophistication and expertise.
Automated proof
In contrast, there is increasing interest in producing proofs of correctness of such systems by automated means. Automated techniques fall into three general categories:
*
Automated theorem proving
Automated theorem proving (also known as ATP or automated deduction) is a subfield of automated reasoning and mathematical logic dealing with proving mathematical theorems by computer programs. Automated reasoning over mathematical proof was a majo ...
, in which a system attempts to produce a formal proof from scratch, given a description of the system, a set of logical
axiom
An axiom, postulate, or assumption is a statement that is taken to be true, to serve as a premise or starting point for further reasoning and arguments. The word comes from the Ancient Greek word (), meaning 'that which is thought worthy or ...
s, and a set of
inference rules.
*
Model checking, in which a system verifies certain properties by means of an exhaustive search of all possible states that a system could enter during its execution.
*
Abstract interpretation, in which a system verifies an over-approximation of a behavioural property of the program, using a fixpoint computation over a (possibly complete)
lattice representing it.
Some
automated theorem provers require guidance as to which properties are "interesting" enough to pursue, while others work without human intervention. Model checkers can quickly get bogged down in checking millions of uninteresting states if not given a sufficiently abstract model.
Proponents of such systems argue that the results have greater mathematical certainty than human-produced proofs, since all the tedious details have been algorithmically verified. The training required to use such systems is also less than that required to produce good mathematical proofs by hand, making the techniques accessible to a wider variety of practitioners.
Critics note that some of those systems are like
oracle
An oracle is a person or thing considered to provide insight, wise counsel or prophetic predictions, most notably including precognition of the future, inspired by deities. If done through occultic means, it is a form of divination.
Descript ...
s: they make a pronouncement of truth, yet give no explanation of that truth. There is also the problem of "
verifying the verifier"; if the program that aids in the verification is itself unproven, there may be reason to doubt the soundness of the produced results. Some modern model checking tools produce a "proof log" detailing each step in their proof, making it possible to perform, given suitable tools, independent verification.
The main feature of the abstract interpretation approach is that it provides a sound analysis, i.e. no false negatives are returned. Moreover, it is efficiently scalable, by tuning the abstract domain representing the property to be analyzed, and by applying widening operators to get fast convergence.
Techniques
Formal methods includes a number of different techniques.
Specification languages
The design of a computing system can be expressed using a specification language, which is a formal language that includes a proof system. Using this proof system, formal verification tools can reason about the specification and establish that a system adheres to the specification.
Binary decision diagrams
A binary decision diagram is a data structure that represents a
Boolean function. If a Boolean formula
expresses that an execution of a program conforms to the specification, a binary decision diagram can be used to determine if
is a tautology; that is, it always evaluates to TRUE. If this is the case, then the program always conforms to the specification.
SAT solvers
A SAT solver is a program that can solve the
Boolean satisfiability problem, the problem of finding an assignment of variables that makes a given propositional formula evaluate to true. If a Boolean formula
expresses that a specific execution of a program conforms to the specification, then determining that
is unsatisfiable is equivalent to determining that all executions conform to the specification. SAT solvers are often used in bounded model checking, but can also be used in unbounded model checking.
Applications
Formal methods are applied in different areas of hardware and software, including
routers,
Ethernet switches,
routing protocol
A routing protocol specifies how routers communicate with each other to distribute information that enables them to select paths between nodes on a computer network. Routers perform the traffic directing functions on the Internet; data packet ...
s, security applications, and
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
microkernel
In computer science, a microkernel (often abbreviated as μ-kernel) is the near-minimum amount of software that can provide the mechanisms needed to implement an operating system (OS). These mechanisms include low-level address space management, ...
s such as
seL4. There are several examples in which they have been used to verify the functionality of the hardware and software used in
data centres.
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
used
ACL2, a theorem prover, in the
AMD x86 processor development process. Intel uses such methods to verify its hardware and
firmware
In computing
Computing is any goal-oriented activity requiring, benefiting from, or creating computer, computing machinery. It includes the study and experimentation of algorithmic processes, and the development of both computer hardware, h ...
(permanent software programmed into a
read-only memory
Read-only memory (ROM) is a type of non-volatile memory used in computers and other electronic devices. Data stored in ROM cannot be electronically modified after the manufacture of the memory device. Read-only memory is useful for storing sof ...
).
Dansk Datamatik Center used formal methods in the 1980s to develop a compiler system for the
Ada programming language that went on to become a long-lived commercial product.
There are several other projects of
NASA
The National Aeronautics and Space Administration (NASA ) is an independent agencies of the United States government, independent agency of the federal government of the United States, US federal government responsible for the United States ...
in which formal methods are applied, such as
Next Generation Air Transportation System, Unmanned Aircraft System integration in National Airspace System, and Airborne Coordinated Conflict Resolution and Detection (ACCoRD).
B-Method with
Atelier B, is used to develop safety automatisms for the various subways installed throughout the world by
Alstom and
Siemens, and also for
Common Criteria
The Common Criteria for Information Technology Security Evaluation (referred to as Common Criteria or CC) is an international standard (International Organization for Standardization, ISO/International Electrotechnical Commission, IEC 15408) for co ...
certification and the development of system models by
ATMEL and
STMicroelectronics.
Formal verification has been frequently used in hardware by most of the well-known hardware vendors, such as IBM,
Intel
Intel Corporation is an American multinational corporation and technology company headquartered in Santa Clara, California, and Delaware General Corporation Law, incorporated in Delaware. Intel designs, manufactures, and sells computer compo ...
, and AMD. There are many areas of hardware, where Intel have used formal methods to verify the working of the products, such as parameterized verification of cache-coherent protocol, Intel Core i7 processor execution engine validation (using theorem proving,
BDDs, and symbolic evaluation), optimization for Intel IA-64 architecture using HOL light theorem prover, and verification of high-performance dual-port
gigabit Ethernet controller with support for
PCI express
PCI Express (Peripheral Component Interconnect Express), officially abbreviated as PCIe, is a high-speed standard used to connect hardware components inside computers. It is designed to replace older expansion bus standards such as Peripher ...
protocol and Intel advance management technology using Cadence. Similarly, IBM has used formal methods in the verification of power gates, registers, and functional verification of the IBM Power7 microprocessor.
In software development
In
software development
Software development is the process of designing and Implementation, implementing a software solution to Computer user satisfaction, satisfy a User (computing), user. The process is more encompassing than Computer programming, programming, wri ...
, formal methods are mathematical approaches to solving software (and hardware) problems at the requirements, specification, and design levels. Formal methods are most likely to be applied to safety-critical or security-critical software and systems, such as
avionics software. Software safety assurance standards, such as
DO-178C allows the usage of formal methods through supplementation, and
Common Criteria
The Common Criteria for Information Technology Security Evaluation (referred to as Common Criteria or CC) is an international standard (International Organization for Standardization, ISO/International Electrotechnical Commission, IEC 15408) for co ...
mandates formal methods at the highest levels of categorization.
For sequential software, examples of formal methods include the
B-Method, the specification languages used in
automated theorem proving
Automated theorem proving (also known as ATP or automated deduction) is a subfield of automated reasoning and mathematical logic dealing with proving mathematical theorems by computer programs. Automated reasoning over mathematical proof was a majo ...
,
RAISE, and the
Z notation.
In
functional programming
In computer science, functional programming is a programming paradigm where programs are constructed by Function application, applying and Function composition (computer science), composing Function (computer science), functions. It is a declarat ...
,
property-based testing has allowed the mathematical specification and testing (if not exhaustive testing) of the expected behaviour of individual functions.
The
Object Constraint Language (and specializations such as
Java Modeling Language) has allowed object-oriented systems to be formally specified, if not necessarily formally verified.
For concurrent software and systems,
Petri nets,
process algebra, and
finite-state machine
A finite-state machine (FSM) or finite-state automaton (FSA, plural: ''automata''), finite automaton, or simply a state machine, is a mathematical model of computation. It is an abstract machine that can be in exactly one of a finite number o ...
s (which are based on
automata theory; see also
virtual finite state machine or
event driven finite state machine) allow executable software specification and can be used to build up and validate application behaviour.
Another approach to formal methods in software development is to write a specification in some form of logic—usually a variation of
first-order logic
First-order logic, also called predicate logic, predicate calculus, or quantificational logic, is a collection of formal systems used in mathematics, philosophy, linguistics, and computer science. First-order logic uses quantified variables over ...
—and then to directly execute the logic as though it were a program. The
OWL language, based on
description logic, is an example. There is also work on mapping some version of English (or another natural language) automatically to and from logic, as well as executing the logic directly. Examples are
Attempto Controlled English, and Internet Business Logic, which do not seek to control the vocabulary or syntax. A feature of systems that support bidirectional English–logic mapping and direct execution of the logic is that they can be made to explain their results, in English, at the business or scientific level.
Semi-formal methods
Semi-formal methods are formalisms and languages that are not considered fully "formal". It defers the task of completing the semantics to a later stage, which is then done either by human interpretation or by interpretation through software like code or test case
generators.
Some practitioners believe that the formal methods community has overemphasized full formalization of a specification or design. They contend that the
expressiveness of the languages involved, as well as the complexity of the systems being modelled, make full formalization a difficult and expensive task. As an alternative, various ''lightweight'' formal methods, which emphasize partial specification and focused application, have been proposed. Examples of this lightweight approach to formal methods include the
Alloy
An alloy is a mixture of chemical elements of which in most cases at least one is a metal, metallic element, although it is also sometimes used for mixtures of elements; herein only metallic alloys are described. Metallic alloys often have prop ...
object modelling notation, Denney's synthesis of some aspects of the
Z notation with
use case driven development, and the CSK
VDM Tools.
Formal methods and notations
There are a variety of formal methods and notations available.
Specification languages
*
Abstract State Machines (ASMs)
*
A Computational Logic for Applicative Common Lisp (ACL2)
*
Actor model
The actor model in computer science is a mathematical model of concurrent computation that treats an ''actor'' as the basic building block of concurrent computation. In response to a message it receives, an actor can: make local decisions, create ...
*
Alloy
An alloy is a mixture of chemical elements of which in most cases at least one is a metal, metallic element, although it is also sometimes used for mixtures of elements; herein only metallic alloys are described. Metallic alloys often have prop ...
*
ANSI/ISO C Specification Language (ACSL)
*
Autonomic System Specification Language (ASSL)
*
B-Method
*
CADP
*
Common Algebraic Specification Language (CASL)
*
Esterel
*
Java Modeling Language (JML)
*
Knowledge Based Software Assistant (KBSA)
*
Lustre
*
mCRL2
*
Perfect Developer
*
Petri nets
*
Predicative programming
*
Process calculi
**
CSP
**
LOTOS
**
π-calculus
In theoretical computer science, the -calculus (or pi-calculus) is a process calculus. The -calculus allows channel names to be communicated along the channels themselves, and in this matter, it is able to describe concurrent computations whose ...
*
RAISE
*
Rebeca Modeling Language
*
SPARK Ada
*
Specification and Description Language
*
TLA+
*
USL
*
VDM
**
VDM-SL
** VDM++
*
Z notation
Model checkers
* ESBMC
*
MALPAS Software Static Analysis Toolset – an industrial-strength model checker used for formal proof of safety-critical systems
*
PAT – a free model checker, simulator and refinement checker for concurrent systems and CSP extensions (e.g., shared variables, arrays, fairness)
*
SPIN
*
UPPAAL
Solvers and competitions
Many problems in formal methods are
NP-hard
In computational complexity theory, a computational problem ''H'' is called NP-hard if, for every problem ''L'' which can be solved in non-deterministic polynomial-time, there is a polynomial-time reduction from ''L'' to ''H''. That is, assumi ...
, but can be solved in cases arising in practice. For example, the Boolean satisfiability problem is
NP-complete
In computational complexity theory, NP-complete problems are the hardest of the problems to which ''solutions'' can be verified ''quickly''.
Somewhat more precisely, a problem is NP-complete when:
# It is a decision problem, meaning that for any ...
by the
Cook–Levin theorem, but
SAT solver
In computer science and formal methods, a SAT solver is a computer program which aims to solve the Boolean satisfiability problem (SAT). On input a formula over Boolean data type, Boolean variables, such as "(''x'' or ''y'') and (''x'' or not ''y'' ...
s can solve a variety of large instances. There are "solvers" for a variety of problems that arise in formal methods, and there are many periodic competitions to evaluate the state-of-the-art in solving such problems.
* The SAT competition is a yearly competition that compares SAT solvers. SAT solvers are used in formal methods tools such as
Alloy
An alloy is a mixture of chemical elements of which in most cases at least one is a metal, metallic element, although it is also sometimes used for mixtures of elements; herein only metallic alloys are described. Metallic alloys often have prop ...
.
*
CASC is a yearly competition of
automated theorem prover
Automated theorem proving (also known as ATP or automated deduction) is a subfield of automated reasoning and mathematical logic dealing with proving mathematical theorems by computer programs. Automated reasoning over mathematical proof was a ma ...
s.
*
SMT-COMP is a yearly competition of
SMT solvers, which are applied to
formal verification
In the context of hardware and software systems, formal verification is the act of proving or disproving the correctness of a system with respect to a certain formal specification or property, using formal methods of mathematics.
Formal ver ...
.
*
CHC-COMP is a yearly competition of solvers of
constrained Horn clauses, which have applications to formal verification.
*
QBFEVAL is a biennial competition of solvers for
true quantified Boolean formula
In computational complexity theory, the language TQBF is a formal language consisting of the true quantified Boolean formulas. A (fully) quantified Boolean formula is a formula in Quantification (logic), quantified propositional logic (also known ...
s, which have applications to
model checking.
*
SV-COMP is an annual competition for software verification tools.
*
SyGuS-COMP is an annual competition for
program synthesis
In computer science, program synthesis is the task to construct a computer program, program that provably correct, provably satisfies a given high-level formal specification. In contrast to program verification, the program is to be constructed rat ...
tools.
Organizations
*
BCS-FACS
*
Formal Methods Europe
*
Z User Group
See also
*
Abstract interpretation
*
Automated theorem proving
Automated theorem proving (also known as ATP or automated deduction) is a subfield of automated reasoning and mathematical logic dealing with proving mathematical theorems by computer programs. Automated reasoning over mathematical proof was a majo ...
*
Design by contract
*
Formal methods people
*
Formal science
Formal science is a branch of science studying disciplines concerned with abstract structures described by formal systems, such as logic, mathematics, statistics, theoretical computer science, artificial intelligence, information theory, game t ...
*
Formal specification
*
Formal verification
In the context of hardware and software systems, formal verification is the act of proving or disproving the correctness of a system with respect to a certain formal specification or property, using formal methods of mathematics.
Formal ver ...
*
Formal system
A formal system is an abstract structure and formalization of an axiomatic system used for deducing, using rules of inference, theorems from axioms.
In 1921, David Hilbert proposed to use formal systems as the foundation of knowledge in ma ...
*
Methodism
Methodism, also called the Methodist movement, is a Protestant Christianity, Christian Christian tradition, tradition whose origins, doctrine and practice derive from the life and teachings of John Wesley. George Whitefield and John's brother ...
*
Methodology
In its most common sense, methodology is the study of research methods. However, the term can also refer to the methods themselves or to the philosophical discussion of associated background assumptions. A method is a structured procedure for bri ...
*
Model checking
*
Scientific method
The scientific method is an Empirical evidence, empirical method for acquiring knowledge that has been referred to while doing science since at least the 17th century. Historically, it was developed through the centuries from the ancient and ...
*
Software engineering
Software engineering is a branch of both computer science and engineering focused on designing, developing, testing, and maintaining Application software, software applications. It involves applying engineering design process, engineering principl ...
*
Specification language
References
Further reading
*
Jonathan P. Bowen and Michael G. Hinchey, ''Formal Methods''. In Allen B. Tucker, Jr. (ed.), ''Computer Science Handbook'', 2nd edition, Section XI, ''Software Engineering'', Chapter 106, pages 106-1 – 106-25,
Chapman & Hall /
CRC Press
The CRC Press, LLC is an American publishing group that specializes in producing technical books. Many of their books relate to engineering, science and mathematics. Their scope also includes books on business, forensics and information technol ...
,
Association for Computing Machinery
The Association for Computing Machinery (ACM) is a US-based international learned society for computing. It was founded in 1947 and is the world's largest scientific and educational computing society. The ACM is a non-profit professional membe ...
, 2004.
* Hubert Garavel (editor) and Susanne Graf
''Formal Methods for Safe and Secure Computer Systems'' Bundesamt für Sicherheit in der Informationstechnik, BSI study 875, Bonn, Germany, December 2013.
* * Michael G. Hinchey, Jonathan P. Bowen, and Emil Vassev, ''Formal Methods''. In Philip A. Laplante (ed.), ''Encyclopedia of Software Engineering'',
Taylor & Francis
Taylor & Francis Group is an international company originating in the United Kingdom that publishes books and academic journals. Its parts include Taylor & Francis, CRC Press, Routledge, F1000 (publisher), F1000 Research and Dovepress. It i ...
, 2010, pages 308–320.
*
Marieke Huisman, Dilian Gurov, and Alexander Malkis
''Formal Methods: From Academia to Industrial Practice – A Travel Guide'' arXiv:2002.07279, 2020.
*
* Jean François Monin and
Michael G. Hinchey, ''Understanding formal methods'',
Springer
Springer or springers may refer to:
Publishers
* Springer Science+Business Media, aka Springer International Publishing, a worldwide publishing group founded in 1842 in Germany formerly known as Springer-Verlag.
** Springer Nature, a multinationa ...
, 2003, .
External links
Formal Methods Europe(FME)
Formal Methods WikiFormal methodsfro
Foldoc;Archival material
Formal methodkeyword on
Microsoft Academic Search via
Archive.org
Evidence on Formal Methods uses and impact on Industrysupported by th
DEPLOY project (EU FP7) in
Archive.org
{{DEFAULTSORT:Formal Methods
Software development philosophies
Theoretical computer science
Specification languages