HOME

TheInfoList



OR:

In
theoretical computer science computer science (TCS) is a subset of general computer science and mathematics that focuses on mathematical aspects of computer science such as the theory of computation, lambda calculus, and type theory. It is difficult to circumscribe the ...
, a Markov algorithm is a
string rewriting system In theoretical computer science and mathematical logic a string rewriting system (SRS), historically called a semi- Thue system, is a rewriting system over strings from a (usually finite) alphabet. Given a binary relation R between fixed strings ...
that uses
grammar In linguistics, the grammar of a natural language is its set of structural constraints on speakers' or writers' composition of clauses, phrases, and words. The term can also refer to the study of such constraints, a field that includes doma ...
-like rules to operate on strings of symbols. Markov algorithms have been shown to be
Turing-complete In computability theory, a system of data-manipulation rules (such as a computer's instruction set, a programming language, or a cellular automaton) is said to be Turing-complete or computationally universal if it can be used to simulate any ...
, which means that they are suitable as a general model of
computation Computation is any type of arithmetic or non-arithmetic calculation that follows a well-defined model (e.g., an algorithm). Mechanical or electronic devices (or, historically, people) that perform computations are known as ''computers''. An esp ...
and can represent any
mathematical expression In mathematics, an expression or mathematical expression is a finite combination of symbols that is well-formed according to rules that depend on the context. Mathematical symbols can designate numbers ( constants), variables, operations, f ...
from its simple notation. Markov algorithms are named after the Soviet mathematician
Andrey Markov, Jr. Andrey Andreyevich Markov (russian: Андре́й Андре́евич Ма́рков; St. Petersburg, September 22, 1903 – Moscow, October 11, 1979) was a Soviet mathematician, the son of the Russian mathematician Andrey Markov Sr, and ...
Refal is a
programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language. The description of a programming ...
based on Markov algorithms.


Description

Normal algorithms are verbal, that is, intended to be applied to strings in different alphabets. The definition of any normal algorithm consists of two parts: the definition of the ''alphabet'' of the algorithm (the algorithm will be applied to strings of these alphabet symbols), and the definition of its ''scheme''. The scheme of a normal algorithm is a finite ordered set of so-called ''substitution formulas'', each of which can be ''simple'' or ''final''. Simple substitution formulas are represented by strings of the form L\to D, where L and D are two arbitrary strings in the alphabet of the algorithm (called, respectively, the left and right sides of the formula substitution). Similarly, final substitution formulas are represented by strings of the form L\to\cdot D, where L and D are two arbitrary strings in the alphabet of the algorithm. This assumes that the auxiliary characters \to and \to\cdot do not belong to the alphabet of the algorithm (otherwise two other characters to perform their role as the dividers of the left and right sides, which are not in the algorithm's alphabet, should be selected). Here is an example of a normal algorithm scheme in the five-letter alphabet , *abc: : \left\{\begin{matrix} , b&\to& ba, \\ ab&\to& ba\\ b&\to&\\ {*}, &\to& b*& \\ {*}&\to& c& \\ , c&\to& c\\ ac&\to& c, \\ c&\to\cdot\end{matrix}\right. The process of applying the normal algorithm to an arbitrary string V in the alphabet of this algorithm is a discrete sequence of elementary steps, consisting of the following. Let’s assume that V' is the word obtained in the previous step of the algorithm (or the original word V, if the current step is the first). If of the substitution formulas there is no left-hand side which is included in the V', then the algorithm terminates, and the result of its work is considered to be the string V'. Otherwise, the first of the substitution formulae whose left sides are included in V' is selected. If the substitution formula is of the form L\to\cdot D, then out of all of possible representations of the string V' of the form RLS (where R and S are arbitrary strings) the one with the shortest R is chosen. Then the algorithm terminates and the result of its work is considered to be RDS. However, if this substitution formula is of the form L\to D, then out of all of the possible representations of the string V' of the form of RLS the one with the shortest R is chosen, after which the string RDS is considered to be the result of the current step, subject to further processing in the next step. For example, the process of applying the algorithm described above to the word , *, , results in the sequence of words , b*, , ba, *, , a, *, , a, b*, aba, *, baa, *, aa, *, aa, c, aac, ac, and c, , , after which the algorithm stops with the result , , . For other examples, see below. Any normal algorithm is equivalent to some
Turing machine A Turing machine is a mathematical model of computation describing an abstract machine that manipulates symbols on a strip of tape according to a table of rules. Despite the model's simplicity, it is capable of implementing any computer alg ...
, and vice versaany
Turing machine A Turing machine is a mathematical model of computation describing an abstract machine that manipulates symbols on a strip of tape according to a table of rules. Despite the model's simplicity, it is capable of implementing any computer alg ...
is equivalent to some normal algorithm. A version of the Church-Turing thesis formulated in relation to the normal algorithm is called the "principle of normalization." Normal algorithms have proved to be a convenient means for the construction of many sections of
constructive mathematics In the philosophy of mathematics, constructivism asserts that it is necessary to find (or "construct") a specific example of a mathematical object in order to prove that an example exists. Contrastingly, in classical mathematics, one can prove th ...
. Moreover, inherent in the definition of a normal algorithm are a number of ideas used in programming languages aimed at handling symbolic informationfor example, in Refal.


Algorithm

The ''Rules'' are a sequence of pairs of strings, usually presented in the form of ''pattern'' → ''replacement''. Each rule may be either ordinary or terminating. Given an ''input'' string: #Check the Rules in order from top to bottom to see whether any of the ''patterns'' can be found in the ''input'' string. #If none is found, the algorithm stops. #If one (or more) is found, use the first of them to replace the leftmost occurrence of matched text in the ''input'' string with its ''replacement''. #If the rule just applied was a terminating one, the algorithm stops. #Go to step 1. Note that after each rule application the search starts over from the first rule.


Example

The following example shows the basic operation of a Markov algorithm.


Rules

#"A" -> "apple" #"B" -> "bag" #"S" -> "shop" #"T" -> "the" #"the shop" -> "my brother" #"a never used" -> ."terminating rule"


Symbol string

"I bought a B of As from T S."


Execution

If the algorithm is applied to the above example, the Symbol string will change in the following manner. #"I bought a B of As from T S." #"I bought a B of apples from T S." #"I bought a bag of apples from T S." #"I bought a bag of apples from T shop." #"I bought a bag of apples from the shop." #"I bought a bag of apples from my brother." The algorithm will then terminate.


Another example

These rules give a more interesting example. They rewrite binary numbers to their unary counterparts. For example, 101 will be rewritten to a string of 5 consecutive bars.


Rules

#", 0" -> "0, , " #"1" -> "0, " #"0" -> ""


Symbol string

"101"


Execution

If the algorithm is applied to the above example, it will terminate after the following steps. #"101" #"0, 01" #"00, , 1" #"00, , 0, " #"00, 0, , , " #"000, , , , , " #"00, , , , , " #"0, , , , , " #", , , , , "


See also

* Thue (programming language) *
Formal grammar In formal language theory, a grammar (when the context is not given, often called a formal grammar for clarity) describes how to form strings from a language's alphabet that are valid according to the language's syntax. A grammar does not describe ...


References

* Caracciolo di Forino, A. ''String processing languages and generalized Markov algorithms.'' In Symbol manipulation languages and techniques, D. G. Bobrow (Ed.), North-Holland Publ. Co., Amsterdam, The Netherlands, 1968, pp. 191–206. * Andrey Andreevich Markov (1903-1979) 1960. ''The Theory of Algorithms.'' American Mathematical Society Translations, series 2, 15, 1-14. (Translation from the Russian, Trudy Instituta im. Steklova 38 (1951) 176-189{{Cite journal , last=Kushner , first=Boris A. , date=1999-05-28 , title=Markov's constructive analysis; a participant's view , url=https://core.ac.uk/works/41825477 , journal=Theoretical Computer Science , language=en , volume=219 , issue=1-2 , pages=268, 284 , doi=10.1016/S0304-3975(98)00291-6)


External links


Yad Studio - Markov algorithms IDE and interpreter (Open Source)

Markov algorithm interpreter

Markov algorithm interpreter

Markov algorithm interpreters at Rosetta-Code
Theory of computation Rewriting systems Models of computation