In
formal language theory
In logic, mathematics, computer science, and linguistics, a formal language is a set of string (computer science), strings whose symbols are taken from a set called "#Definition, alphabet".
The alphabet of a formal language consists of symbol ...
, a context-sensitive language is a language that can be defined by a
context-sensitive grammar
A context-sensitive grammar (CSG) is a formal grammar in which the left-hand sides and right-hand sides of any Production (computer science), production rules may be surrounded by a context of terminal symbol, terminal and nonterminal symbols. Cont ...
(and equivalently by a
noncontracting grammar). Context-sensitive is known as type-1 in the
Chomsky hierarchy
The Chomsky hierarchy in the fields of formal language theory, computer science, and linguistics, is a containment hierarchy of classes of formal grammars. A formal grammar describes how to form strings from a formal language's alphabet that are v ...
of formal languages.
Computational properties
Computationally, a context-sensitive language is equivalent to a linear bounded
nondeterministic Turing machine
In theoretical computer science, a nondeterministic Turing machine (NTM) is a theoretical model of computation whose governing rules specify more than one possible action when in some given situations. That is, an NTM's next state is ''not'' comp ...
, also called a
linear bounded automaton
In computer science, a linear bounded automaton (plural linear bounded automata, abbreviated LBA) is a restricted form of Turing machine.
Operation
A linear bounded automaton is a Turing machine that satisfies the following three conditions:
* ...
. That is a non-deterministic Turing machine with a tape of only
cells, where
is the size of the input and
is a constant associated with the machine. This means that every formal language that can be decided by such a machine is a context-sensitive language, and every context-sensitive language can be decided by such a machine.
This set of languages is also known as NLINSPACE or NSPACE(''O''(''n'')), because they can be accepted using linear space on a non-deterministic Turing machine. The class LINSPACE (or DSPACE(''O''(''n''))) is defined the same, except using a
deterministic
Determinism is the metaphysical view that all events within the universe (or multiverse) can occur only in one possible way. Deterministic theories throughout the history of philosophy have developed from diverse and sometimes overlapping mo ...
Turing machine. Clearly LINSPACE is a subset of NLINSPACE, but it is not known whether LINSPACE = NLINSPACE.
Examples
One of the simplest context-sensitive but not context-free languages is
: the language of all strings consisting of occurrences of the symbol "a", then "b"s, then "c"s (abc, , , etc.). A superset of this language, called the Bach language, is defined as the set of all strings where "a", "b" and "c" (or any other set of three symbols) occurs equally often (, , etc.) and is also context-sensitive.
can be shown to be a context-sensitive language by constructing a linear bounded automaton which accepts . The language can easily be shown to be neither
regular nor
context-free by applying the respective
pumping lemmas for each of the language classes to .
Similarly:
is another context-sensitive language; the corresponding context-sensitive grammar can be easily projected starting with two context-free grammars generating sentential forms in the formats
and
and then supplementing them with a permutation production like
, a new starting symbol and standard syntactic sugar.
is another context-sensitive language (the "3" in the name of this language is intended to mean a ternary alphabet); that is, the "product" operation defines a context-sensitive language (but the "sum" defines only a context-free language as the grammar
and
shows). Because of the commutative property of the product, the most intuitive grammar for
is ambiguous. This problem can be avoided considering a somehow more restrictive definition of the language, e.g.
. This can be specialized to
and, from this, to
,
, etc.
is a context-sensitive language. The corresponding context-sensitive grammar can be obtained as a generalization of the context-sensitive grammars for
,
, etc.
is a context-sensitive language.
is a context-sensitive language (the "2" in the name of this language is intended to mean a binary alphabet). This was proved by Hartmanis using pumping lemmas for regular and context-free languages over a binary alphabet and, after that, sketching a linear bounded multitape automaton accepting
.
is a context-sensitive language (the "1" in the name of this language is intended to mean a unary alphabet). This was credited by A. Salomaa to Matti Soittola by means of a linear bounded automaton over a unary alphabet (pages 213–214, exercise 6.8) and also to Marti Penttonen by means of a context-sensitive grammar also over a unary alphabet (See: Formal Languages by A. Salomaa, page 14, Example 2.5).
An example of
recursive language
In mathematics, logic and computer science, a recursive (or ''decidable'') language is a recursive subset of the Kleene closure of an alphabet. Equivalently, a formal language is recursive if there exists a Turing machine that decides the forma ...
that is not context-sensitive is any recursive language whose decision is an
EXPSPACE-hard problem, say, the set of pairs of equivalent
regular expression
A regular expression (shortened as regex or regexp), sometimes referred to as rational expression, is a sequence of characters that specifies a match pattern in text. Usually such patterns are used by string-searching algorithms for "find" ...
s with exponentiation.
Properties of context-sensitive languages
* The
union,
intersection
In mathematics, the intersection of two or more objects is another object consisting of everything that is contained in all of the objects simultaneously. For example, in Euclidean geometry, when two lines in a plane are not parallel, their ...
, concatenation of two context-sensitive languages is context-sensitive, also the
Kleene plus of a context-sensitive language is context-sensitive.
* The complement of a context-sensitive language is itself context-sensitive
a result known as the
Immerman–Szelepcsényi theorem.
* Membership of a string in a language defined by an arbitrary context-sensitive grammar, or by an arbitrary deterministic context-sensitive grammar, is a
PSPACE-complete
In computational complexity theory, a decision problem is PSPACE-complete if it can be solved using an amount of memory that is polynomial in the input length (PSPACE, polynomial space) and if every other problem that can be solved in polynomial sp ...
problem.
See also
*
Linear bounded automaton
In computer science, a linear bounded automaton (plural linear bounded automata, abbreviated LBA) is a restricted form of Turing machine.
Operation
A linear bounded automaton is a Turing machine that satisfies the following three conditions:
* ...
*
List of parser generators for context-sensitive languages
*
Chomsky hierarchy
The Chomsky hierarchy in the fields of formal language theory, computer science, and linguistics, is a containment hierarchy of classes of formal grammars. A formal grammar describes how to form strings from a formal language's alphabet that are v ...
*
Indexed languages – a strict subset of the context-sensitive languages
*
Weir hierarchy
References
* Sipser, M. (1996), ''Introduction to the Theory of Computation'', PWS Publishing Co.
{{DEFAULTSORT:Context-Sensitive Language
Formal languages