Mildly Context-sensitive Language
   HOME

TheInfoList



OR:

In
computational linguistics Computational linguistics is an Interdisciplinarity, interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, comput ...
, the term mildly context-sensitive grammar formalisms refers to several
grammar formalism In formal language theory, a grammar (when the context is not given, often called a formal grammar for clarity) describes how to form strings from a language's alphabet that are valid according to the language's syntax. A grammar does not describe ...
s that have been developed in an effort to provide adequate descriptions of the syntactic structure of
natural language In neuropsychology, linguistics, and philosophy of language, a natural language or ordinary language is any language that has evolved naturally in humans through use and repetition without conscious planning or premeditation. Natural languages ...
. Every mildly context-sensitive grammar formalism defines a class of mildly context-sensitive grammars (the grammars that can be specified in the formalism), and therefore also a class of mildly context-sensitive languages (the
formal language In logic, mathematics, computer science, and linguistics, a formal language consists of words whose letters are taken from an alphabet and are well-formed according to a specific set of rules. The alphabet of a formal language consists of symb ...
s generated by the grammars).


Background

By 1985, several researchers in descriptive and mathematical linguistics had provided evidence against the hypothesis that the syntactic structure of natural language can be adequately described by
context-free grammar In formal language theory, a context-free grammar (CFG) is a formal grammar whose production rules are of the form :A\ \to\ \alpha with A a ''single'' nonterminal symbol, and \alpha a string of terminals and/or nonterminals (\alpha can be empt ...
s.Riny Huybregts. "The Weak Inadequacy of Context-Free Phrase Structure Grammars". In Ger de Haan, Mieke Trommelen, and Wim Zonneveld, editors, ''Van periferie naar kern'', pages 81–99. Foris, Dordrecht, The Netherlands, 1984.Stuart M. Shieber.
Evidence Against the Context-Freeness of Natural Language
. ''Linguistics and Philosophy'', 8(3):333–343, 1985.
At the same time, the step to the next level of the
Chomsky hierarchy In formal language theory, computer science and linguistics, the Chomsky hierarchy (also referred to as the Chomsky–Schützenberger hierarchy) is a containment hierarchy of classes of formal grammars. This hierarchy of grammars was described by ...
, to context-sensitive grammars, appeared both unnecessary and undesirable. In an attempt to pinpoint the exact formal power required for the adequate description of natural language syntax,
Aravind Joshi Aravind Krishna Joshi (August 5, 1929 – December 31, 2017) was the Henry Salvatori Professor of Computer and Cognitive Science in the computer science department of the University of Pennsylvania. Joshi defined the tree-adjoining grammar form ...
characterized "grammars (and associated
language Language is a structured system of communication. The structure of a language is its grammar and the free components are its vocabulary. Languages are the primary means by which humans communicate, and may be conveyed through a variety of met ...
s) that are only slightly more powerful than context-free grammars (context-free languages)".Aravind K. Joshi.
Tree Adjoining Grammars: How Much Context-Sensitivity Is Required to Provide Reasonable Structural Descriptions?
. In David R. Dowty, Lauri Karttunen, and Arnold M. Zwicky, editors, ''Natural Language Parsing'', pages 206–250. Cambridge University Press, 1985.
He called these grammars ''mildly context-sensitive grammars'' and the associated languages ''mildly context-sensitive languages''. Joshi’s characterization of mildly context-sensitive grammars was biased toward his work on
tree-adjoining grammar Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi. Tree-adjoining grammars are somewhat similar to context-free grammars, but the elementary unit of rewriting is the tree rather than the symbol. Whereas context-free gra ...
(TAG). However, together with his students Vijay Shanker and David Weir, Joshi soon discovered that TAGs are equivalent, in terms of the generated string languages, to the independently introduced
head grammar Head grammar (HG) is a grammar formalism introduced in Carl Pollard (1984) Pollard, C. 1984. ''Generalized Phrase Structure Grammars, Head Grammars, and Natural Language''. Ph.D. thesis, Stanford University, CA. as an extension of the context-f ...
(HG).David J. Weir, K. Vijay-Shanker, and Aravind K. Joshi.
The Relationship Between Tree Adjoining Grammars and Head Grammars
. In ''Proceedings of the 24th Annual Meeting of the Association for Computational Linguistics (ACL)'', pages 67–74, New York, USA, 1986.
This was followed by two similar equivalence results, for
linear indexed grammar Indexed grammars are a generalization of context-free grammars in that nonterminals are equipped with lists of ''flags'', or ''index symbols''. The language produced by an indexed grammar is called an indexed language. Definition Modern definitio ...
(LIG)K. Vijay-Shanker.
A Study of Tree Adjoining Grammars
. Ph.D. thesis, University of Pennsylvania, Philadelphia, USA, 1987.
and
combinatory categorial grammar Combinatory categorial grammar (CCG) is an efficiently parsable, yet linguistically expressive grammar formalism. It has a transparent interface between surface syntax and underlying semantic representation, including predicate–argument structur ...
(CCG),David J. Weir and Aravind K. Joshi.
Combinatory Categorial Grammars: Generative Power and Relationship to Linear Context-Free Rewriting Systems
. In ''Proceedings of the 26th Annual Meeting of the Association for Computational Linguistics (ACL)'', pages 278–285, Buffalo, USA, 1988.
which showed that the notion of mild context-sensitivity is a very general one and not tied to a specific formalism. The TAG-equivalent formalisms were generalized by the introduction of
linear context-free rewriting system Generalized context-free grammar (GCFG) is a grammar formalism that expands on context-free grammars by adding potentially non-context-free composition functions to rewrite rules. Head grammar (and its weak equivalents) is an instance of such a GC ...
s (LCFRS).K. Vijay-Shanker, David J. Weir, and Aravind K. Joshi.
Characterizing Structural Descriptions Produced by Various Grammatical Formalisms
. In ''Proceedings of the 25th Annual Meeting of the Association for Computational Linguistics (ACL)'', pages 104–111, Stanford, CA, USA, 1987.
David J. Weir.
Characterizing Mildly Context-Sensitive Grammar Formalisms
. Ph.D. thesis, University of Pennsylvania, Philadelphia, USA, 1988.
These grammars define an infinite hierarchy of string languages in between the context-free and the context-sensitive languages, with the languages generated by the TAG-equivalent formalisms at the lower end of the hierarchy. Independently of and almost simultaneously to LCFRS, Hiroyuki Seki et al. proposed the essentially identical formalism of multiple context-free grammar (MCFG).Hiroyuki Seki, Takashi Matsumura, Mamoru Fujii, and Tadao Kasami.
On Multiple Context-Free Grammars
. ''Theoretical Computer Science'', 88(2):191–229, 1991.
LCFRS/MCFG is sometimes regarded as the most general formalism for specifying mildly context-sensitive grammars. However, several authors have noted that some of the characteristic properties of the TAG-equivalent formalisms are not preserved by LCFRS/MCFG,Makoto Kanazawa.
The Pumping Lemma for Well-Nested Multiple Context-Free Languages
. In ''Developments in Language Theory. 13th International Conference, DLT 2009, Stuttgart, Germany, June 30–July 3, 2009. Proceedings'', volume 5583 of ''Lecture Notes in Computer Science'', pages 312–325, 2009.
and that there are languages that have the characteristic properties of mildly context-sensitivity but are not generated by LCFRS/MCFG.Laura Kallmeyer.
On Mildly Context-Sensitive Non-Linear Rewriting
. ''Research on Language and Computation'', 8(4):341–363, 2010.
Recent years have seen increased interest in the restricted class of ''well-nested'' linear context-free rewriting systems/multiple context-free grammars,Carlos Gómez-Rodríguez, Marco Kuhlmann, and Giorgio Satta.
Efficient Parsing of Well-Nested Linear Context-Free Rewriting Systems
. In ''Proceedings of Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)'', pages 276–284, Los Angeles, USA, 2010.
which define a class of grammars that properly includes the TAG-equivalent formalisms but is properly included in the unrestricted LCFRS/MCFG hierarchy.


Characterization

Despite a considerable amount of work on the subject, there is no generally accepted formal definition of mild context-sensitivity. According to the original characterization by Joshi, a class of mildly context-sensitive grammars should have the following properties: # limited
cross-serial dependencies In linguistics, cross-serial dependencies (also called crossing dependencies by some authors.) occur when the lines representing the dependency relations between two series of words cross over each other.. They are of particular interest to ling ...
# constant growth # polynomial parsing In addition to these, it is understood that every class of mildly context-sensitive grammars should be able to generate all context-free languages. Joshi’s characterization is not a formal definition. He notes: Other authors have proposed alternative characterizations of mild context-sensitivity, some of which take the form of formal definitions. For example, Laura KallmeyerLaura Kallmeyer. ''Parsing Beyond Context-Free Grammars''. Springer, 2010. takes the perspective that mild context-sensitivity should be defined as a property of classes of languages rather than, as in Joshi’s characterization, classes of grammars. Such a language-based definition leads to a different notion of the concept than Joshi’s.


Cross-serial dependencies

The term ''
cross-serial dependencies In linguistics, cross-serial dependencies (also called crossing dependencies by some authors.) occur when the lines representing the dependency relations between two series of words cross over each other.. They are of particular interest to ling ...
'' refers to certain characteristic word ordering patterns, in particular to the verb–argument patterns observed in subordinate clauses in Dutch and Swiss German. These are the very patterns that can be used to argue against the context-freeness of natural language; thus requiring mildly context-sensitive grammars to model cross-serial dependencies means that these grammars must be more powerful than context-free grammars. Kallmeyer identifies the ability to model cross-serial dependencies with the ability to generate the ''copy language'' \mathit = \ and its generalizations to two or more copies of ''w'', up to some bound. These languages are not context-free, which can be shown using the
pumping lemma In the theory of formal language In logic, mathematics, computer science, and linguistics, a formal language consists of words whose letters are taken from an alphabet and are well-formed according to a specific set of rules. The alphabet of ...
.


Constant growth

A formal language is of ''constant growth'' if every string in the language is longer than the next shorter strings by at most a (language-specific) constant. Languages that violate this property are often considered to be beyond human capacity, although some authors have argued that certain phenomena in natural language do show a growth that cannot be bounded by a constant.Jens Michaelis and Marcus Kracht.
Semilinearity as a Syntactic Invariant
. In ''Logical Aspects of Computational Linguistics. First International Conference, LACL 1996, Nancy, France, September 23–25, 1996. Selected Papers'', volume 1328 of ''Lecture Notes in Computer Science'', pages 329–345. Springer, 1997.
Most mildly context-sensitive grammar formalisms (in particular, LCFRS/MCFG) actually satisfy a stronger property than constant growth called ''semilinearity''. A language is semilinear if its image under the Parikh-mapping (the mapping that "forgets" the relative position of the symbols in a string, effectively treating it as a bag of words) is a
regular language In theoretical computer science and formal language theory, a regular language (also called a rational language) is a formal language that can be defined by a regular expression, in the strict sense in theoretical computer science (as opposed to ...
. All semilinear languages are of constant growth, but not every language with constant growth is semilinear.


Polynomial parsing

A grammar formalism is said to have ''polynomial parsing'' if its membership problem can be solved in deterministic polynomial time. This is the problem to decide, given a grammar ''G'' written in the formalism and a string ''w'', whether ''w'' is generated by ''G'' – that is, whether ''w'' is "grammatical" according to ''G''. The time complexity of this problem is measured in terms of the combined size of ''G'' and ''w''. Under the view on mild context-sensitivity as a property of classes of languages, ''polynomial parsing'' refers to the language membership problem. This is the problem to decide, for a fixed language ''L'', whether a given string ''w'' belongs to ''L''. The time complexity of this problem is measured in terms of the length of ''w''; it ignores the question how ''L'' is specified. Note that both understandings of ''polynomial parsing'' are idealizations in the sense that for practical applications one is interested not only in the yes/no question whether a sentence is grammatical, but also in the syntactic structure that the grammar assigns to the sentence.


Formalisms

Over the years, a large number of grammar formalisms have been introduced that satisfy some or all of the characteristic properties put forth by Joshi. Several of them have alternative, automaton-based characterizations that are not discussed in this article; for example, the languages generated by tree-adjoining grammar can be characterized by
embedded pushdown automata An embedded pushdown automaton or EPDA is a computational model for parsing languages generated by tree-adjoining grammars (TAGs). It is similar to the context-free grammar-parsing pushdown automaton, but instead of using a plain stack to store sy ...
.


Formalisms equivalent to TAG

*
Tree-adjoining grammar Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi. Tree-adjoining grammars are somewhat similar to context-free grammars, but the elementary unit of rewriting is the tree rather than the symbol. Whereas context-free gra ...
(TAG) *
Head grammar Head grammar (HG) is a grammar formalism introduced in Carl Pollard (1984) Pollard, C. 1984. ''Generalized Phrase Structure Grammars, Head Grammars, and Natural Language''. Ph.D. thesis, Stanford University, CA. as an extension of the context-f ...
(HG) Carl J. Pollard. "Generalized Phrase Structure Grammars, Head Grammars, and Natural Language". Ph.D. thesis, Stanford University, 1984.Kelly Roach.
Formal Properties of Head Grammars
. In Alexis Manaster-Ramer, editor, ''Mathematics of Language'', pages 293–347. John Benjamins, 1987.
*
Linear indexed grammar Indexed grammars are a generalization of context-free grammars in that nonterminals are equipped with lists of ''flags'', or ''index symbols''. The language produced by an indexed grammar is called an indexed language. Definition Modern definitio ...
(LIG)Gerald Gazdar.
Applicability of Indexed Grammars to Natural Language
. In Uwe Reyle and Christian Rohrer, editors, ''Natural Language Parsing and Linguistic Theories'', pages 69–94. D. Reidel, 1987.
*
Combinatory categorial grammar Combinatory categorial grammar (CCG) is an efficiently parsable, yet linguistically expressive grammar formalism. It has a transparent interface between surface syntax and underlying semantic representation, including predicate–argument structur ...
(CCG) * Well-nested LCFRS/MCFG of fan-out 2


Formalisms equivalent to general LCFRS/MCFG

*
Linear context-free rewriting system Generalized context-free grammar (GCFG) is a grammar formalism that expands on context-free grammars by adding potentially non-context-free composition functions to rewrite rules. Head grammar (and its weak equivalents) is an instance of such a GC ...
s (LCFRS) * Multiple context-free grammars (MCFG) * Multicomponent tree-adjoining grammars (MCTAG) * Minimalist grammars (MG)Jens Michaelis.
Derivational Minimalism Is Mildly Context-Sensitive
. In ''Logical Aspects of Computational Linguistics, Third International Conference, LACL 1998, Grenoble, France, December 14–16, 1998, Selected Papers'', volume 2014 of ''Lecture Notes in Computer Science'', pages 179–198. Springer, 1998.
* Simple (linear, non-erasing, non-combinatorial), positive
range concatenation grammars Range concatenation grammar (RCG) is a grammar formalism developed by Pierre Boullier in 1998 as an attempt to characterize a number of phenomena of natural language, such as Chinese numbers and German word order scrambling, which are outside the b ...
(sRCG)Pierre Boullier.
Range Concatenation Grammars
. In Harry C. Bunt, John Carroll, and Giorgio Satta, editors, ''New Developments in Parsing Technology'', volume 23 of ''Text, Speech and Language Technology'', pages 269–289. Kluwer Academic Publishers, 2004.


Formalisms equivalent to well-nested LCFRS/MCFG

* Non-duplicating macro grammarsMichael J. Fischer.
Grammars with Macro-Like Productions
. In ''Ninth Annual Symposium on Switching and Automata Theory'', pages 131–142, Schenectady, NY, USA, 1968.
* Coupled context-free grammars (CCFG)Günter Hotz and Gisela Pitsch. "On Parsing Coupled-Context-Free Languages". ''Theoretical Computer Science'', 161(1–2):205–233, 1996. * Well-nested linear context-free rewriting systems * Well-nested multiple context-free grammars


Relations among the formalisms

Linear context-free rewriting systems/multiple context-free grammars form a two-dimensional hierarchy of generative power with respect to two grammar-specific parameters called ''fan-out'' and ''rank''.Owen Rambow and Giorgio Satta.
A Two-Dimensional Hierarchy for Parallel Rewriting Systems
. Technical Report IRCS-94-02, University of Pennsylvania, Philadelphia, USA, 1994.
More precisely, the languages generated by LCFRS/MCFG with fan-out  and rank  are properly included in the class of languages generated by LCFRS/MCFG with rank  and fan-out ''f'', as well as the class of languages generated by LCFRS/MCFG with rank ''r'' and fan-out . In the presence of well-nestedness, this hierarchy collapses to a one-dimensional hierarchy with respect to fan-out; this is because every well-nested LCFRS/MCFG can be transformed into an equivalent well-nested LCFRS/MCFG with the same fan-out and rank 2. Within the LCFRS/MCFG hierarchy, the context-free languages can be characterized by the grammars with fan-out 1; for this fan-out there is no difference between general and well-nested grammars. The TAG-equivalent formalisms can be characterized as well-nested LCFRS/MCFG of fan-out 2.


See also

*
Tree-adjoining grammar Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi. Tree-adjoining grammars are somewhat similar to context-free grammars, but the elementary unit of rewriting is the tree rather than the symbol. Whereas context-free gra ...
*
Linear context-free rewriting system Generalized context-free grammar (GCFG) is a grammar formalism that expands on context-free grammars by adding potentially non-context-free composition functions to rewrite rules. Head grammar (and its weak equivalents) is an instance of such a GC ...
*
Range concatenation grammar Range concatenation grammar (RCG) is a grammar formalism developed by Pierre Boullier in 1998 as an attempt to characterize a number of phenomena of natural language, such as Chinese numbers and German word order scrambling, which are outside the b ...
*
Weir hierarchy An embedded pushdown automaton or EPDA is a computational model for parsing languages generated by tree-adjoining grammars (TAGs). It is similar to the context-free grammar-parsing pushdown automaton, but instead of using a plain stack to store sy ...


References

{{Formal languages and grammars Formal languages