HOME

TheInfoList



OR:

Combinatory categorial grammar (CCG) is an efficiently
parsable Parsing, syntax analysis, or syntactic analysis is the process of analyzing a string of symbols, either in natural language, computer languages or data structures, conforming to the rules of a formal grammar. The term ''parsing'' comes from L ...
, yet linguistically expressive grammar formalism. It has a transparent interface between surface syntax and underlying semantic representation, including predicate–argument structure, quantification and information structure. The formalism generates constituency-based structures (as opposed to dependency-based ones) and is therefore a type of
phrase structure grammar The term phrase structure grammar was originally introduced by Noam Chomsky as the term for grammar studied previously by Emil Post and Axel Thue (Post canonical systems). Some authors, however, reserve the term for more restricted grammars in the ...
(as opposed to a
dependency grammar Dependency grammar (DG) is a class of modern grammatical theories that are all based on the dependency relation (as opposed to the ''constituency relation'' of phrase structure) and that can be traced back primarily to the work of Lucien TesniĂ ...
). CCG relies on
combinatory logic Combinatory logic is a notation to eliminate the need for quantified variables in mathematical logic. It was introduced by Moses Schönfinkel and Haskell Curry, and has more recently been used in computer science as a theoretical model of comput ...
, which has the same expressive power as the
lambda calculus Lambda calculus (also written as ''λ''-calculus) is a formal system in mathematical logic for expressing computation based on function abstraction and application using variable binding and substitution. It is a universal model of computation ...
, but builds its expressions differently. The first linguistic and psycholinguistic arguments for basing the grammar on combinators were put forth by Steedman and Szabolcsi. More recent prominent proponents of the approach are Pauline Jacobson an
Jason Baldridge
In these new approaches, the
combinator Combinatory logic is a notation to eliminate the need for quantified variables in mathematical logic. It was introduced by Moses Schönfinkel and Haskell Curry, and has more recently been used in computer science as a theoretical model of comput ...
B (the compositor) is useful in creating long-distance dependencies, as in "Who do you think Mary is talking about?" and the combinator W (the duplicator) is useful as the lexical interpretation of reflexive pronouns, as in "Mary talks about herself". Together with I (the identity mapping) and C (the permutator) these form a set of primitive, non-interdefinable combinators. Jacobson interprets personal pronouns as the combinator I, and their binding is aided by a complex combinator Z, as in "Mary lost her way". Z is definable using W and B.


Parts of the formalism

The CCG formalism defines a number of combinators (application, composition, and type-raising being the most common). These operate on syntactically-typed lexical items, by means of
Natural deduction In logic and proof theory, natural deduction is a kind of proof calculus in which logical reasoning is expressed by inference rules closely related to the "natural" way of reasoning. This contrasts with Hilbert-style systems, which instead use axiom ...
style proofs. The goal of the proof is to find some way of applying the combinators to a sequence of lexical items until no lexical item is unused in the proof. The resulting type after the proof is complete is the type of the whole expression. Thus, proving that some sequence of words is a sentence of some language amounts to proving that the words reduce to the type ''S''.


Syntactic types

The syntactic type of a lexical item can be either a primitive type, such as ''S'', ''N'', or ''NP'', or complex, such as ''S\NP'', or ''NP/N''. The complex types, schematizable as ''X/Y'' and ''X\Y'', denote functor types that take an argument of type ''Y'' and return an object of type ''X''. A forward slash denotes that the argument should appear to the right, while a backslash denotes that the argument should appear on the left. Any type can stand in for the ''X'' and ''Y'' here, making syntactic types in CCG a recursive type system.


Application combinators

The application combinators, often denoted by ''>'' for forward application and ''<'' for backward application, apply a lexical item with a functor type to an argument with an appropriate type. The definition of application is given as: : \dfrac> : \dfrac<


Composition combinators

The composition combinators, often denoted by B_> for forward composition and B_< for backward composition, are similar to function composition from mathematics, and can be defined as follows: : \dfracB_> : \dfracB_<


Type-raising combinators

The type-raising combinators, often denoted as T_> for forward type-raising and T_< for backward type-raising, take argument types (usually primitive types) to functor types, which take as their argument the functors that, before type-raising, would have taken them as arguments. : \dfracT_> : \dfracT_<


Example

The sentence "the dog bit John" has a number of different possible proofs. Below are a few of them. The variety of proofs demonstrates the fact that in CCG, sentences don't have a single structure, as in other models of grammar. Let the types of these lexical items be : \text : NP/N \qquad \text : N \qquad \text : NP \qquad \text : (S\backslash NP)/NP We can perform the simplest proof (changing notation slightly for brevity) as: : \dfrac< Opting to type-raise and compose some, we could get a fully incremental, left-to-right proof. The ability to construct such a proof is an argument for the psycholinguistic plausibility of CCG, because listeners do in fact construct partial interpretations (syntactic and semantic) of utterances before they have been completed. : \dfrac>


Formal properties

CCGs are known to be able to generate the language (which is a non-context-free
indexed language Indexed languages are a class of formal languages discovered by Alfred Aho; they are described by indexed grammars and can be recognized by nested stack automata. Indexed languages are a proper subset of context-sensitive languages. They qualify as ...
). A grammar for this language can be found in Vijay-Shanker and Weir (1994). Vijay-Shanker and Weir (1994)Vijay-Shanker, K. and Weir, David J. 1994.
The Equivalence of Four Extensions of Context-Free Grammars
'. Mathematical Systems Theory 27(6): 511–546.
demonstrates that
Linear Indexed Grammars Linearity is the property of a mathematical relationship (''function'') that can be graphically represented as a straight line. Linearity is closely related to '' proportionality''. Examples in physics include rectilinear motion, the linear r ...
, Combinatory Categorial Grammars, Tree-adjoining Grammars, and Head Grammars are weakly equivalent formalisms, in that they all define the same string languages. Kuhlmann et al. (2015)Kuhlmann, M., Koller, A., and Satta, G. 2015.
Lexicalization and Generative Power in CCG
'. Computational Linguistics 41(2): 215-247.
show that this equivalence, and the ability of CCG to describe , rely crucially on the ability to restrict the use of the combinatory rules to certain categories, in ways not explained above.


See also

*
Categorial grammar Categorial grammar is a family of formalisms in natural language syntax that share the central assumption that syntactic constituents combine as functions and arguments. Categorial grammar posits a close relationship between the syntax and seman ...
*
Combinatory logic Combinatory logic is a notation to eliminate the need for quantified variables in mathematical logic. It was introduced by Moses Schönfinkel and Haskell Curry, and has more recently been used in computer science as a theoretical model of comput ...
*
Embedded pushdown automaton An embedded pushdown automaton or EPDA is a computational model for parsing languages generated by tree-adjoining grammars (TAGs). It is similar to the context-free grammar-parsing pushdown automaton, but instead of using a plain stack to store sy ...
*
Link grammar Link grammar (LG) is a theory of syntax by Davy Temperley and Daniel Sleator which builds relations between pairs of words, rather than constructing constituents in a phrase structure hierarchy. Link grammar is similar to dependency grammar, but d ...
*
Type shifter Type may refer to: Science and technology Computing * Typing, producing text via a keyboard, typewriter, etc. * Data type, collection of values used for computations. * File type * TYPE (DOS command), a command to display contents of a file. * ...


References

*Baldridge, Jason (2002),
Lexically Specified Derivational Control in Combinatory Categorial Grammar
" PhD Dissertation. Univ. of Edinburgh. *Curry, Haskell B. and Richard Feys (1958), Combinatory Logic, Vol. 1. North-Holland. *Jacobson, Pauline (1999), â
Towards a variable-free semantics
” Linguistics and Philosophy 22, 1999. 117–184 *Steedman, Mark (1987), â
Combinatory grammars and parasitic gaps
€ť. Natural Language and Linguistic Theory 5, 403–439. *Steedman, Mark (1996), Surface Structure and Interpretation. The MIT Press. *Steedman, Mark (2000), The Syntactic Process. The MIT Press. *Szabolcsi, Anna (1989),
Bound variables in syntax (are there any?)
" Semantics and Contextual Expression, ed. by Bartsch, van Benthem, and van Emde Boas. Foris, 294–318. *Szabolcsi, Anna (1992),
Combinatory grammar and projection from the lexicon
" Lexical Matters. CSLI Lecture Notes 24, ed. by Sag and Szabolcsi. Stanford, CSLI Publications. 241–269. *Szabolcsi, Anna (2003), â
Binding on the fly: Cross-sentential anaphora in variable-free semantics
€ť. Resource Sensitivity in Binding and Anaphora, ed. by Kruijff and Oehrle. Kluwer, 215–229.


Further reading

* Michael Moortgat,
Categorial Type Logics
', Chapter Two in J. van Benthem and A. ter Meulen (eds.) ''Handbook of Logic and Language''. Elsevier, 1997,
homepages.inf.ed.ac.uk


External links


The Combinatory Categorial Grammar Site

The ACL CCG wiki page
(likely to be more up-to-date than this one)
Semantic Parsing with Combinatory Categorial Grammars
– Tutorial describing general principles for building semantic parsers {{Formal semantics Grammar frameworks Combinatory logic Type theory