Sketch Engine
Sketch Engine is a corpus manager and text analysis software developed by Lexical Computing since 2003. Its purpose is to enable people studying language behaviour (lexicographers, researchers in corpus linguistics, translators or language learners) to search large text collections according to complex and linguistically motivated queries. Sketch Engine gained its name after one of the key features, word sketches: one-page, automatic, corpus-derived summaries of a word's grammatical and collocational behaviour. Currently, it supports and provides corpora in over 90 languages. History of development Sketch Engine is a product of Lexical Computing, a company founded in 2003 by the lexicographer and research scientist Adam Kilgarriff. He started a collaboration with Pavel Rychlý, a computer scientist working at the Natural Language Processing Centre, Masaryk University, and the developer of Manatee and Bonito (two major parts of the software suite). Kilgarriff also introduced th ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Adam Kilgarriff
Adam Kilgarriff (12 February 1960 – 16 May 2015) was a corpus linguist, lexicographer, and co-author of Sketch Engine. Life His parents were booksellers. He spent one year as a volunteer in Kenya 1978–1979 then began studying at Cambridge University, graduating with a first class BA degree in philosophy and engineering in 1982. His first job was as a Housing Officer for the London and Quadrant Housing Trust. At the same time he studied at the South West London College. In 1987, he left his job and started an MSc in intelligent knowledge-based systems at the University of Sussex, from where he graduated the following year, continuing a DPhil in computational linguistics with thesis Polysemy (1992). In 2008 he made a return trip to Kenya with his old friend Raphael. He was also a participant in the Hastings Half Marathon for many years. In November 2014, he was diagnosed with stage 4 bowel cancer which he succumbed to in May 2015. After the diagnosis he started his own ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Corpus Manager
A corpus manager (corpus browser or corpus query system) is a tool for multilingual corpus analysis, which allows effective searching in corpora. A corpus manager usually represents a complex tool that allows one to perform searches for language forms or sequences. It may provide information about the context or allow the user to search by positional attributes, such as lemma, tag, etc. These are called concordances. Other features include the ability to search for collocations, frequency statistics as well as metadata information about the processed text. The narrower meaning of corpus manager refers only to the server side or the corpus query engine, whereas the client side is simply called the user interface. A corpus manager can be software installed on a personal computer or it might be provided as a web service. List of corpus managers * BNCweb – a web-based interface for the British National Corpus * CQPweb - a web-based interface for the study of a large variety o ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Word
A word is a basic element of language that carries semantics, meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguistics, linguists on its definition and numerous attempts to find specific criteria of the concept remain controversial. Different standards have been proposed, depending on the theoretical background and descriptive context; these do not converge on a single definition. Some specific definitions of the term "word" are employed to convey its different meanings at different levels of description, for example based on phonology, phonological, grammar, grammatical or orthography, orthographic basis. Others suggest that the concept is simply a convention used in everyday situations. The concept of "word" is distinguished from that of a morpheme, which is the smallest unit of language that has a meaning, even if it cannot stand on its own. Words a ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Concordance (publishing)
A concordance is an alphabetical list of the principal words used in a book or body of work, listing every instance of each word with its immediate context (language use)#Verbal context, context. Historically, concordances have been compiled only for works of special importance, such as the Vedas, Bible, Qur'an or the works of William Shakespeare, Shakespeare, James Joyce or classical Latin and Greek authors, because of the time, difficulty, and expense involved in creating a concordance in the pre-computer era. A concordance is more than an Subject indexing, index, with additional material such as commentary, definitions and topical cross-indexing which makes producing one a labor-intensive process even when assisted by computers. In the precomputing era, search engine technology, search technology was unavailable, and a concordance offered readers of long works such as the Bible something comparable to search results for every word that they would have been likely to search fo ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Thesaurus
A thesaurus (: thesauri or thesauruses), sometimes called a synonym dictionary or dictionary of synonyms, is a reference work which arranges words by their meanings (or in simpler terms, a book where one can find different words with similar meanings to other words), sometimes as a hierarchy of broader and narrower terms, sometimes simply as lists of synonyms and antonyms. They are often used by writers to help find the best word to express an idea: Synonym dictionaries have a long history. The word 'thesaurus' was used in 1852 by Peter Mark Roget for his ''Roget's Thesaurus''. While some works called "thesauri", such as ''Roget's Thesaurus'', group words in a hierarchical hypernymic taxonomy of concepts, others are organised alphabetically or in some other way. Most thesauri do not include definitions, but many dictionaries include listings of synonyms. Some thesauri and dictionary synonym notes characterise the distinctions between similar words, with notes on their " ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Word Sketch
A word sketch is a one-page, automatic, corpus-derived summary of a word’s grammatical and collocational behaviour. Word sketches were first introduced by the British corpus linguist Adam KilgarriffKilgarriff, Adam; Rychlý, Pavel; Smrž, Pavel; Tugwell, David (2004) The Sketch Engine. Information Technology, 2004 and exploited within the Sketch Engine corpus management system. They are an extension of the general collocation concept used in corpus linguistics in that they group collocations according to particular grammatical relations (e.g. subject, object, modifier etc.). The collocation candidates in a word sketch are sorted either by their frequency or using a lexicographic association score like Dice, T-score or MI-score. Since the introduction, word sketches have been used by lexicographers to develop modern corpus-based dictionaries by major publishing houses including Oxford English Dictionary, Macmillan English Dictionary and comprising dozens of languages including En ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Masaryk University
Masaryk University (MU) (; ) is the second largest university in the Czech Republic, a member of the Compostela Group and the Utrecht Network. Founded in 1919 in Brno, it now consists of ten faculties and 35,115 students. It is named after Tomáš Garrigue Masaryk, the first president of an independent Czechoslovakia as well as the leader of the movement for a second Czech university. In 1960 the university was renamed ''Jan Evangelista Purkyně University'' after Jan Evangelista Purkyně, a Czech biologist. In 1990, following the Velvet Revolution it regained its original name. Since 1922, over 171,000 students have graduated from the university. The Prime Minister of the Czech Republic Petr Fiala has been working part-time at Masaryk University for the entire duration of his mandate. He should therefore devote eight hours a week – thirty-two hours a month – to work for this university. Jan Fischer responded to this as follows: "I also cannot imagine that someone, in ad ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Word Sketch
A word sketch is a one-page, automatic, corpus-derived summary of a word’s grammatical and collocational behaviour. Word sketches were first introduced by the British corpus linguist Adam KilgarriffKilgarriff, Adam; Rychlý, Pavel; Smrž, Pavel; Tugwell, David (2004) The Sketch Engine. Information Technology, 2004 and exploited within the Sketch Engine corpus management system. They are an extension of the general collocation concept used in corpus linguistics in that they group collocations according to particular grammatical relations (e.g. subject, object, modifier etc.). The collocation candidates in a word sketch are sorted either by their frequency or using a lexicographic association score like Dice, T-score or MI-score. Since the introduction, word sketches have been used by lexicographers to develop modern corpus-based dictionaries by major publishing houses including Oxford English Dictionary, Macmillan English Dictionary and comprising dozens of languages including En ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Language Acquisition
Language acquisition is the process by which humans acquire the capacity to perceive and comprehend language. In other words, it is how human beings gain the ability to be aware of language, to understand it, and to produce and use words and sentence (linguistics), sentences to communicate. Language acquisition involves structures, rules, and representation. The capacity to successfully use language requires human beings to acquire a range of tools, including phonology, morphology (linguistics), morphology, syntax, semantics, and an extensive vocabulary. Language can be vocalized as in speech, or manual as in sign language, sign. Human language capacity is language processing in the brain, represented in the brain. Even though human language capacity is finite, one can say and understand an infinite number of sentences, which is based on a syntactic principle called recursion. Evidence suggests that every individual has three recursive Mechanisms of mindfulness meditation, mech ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Translation
Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''translating'' (a written text) and ''interpreting'' (oral or Sign language, signed communication between users of different languages); under this distinction, translation can begin only after the appearance of writing within a language community. A translator always risks inadvertently introducing source-language words, grammar, or syntax into the target-language rendering. On the other hand, such "spill-overs" have sometimes imported useful source-language calques and loanwords that have enriched target languages. Translators, including early translators of sacred texts, have helped shape the very languages into which they have translated. Becau ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Corpus Linguistics
Corpus linguistics is an empirical method for the study of language by way of a text corpus (plural ''corpora''). Corpora are balanced, often stratified collections of authentic, "real world", text of speech or writing that aim to represent a given linguistic variety. Today, corpora are generally machine-readable data collections. Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora collected in the field—the natural context ("realia") of that language—with minimal experimental interference. Large collections of text, though corpora may also be small in terms of running words, allow linguists to run quantitative analyses on linguistic concepts that may be difficult to test in a qualitative manner. The text-corpus method uses the body of texts in any natural language to derive the set of abstract rules which govern that language. Those results can be used to explore the relationships between that subject language and other language ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |
|
Lexicographers
This list contains people who contributed to the field of lexicography, the theory and practice of compiling dictionaries. __NOTOC__ A * Maulvi Abdul Haq (India/Pakistan, 1872–1961) Baba-e-Urdu, English-Urdu dictionary *Ivar Aasen (Norway, 1813–1896) Norwegian language * Abu Amr Ishaq ibn Mirar al-Shaybani (Iraq, c. 738–828) Arabic * Ilia Abuladze (Georgia, 1901–1968) Old Georgian * Johann Christoph Adelung (Germany, 1732–1806) German language general dictionary * George J. Adler (Germany/US, 1821–1868) German/English * Robert Ainsworth (UK, 1660–1743) Latin * Adam Jack Aitken (UK, 1921–1998) Scots language * John Michael Allaby (UK, born 1933) English LSP * Anthony Allen (UK, late 17th century – 1754) obsolete English words * Robert Allen (UK, born 1944) English language general * Amerias (Greece, 3rd century BC) Ancient Macedonian * Ethan Allen Andrews (US, 1787–1858) Latin * Vladimir Anić (Croatia, 1930–2000) Croatian general * Vaman Shivram Apte (Ind ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   [Amazon] |