SkELL
   HOME
*





SkELL
SkELL (abbreviation of ''Sketch Engine for Language Learning'') is a free corpus-based web tool that allows language learners and teachers find authentic sentences for specific target word(s). For any word or a phrase, SkELL displays a concordance that lists example sentences drawn from a special text corpus crawled from the World Wide Web, which has been cleaned of spam and includes only high-quality texts covering everyday, standard, formal, and professional language. There are versions of SkELL for English, Russian, German, Italian, Czech and Estonian. SkELL is based on the commercial Sketch Engine corpus manager and the proprietary GDEX (Good Dictionary Examples) score that it implements. Features SkELL can provide three kinds of results for a query: * Examples: This page displays a concordance created by searching for the specified word or phrase in the reference corpus, taking any derived forms into account. * Word sketch: This page shows the most frequent collo ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Sketch Engine
Sketch Engine is a corpus manager and text analysis software developed by Lexical Computing CZ s.r.o. since 2003. Its purpose is to enable people studying language behaviour ( lexicographers, researchers in corpus linguistics, translators or language learners) to search large text collections according to complex and linguistically motivated queries. Sketch Engine gained its name after one of the key features, word sketches: one-page, automatic, corpus-derived summaries of a word's grammatical and collocational behaviour. Currently, it supports and provides corpora in 90+ languages. History of development Sketch Engine is a product of Lexical Computing Limited, a company founded in 2003 by the lexicographer and research scientist Adam Kilgarriff. He started a collaboration with Pavel Rychlý, a computer scientist working at the Natural Language Processing Centre, Masaryk University, and the developer of Manatee and Bonito (two major parts of the software suite), and introduced ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Go (programming Language)
Go is a statically typed, compiled programming language designed at Google by Robert Griesemer, Rob Pike, and Ken Thompson. It is syntactically similar to C, but with memory safety, garbage collection, structural typing, and CSP-style concurrency. It is often referred to as Golang because of its former domain name, golang.org, but its proper name is Go. There are two major implementations: * Google's self-hosting "gc" compiler toolchain, targeting multiple operating systems and WebAssembly. * gofrontend, a frontend to other compilers, with the ''libgo'' library. With GCC the combination is gccgo; with LLVM the combination is gollvm. A third-party source-to-source compiler, GopherJS, compiles Go to JavaScript for front-end web development. History Go was designed at Google in 2007 to improve programming productivity in an era of multicore, networked machines and large codebases. The designers wanted to address criticism of other languages in use at Google, but keep ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Collocation
In corpus linguistics, a collocation is a series of words or terms that co-occur more often than would be expected by chance. In phraseology, a collocation is a type of compositional phraseme, meaning that it can be understood from the words that make it up. This contrasts with an idiom, where the meaning of the whole cannot be inferred from its parts, and may be completely unrelated. An example of a phraseological collocation is the expression ''strong tea''. While the same meaning could be conveyed by the roughly equivalent ''powerful tea'', this adjective does not modify ''tea'' frequently enough for English speakers to become accustomed to its co-occurrence and regard it as idiomatic or unmarked. (By way of counterexample, ''powerful'' is idiomatically preferred to ''strong'' when modifying a ''computer'' or a ''car''.) There are about six main types of collocations: adjective + noun, noun + noun (such as collective nouns), verb + noun, adverb ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

British National Corpus
The British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. The corpus covers British English of the late 20th century from a wide variety of genres, with the intention that it be a representative sample of spoken and written British English of that time. It is used in corpus linguistic for analysis of corpora History The project to create the BNC involved the collaboration of three publishers (with the Oxford University Press as the lead collaborator, Longman and W. & R. Chambers), two universities (the University of Oxford and Lancaster University), and the British Library. The creation of the BNC started in 1991 under the management of the BNC consortium, and the project was finished by 1994. There have been no additions of new samples after 1994, but the BNC underwent slight revisions before the release of the second edition BNC World (2001) and the third edition BNC XML Edition (2007).
[...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


TenTen Corpus Family
The TenTen Corpus Family (also called TenTen corpora) is a set of comparable web text corpora, i.e. collections of texts that have been crawled from the World Wide Web and processed to match the same standards. These corpora are made available through the Sketch Engine corpus manager. There are TenTen corpora for more than 35 languages. Their target size is 10 billion (1010) words per language, which gave rise to the corpus family's name. In the creation of the TenTen corpora, data crawled from the World Wide Web are processed with natural language processing tools developed by the Natural Language Processing Centre at the Faculty of Informatics at Masaryk University ( Brno, Czech Republic) and by the Lexical Computing company (developer of the Sketch Engine). Corpus linguistics In corpus linguistics, a text corpus is a large and structured collection of texts that are electronically stored and processed. It is used to do hypothesis testing about languages, validating linguis ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

English Wikipedia
The English Wikipedia is, along with the Simple English Wikipedia, one of two English-language editions of Wikipedia, an online encyclopedia. It was founded on January 15, 2001, as Wikipedia's first edition, and, as of , has the most articles of any edition, at . As of , of articles in all Wikipedias belong to the English-language edition; this share was more than 50% in 2003. The edition's one-billionth edit was made on January 13, 2021. Articles The English Wikipedia has pioneered some ideas as conventions, policies or features which were later adopted by Wikipedia editions in some of the other languages. These ideas include "featured articles", the neutral-point-of-view policy, navigation templates, the sorting of short "stub" articles into sub-categories, dispute resolution mechanisms such as mediation and arbitration, and weekly collaborations. It surpassed six million articles on 23 January 2020. In November 2022, the total volume of the compressed texts of it ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Polysemy
Polysemy ( or ; ) is the capacity for a sign (e.g. a symbol, a morpheme, a word, or a phrase) to have multiple related meanings. For example, a word can have several word senses. Polysemy is distinct from ''monosemy'', where a word has a single meaning. Polysemy is distinct from homonymy—or homophony—which is an accidental similarity between two or more words (such as ''bear'' the animal, and the verb ''bear''); whereas homonymy is a mere linguistic coincidence, polysemy is not. In discerning whether a given set of meanings represent polysemy or homonymy, it is often necessary to look at the history of the word to see whether the two meanings are historically related. Dictionary writers often list polysemes (words or phrases with different, but related, senses) in the same entry (that is, under the same headword) and enter homonyms as separate headwords (usually with a numbering convention such as ''¹bear'' and ''²bear''). Polysemes A polyseme is a word or phrase wit ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Homonym
In linguistics, homonyms are words which are homographs (words that share the same spelling, regardless of pronunciation), or homophones (equivocal words, that share the same pronunciation, regardless of spelling), or both. Using this definition, the words ''row'' (propel with oars), ''row'' (a linear arrangement) and ''row'' (an argument) are homonyms because they are homographs (though only the first two are homophones): so are the words ''see'' (vision) and ''sea'' (body of water), because they are homophones (though not homographs). A more restrictive and technical definition requires that homonyms be simultaneously homographs ''and'' homophoneshomonym
''Random House Unabridged Dictionary'' at dictionary.com
– that is to say they have identical spelling ''and'' pronunciation, but with different meanings. Examples are the pair ''stalk'' ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Grammar
In linguistics, the grammar of a natural language is its set of structure, structural constraints on speakers' or writers' composition of clause (linguistics), clauses, phrases, and words. The term can also refer to the study of such constraints, a field that includes domains such as phonology, morphology (linguistics), morphology, and syntax, often complemented by phonetics, semantics, and pragmatics. There are currently two different approaches to the study of grammar: traditional grammar and Grammar#Theoretical frameworks, theoretical grammar. Fluency, Fluent speakers of a variety (linguistics), language variety or ''lect'' have effectively internalized these constraints, the vast majority of which – at least in the case of one's First language, native language(s) – are language acquisition, acquired not by conscious study or language teaching, instruction but by hearing other speakers. Much of this internalization occurs during early childhood; learning a language later ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Lexicon
A lexicon is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Koine Greek language, Greek word (), neuter of () meaning 'of or for words'. Linguistic theories generally regard human languages as consisting of two parts: a lexicon, essentially a catalogue of a language's words (its wordstock); and a grammar, a system of rules which allow for the combination of those words into meaningful sentences. The lexicon is also thought to include bound morphemes, which cannot stand alone as words (such as most affixes). In some analyses, compound words and certain classes of idiomatic expressions, collocations and other phrases are also considered to be part of the lexicon. Dictionary, Dictionaries are lists of the lexicon, in alphabetical order, of a given language; usually, however, bound morphemes are not included. Size and organization Items in the le ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Hits Per Million
Hits or H.I.T.S. may refer to: Arts, entertainment, and media Music * ''H.I.T.S.'', 1991 album by New Kids on the Block * ''...Hits'' (Phil Collins album), 1998 * ''Hits'' (compilation series), 1984–2006; 2014 - a British compilation album series * ''Hits'' (Dru Hill album), 2005 * ''Hits'' (Mike + The Mechanics album), 1996 * ''Hits'' (Kylie Minogue album), 2011 - an EMI compilation album released only in Japan, Hong Kong, and the Philippines * ''Hits'' (Joni Mitchell album), 1996 * ''Hits'' (New Found Glory album), 2008 * ''Hits'' (Pulp album), 2002 * ''Hits'' (Seal album), 2009 * ''Hits'' (Mauro Scocco album), 1997 * ''Hits'' (Spice 1 album), 1998 * ''Hits'' (Billy Talent album), 2014 * ''Hits'' (The Beach Boys EP), 1966 * ''Hits'' (Tony! Toni! Toné! album), 1997 * '' Hits: Greatest and Others'', 1973 - album by Joan Baez * ''Hits 1979–1989'', 1989 - album by Rosanne Cash * ''Hits+'', 2000 - album by Kylie Minogue * ''Hits!'' (Kim Kay album), 2000 * ''Hits!' ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Frequency (statistics)
In statistics, the frequency (or absolute frequency) of an event i is the number n_i of times the observation has occurred/recorded in an experiment or study. These frequencies are often depicted graphically or in tabular form. Types The cumulative frequency is the total of the absolute frequencies of all events at or below a certain point in an ordered list of events. The (or empirical probability) of an event is the absolute frequency normalized by the total number of events: : f_i = \frac = \frac. The values of f_i for all events i can be plotted to produce a frequency distribution. In the case when n_i = 0 for certain i, pseudocounts can be added. Depicting frequency distributions A frequency distribution shows us a summarized grouping of data divided into mutually exclusive classes and the number of occurrences in a class. It is a way of showing unorganized data notably to show results of an election, income of people for a certain region, sales of a product within ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]