SkELL
   HOME

TheInfoList



OR:

SkELL (abbreviation of ''Sketch Engine for Language Learning'') is a free corpus-based web tool that allows language learners and
teachers A teacher, also called a schoolteacher or formally an educator, is a person who helps students to acquire knowledge, competence, or virtue, via the practice of teaching. ''Informally'' the role of teacher may be taken on by anyone (e.g. whe ...
find authentic sentences for specific target word(s). For any word or a phrase, SkELL displays a
concordance Concordance may refer to: * Agreement (linguistics), a form of cross-reference between different parts of a sentence or phrase * Bible concordance, an alphabetical listing of terms in the Bible * Concordant coastline, in geology, where beds, or la ...
that lists example sentences drawn from a special text corpus crawled from the
World Wide Web The World Wide Web (WWW), commonly known as the Web, is an information system enabling documents and other web resources to be accessed over the Internet. Documents and downloadable media are made available to the network through web se ...
, which has been cleaned of
spam Spam may refer to: * Spam (food), a canned pork meat product * Spamming, unsolicited or undesired electronic messages ** Email spam, unsolicited, undesired, or illegal email messages ** Messaging spam, spam targeting users of instant messaging ( ...
and includes only high-quality texts covering everyday, standard, formal, and professional language. There are versions of SkELL for
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ide ...
, Russian,
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ger ...
, Italian, Czech and
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also

...
. SkELL is based on the commercial Sketch Engine corpus manager and the proprietary GDEX (Good Dictionary Examples) score that it implements.


Features

SkELL can provide three kinds of results for a query: * Examples: This page displays a
concordance Concordance may refer to: * Agreement (linguistics), a form of cross-reference between different parts of a sentence or phrase * Bible concordance, an alphabetical listing of terms in the Bible * Concordant coastline, in geology, where beds, or la ...
created by searching for the specified word or phrase in the reference corpus, taking any derived forms into account. * Word sketch: This page shows the most frequent collocates for the specified word. It is a simplified version of Sketch Engine's
word sketch A word sketch is a one-page, automatic, corpus-derived summary of a word’s grammatical and collocational behaviour. Word sketches were first introduced by the British corpus linguist Adam KilgarriffKilgarriff, Adam; Rychlý, Pavel; Smrž, Pavel; ...
function. * Similar words: This page contains visualization of similar (not necessarily just
synonym A synonym is a word, morpheme, or phrase that means exactly or nearly the same as another word, morpheme, or phrase in a given language. For example, in the English language, the words ''begin'', ''start'', ''commence'', and ''initiate'' are all ...
ous) words in a
word cloud A tag cloud (also known as a word cloud, wordle or weighted list in visual design) is a visual representation of text data, which is often used to depict keyword metadata on websites, or to visualize free form text. Tags are usually single word ...
, based on Sketch Engine's distributional thesaurus. The number of displayed lines in a concordance is limited to 40. However, the
frequency Frequency is the number of occurrences of a repeating event per unit of time. It is also occasionally referred to as ''temporal frequency'' for clarity, and is distinct from ''angular frequency''. Frequency is measured in hertz (Hz) which is eq ...
of the searched query in the reference corpus is indicated above the concordance as '' hits per million''.


Use

It has been suggested that SkELL can be used, for instance: * to obtain illustrative examples of target features,
lexical Lexical may refer to: Linguistics * Lexical corpus or lexis, a complete set of all words in a language * Lexical item, a basic unit of lexicographical classification * Lexicon, the vocabulary of a person, language, or branch of knowledge * Lexical ...
and
grammatical In linguistics, grammaticality is determined by the conformity to language usage as derived by the grammar of a particular variety (linguistics), speech variety. The notion of grammaticality rose alongside the theory of generative grammar, the go ...
; * to find authentic sentences for the target word(s); * to help students understand the meaning and/or usage of a word or phrase; * to help teachers wanting to use example sentences in a class; * to discover and explore collocates; * to create gap-fill exercises; * to have the students find and investigate examples/collocates; * to draw sentences to be used for translation exercises; * to teach various kinds of
homonym In linguistics, homonyms are words which are homographs (words that share the same spelling, regardless of pronunciation), or homophones (equivocal words, that share the same pronunciation, regardless of spelling), or both. Using this definition, ...
s and polysemous words;


Data

For each language, SkELL uses a dedicated text corpus, which can also be searched manually in the Sketch Engine using more powerful tools. For example, the English Corpus for SkELL includes a total of more than 57 million sentences that contain more than one billion words. It is based on the English Wikipedia (a special selection of 130,000 articles), a subset from the English web corpus enTenTen14, the whole of the British National Corpus, and free news sources. The English collection of
Project Gutenberg Project Gutenberg (PG) is a Virtual volunteering, volunteer effort to digitize and archive cultural works, as well as to "encourage the creation and distribution of eBooks." It was founded in 1971 by American writer Michael S. Hart and is the ...
used to be a part of the corpus as well, but was removed due to its too archaic language.


History

SkELL was first presented in 2014, when only
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ide ...
was supported. In 2015, support for Russian was added, and Czech has been supported since 2017.
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ger ...
, Italian and
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also

...
were added in 2018.


References

{{Reflist


External links


SkELL – corpus tool for language learnersSkELL: corpus examples for language learning
Vocabulary Concordances (publishing) Language learning software Language-learning websites Online dictionaries Online English dictionaries Russian dictionaries German dictionaries Italian dictionaries Czech dictionaries Estonian dictionaries Internet properties established in 2014 Czech educational websites