Slovenian National Corpus
   HOME
*





Slovenian National Corpus
Slovenian National Corpus FidaPLUS is the 621 million words (tokens) corpus of the Slovenian language, gathered from selected texts written in Slovenian of different genres and styles, mainly from books and newspapers. The FidaPLUS database is an upgrade of the older (FIDA) corpus, which was developed between 1997 and 2000, with added texts that were published up to 2006 and was the result of the applicative research project of the Faculty of Arts, Faculty of Social Sciences, both University of Ljubljana, and Jožef Stefan Institute's Department of Knowledge Technologies. Corpus is available via a corpus manager Sketch Engine Sketch Engine is a corpus manager and text analysis software developed by Lexical Computing CZ s.r.o. since 2003. Its purpose is to enable people studying language behaviour ( lexicographers, researchers in corpus linguistics, translators or lan .... This version FidaPLUS corpus contains Word sketches, an automatic corpus-derived overview of word's gram ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Corpus Linguistics
Corpus linguistics is the study of language, study of a language as that language is expressed in its text corpus (plural ''corpora''), its body of "real world" text. Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora collected in the field—the natural context ("realia") of that language—with minimal experimental interference. The text-corpus method uses the body of texts written in any natural language to derive the set of abstract rules which govern that language. Those results can be used to explore the relationships between that subject language and other languages which have undergone a similar analysis. The first such corpora were manually derived from source texts, but now that work is automated. Corpora have not only been used for linguistics research, they have also been used to compile dictionaries (starting with ''The American Heritage Dictionary of the English Language'' in 1969) and grammar guides, such as ''A Compreh ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Slovenian Language
Slovene ( or ), or alternatively Slovenian (; or ), is a South Slavic language, a sub-branch that is part of the Balto-Slavic branch of the Indo-European language family. It is spoken by about 2.5 million speakers worldwide (excluding speakers of Kajkavian), mainly ethnic Slovenes, the majority of whom live in Slovenia, where it is the sole official language. As Slovenia is part of the European Union, Slovene is also one of its 24 official and working languages. Standard Slovene Standard Slovene is the national standard language that was formed in the 18th and 19th century, based on Upper and Lower Carniolan dialect groups, more specifically on language of Ljubljana and its adjacent areas. The Lower Carniolan dialect group was the dialect used in the 16th century by Primož Trubar for his writings, while he also used Slovene as spoken in Ljubljana, since he lived in the city for more than 20 years. It was the speech of Ljubljana that Trubar took as a foundation of what lat ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

University Of Ljubljana
The University of Ljubljana ( sl, Univerza v Ljubljani, , la, Universitas Labacensis), often referred to as UL, is the oldest and largest university in Slovenia. It has approximately 39,000 enrolled students. History Beginnings Although certain academies (notably of philosophy and theology) were established as Jesuit higher education in what is now Slovenia as early as the seventeenth century, the first university was founded in 1810 under the ''Écoles centrales'' of the French imperial administration of the Illyrian provinces. The chancellor of the university in Ljubljana during the French period was Joseph Walland (a.k.a. , 1763–1834), born in Upper Carniola. That university was disbanded in 1813, when Austria regained territorial control and reestablished the Imperial Royal Lyceum of Ljubljana as a higher-education institution. Quest for a national university During the second half of the 19th century, several political claims for the establishment of a Slovene-language u ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Jožef Stefan Institute
The Jožef Stefan Institute (IJS, JSI) ( sl, Institut "Jožef Stefan") is the largest research institute in Slovenia. The main research areas are physics, chemistry, molecular biology, biotechnology, information technologies, physics, reactor physics, energy and Natural environment, environment. At the beginning of 2013 the institute had 962 employees, of whom 404 were PhD scientists. The mission of the Jožef Stefan Institute is the accumulation and dissemination of knowledge at the frontiers of natural science and technology for the benefit of society at large through the pursuit of education, learning, research, and development of high technology at the highest international levels of excellence. History The institute was founded by the State Security Administration (Yugoslavia) in 1949 for atomic weapons research. Initially, the Vinča Nuclear Institute in Belgrade was established in 1948, followed by the Ruđer Bošković Institute in Zagreb in 1950 and the Jožef Stefan In ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Sketch Engine
Sketch Engine is a corpus manager and text analysis software developed by Lexical Computing CZ s.r.o. since 2003. Its purpose is to enable people studying language behaviour ( lexicographers, researchers in corpus linguistics, translators or language learners) to search large text collections according to complex and linguistically motivated queries. Sketch Engine gained its name after one of the key features, word sketches: one-page, automatic, corpus-derived summaries of a word's grammatical and collocational behaviour. Currently, it supports and provides corpora in 90+ languages. History of development Sketch Engine is a product of Lexical Computing Limited, a company founded in 2003 by the lexicographer and research scientist Adam Kilgarriff. He started a collaboration with Pavel Rychlý, a computer scientist working at the Natural Language Processing Centre, Masaryk University, and the developer of Manatee and Bonito (two major parts of the software suite), and introduced ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Word Sketch
A word sketch is a one-page, automatic, corpus-derived summary of a word’s grammatical and collocational behaviour. Word sketches were first introduced by the British corpus linguist Adam KilgarriffKilgarriff, Adam; Rychlý, Pavel; Smrž, Pavel; Tugwell, David (2004) The Sketch Engine. Information Technology, 2004 and exploited within the Sketch Engine corpus management system. They are an extension of the general collocation concept used in corpus linguistics in that they group collocations according to particular grammatical relations (e.g. subject, object, modifier etc.). The collocation candidates in a word sketch are sorted either by their frequency or using a lexicographic association score like Dice, T-score or MI-score. Since the introduction, word sketches have been used by lexicographers to develop modern corpus-based dictionaries by major publishing houses including Oxford English Dictionary, Macmillan English Dictionary and comprising dozens of languages including ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Corpora
Corpus is Latin for "body". It may refer to: Linguistics * Text corpus, in linguistics, a large and structured set of texts * Speech corpus, in linguistics, a large set of speech audio files * Corpus linguistics, a branch of linguistics Music * ''Corpus'' (album), by Sebastian Santa Maria * Corpus Delicti (band), also known simply as Corpus Medicine * Corpus callosum, a structure in the brain * Corpus cavernosum (other), a pair of structures in human genitals * Corpus luteum, a temporary endocrine structure in mammals * Corpus gastricum, the Latin term referring to the body of the stomach * Corpus alienum, a foreign object originating outside the body * Corpus albicans * Corpora amylacea * Corpora arenacea Other uses * ''Corpus'' (Bernini), a 1650 sculpture of Christ by Gian Lorenzo Bernini * Corpus (museum), a human body themed museum in the Netherlands * Corpus Clock, a large sculptural clock * Corpus (dance troupe), a Canadian dance troupe * Corpus (typography) ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Slovene Language
Slovene ( or ), or alternatively Slovenian (; or ), is a South Slavic languages, South Slavic language, a sub-branch that is part of the Balto-Slavic languages, Balto-Slavic branch of the Indo-European languages, Indo-European language family. It is spoken by about 2.5 million speakers worldwide (excluding speakers of Kajkavian), mainly ethnic Slovenes, the majority of whom live in Slovenia, where it is the sole official language. As Slovenia is part of the European Union, Slovene is also one of its 24 Languages of the European Union, official and working languages. Standard Slovene Standard Slovene is the national standard language that was formed in the 18th and 19th century, based on Upper Carniolan dialect group, Upper and Lower Carniolan dialect groups, more specifically on language of Ljubljana and its adjacent areas. The Lower Carniolan dialect group was the dialect used in the 16th century by Primož Trubar for his writings, while he also used Slovene as spoken in Lju ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Online Databases
An online database is a database accessible from a local network or the Internet, as opposed to one that is stored locally on an individual computer or its attached storage (such as a CD). Online databases are hosted on websites, made available as software as a service products accessible via a web browser. They may be free or require payment, such as by a monthly subscription. Some have enhanced features such as collaborative editing and email notification. Cloud database A cloud database is a database that is run on and accessed via the Internet, rather than locally. So, rather than keep a customer information database at one location, a business may choose to have it hosted on the Internet so that all its departments or divisions can access and update it. Most database services offer web-based consoles, which the end user can use to provision and configure database instances. See also * List of online databases ** Bibliographic databases * Customer relationship management * List ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Applied Linguistics
Applied linguistics is an interdisciplinary field which identifies, investigates, and offers solutions to language-related real-life problems. Some of the academic fields related to applied linguistics are education, psychology, communication research, information science, natural language processing, anthropology, and sociology. Domain Applied linguistics is an interdisciplinary field. Major branches of applied linguistics include bilingualism and multilingualism, conversation analysis, contrastive linguistics, language assessment, literacies, discourse analysis, language pedagogy, second language acquisition, language planning and policy, interlinguistics, stylistics, language teacher education, forensic linguistics, and translation. Journals Major journals of the field include ''Research Methods in Applied Linguistics'', ''Annual Review of Applied Linguistics'', ''Applied Linguistics'', Studies in Second Language Acquisition, ''Applied Psycholinguistics'', ''Internat ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]