National Corpus Of Polish
   HOME
*





National Corpus Of Polish
The National Corpus of Polish (Polish : Narodowy Korpus Języka Polskiego NKJP) is the biggest and the most important corpus of the Polish language. A linguistic corpus is a collection of texts where one can find the typical use of a single word or a phrase, as well as their meaning and grammatical function. Description The National Corpus of Polish is a shared initiative of four institutions: Institute of Computer Science and the Institute of the Polish Language at the Polish Academy of Sciences, Polish Scientific Publishers PWN, and the Department of Computational and Corpus Linguistics at the University of Łódź. It has been registered as a research-development project of the Ministry of Science and Higher Education. The intended size of the whole National Corpus of Polish is over 1 billion words, of which a 300-million word subcorpus has been carefully balanced, and a manually-annotated 1-million corpus has been released under an open license. The corpus is accessible online ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Text Corpus
In linguistics, a corpus (plural ''corpora'') or text corpus is a language resource consisting of a large and structured set of texts (nowadays usually electronically stored and processed). In corpus linguistics, they are used to do statistical analysis and statistical hypothesis testing, hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory. In Search engine (computing), search technology, a corpus is the collection of documents which is being searched. Overview A corpus may contain texts in a single language (''monolingual corpus'') or text data in multiple languages (''multilingual corpus''). In order to make the corpora more useful for doing linguistic research, they are often subjected to a process known as annotation. An example of annotating a corpus is part-of-speech tagging, or ''POS-tagging'', in which information about each word's part of speech (verb, noun, adjective, etc.) is added to the corpus in the form o ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Polish Language
Polish (Polish: ''język polski'', , ''polszczyzna'' or simply ''polski'', ) is a West Slavic language of the Lechitic group written in the Latin script. It is spoken primarily in Poland and serves as the native language of the Poles. In addition to being the official language of Poland, it is also used by the Polish diaspora. There are over 50 million Polish speakers around the world. It ranks as the sixth most-spoken among languages of the European Union. Polish is subdivided into regional dialects and maintains strict T–V distinction pronouns, honorifics, and various forms of formalities when addressing individuals. The traditional 32-letter Polish alphabet has nine additions (''ą'', ''ć'', ''ę'', ''ł'', ''ń'', ''ó'', ''ś'', ''ź'', ''ż'') to the letters of the basic 26-letter Latin alphabet, while removing three (x, q, v). Those three letters are at times included in an extended 35-letter alphabet, although they are not used in native words. The traditional ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Polish Academy Of Sciences
The Polish Academy of Sciences ( pl, Polska Akademia Nauk, PAN) is a Polish state-sponsored institution of higher learning. Headquartered in Warsaw, it is responsible for spearheading the development of science across the country by a society of distinguished scholars and a network of research institutes. It was established in 1951, during the early period of the Polish People's Republic following World War II. History The Polish Academy of Sciences is a Polish state-sponsored institution of higher learning, headquartered in Warsaw, that was established by the merger of earlier science societies, including the Polish Academy of Learning (''Polska Akademia Umiejętności'', abbreviated ''PAU''), with its seat in Kraków, and the Warsaw Society of Friends of Learning (Science), which had been founded in the late 18th century. The Polish Academy of Sciences functions as a learned society acting through an elected assembly of leading scholars and research institutions. The Academy h ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Polish Scientific Publishers
Wydawnictwo Naukowe PWN (''Polish Scientific Publishers PWN''; until 1991 ''Państwowe Wydawnictwo Naukowe'' - ''National Scientific Publishers PWN'', PWN) is a Polish book publisher, founded in 1951, when it split from the Wydawnictwa Szkolne i Pedagogiczne. Adam Bromberg, who was the enterprise's director between 1953 and 1965, made it into communist Poland's largest publishing house. The printing house is best known as a publisher of encyclopedias, dictionaries and university handbooks. It is the leading Polish provider of scientific, educational and professional literature as well as works of reference. It authored the Wielka Encyklopedia Powszechna PWN, by then the largest Polish encyclopedia, as well as its successor, the Wielka Encyklopedia PWN, which was published between 2001 and 2005. There is also an online PWN encyclopedia – Internetowa encyklopedia PWN. Initially state-owned, since 1991 it has been a private company. The company is a member of International Associat ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


University Of Łódź
The University of Łódź (Polish: ''Uniwersytet Łódzki'', Latin: ''Universitas Lodziensis'') is a public research university founded in 1945 in Łódź, Poland, as a continuation of three higher education institutions functioning in Łódź in the interwar period — the Teacher Training Institute (1921–1928), the Higher School of Social and Economic Sciences (1924–1928) and the local division of the Free Polish University of Warsaw (1928–1939). The University of Łódź (alternative spelling: University of Lodz) is a fully accredited, state-owned, traditional university. It is one of 18 institutions of its type in Poland. It has more than 25,000 students and 2,600 teachers. Its international cooperation includes 385 partner institutions from all over the world. A range of BA, MA, and postgraduate courses held in English as a language of instruction are offered to Polish and overseas students. Reputation The university strives to maintain its high ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Ministry Of Science And Higher Education Of The Republic Of Poland
The Ministry of Science and Higher Education ( pl, Ministerstwo Nauki i Szkolnictwa Wyższego) in Poland was opened on 5 May 2006 by the Minister of Science and Higher Education, in replacement of several parts of the Ministry of Education and Science. The Minister of Science and Higher Education administers governmental activities in science and higher education and has a budget for scientific research provided by State funds. The Rada Nauki (Science Council) acts together with the Minister, in replacement of the Komitet Badań Naukowych (Science Research Council) which was closed in 2005. The headquarters of the ministry are located at ulica Wspólna 1/3, Warsaw Warsaw ( pl, Warszawa, ), officially the Capital City of Warsaw,, abbreviation: ''m.st. Warszawa'' is the capital and largest city of Poland. The metropolis stands on the River Vistula in east-central Poland, and its population is officia .... From 2020 Minister of Science and Higher Education is Przemysław ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Poliqarp
Poliqarp is an open source search engine designed to process text corpora, among others the National Corpus of Polish created at the Institute of Computer Science, Polish Academy of Sciences. Features * Custom query language * Two-level regular expressions: ** operating at the level of characters in words ** operating at the level of words in statements/paragraphs * Good performance * Compact corpus representation (compared to similar projects) * Portability across operating systems: Linux/BSD/Win32 * Lack of portability across endianness In computing, endianness, also known as byte sex, is the order or sequence of bytes of a word of digital data in computer memory. Endianness is primarily expressed as big-endian (BE) or little-endian (LE). A big-endian system stores the mos ... (current release works only on little endian devices) References {{reflist External links Polish corpus website (in English)Project website on SourceForgeSearch plugin for Firefox Information ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Corpora
Corpus is Latin for "body". It may refer to: Linguistics * Text corpus, in linguistics, a large and structured set of texts * Speech corpus, in linguistics, a large set of speech audio files * Corpus linguistics, a branch of linguistics Music * ''Corpus'' (album), by Sebastian Santa Maria * Corpus Delicti (band), also known simply as Corpus Medicine * Corpus callosum, a structure in the brain * Corpus cavernosum (other), a pair of structures in human genitals * Corpus luteum, a temporary endocrine structure in mammals * Corpus gastricum, the Latin term referring to the body of the stomach * Corpus alienum, a foreign object originating outside the body * Corpus albicans * Corpora amylacea * Corpora arenacea Other uses * ''Corpus'' (Bernini), a 1650 sculpture of Christ by Gian Lorenzo Bernini * Corpus (museum), a human body themed museum in the Netherlands * Corpus Clock, a large sculptural clock * Corpus (dance troupe), a Canadian dance troupe * Corpus (typography) ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Linguistic Research
Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Linguistics is concerned with both the cognitive and social aspects of language. It is considered a scientific field as well as an academic discipline; it has been classified as a social science, natural science, cognitive science,Thagard, PaulCognitive Science, The Stanford Encyclopedia of Philosophy (Fall 2008 Edition), Edward N. Zalta (ed.). or part of the humanities. Traditional areas of linguistic analysis correspond to phenomena found in human linguistic systems, such as syntax (rules governing the structure of sentences); semantics (meaning); morphology (structure of words); phonetics (speech sounds and equivalent gestures in sign languages); phonology (the abstract sound system of a particular language); and pragmatics (how social contex ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]