HOME

TheInfoList



OR:

ETAP-3 is a proprietary linguistic processing system focusing on
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ide ...
and
Russian Russian(s) refers to anything related to Russia, including: *Russians (, ''russkiye''), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *Rossiyane (), Russian language term for all citizens and peo ...
. It was developed in
Moscow Moscow ( , US chiefly ; rus, links=no, Москва, r=Moskva, p=mɐskˈva, a=Москва.ogg) is the capital and largest city of Russia. The city stands on the Moskva River in Central Russia, with a population estimated at 13.0 million ...
,
Russia Russia (, , ), or the Russian Federation, is a List of transcontinental countries, transcontinental country spanning Eastern Europe and North Asia, Northern Asia. It is the List of countries and dependencies by area, largest country in the ...
at the Institute for Information Transmission Problems ( :ru:Институт проблем передачи информации им. А. А. Харкевича РАН). It is a
rule-based system In computer science, a rule-based system is used to store and manipulate knowledge to interpret information in a useful way. It is often used in artificial intelligence applications and research. Normally, the term ''rule-based system'' is appli ...
which uses the Meaning-Text Theory as its theoretical foundation. At present, there are several applications of ETAP-3, such as a
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates t ...
tool, a converter of the
Universal Networking Language {{Advert, date=April 2021 Universal Networking Language (UNL) is a declarative formal language specifically designed to represent semantic data extracted from natural language texts. It can be used as a pivot language in interlingual machine tran ...
, an interactive learning tool for Russian language learners and a syntactically annotated corpus of Russian language. Demo versions of some of these tools ar
available online


Machine translation tool

The ETAP-3 machine translation tool can translate text from
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ide ...
into
Russian Russian(s) refers to anything related to Russia, including: *Russians (, ''russkiye''), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *Rossiyane (), Russian language term for all citizens and peo ...
and vice versa. It is a rule-based system, which makes it different from the most present-day systems that are predominantly statistical-based. The system makes a syntactical analysis of the input sentence, which can be visualized as a
syntax tree Syntax tree may refer to: * Abstract syntax tree, used in computer science * Concrete syntax tree A parse tree or parsing tree or derivation tree or concrete syntax tree is an ordered, rooted tree that represents the syntactic structure of a str ...
. The machine translation tool uses bilingual dictionaries which contain more than 100,000 lexical entries.


UNL converter

The UNL converter based on ETAP-3 can transform English and Russian sentences into their representations in UNL (''Universal Networking Language'') and generate English and Russian sentences from their UNL representations.


Russian language treebank

A syntactically annotated corpus (
treebank In linguistics, a treebank is a parsed text corpus that annotates syntactic or semantic sentence structure. The construction of parsed corpora in the early 1990s revolutionized computational linguistics, which benefitted from large-scale empiri ...
) is a part of
Russian National Corpus The Russian National Corpus (russian: Национальный корпус русского языка, , National Corpus of the Russian language) is a corpus of the Russian language that has been partially accessible through a query interface onl ...
. It contains 40,000 sentences (600,000 words) which are fully syntactically and morphologically annotated. The primary annotation was made by ETAP-3 and then manually verified by competent linguists. This makes the syntactically annotated corpus a reliable tool for linguistic research.


Lexical functions learning tool

The ETAP-3 system makes extensive use of lexical functions explored in the Meaning-Text Theory. For this reason, an interactive tool for Russian language learners aiming at the acquisition of lexical functions has been developed. Such learning tools are now being created for
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ger ...
,
Spanish Spanish might refer to: * Items from or related to Spain: **Spaniards are a nation and ethnic group indigenous to Spain **Spanish language, spoken in Spain and many Latin American countries **Spanish cuisine Other places * Spanish, Ontario, Cana ...
and
Bulgarian Bulgarian may refer to: * Something of, from, or related to the country of Bulgaria * Bulgarians, a South Slavic ethnic group * Bulgarian language, a Slavic language * Bulgarian alphabet * A citizen of Bulgaria, see Demographics of Bulgaria * Bul ...


References


External links

{{Official website, 1=http://cl.iitp.ru/etap3, name=Official website with demo-versions of linguistic tools Applications of artificial intelligence Computational linguistics Machine translation Natural language processing software