Universal Terminology EXchange
   HOME

TheInfoList



OR:

UTX (Universal Terminology eXchange) is a simple glossary format. UTX is developed by AAMT (
Asia-Pacific Association for Machine Translation Asia-Pacific (APAC) is the part of the world near the western Pacific Ocean. The Asia-Pacific region varies in area depending on context, but it generally includes East Asia, Russian Far East, South Asia, Southeast Asia, Australia and Pacific Is ...
). A tab-separated text format that contains minimal information, such as source language entry, target language entry, and part-of-speech entry. UTX is intended to facilitate rapid creation and quick exchanges of human-readable and machine-readable glossaries. Initially, UTX was created to absorb the differences between various user dictionary formats for
machine translation Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates t ...
. The scope of the format was later expanded to include other purposes, such as glossaries for human translations,
natural language processing Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to pro ...
,
thesaurus A thesaurus (plural ''thesauri'' or ''thesauruses'') or synonym dictionary is a reference work for finding synonyms and sometimes antonyms of words. They are often used by writers to help find the best word to express an idea: Synonym diction ...
,
text-to-speech Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal languag ...
,
input method An input method (or input method editor, commonly abbreviated IME) is an operating system component or program that enables users to generate characters not natively available on their input devices by using sequences of characters (or mouse o ...
, etc. UTX could be used to improve the efficiency of
localization Localization or localisation may refer to: Biology * Localization of function, locating psychological functions in the brain or nervous system; see Linguistic intelligence * Localization of sensation, ability to tell what part of the body is a ...
for
open source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
projects.


UTX Converter

UTX Converter was developed as an
open source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
project by AAMT. UTX Converter is available for free. It has the following functions: * Functions for UTX ** The format check of a UTX file (UTX 1.11) ** Extraction of forbidden terms ** Extraction of the pairs of forbidden terms and approved terms ** Extraction of the pairs of non-standard terms and approved terms * Conversion function ** Conversion between UTX and a user dictionary (*. txt file) of ATLAS (Fujitsu) ** Conversion between UTX and a user dictionary (*. txt file) of The Honyaku (Toshiba) ** Conversion between UTX and a user dictionary (*.opt file for EJ, *.dic file for JE) of PC/MED/PAT/Legal Transer (Cross Language) ** Conversion from UTX to a text for MultiTerm import


See also

* TBX *
Translation memory A translation memory (TM) is a database that stores "segments", which can be sentences, paragraphs or sentence-like units (headings, titles or elements in a list) that have previously been translated, in order to aid human translators. The translati ...
*
Terminology Terminology is a group of specialized words and respective meanings in a particular field, and also the study of such terms and their use; the latter meaning is also known as terminology science. A ''term'' is a word, compound word, or multi-wor ...
*
Computer-assisted translation Computer-aided translation (CAT), also referred to as computer-assisted translation or computer-aided human translation (CAHT), is the use of software to assist a human translator in the translation process. The translation is created by a huma ...


External links


UTX Home


(A predecessor to UTX, in Japanese)
UTX mailing list

Glossary Markup Language (GlossML)
An open XML format for storing glossaries. Computer-assisted translation {{assisted-translation-stub