The Info List - ISO 639-1

--- Advertisement ---

ISO 639-1:2002, _Codes for the representation of names of languages — Part 1: Alpha-2 code_, is the first part of the ISO 639 series of international standards for language codes . Part 1 covers the registration of two-letter codes. There are 184 two-letter codes registered as of October 2015. The registered codes cover the world's major languages.

These codes are a useful international, and formal, shorthand for indicating languages. For example:

* Armenian is represented by hy (from the endonym հայերէն, _Hayeren_) * Chinese is represented by zh (from the endonym 中文, Zhōngwén_)_ * Dutch is represented by nl (from the endonym _Nederlands_) * English is represented by en * Esperanto is represented by eo * French is represented by fr * Georgian is represented by ka (from the endonym ქართული, _KArtuli_) * German is represented by de (from the endonym _Deutsch_) * Greek is represented by el (from the endonym ελληνικά, _ELliniká_) * Italian is represented by it * Japanese is represented by ja (even though its endonym is 日本語, _Nihongo_ ) * Korean is represented by ko (even though its endonym is 한국어, _Hangugeo_) * Kurdish is represented by ku (from the endonym کوردی, _KUrdî_) * Persian is represented by fa (from the endonym فارسی, _FArsi_) * Polish is represented by pl * Portuguese is represented by pt * Romanian is represented by ro * Russian is represented by ru * Spanish is represented by es (from the endonym _español_) * Swedish is represented by sv (from the endonym _svenska_) * Turkish is represented by tr * Urdu is represented by ur (from the endonym اُردو, _URdu_)

Many multilingual web sites—such as—use these codes to prefix URLs of specific language versions of their web sites: for example, en.Wikipedia.org is the English version of. See also IETF language tag . (Two-letter country-specific top-level-domain code suffixes are often different from these language-tag prefixes).

ISO 639, the original standard for language codes, was approved in 1967. It was split into parts, and in 2002 ISO 639-1 became the new revision of the original standard. The last code added was ht, representing Haitian Creole on 2003-02-26. The use of the standard was encouraged by IETF language tags , introduced in RFC 1766 in March 1995, and continued by RFC 3066 from January 2001 and RFC 4646 from September 2006. The current version is RFC 5646 from September 2009. Infoterm (International Information Center for Terminology) is the registration authority for ISO 639-1 codes.

New ISO 639-1 codes are not added if an ISO 639-2 code exists, so systems that use ISO 639-1 and 639-2 codes, with 639-1 codes preferred, do not have to change existing codes.

If an ISO 639-2 code that covers a group of languages is used, it might be overridden for some specific languages by a new ISO 639-1 code.

ISO 639-1 codes added after RFC publication in January 2001 ISO 639-1 ISO 639-2 NAME DATE ADDED PREVIOUSLY COVERED BY

io ido Ido 2002-01-15 art

wa wln Walloon 2002-01-29 roa

li lim Limburgish 2002-08-02 gem

ii iii Sichuan Yi 2002-10-14 sit

an arg Aragonese 2002-12-23 roa

ht hat Haitian Creole 2003-02-26 cpf

There is no specification on treatment of macrolanguages (see ISO 639-3 ).


* List of ISO 639-1 codes * ISO 3166-1 alpha-2 , a different set of two-letter codes used for countries


* ^ ISO 639 Joint Advisory Committee - Working principles for ISO 639 maintenance


* ISO 639 * ISO 639-1/RA * ISO 639-2