HOME
*





Language Resource
In linguistics and language technology, a language resource is a "[composition] of linguistic material used in the construction, improvement and/or evaluation of language processing applications, (...) in language and language-mediated research studies and applications."LD4LT (2020), The Metashare Ontology as Created by the LD4LT Community Group', W3C Community Group Linked Data for Language Technology (LD4LT), Development branch, version of Mar 10, 2020 According to Bird & Simons (2003), this includes # data, i.e. "any information that documents or describes a language, such as a published monograph, a computer data file, or even a shoebox full of handwritten index cards. The information could range in content from unanalyzed sound recordings to fully transcribed and annotated texts to a complete descriptive grammar", # tools, i.e., "computational resources that facilitate creating, viewing, querying, or otherwise using language data", and # advice, i.e., "any information about wh ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Linguistics
Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Linguistics is concerned with both the cognitive and social aspects of language. It is considered a scientific field as well as an academic discipline; it has been classified as a social science, natural science, cognitive science,Thagard, PaulCognitive Science, The Stanford Encyclopedia of Philosophy (Fall 2008 Edition), Edward N. Zalta (ed.). or part of the humanities. Traditional areas of linguistic analysis correspond to phenomena found in human linguistic systems, such as syntax (rules governing the structure of sentences); semantics (meaning); morphology (structure of words); phonetics (speech sounds and equivalent gestures in sign languages); phonology (the abstract sound system of a particular language); and pragmatics (how social con ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Glottolog
''Glottolog'' is a bibliographic database of the world's lesser-known languages, developed and maintained first at the Max Planck Institute for Evolutionary Anthropology in Leipzig, Germany (between 2015 and 2020 at the Max Planck Institute for the Science of Human History in Jena, Germany). Its main curators include Harald Hammarström and Martin Haspelmath. Overview Sebastian Nordhoff and Harald Hammarström created the Glottolog/Langdoc project in 2011. The creation of ''Glottolog'' was partly motivated by the lack of a comprehensive language bibliography, especially in ''Ethnologue''. Glottolog provides a catalogue of the world's languages and language families and a bibliography on the world's less-spoken languages. It differs from the similar catalogue '' Ethnologue'' in several respects: * It tries to accept only those languages that the editors have been able to confirm both exist and are distinct. Varieties that have not been confirmed, but are inherited from anothe ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Linguistic Linked Open Data
In natural language processing, linguistics, and neighboring fields, Linguistic Linked Open Data (LLOD) describes a method and an interdisciplinary community concerned with creating, sharing, and (re-)using language resources in accordance with Linked Data principles. The Linguistic Linked Open Data Cloud was conceived and is being maintained by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation, but has been a point of focal activity for several W3C community groups, research projects, and infrastructure efforts since then. Definition and Development Linguistic Linked Open Data describes the publication of data for linguistics and natural language processing using the following principles: * Data should be openly licensed using licenses such as the Creative Commons licenses. * The elements in a dataset should be uniquely identified by means of a URI. * The URI should resolve, so users can access more information using web browsers. * Resolving an LLOD ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

The Open Definition
The Open Definition is a document published by the Open Knowledge Foundation (OKF) (previously Open Knowledge International) to define openness in relation to data and content. It specifies what licences for such material may and may not stipulate, in order to be considered open licences. The definition itself was derived from the Open Source Definition for software. OKI summarise the document as: The latest form of the document, published in November 2015, is version 2.1. The use of language in the document is conformant with RFC 2119. The document is available under a Creative Commons Attribution 4.0 International License, which itself meets the Open Definition. History * August 2005: Circulation of the first draft of the Open Definition, v0.1. * July 2006: publication of v1.0 * November 2009: publication of v1.1 * October 2014: publication of v2.0 * November 2015: publication of v2.1 See also * Berlin Declaration on Open Access to Knowledge in the Sciences and Hum ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Open Knowledge Foundation
Open Knowledge Foundation (OKF) is a global, non-profit network that promotes and shares information at no charge, including both content and data. It was founded by Rufus Pollock on 20 May 2004 in Cambridge, UK. It is incorporated in England and Wales as a private company limited by guarantee. Between May 2016 and May 2019 the organisation was named ''Open Knowledge International'', but decided in May 2019 to return to ''Open Knowledge Foundation''. Aims The aims of Open Knowledge Foundation are: *Promoting the idea of open knowledge, both what it is, and why it is a good idea. *Running open knowledge events, such as OKCon. *Working on open knowledge projects, such as Open Economics or Open Shakespeare. *Providing infrastructure, and potentially a home, for open knowledge projects, communities and resources. For example, the KnowledgeForge service and CKAN. *Acting at UK, European and international levels on open knowledge issues. People Renata Ávila Pinto joined as the n ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

OntoLex
OntoLex is the short name of a vocabulary for lexical resources in the web of data (OntoLex-Lemon) and the short name of the W3C community group that created it (W3C Ontology-Lexica Community Group). OntoLex-Lemon vocabulary The OntoLex-Lemon vocabulary represents a vocabulary for publishing lexical data as a knowledge graph, in an RDF format and/or as Linguistic Linked Open Data. Since its publication as a W3C Community report in 2016, it serves as ``a de facto standard to represent ontology-lexica on the Web´´. OntoLex-Lemon is a revision of the Lemon vocabulary originally proposed by McCrae et al. (2011). The core elements of OntoLex-Lemon, shown in Fig. 1, are: * lexical entry: unit of analysis of the lexicon, groups together one or more forms and one or more senses, resp. concepts. Can provide additional morphosyntactic information, e.g., one part of speech. Note that every lexical entry can have at most one part of speech, for representing groups of lexical entries w ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Resource Description Framework
The Resource Description Framework (RDF) is a World Wide Web Consortium (W3C) standard originally designed as a data model for metadata. It has come to be used as a general method for description and exchange of graph data. RDF provides a variety of syntax notations and data serialization formats with Turtle (Terse RDF Triple Language) currently being the most widely used notation. RDF is a directed graph composed of triple statements. An RDF graph statement is represented by: 1) a node for the subject, 2) an arc that goes from a subject to an object for the predicate, and 3) a node for the object. Each of the three parts of the statement can be identified by a URI. An object can also be a literal value. This simple, flexible data model has a lot of expressive power to represent complex situations, relationships, and other things of interest, while also being appropriately abstract. RDF was adopted as a W3C recommendation in 1999. The RDF 1.0 specification was published in 2004, th ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Linked Data
In computing, linked data (often capitalized as Linked Data) is structured data which is interlinked with other data so it becomes more useful through semantic queries. It builds upon standard Web technologies such as HTTP, RDF and URIs, but rather than using them to serve web pages only for human readers, it extends them to share information in a way that can be read automatically by computers. Part of the vision of linked data is for the Internet to become a global database. Tim Berners-Lee, director of the World Wide Web Consortium (W3C), coined the term in a 2006 design note about the Semantic Web project. Linked data may also be open data, in which case it is usually described as Linked Open Data. Principles In his 2006 "Linked Data" note, Tim Berners-Lee outlined four principles of linked data, paraphrased along the following lines: #Uniform Resource Identifiers (URIs) should be used to name and identify individual things. #HTTP URIs should be used to allow these thing ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




World Wide Web Consortium
The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. Founded in 1994 and led by Tim Berners-Lee, the consortium is made up of member organizations that maintain full-time staff working together in the development of standards for the World Wide Web. , W3C had 459 members. W3C also engages in education and outreach, develops software and serves as an open forum for discussion about the Web. History The World Wide Web Consortium (W3C) was founded in 1994 by Tim Berners-Lee after he left the European Organization for Nuclear Research (CERN) in October 1994. It was founded at the Massachusetts Institute of Technology (MIT) Laboratory for Computer Science with support from the European Commission, and the Defense Advanced Research Projects Agency, which had pioneered the ARPANET, one of the predecessors to the Internet. It was located in Technology Square until 2004, when it moved, with the MIT Computer Science and Artificial ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/TC 37
ISO/TC 37 is a technical committee within the International Organization for Standardization (ISO) that prepares standards and other documents concerning methodology and principles for terminology and language resources. Title: Terminology and other language and content resources Scope: Standardization of principles, methods and applications relating to terminology and other language and content resources in the contexts of multilingual communication and cultural diversity ISO/TC 37 is a so-called "horizontal committee", providing guidelines for all other technical committees that develop standards on how to manage their terminological problems. However, the standards developed by ISO/TC 37 are not restricted to ISO. Collaboration with industry is sought to ensure that the requirements and needs from all possible users of standards concerning terminology, language and structured content are duly and timely addressed. Involvement in standards development is open to all stake ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

International Organization For Standardization
The International Organization for Standardization (ISO ) is an international standard development organization composed of representatives from the national standards organizations of member countries. Membership requirements are given in Article 3 of the ISO Statutes. ISO was founded on 23 February 1947, and (as of November 2022) it has published over 24,500 international standards covering almost all aspects of technology and manufacturing. It has 809 Technical committees and sub committees to take care of standards development. The organization develops and publishes standardization in all technical and nontechnical fields other than electrical and electronic engineering, which is handled by the IEC.Editors of Encyclopedia Britannica. 3 June 2021.International Organization for Standardization" ''Encyclopedia Britannica''. Retrieved 2022-04-26. It is headquartered in Geneva, Switzerland, and works in 167 countries . The three official languages of the ISO are English, Fren ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


OLAC
OLAC, the Open Language Archives Community, is an initiative to create a unified means of searching online databases of language resources for linguistic research. The information about resources is stored in XML format for easy searching. OLAC was founded in 2000, and is hosted at the Linguistic Data Consortium webserver at the University of Pennsylvania. OLAC advises on best practices in language archiving, and works to promote interoperation between language archives. Metadata The OLAC metadata set is based on the complete set of Dublin Core metadata terms DCMT, but the format allows for the use of extensions to express community-specific qualifiers. It is often contrasted to IMDI IMDI (ISLE Meta Data Initiative) is a metadata standard to describe multi-media and multi-modal language resources. The standard provides interoperability for browsable and searchable corpus structures and resource descriptions with help of specif ... (ISLE Metadata Initiative). Attributes The ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]