Subject Indexing

	Subject Indexing Subject indexing is the act of describing or classifying a document A document is a writing, written, drawing, drawn, presented, or memorialized representation of thought, often the manifestation of nonfiction, non-fictional, as well as fictional, content. The word originates from the Latin ', which denotes ... by index terms, keywords, or other symbols in order to indicate what different documents are '' about'', to summarize their contents or to increase findability. In other words, it is about identifying and describing the '' subject'' of documents. Indexes are constructed, separately, on three distinct levels: terms in a document such as a book; objects in a collection such as a library; and documents (such as books and articles) within a field of knowledge. Subject indexing is used in information retrieval especially to create bibliographic indexes to retrieve documents on a particular subject. Examples of academic indexing services are Zentralblatt MATH, Ch ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Document Classification Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a document to one or more Class (philosophy), classes or Categorization, categories. This may be done "manually" (or "intellectually") or algorithmically. The intellectual classification of documents has mostly been the province of library science, while the algorithmic classification of documents is mainly in information science and computer science. The problems are overlapping, however, and there is therefore interdisciplinary research on document classification. The documents to be classified may be texts, images, music, etc. Each kind of document possesses its special classification problems. When not otherwise specified, text classification is implied. Documents may be classified according to their Subject (documents), subjects or according to other attributes (such as document type, author, printing year etc.). In the rest of th ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Librarians A librarian is a person who professionally works managing information. Librarians' common activities include providing access to information, conducting research, creating and managing information systems, creating, leading, and evaluating educational programs, and providing instruction on information literacy to users. The role of the librarian has changed over time, with the past century in particular bringing many new media and technologies into play. From the earliest libraries in the ancient world to the modern information hub, there have been keepers and disseminators of the information held in data stores. Roles and responsibilities vary widely depending on the type of library, the specialty of the librarian, and the functions needed to maintain collections and make them available to its users. Education for librarianship has changed over time to reflect changing roles. History The ancient world The Sumerians were the first to train clerks to keep records of accounts. '' ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Indexing And Abstracting Service An abstracting service is a service that provides abstracts of publications, often on a subject or group of related subjects, usually on a subscription basis. An indexing service is a service that assigns descriptors and other kinds of access points to documents. The word indexing service is today mostly used for computer programs, but may also cover services providing back-of-the-book indexes, journal indexes, and related kinds of indexes. An indexing and abstracting service is a service that provides shortening or summarizing of documents and assigning of descriptors for referencing documents. The product is often an abstracts journal or a bibliographic index, which may be a subject bibliography or a bibliographic database. Guidelines for indexing and abstracting, including the evaluation of such services, are given in the literature of library and information science.Lancaster, F. W. (1991/1998/2003). ''Indexing and abstracting in theory and practice''. London: Librar ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Document Classification Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a document to one or more Class (philosophy), classes or Categorization, categories. This may be done "manually" (or "intellectually") or algorithmically. The intellectual classification of documents has mostly been the province of library science, while the algorithmic classification of documents is mainly in information science and computer science. The problems are overlapping, however, and there is therefore interdisciplinary research on document classification. The documents to be classified may be texts, images, music, etc. Each kind of document possesses its special classification problems. When not otherwise specified, text classification is implied. Documents may be classified according to their Subject (documents), subjects or according to other attributes (such as document type, author, printing year etc.). In the rest of th ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Birger Hjørland Birger Hjørland (born January 1, 1947, in Denmark) is a professor of knowledge organization at the Royal School of Library and Information Science (RSLIS) in Copenhagen. His main areas of study pertain to theory of library and information science and of knowledge organization. Hjørland has contributed important developments to domain analysis and concept theory. He has been cited as an anchor of North American knowledge organization studies, as well as an information science pioneer. Birger Hjørland started working at the RSLIS in 1976, then became a research librarian at The Royal Library in Copenhagen from 1978 to 1990. From 1990 he returned to RSLIS and became professor in 2001. He has done research and taught at RSLIS ever since. Many of his students have become research librarians or information specialists. He is a member of the editorial board of various journals and the editor-in-chief of the ''ISKO Encyclopedia of Knowledge Organization''. Scientific work Hjørland ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Precision (information Retrieval) In pattern recognition, information retrieval, object detection and classification (machine learning), precision and recall are performance metrics that apply to data retrieved from a collection, corpus or sample space. Precision (also called positive predictive value) is the fraction of relevant instances among the retrieved instances. Written as a formula: \text = \frac Recall (also known as sensitivity) is the fraction of relevant instances that were retrieved. Written as a formula: \text = \frac Both precision and recall are therefore based on relevance. Consider a computer program for recognizing dogs (the relevant element) in a digital photograph. Upon processing a picture which contains ten cats and twelve dogs, the program identifies eight dogs. Of the eight elements identified as dogs, only five actually are dogs ( true positives), while the other three are cats ( false positives). Seven dogs were missed ( false negatives), and seven cats were correctly excl ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Recall (information Retrieval) In pattern recognition, information information retrieval, retrieval, object detection and classification (machine learning), precision and recall are performance metrics that apply to data retrieved from a data store, collection, Text_corpus, corpus or sample space. Precision (also called positive predictive value) is the fraction of relevant instances among the retrieved instances. Written as a formula: \text = \frac Recall (also known as Sensitivity and specificity, sensitivity) is the fraction of relevant instances that were retrieved. Written as a formula: \text = \frac Both precision and recall are therefore based on Relevance (information retrieval), relevance. Consider a computer program for recognizing dogs (the relevant element) in a digital photograph. Upon processing a picture which contains ten cats and twelve dogs, the program identifies eight dogs. Of the eight elements identified as dogs, only five actually are dogs (True positive, true positives), while t ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Homograph A homograph (from the , and , ) is a word that shares the same written form as another word but has a different meaning. However, some dictionaries insist that the words must also be pronounced differently, while the Oxford English Dictionary says that the words should also be of "different origin". In this vein, ''The Oxford Guide to Practical Lexicography'' lists various types of homographs, including those in which the words are discriminated by being in a different ''word class'', such as ''hit'', the verb ''to strike'', and ''hit'', the noun ''a strike''. If, when spoken, the meanings may be distinguished by different pronunciations, the words are also heteronyms. Words with the same writing ''and'' pronunciation (i.e. are both homographs and homophones) are considered homonyms. However, in a broader sense the term "homonym" may be applied to words with the same writing ''or'' pronunciation. Homograph disambiguation is critically important in speech synthesis, natural ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Synonym A synonym is a word, morpheme, or phrase that means precisely or nearly the same as another word, morpheme, or phrase in a given language. For example, in the English language, the words ''begin'', ''start'', ''commence'', and ''initiate'' are all synonyms of one another: they are ''synonymous''. The standard test for synonymy is substitution: one form can be replaced by another in a sentence without changing its meaning. Words may often be synonymous in only one particular sense: for example, ''long'' and ''extended'' in the context ''long time'' or ''extended time'' are synonymous, but ''long'' cannot be used in the phrase ''extended family''. Synonyms with exactly the same meaning share a seme or denotational sememe, whereas those with inexactly similar meanings share a broader denotational or connotational sememe and thus overlap within a semantic field. The former are sometimes called cognitive synonyms and the latter, near-synonyms, plesionyms or poecilonyms. Lexic ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Concordance (publishing) A concordance is an alphabetical list of the principal words used in a book or body of work, listing every instance of each word with its immediate context (language use)#Verbal context, context. Historically, concordances have been compiled only for works of special importance, such as the Vedas, Bible, Qur'an or the works of William Shakespeare, Shakespeare, James Joyce or classical Latin and Greek authors, because of the time, difficulty, and expense involved in creating a concordance in the pre-computer era. A concordance is more than an Subject indexing, index, with additional material such as commentary, definitions and topical cross-indexing which makes producing one a labor-intensive process even when assisted by computers. In the precomputing era, search engine technology, search technology was unavailable, and a concordance offered readers of long works such as the Bible something comparable to search results for every word that they would have been likely to search fo ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Stop Words Stop words are the words in a stop list (or ''stoplist'' or ''negative dictionary'') which are filtered out ("stopped") before or after processing of natural language data (i.e. text) because they are deemed to have little semantic value or are otherwise insignificant for the task at hand. There is no single universal list of stop words used by all natural language processing (NLP) tools, nor any agreed upon rules for identifying stop words, and indeed not all tools even use such a list. Therefore, any group of words can be chosen as the stop words for a given purpose. The "general trend in nformation retrievalsystems over time has been from standard use of quite large stop lists (200–300 terms) to very small stop lists (7–12 terms) to no stop list whatsoever". History of stop words A predecessor concept was used in creating some concordances. For example, the first Hebrew concordance, Isaac Nathan ben Kalonymus's , contained a one-page list of unindexed words, with nonsu ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Natural Language A natural language or ordinary language is a language that occurs naturally in a human community by a process of use, repetition, and change. It can take different forms, typically either a spoken language or a sign language. Natural languages are distinguished from constructed and formal languages such as those used to program computers or to study logic. Defining natural language Natural languages include ones that are associated with linguistic prescriptivism or language regulation. ( Nonstandard dialects can be viewed as a wild type in comparison with standard languages.) An official language with a regulating academy such as Standard French, overseen by the , is classified as a natural language (e.g. in the field of natural language processing), as its prescriptive aspects do not make it constructed enough to be a constructed language or controlled enough to be a controlled natural language. Natural language are different from: * artificial and constructed la ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]