Canonicalization

	Canonicalization In computer science, canonicalization (sometimes standardization or normalization) is a process for converting data that has more than one possible representation into a "standard", "normal", or canonical form. This can be done to compare different representations for equivalence, to count the number of distinct data structures, to improve the efficiency of various algorithms by eliminating repeated calculations, or to make it possible to impose a meaningful sorting order. Usage cases Filenames Files in file systems may in most cases be accessed through multiple filenames. For instance in Unix-like systems, the string "/./" can be replaced by "/". In the C standard library, the function realpath() performs this task. Other operations performed by this function to canonicalize filenames are the handling of /.. components referring to parent directories, simplification of sequences of multiple slashes, removal of trailing slashes, and the resolution of symbolic links. Can ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Canonical Form In mathematics and computer science, a canonical, normal, or standard form of a mathematical object is a standard way of presenting that object as a mathematical expression. Often, it is one which provides the simplest representation of an object and which allows it to be identified in a unique way. The distinction between "canonical" and "normal" forms varies from subfield to subfield. In most fields, a canonical form specifies a ''unique'' representation for every object, while a normal form simply specifies its form, without the requirement of uniqueness. The canonical form of a positive integer in decimal representation is a finite sequence of digits that does not begin with zero. More generally, for a class of objects on which an equivalence relation is defined, a canonical form consists in the choice of a specific object in each class. For example: Jordan normal form is a canonical form for matrix similarity. The row echelon form is a canonical form, when one conside ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Canonical Link Element A canonical link element is an HTML element that helps webmasters prevent duplicate content issues in search engine optimization by specifying the "canonical" or "preferred" version of a web page. It is described in RFC 6596, which went live in April 2012. Purpose A major problem for search engines is to determine the original source for documents that are available on multiple URLs. Content duplication can happen in many ways, including: * Duplication due to -parameters * Duplication with multiple URLs due to CMS * Duplication due to accessibility on different hosts/protocols * Duplication due to print versions of websites Duplicate content issues occur when the same content is accessible from multiple URLs. For example, would be considered by search engines to be an entirely different page from , even though both URLs may reference the same content. In February 2009, Google, Yahoo and Microsoft announced support for the canonical link element, which can be inserted into ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	URL Canonicalization A canonical link element is an HTML element that helps webmasters prevent duplicate content issues in search engine optimization by specifying the "canonical" or "preferred" version of a web page. It is described in RFC 6596, which went live in April 2012. Purpose A major problem for search engines is to determine the original source for documents that are available on multiple URLs. Content duplication can happen in many ways, including: * Duplication due to -parameters * Duplication with multiple URLs due to CMS * Duplication due to accessibility on different hosts/protocols * Duplication due to print versions of websites Duplicate content issues occur when the same content is accessible from multiple URLs. For example, would be considered by search engines to be an entirely different page from , even though both URLs may reference the same content. In February 2009, Google, Yahoo and Microsoft announced support for the canonical link element, which can be inserted into th ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Search Engine Optimization Search engine optimization (SEO) is the process of improving the quality and quantity of website traffic to a website or a web page from search engines. SEO targets unpaid traffic (known as "natural" or " organic" results) rather than direct traffic or paid traffic. Unpaid traffic may originate from different kinds of searches, including image search, video search, academic search, news search, and industry-specific vertical search engines. As an Internet marketing strategy, SEO considers how search engines work, the computer-programmed algorithms that dictate search engine behavior, what people search for, the actual search terms or keywords typed into search engines, and which search engines are preferred by their targeted audience. SEO is performed because a website will receive more visitors from a search engine when websites rank higher on the search engine results page (SERP). These visitors can then potentially be converted into customers. History Webma ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Directory Traversal A directory traversal (or path traversal) attack exploits insufficient security validation or sanitization of user-supplied file names, such that characters representing "traverse to parent directory" are passed through to the operating system's file system API. An affected application can be exploited to gain unauthorized access to the file system. Directory traversal is also known as the ../ (dot dot slash) attack, directory climbing, and backtracking. Some forms of this attack are also canonicalization attacks. Example A typical example of a vulnerable application in PHP code is: TEMPLATE') include "/home/users/phpguru/templates/" . $template; An attack against this system could be to send the following HTTP request: GET /vulnerable.php HTTP/1.0 Cookie: TEMPLATE=../../../../../../../../../etc/ ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Canonical Link Element A canonical link element is an HTML element that helps webmasters prevent duplicate content issues in search engine optimization by specifying the "canonical" or "preferred" version of a web page. It is described in RFC 6596, which went live in April 2012. Purpose A major problem for search engines is to determine the original source for documents that are available on multiple URLs. Content duplication can happen in many ways, including: * Duplication due to -parameters * Duplication with multiple URLs due to CMS * Duplication due to accessibility on different hosts/protocols * Duplication due to print versions of websites Duplicate content issues occur when the same content is accessible from multiple URLs. For example, would be considered by search engines to be an entirely different page from , even though both URLs may reference the same content. In February 2009, Google, Yahoo and Microsoft announced support for the canonical link element, which can be inserted into ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Computer Science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (including the design and implementation of hardware and software). Computer science is generally considered an area of academic research and distinct from computer programming. Algorithms and data structures are central to computer science. The theory of computation concerns abstract models of computation and general classes of problems that can be solved using them. The fields of cryptography and computer security involve studying the means for secure communication and for preventing security vulnerabilities. Computer graphics and computational geometry address the generation of images. Programming language theory considers different ways to describe computational processes, and database theory concerns the management of repositories ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Duplicate Content Duplicate content is a term used in the field of search engine optimization to describe content that appears on more than one web page. The duplicate content can be substantial parts of the content within or across domains and can be either exactly duplicate or closely similar. When multiple pages contain essentially the same content, search engines such as Google and Bing can penalize or cease displaying the copying site in any relevant search results. Types Non-malicious Non-malicious duplicate content may include variations of the same page, such as versions optimized for normal HTML, mobile devices, or printer-friendliness, or store items that can be shown via multiple distinct URLs. Duplicate content issues can also arise when a site is accessible under multiple subdomains, such as with or without the "www." or where sites fail to handle the trailing slash of URLs correctly. Another common source of non-malicious duplicate content is pagination, in which content and/or cor ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Lexicography Lexicography is the study of lexicons, and is divided into two separate academic disciplines. It is the art of compiling dictionaries. * Practical lexicography is the art or craft of compiling, writing and editing dictionaries. * Theoretical lexicography is the scholarly study of semantic, orthographic, syntagmatic and paradigmatic features of lexemes of the lexicon ( vocabulary) of a language, developing theories of dictionary components and structures linking the data in dictionaries, the needs for information by users in specific types of situations, and how users may best access the data incorporated in printed and electronic dictionaries. This is sometimes referred to as 'metalexicography'. There is some disagreement on the definition of lexicology, as distinct from lexicography. Some use "lexicology" as a synonym for theoretical lexicography; others use it to mean a branch of linguistics pertaining to the inventory of words in a particular language. A person ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Morphology (linguistics) In linguistics, morphology () is the study of words, how they are formed, and their relationship to other words in the same language. It analyzes the structure of words and parts of words such as stems, root words, prefixes, and suffixes. Morphology also looks at parts of speech, intonation and stress, and the ways context can change a word's pronunciation and meaning. Morphology differs from morphological typology, which is the classification of languages based on their use of words, and lexicology, which is the study of words and how they make up a language's vocabulary. While words, along with clitics, are generally accepted as being the smallest units of syntax, in most languages, if not all, many words can be related to other words by rules that collectively describe the grammar for that language. For example, English speakers recognize that the words ''dog'' and ''dogs'' are closely related, differentiated only by the plurality morpheme "-s", only found bound to ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Canonical XML Canonical XML is a normal form of XML, intended to allow relatively simple comparison of pairs of XML documents for equivalence; for this purpose, the Canonical XML transformation removes non-meaningful differences between the documents. Any XML document can be converted to Canonical XML. For example, XML permits whitespace to occur at various points within start-tags, and attributes to be specified in any order. Such differences are seldom if ever used to convey meaning, and so these forms are generally considered equivalent: <p class="a" secure="1"> <p secure = "1" class='a' > In converting an arbitrary XML document to Canonical XML, attributes are encoded in a normative order (alphabetical by name), and with normative spacing and quoting (though with all namespace declarations placed ahead of regular attributes, and namespaced attributes sorted by namespace rather than prefix or qualified name). Thus, the second form above would be converted ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Noindex The noindex value of an HTML robots meta tag requests that automated Internet bots avoid indexing a web page.Robots and the META element Official W3 specification Reasons why one might want to use this meta tag include advising robots not to index a very large database, web pages that are very transitory, web pages that are under development, web pages that one wishes to keep slightly more private, or the printer and mobile-friendly versions of pages. Since the burden of honoring a website's noindex tag lies with the author of the search robot, sometimes these tags are ignored. Also the interpretation of the noindex tag is sometimes slightly different from one search engine company to the next. Noindexing entire pages [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]

Noindexing entire pages