Silesia Corpus
The Silesia corpus is a collection of files intended for use as a benchmark for testing lossless data compression algorithms. It was created in 2003 as an alternative for the Canterbury corpus and Calgary corpus, based on concerns about how well these represented modern files. It contains various data types, including large text documents, executable files, and databases. Contents The corpus consists of 12 files, totaling 211MB. The files were chosen to represent what the author considered to be data types likely to grow rapidly in size over time, such as computer programs and databases, along with more traditional compression benchmarks, such as large text files. Because it has a broader and more modern selection of datatypes, it is considered a better source of test data for compression algorithms when compared to the Calgary corpus. See also * Data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding in ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Computer File
A computer file is a System resource, resource for recording Data (computing), data on a Computer data storage, computer storage device, primarily identified by its filename. Just as words can be written on paper, so too can data be written to a computer file. Files can be shared with and transferred between computers and Mobile device, mobile devices via removable media, Computer networks, networks, or the Internet. Different File format, types of computer files are designed for different purposes. A file may be designed to store a written message, a document, a spreadsheet, an Digital image, image, a Digital video, video, a computer program, program, or any wide variety of other kinds of data. Certain files can store multiple data types at once. By using computer programs, a person can open, read, change, save, and close a computer file. Computer files may be reopened, modified, and file copying, copied an arbitrary number of times. Files are typically organized in a file syst ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
The Peasants
''The Peasants'' () is a novel written by the Polish author Władysław Reymont in four parts between 1904 and 1909. He started writing it in 1897, but because of a railway accident and health problems, it took seven years to complete. The first parts of the story were published in the weekly magazine ''Tygodnik Illustrowany''. The novel has been translated into at least 27 languages. Władysław Reymont received the 1924 Nobel Prize in Literature for this work. Description Each of the four parts represents a season in the life of the peasants – Autumn (published in 1904), Winter (published in 1904), Spring (published in 1906), and Summer (published in 1909). This division underlines the relationship of human life with nature. Main characters * Maciej Boryna – the richest man in the village and the main character of the novel. * Antek Boryna – Maciej's son, husband of Hanka * Hanka Boryna – Antek's wife and a mother of three children * Jagna – a beautiful 19-year-o ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Data Compression
In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compression reduces bits by identifying and eliminating statistical redundancy. No information is lost in lossless compression. Lossy compression reduces bits by removing unnecessary or less important information. Typically, a device that performs data compression is referred to as an encoder, and one that performs the reversal of the process (decompression) as a decoder. The process of reducing the size of a data file is often referred to as data compression. In the context of data transmission, it is called source coding: encoding is done at the source of the data before it is stored or transmitted. Source coding should not be confused with channel coding, for error detection and correction or line coding, the means for mapping data onto a sig ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
X-Ray
An X-ray (also known in many languages as Röntgen radiation) is a form of high-energy electromagnetic radiation with a wavelength shorter than those of ultraviolet rays and longer than those of gamma rays. Roughly, X-rays have a wavelength ranging from 10 Nanometre, nanometers to 10 Picometre, picometers, corresponding to frequency, frequencies in the range of 30 Hertz, petahertz to 30 Hertz, exahertz ( to ) and photon energies in the range of 100 electronvolt, eV to 100 keV, respectively. X-rays were discovered in 1895 in science, 1895 by the German scientist Wilhelm Röntgen, Wilhelm Conrad Röntgen, who named it ''X-radiation'' to signify an unknown type of radiation.Novelline, Robert (1997). ''Squire's Fundamentals of Radiology''. Harvard University Press. 5th edition. . X-rays can penetrate many solid substances such as construction materials and living tissue, so X-ray radiography is widely used in medical diagnostics (e.g., checking for Bo ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Webster's Dictionary
''Webster's Dictionary'' is any of the US English language dictionaries edited in the early 19th century by Noah Webster (1758–1843), a US lexicographer, as well as numerous related or unrelated dictionaries that have adopted the Webster's name in his honor. "''Webster's''" has since become a genericized trademark in the United States for US English dictionaries, and is widely used in dictionary titles. Merriam-Webster is the corporate heir to Noah Webster's original works, which are in the public domain. Noah Webster's ''American Dictionary of the English Language'' Noah Webster (1758–1843), the author of the readers and spelling books which dominated the American market at the time, spent decades of research in compiling his dictionaries. His first dictionary, ''A Compendious Dictionary of the English Language'', appeared in 1806. In it, he popularized features which would become a hallmark of American English spelling (''center'' rather than ''centre'', ''honor'' rat ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Smithsonian Astrophysical Observatory Star Catalog
The Smithsonian Astrophysical Observatory Star Catalog is an astrometric star catalogue, created by Smithsonian Institution, a research institute. It was published by the Smithsonian Astrophysical Observatory in 1966 and contains 258,997 stars. The catalogue was compiled from various previous astrometric catalogues, and contains only stars to about ninth magnitude for which accurate proper motions were known. Names in the SAO catalogue start with the letters SAO, followed by a number. The numbers are assigned following 18 ten-degree bands of declination, with stars sorted by right ascension within each band. Online version of the SAO Catalog was created by the HEASARC in March 2001 based on ADC/CDS Catalog I/131A, which itself is originally derived from a character-coded machine-readable version of the Smithsonian Astrophysical Observatory Star Catalog (SAO, SAO Staff 1966) prepared by T.A. Nagy in 1979, and subsequently modified over the next decade or so. Examples of S ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Samba
Samba () is a broad term for many of the rhythms that compose the better known Brazilian music genres that originated in the Afro-Brazilians, Afro Brazilian communities of Bahia in the late 19th century and early 20th century, It is a name or prefix used for several rhythmic variants, such as samba urbano carioca (''urban Carioca samba''), samba de roda (sometimes also called ''rural samba''), among many other forms of samba, mostly originated in the Rio de Janeiro (state), Rio de Janeiro and Bahia states. Having its roots in Brazilian mythology, Brazilian folk traditions, especially those linked to the primitive rural samba of the Colonial Brazil, colonial and Empire of Brazil, imperial periods, is considered one of the most important cultural phenomena in Brazil and one of the country symbols. Present in the Portuguese language at least since the 19th century, the word "samba" was originally used to designate a "popular dance". Over time, its meaning has been extended to a "B ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Władysław Reymont
Władysław Stanisław Reymont (; born Rejment; 7 May 1867 – 5 December 1925) was a Polish novelist and the laureate of the 1924 Nobel Prize in Literature. His best-known work is the award-winning four-volume novel '' Chłopi'' (''The Peasants''). Born into an impoverished noble family, Reymont was educated to become a master tailor, but instead worked as a gateman at a railway station and then as an actor in a troupe. His intensive travels and voyages encouraged him to publish short stories, with notions of literary realism. Reymont's first successful and widely praised novel was '' The Promised Land'' from 1899, which brought attention to the bewildering social inequalities, poverty, conflictive multiculturalism and labour exploitation in the industrial city of Łódź (Lodz). The aim of the novel was to extensively emphasize the consequences of extreme industrialization and how it affects society as a whole. In 1900, Reymont was severely injured in a railway accident, wh ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
MySQL
MySQL () is an Open-source software, open-source relational database management system (RDBMS). Its name is a combination of "My", the name of co-founder Michael Widenius's daughter My, and "SQL", the acronym for Structured Query Language. A relational database organizes data into one or more data tables in which data may be related to each other; these relations help structure the data. SQL is a language that programmers use to create, modify and extract data from the relational database, as well as control user access to the database. In addition to relational databases and SQL, an RDBMS like MySQL works with an operating system to implement a relational database in a computer's storage system, manages users, allows for network access and facilitates testing database integrity and creation of backups. MySQL is free and open-source software under the terms of the GNU General Public License, and is also available under a variety of proprietary software, proprietary licenses. MySQ ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Lossless Data Compression
Lossless compression is a class of data compression that allows the original data to be perfectly reconstructed from the compressed data with no loss of information. Lossless compression is possible because most real-world data exhibits Redundancy (information theory), statistical redundancy. By contrast, lossy compression permits reconstruction only of an approximation of the original data, though usually with greatly improved Bit rate#Bitrates in multimedia, compression rates (and therefore reduced media sizes). By operation of the pigeonhole principle, no lossless compression algorithm can shrink the size of all possible data: Some data will get longer by at least one symbol or bit. Compression algorithms are usually effective for human- and machine-readable documents and cannot shrink the size of random data that contain no Redundancy (information theory), redundancy. Different algorithms exist that are designed either with a specific type of input data in mind or with speci ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
OpenOffice
OpenOffice or open office may refer to: Computing Software * OpenOffice.org (OOo), a discontinued open-source office software suite, originally based on StarOffice * Apache OpenOffice (AOO), a derivative of OOo by the Apache Software Foundation, with contribution from IBM Lotus Symphony Programming * OpenOffice Basic (formerly known as StarOffice Basic or StarBasic or OOoBasic), a dialect of the programming language BASIC File formats * OpenDocument format (ODF), also known as ''Open Document Format for Office Applications'', a widely supported standard XML-based file format originating from OOo * OpenOffice.org XML, a file format used by early versions of OpenOffice.org * Office Open XML Office Open XML (also informally known as OOXML) is a zipped, XML-based file format developed by Microsoft for representing spreadsheets, charts, presentations and word processing documents. Ecma International standardized the initial version ... (OOXML), a competing file format from ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Shared Library
In computing, a library is a collection of System resource, resources that can be leveraged during software development to implement a computer program. Commonly, a library consists of executable code such as compiled function (computer science), functions and Class (computer programming), classes, or a library can be a collection of source code. A resource library may contain data such as images and Text string, text. A library can be used by multiple, independent consumers (programs and other libraries). This differs from resources defined in a program which can usually only be used by that program. When a consumer uses a library resource, it gains the value of the library without having to implement it itself. Libraries encourage software reuse in a Modular programming, modular fashion. Libraries can use other libraries resulting in a hierarchy of libraries in a program. When writing code that uses a library, a programmer only needs to know how to use it not its internal d ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |