HOME

TheInfoList



OR:

Wikisource is an online wiki-based
digital library A digital library (also called an online library, an internet library, a digital repository, a library without walls, or a digital collection) is an online database of digital resources that can include text, still images, audio, video, digital ...
of free-content textual sources operated by the
Wikimedia Foundation The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered there as foundation (United States law), a charitable foundation. It is the host of Wikipedia, th ...
. Wikisource is the name of the project as a whole; it is also the name for each instance of that project, one for each language. The project's aim is to host all forms of free text, in many languages, and translations. Originally conceived as an archive to store useful or important historical texts, it has expanded to become a general-content library. The project officially began on November 24, 2003, under the name Project Sourceberg, a play on
Project Gutenberg Project Gutenberg (PG) is a volunteer effort to digitize and archive cultural works, as well as to "encourage the creation and distribution of eBooks." It was founded in 1971 by American writer Michael S. Hart and is the oldest digital li ...
. The name Wikisource was adopted later that year and it received its own
domain name In the Internet, a domain name is a string that identifies a realm of administrative autonomy, authority, or control. Domain names are often used to identify services provided through the Internet, such as websites, email services, and more. ...
. The project holds works that are either in the
public domain The public domain (PD) consists of all the creative work to which no Exclusive exclusive intellectual property rights apply. Those rights may have expired, been forfeited, expressly Waiver, waived, or may be inapplicable. Because no one holds ...
or
freely licensed Free content, libre content, libre information, or free information is any kind of creative work, such as a work of art, a book, a software, software program, or any other creative Media (communication), content for which there are very minimal ...
: professionally published works or historical source documents, not vanity products. Verification was initially made offline, or by trusting the reliability of other digital libraries. Now works are supported by online scans via the ProofreadPage extension, which ensures the reliability and accuracy of the project's texts. Some individual Wikisources, each representing a specific language, now only allow works backed up with scans. While the bulk of its collection are texts, Wikisource as a whole hosts other media, from comics to film to
audiobook An audiobook (or a talking book) is a recording of a book or other work being read out loud. A reading of the complete text is described as "unabridged", while readings of shorter versions are abridgements. Spoken audio has been available in sch ...
s. Some Wikisources allow user-generated annotations, subject to the specific policies of the Wikisource in question. The project has come under criticism for lack of reliability but it is also cited by organisations such as the
National Archives and Records Administration The National Archives and Records Administration (NARA) is an independent agency of the United States government within the executive branch, charged with the preservation and documentation of government and historical records. It is also task ...
. As of , there are Wikisource subdomains active for languages
Wikimedia The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered there as a charitable foundation. It is the host of Wikipedia, the eighth most visited website ...
's
MediaWiki MediaWiki is free and open-source wiki software originally developed by Magnus Manske for use on Wikipedia on January 25, 2002, and further improved by Lee Daniel Crocker,mailarchive:wikipedia-l/2001-August/000382.html, Magnus Manske's announc ...
API:Sitematrix. Retrieved from Data:Wikipedia statistics/meta.tab
comprising a total of articles and recently active editors.
Wikimedia The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered there as a charitable foundation. It is the host of Wikipedia, the eighth most visited website ...
's
MediaWiki MediaWiki is free and open-source wiki software originally developed by Magnus Manske for use on Wikipedia on January 25, 2002, and further improved by Lee Daniel Crocker,mailarchive:wikipedia-l/2001-August/000382.html, Magnus Manske's announc ...
API:Siteinfo. Retrieved from Data:Wikipedia statistics/data.tab


History

The original concept for Wikisource was as storage for useful or important historical texts. These texts were intended to support
Wikipedia Wikipedia is a free content, free Online content, online encyclopedia that is written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. Founded by Jimmy Wales and La ...
articles, by providing primary evidence and original source texts, and as an archive in its own right. The collection was initially focused on important historical and cultural material, distinguishing it from other digital archives like Project Gutenberg. The project was originally called Project Sourceberg during its planning stages (a play on words for Project Gutenberg). In 2001, there was a dispute on Wikipedia regarding the addition of primary-source materials, leading to edit wars over their inclusion or deletion. Project Sourceberg was suggested as a solution to this. In describing the proposed project, user The Cunctator said, "It would be to Project Gutenberg what Wikipedia is to
Nupedia Nupedia was a multi-language online encyclopedia whose articles were written by volunteer contributors with relevant subject-matter expertise, reviewed by expert editors before publication, and licensed as free content. It was founded by Jimmy ...
", soon clarifying the statement with "we don't want to try to duplicate Project Gutenberg's efforts; rather, we want to complement them. Perhaps Project Sourceberg can mainly work as an interface for easily linking from Wikipedia to a Project Gutenberg file, and as an interface for people to easily submit new work to PG." Initial comments were skeptical, with
Larry Sanger Lawrence Mark Sanger (; born July 16, 1968) is an American Internet project developer and philosopher who co-founded Wikipedia along with Jimmy Wales. Sanger coined Wikipedia's name, and provided initial drafts for many of its early guidelines, ...
questioning the need for the project, writing "The hard question, I guess, is why we are reinventing the wheel, when Project Gutenberg already exists? We'd want to complement Project Gutenberg—how, exactly?", and
Jimmy Wales Jimmy Donal Wales (born August 7, 1966), also known as Jimbo Wales, is an American List of Internet entrepreneurs, Internet entrepreneur and former Trader (finance), financial trader. He is a Founders of Wikipedia, co-founder of the non-profi ...
adding "like Larry, I'm interested that we think it over to see what we can add to Project Gutenberg. It seems unlikely that primary sources should in general be editable by anyone — I mean, Shakespeare is Shakespeare, unlike our commentary on his work, which is whatever we want it to be." The project began its activity at ps.wikipedia.org. The contributors understood the "PS" subdomain to mean either "primary sources" or Project Sourceberg. However, this resulted in Project Sourceberg occupying the subdomain of the
Pashto Wikipedia Wikipedia is a free content, free multilingualism, multilingual open source, open-source wiki-based online encyclopedia open collaboration, edited and maintained by a Wikipedia community, community of volunteer editors, started on 15 January 2001 ...
(the ISO language code of the
Pashto language Pashto ( , ; , ) is an eastern Iranian language in the Indo-European language family, natively spoken in northwestern Pakistan and southern and eastern Afghanistan. It has official status in Afghanistan and the Pakistani province of Khyb ...
is "ps"). Project Sourceberg officially launched on November 24, 2003, when it received its own temporary URL, at sources.wikipedia.org, and all texts and discussions hosted on ps.wikipedia.org were moved to the temporary address. A vote on the project's name changed it to Wikisource on December 6, 2003. Despite the change in name, the project did not move to its permanent URL ( http://wikisource.org/) until July 23, 2004.


Logo and slogan

Since Wikisource was initially called "Project Sourceberg", its first logo was a picture of an
iceberg An iceberg is a piece of fresh water ice more than long that has broken off a glacier or an ice shelf and is floating freely in open water. Smaller chunks of floating glacially derived ice are called "growlers" or "bergy bits". Much of an i ...
. Two votes conducted to choose a successor were inconclusive, and the original logo remained until 2006. Finally, for both legal and technical reasons—because the picture's license was inappropriate for a Wikimedia Foundation logo and because a photo cannot scale properly—a stylized vector iceberg inspired by the original picture was mandated to serve as the project's logo. The first prominent use of Wikisource's slogan—''The Free Library''—was at the project's multilingual portal, when it was redesigned based upon the Wikipedia portal on August 27, 2005, (historical version). As in the Wikipedia portal the Wikisource slogan appears around the logo in the project's ten largest languages. Clicking on the portal's central images (the iceberg logo in the center and the "Wikisource" heading at the top of the page) links to a '' list of translations'' for ''Wikisource'' and ''The Free Library'' in 60 languages.


Tools built

A
MediaWiki MediaWiki is free and open-source wiki software originally developed by Magnus Manske for use on Wikipedia on January 25, 2002, and further improved by Lee Daniel Crocker,mailarchive:wikipedia-l/2001-August/000382.html, Magnus Manske's announc ...
extension called ProofreadPage was developed for Wikisource by developer ThomasV to improve the vetting of transcriptions by the project. This displays pages of scanned works side by side with the text relating to that page, allowing the text to be
proofread Proofreading is a phase in the process of publishing where galley proofs are compared against the original manuscripts or graphic artworks, to identify transcription errors in the typesetting process. In the past, proofreaders would place corre ...
and its accuracy later verified independently by any other editor. Once a book, or other text, has been scanned, the raw images can be modified with
image processing An image or picture is a visual representation. An image can be two-dimensional, such as a drawing, painting, or photograph, or three-dimensional, such as a carving or sculpture. Images may be displayed through other media, including a pr ...
software to correct for page rotations and other problems. The retouched images can then be converted into a
PDF Portable document format (PDF), standardized as ISO 32000, is a file format developed by Adobe Inc., Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, computer hardware, ...
or DjVu file and uploaded to either Wikisource or
Wikimedia Commons Wikimedia Commons, or simply Commons, is a wiki-based Digital library, media repository of Open content, free-to-use images, sounds, videos and other media. It is a project of the Wikimedia Foundation. Files from Wikimedia Commons can be used ...
. This system assists editors in ensuring the accuracy of texts on Wikisource. The original page scans of completed works remain available to any user so that errors may be corrected later and readers may check texts against the originals. ProofreadPage also allows greater participation, since access to a physical copy of the original work is not necessary to be able to contribute to the project once images have been uploaded.


Milestones

Within two weeks of the project's official start at sources.wikipedia.org, over 1,000 pages had been created, with approximately 200 of these being designated as actual articles. On January 4, 2004, Wikisource welcomed its 100th registered user. In early July, 2004 the number of articles exceeded 2,400, and more than 500 users had registered. On April 30, 2005, there were 2667 registered users (including 18 administrators) and almost 19,000 articles. The project passed its 96,000th edit that same day. On November 27, 2005, the English Wikisource passed 20,000 text-units in its third month of existence, already holding more texts than did the entire project in April (before the move to language subdomains). On May 10, 2006, the /fr.wikisource.org/w/index.php?title=Portail:Philosophie&oldid=91377 first Wikisource Portal/span> was created. On February 14, 2008, the English Wikisource passed 100,000 text-units with Chapter LXXIV of '' Six Months at the White House'', a memoir by painter
Francis Bicknell Carpenter Francis Bicknell Carpenter (August 6, 1830 – May 23, 1900) was an American painter born in Homer (town), New York, Homer, New York. Carpenter is best known for his painting ''First Reading of the Emancipation Proclamation of President Lincoln ...
. In November, 2011, 250,000 text-units milestone was passed.


Library contents

Wikisource collects and stores in digital format previously published texts; including novels, non-fiction works, letters, speeches, constitutional and historical documents, laws and a range of other documents. All texts collected are either free of copyright or released under the Creative Commons Attribution/Share-Alike License. Texts in all languages are welcomed, as are translations. In addition to texts, Wikisource hosts material such as
comics a Media (communication), medium used to express ideas with images, often combined with text or other visual information. It typically the form of a sequence of Panel (comics), panels of images. Textual devices such as speech balloons, Glo ...
,
film A film, also known as a movie or motion picture, is a work of visual art that simulates experiences and otherwise communicates ideas, stories, perceptions, emotions, or atmosphere through the use of moving images that are generally, sinc ...
s, recordings and spoken-word works. All texts held by Wikisource must have been previously published; the project does not host "
vanity press A vanity press or vanity publisher, sometimes also subsidy publisher, is a book printer that is paid by authors to Self-published, self-publish their books. A vanity press charges fees in advance and does not contribute to the development of the ...
" books or documents produced by its contributors. A scanned source is preferred on many Wikisources and required on some. Most Wikisources will, however, accept works transcribed from offline sources or acquired from other digital libraries. The requirement for prior publication can also be waived in a small number of cases if the work is a source document of notable historical importance. The legal requirement for works to be licensed or free of copyright remains constant.


Annotations and translations – the difference to Wikibooks

The only original pieces accepted by Wikisource are annotations and translations. Wikisource, and its sister project
Wikibooks Wikibooks (previously called ''Wikimedia Free Textbook Project'' and ''Wikimedia-Textbooks'') is a wiki-based Wikimedia project hosted by the Wikimedia Foundation for the creation of free content digital textbooks and annotated texts that anyon ...
, has the capacity for annotated editions of texts. On Wikisource, the annotations are supplementary to the original text, which remains the primary objective of the project. By contrast, on Wikibooks the annotations are primary, with the original text as only a reference or supplement, if present at all. Annotated editions are more popular on the German Wikisource. The project also accommodates translations of texts provided by its users. A significant translation on the English Wikisource is the Wiki Bible project, intended to create a new, "laissez-faire translation" of
The Bible The Bible is a collection of religious texts that are central to Christianity and Judaism, and esteemed in other Abrahamic religions such as Islam. The Bible is an anthology (a compilation of texts of a variety of forms) originally writte ...
.


Structure


Language subdomains

A separate Hebrew version of Wikisource ( he.wikisource.org) was created in August 2004. The need for a language-specific
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
website derived from the difficulty of typing and editing Hebrew texts in a
left-to-right A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
environment (Hebrew is written right-to-left). In the ensuing months, contributors in other languages including
German German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...
requested their own wikis, but a December vote on the creation of separate language domains was inconclusive. Finally, a second vote that ended May 12, 2005, supported the adoption of separate language subdomains at Wikisource by a large margin, allowing each language to host its texts on its own wiki. An initial wave of 14 languages was set up on August 23, 2005. The new languages did not include English, but the code en: was temporarily set to redirect to the main website ( wikisource.org). At this point the Wikisource community, through a mass project of manually sorting thousands of pages and categories by language, prepared for a second wave of page imports to local wikis. On September 11, 2005, the wikisource.org wiki was reconfigured to enable the English version, along with 8 other languages that were created early that morning and late the night before. Three more languages were created on March 29, 2006, and then another large wave of 14 language domains was created on June 2, 2006. Languages without subdomains are locally incubated. , 182 languages are hosted locally. As of , there are Wikisource subdomains for languages of which are active and are closed. The active sites have articles and the closed sites have articles. There are registered users of which are recently active.


wikisource.org

During the move to language subdomains, the community requested that the main wikisource.org website remain a functioning wiki, in order to serve three purposes: # ''To be a multilingual coordination site for the entire Wikisource project in all languages.'' In practice, use of the website for multilingual coordination has not been heavy since the conversion to language domains. Nevertheless, there is some policy activity at the
Scriptorium A scriptorium () was a writing room in medieval European monasteries for the copying and illuminating of manuscripts by scribes. The term has perhaps been over-used—only some monasteries had special rooms set aside for scribes. Often they ...
, and multilingual updates for news and language milestones at pages such as Wikisource:2007. # ''To be a home for texts in languages without their own subdomains, each with its own local main page for self-organization.'' As a language incubator, the wiki currently provides a home for over 30 languages that do not yet have their own language subdomains. Some of these are very active, and have built libraries with hundreds of texts (such as
Volapük Volapük (; , 'Language of the World', or lit. 'World Speak') is a constructed language created in 1879 and 1880 by Johann Martin Schleyer, a Roman Catholic priest in Baden, Germany, who believed that God told him to create an international lang ...
). # ''To provide direct, ongoing support by a local wiki community for a dynamic multilingual portal at its Main Page, for users who go to http://wikisource.org.'' The current Main Page portal was created on August 26, 2005, by ThomasV, who based it upon the Wikipedia portal. The idea of a project-specific coordination wiki, first realized at Wikisource, also took hold in another Wikimedia project, namely at
Wikiversity Wikiversity is a Wikimedia Foundation project that supports learning communities, their learning materials, and resulting activities. It differs from Wikipedia in that it offers tutorials and other materials for the fostering of learning, rather ...
's Beta Wiki. Like wikisource.org, it serves Wikiversity coordination in all languages, and as a language incubator, but unlike Wikisource, its
Main Page Welcome to Wikipedia, the free content, free encyclopedia that Help:Introduction to Wikipedia, anyone can edit. Special:Statistics, active editors Special:Statistics, articles in English language, English Did you know ... In the ...
does not serve as its multilingual portal.


Reception

Wikipedia co-founder
Larry Sanger Lawrence Mark Sanger (; born July 16, 1968) is an American Internet project developer and philosopher who co-founded Wikipedia along with Jimmy Wales. Sanger coined Wikipedia's name, and provided initial drafts for many of its early guidelines, ...
criticised Wikisource and sister project
Wiktionary Wiktionary (, ; , ; rhyming with "dictionary") is a multilingual, web-based project to create a free content dictionary of terms (including words, phrases, proverbs, linguistic reconstructions, etc.) in all natural languages and in a number o ...
in 2011, after he left the project, saying that their collaborative nature and technology means that there is no oversight by experts, and alleging that their content is therefore not reliable.
Bart D. Ehrman Bart Denton Ehrman (born October 5, 1955) is an American New Testament scholar focusing on textual criticism of the New Testament, the historical Jesus, and the origins and development of early Christianity. He has written and edited 30 books ...
, a New Testament scholar and professor of religious studies at the
University of North Carolina at Chapel Hill The University of North Carolina at Chapel Hill (UNC, UNC–Chapel Hill, or simply Carolina) is a public university, public research university in Chapel Hill, North Carolina, United States. Chartered in 1789, the university first began enrolli ...
, has criticised the English Wikisource's project to create a user-generated translation of the Bible saying "Democratization isn't necessarily good for scholarship."
Richard Elliott Friedman Richard Elliott Friedman (born May 5, 1946) is an American biblical scholar, theologian, and translator who currently serves as the Ann and Jay Davis Professor of Jewish Studies at the University of Georgia. Life and career Friedman was born in ...
, an Old Testament scholar and professor of Jewish studies at the
University of Georgia The University of Georgia (UGA or Georgia) is a Public university, public Land-grant university, land-grant research university with its main campus in Athens, Georgia, United States. Chartered in 1785, it is the oldest public university in th ...
, identified errors in the translation of the
Book of Genesis The Book of Genesis (from Greek language, Greek ; ; ) is the first book of the Hebrew Bible and the Christian Old Testament. Its Hebrew name is the same as its incipit, first word, (In the beginning (phrase), 'In the beginning'). Genesis purpor ...
as of 2008. In 2010, Wikimedia France signed an agreement with the (National Library of France) to add scans from its own ''Gallica'' digital library to French Wikisource. Fourteen hundred public domain French texts were added to the Wikisource library as a result via upload to the
Wikimedia Commons Wikimedia Commons, or simply Commons, is a wiki-based Digital library, media repository of Open content, free-to-use images, sounds, videos and other media. It is a project of the Wikimedia Foundation. Files from Wikimedia Commons can be used ...
. The quality of the transcriptions, previously automatically generated by
optical character recognition Optical character recognition or optical character reader (OCR) is the electronics, electronic or machine, mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo ...
(OCR), was expected to be improved by Wikisource's human proofreaders. In 2011, the English Wikisource received many high-quality scans of documents from the US
National Archives and Records Administration The National Archives and Records Administration (NARA) is an independent agency of the United States government within the executive branch, charged with the preservation and documentation of government and historical records. It is also task ...
(NARA) as part of their efforts "to increase the accessibility and visibility of its holdings." Processing and upload to Commons of these documents, along with many images from the NARA collection, was facilitated by a NARA
Wikimedian in residence A Wikipedian in residence or Wikimedian in residence (WiR) is a Wikipedia editor, a Wikipedian (or Wikimedian), who accepts a placement with an institution, typically an art gallery, library, archive, museum, cultural institution, learned societ ...
, Dominic McDevitt-Parks. Many of these documents have been transcribed and proofread by the Wikisource community and are featured as links in the National Archives' own online catalog.


See also

*
Internet Archive The Internet Archive is an American 501(c)(3) organization, non-profit organization founded in 1996 by Brewster Kahle that runs a digital library website, archive.org. It provides free access to collections of digitized media including web ...
– non-profit digital library *
Open Library Open Library is an online project intended to create "one web page for every book ever published". Created by Aaron Swartz, Brewster Kahle, Alexis Rossi, Anand Chitipothu, and Rebecca Hargrave Malamud, Open Library is a project of the Internet ...
– an online database and repository of books, created by the Internet Archive


References


External links

Wikisource * * Wikipedia:List of Wikisources * Wikisource:For Wikipedians About Wikisource * Danny Wool on Wikisource (
Wikimedia Foundation The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered there as foundation (United States law), a charitable foundation. It is the host of Wikipedia, th ...
article). * A personal perspective on the history of Wikisource by Angela Beesley * Early discussions and plans for the project (Meta) {{Authority control Aggregation-based digital libraries Ebook suppliers Internet properties established in 2003 Multilingual websites Proofreading Wikimedia projects Articles containing video clips