OCRFeeder
   HOME
*



picture info

OCRFeeder
OCRFeeder is an optical character recognition suite for GNOME, which also supports virtually any command-line OCR engine, such as CuneiForm, GOCR, Ocrad and Tesseract. It converts paper documents to digital document files and can serve to make them accessible to visually impaired users. OCRFeeder is free and open-source software subject to the terms of the GNU General Public License (GPL) version 3 or later. It is available for Linux and other Unix-like operating systems. History OCRFeeder was started as a master's thesis in computer science by Joaquim Rocha, who was later hired by Igalia, S.L. and continued development there. The first version was published in March 2009. The OCRFeeder project was initially published and hosted on Google Code, temporarily used Gitorious and now uses the GNOME infrastructure. Since 5 April 2010 a software package is included in the official Debian repositories. Version 0.7 from July 30, 2010 brought image pre-processing features, 0.7.1 (N ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Tesseract (software)
Tesseract is an optical character recognition engine for various operating systems. It is free software, released under the Apache License. Originally developed by Hewlett-Packard as proprietary software in the 1980s, it was released as open source in 2005 and development has been sponsored by Google since 2006.Announcing Tesseract OCR
- The official Google blog
In 2006, Tesseract was considered one of the most accurate open-source OCR engines available.


History

The Tesseract engine was originally developed as proprietary software at Hewlett Packard labs in Bristol, England and Greeley, Colorado between 1985 and 1994, with more changes made in 1996 to port to Windows, and some migration from C (programming lan ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Igalia
Igalia is a private, worker-owned, employee-run cooperative model consultancy focused on open source software. Based in A Coruña, Galicia (Spain), Igalia is known for its contributions and commitments to both open-source and open standards. Igalia's primary focus is on open source solutions for a large set of hardware and software platforms centering on browsers, graphics, multimedia, compilers, device drivers, virtualization, embedded Linux, and device drivers. Active projects Igalia is the current core maintainer of several projects, including: * Two official WebKit ports ** WebKit WPE, a WebKit port optimized for embedded devices ** WebKitGTK, the GTK port of the WebKit web rendering engine used in GNOME desktop applications. * GNOME Web (also known as Epiphany), the GNOME web browser * Orca, a screen reader that provides access to the graphical Linux desktop via user-customizable combinations of speech and braille. * Wolvic, a web browser built for extended reality. Iga ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Ocrad
Ocrad is an optical character recognition program and part of the GNU Project. It is free software licensed under the GNU GPL. Based on a feature extraction method, it reads images in portable pixmap formats known as Portable anymap and produces text in byte (8-bit) or UTF-8 formats. Also included is a layout analyser, able to separate the columns or blocks of text normally found on printed pages. User interface Ocrad can be used as a stand-alone command-line application or as a back-end to other programs. Kooka, which was the KDE environment's default scanning application until KDE 4, can use Ocrad as its OCR engine. Since conversion to newer Qt versions, current versions of KDE no longer contain Kooka; development continues in the KDE git repository. Ocrad can be also used as an OCR engine in OCRFeeder. History Ocrad has been developed by Antonio Diaz Diaz since 2003. Version 0.7 was released in February 2004, 0.14 in February 2006 and 0.18 in May 2009. It is written in ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


GOCR
GOCR (or JOCR) is a free optical character recognition program, initially written by Jörg Schulenburg. It can be used to convert or scan image files (portable pixmap or PCX) into text files. Features GOCR claims it can handle single-column sans-serif fonts of 20–60 pixels in height. It reports trouble with serif fonts, overlapping characters, handwritten text, heterogeneous fonts, noisy images, large angles of skew, and text in anything other than a Latin alphabet. GOCR can also translate barcodes. User interface GOCR can be used as a stand-alone command-line application, or as a back-end to other programs. It comes with a gocr.tcl graphic interface. GOCR can be also used as an OCR engine in OCRFeeder. Development Version 0.3.0 was released in December 2000, 0.3.5 in February 2002, and 0.37 in August 2002. Between version 0.40 (March 2005) and 0.43 (December 2006), the recognition engine was gradually replaced with a vector version. Version 0.48 was released in A ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Swedish Language
Swedish ( ) is a North Germanic language spoken predominantly in Sweden and in parts of Finland. It has at least 10 million native speakers, the fourth most spoken Germanic language and the first among any other of its type in the Nordic countries overall. Swedish, like the other Nordic languages, is a descendant of Old Norse, the common language of the Germanic peoples living in Scandinavia during the Viking Era. It is largely mutually intelligible with Norwegian and Danish, although the degree of mutual intelligibility is largely dependent on the dialect and accent of the speaker. Written Norwegian and Danish are usually more easily understood by Swedish speakers than the spoken languages, due to the differences in tone, accent, and intonation. Standard Swedish, spoken by most Swedes, is the national language that evolved from the Central Swedish dialects in the 19th century and was well established by the beginning of the 20th century. While distinct regional varieties ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Master's Thesis
A thesis ( : theses), or dissertation (abbreviated diss.), is a document submitted in support of candidature for an academic degree or professional qualification presenting the author's research and findings.International Standard ISO 7144: DocumentationâPresentation of theses and similar documents International Organization for Standardization, Geneva, 1986. In some contexts, the word "thesis" or a cognate is used for part of a bachelor's or master's course, while "dissertation" is normally applied to a doctorate. This is the typical arrangement in American English. In other contexts, such as within most institutions of the United Kingdom and Republic of Ireland, the reverse is true. The term graduate thesis is sometimes used to refer to both master's theses and doctoral dissertations. The required complexity or quality of research of a thesis or dissertation can vary by country, university, or program, and the required minimum study period may thus vary significantly in du ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Joaquim Rocha Introducing OCRFeeder
Joaquim is the Portuguese and Catalan version of Joachim and may refer to: * Alberto Joaquim Chipande, politician * Eduardo Joaquim Mulémbwè, politician * Joaquim Agostinho (1943–1984), Portuguese professional bicycle racer * Joaquim Amat-Piniella (1913–1974), Catalan writer and politician * Joaquim Antonio (Callado) da Silva (1848–1880), Brazilian composer and flutist * Joaquim António de Aguiar (1792–1884), Portuguese politician * Joaquim Arcoverde de Albuquerque Cavalcanti (1850–1930), first Cardinal to be born in Latin America * Joaquim Augusto Mouzinho de Albuquerque (1855–1902), Portuguese soldier * Joaquim Benedito Barbosa Gomes, first black Supreme Federal Tribunal justice in Brazil * Joaquim Carvalho (born 1937), Portuguese football goalkeeper * Joaquim Chissano (born 1939), President of Mozambique * Joaquim Cruz (born 1963), former Brazilian athlete * Joaquim de Almeida (born 1957), Portuguese actor * Joaquim Ferraz (born 1974), Portuguese footballer * Jo ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Free And Open-source Software
Free and open-source software (FOSS) is a term used to refer to groups of software consisting of both free software and open-source software where anyone is freely licensed to use, copy, study, and change the software in any way, and the source code is openly shared so that people are encouraged to voluntarily improve the design of the software. This is in contrast to proprietary software, where the software is under restrictive copyright licensing and the source code is usually hidden from the users. FOSS maintains the software user's civil liberty rights (see the Four Essential Freedoms, below). Other benefits of using FOSS can include decreased software costs, increased security and stability (especially in regard to malware), protecting privacy, education, and giving users more control over their own hardware. Free and open-source operating systems such as Linux and descendants of BSD are widely utilized today, powering millions of servers, desktops, smartphones (e.g., ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




CuneiForm (software)
CuneiForm Cognitive OpenOCR is a freely distributed open-source OCR system developed by Russian software company Cognitive Technologies. CuneiForm OCR was developed by Cognitive Technologies as a commercial product in 1993. The system came with the most popular models of scanners, MFPs and software in Russia and the rest of the world: Corel Draw, Hewlet-Packard, Epson, Xerox, Samsung, Brother, Mustek, OKI, Canon, Olivetti, etc. In 2008 Cognitive Technologies opened the program's source codes. Features CuneiForm is a system developed for transforming the electronic copies of paper documents and image files into an editable form without changing the structure and the original document fonts in automatic or semi-automatic mode. The system includes two components for single and batch processing of electronic documents. The list of languages supported by the system: Besides, the system supports a mixture of Russian and English. Recognition of other mixed languages is only suppo ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

GNOME
A gnome is a mythological creature and diminutive spirit in Renaissance magic and alchemy, first introduced by Paracelsus in the 16th century and later adopted by more recent authors including those of modern fantasy literature. Its characteristics have been reinterpreted to suit the needs of various story tellers, but it is typically said to be a small humanoid that lives underground. Diminutive statues of gnomes introduced as lawn ornaments during the 19th century grew in popularity during the 20th century and came to be known as garden gnomes. History Origins The word comes from Renaissance Latin ''gnomus'', which first appears in ''A Book on Nymphs, Sylphs, Pygmies, and Salamanders, and on the Other Spirits'' by Paracelsus, published posthumously in Nysa in 1566 (and again in the Johannes Huser edition of 1589–1591 from an autograph by Paracelsus). The term may be an original invention of Paracelsus, possibly deriving the term from Latin ''gēnomos'' (itself represen ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Optical Character Recognition
Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo) or from subtitle text superimposed on an image (for example: from a television broadcast). Widely used as a form of data entry from printed paper data records – whether passport documents, invoices, bank statements, computerized receipts, business cards, mail, printouts of static-data, or any suitable documentation – it is a common method of digitizing printed texts so that they can be electronically edited, searched, stored more compactly, displayed on-line, and used in machine processes such as cognitive computing, machine translation, (extracted) text-to-speech, key data and text mining. OCR is a field of research in pattern recognition, artificial intellig ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Free Software
Free software or libre software is computer software distributed under terms that allow users to run the software for any purpose as well as to study, change, and distribute it and any adapted versions. Free software is a matter of liberty, not price; all users are legally free to do what they want with their copies of a free software (including profiting from them) regardless of how much is paid to obtain the program.Selling Free Software
(gnu.org)
Computer programs are deemed "free" if they give end-users (not just the developer) ultimate control over the software and, subsequently, over their devices. The right to study and modify a computer program entails that