CiteSeerx (originally called CiteSeer) is a public search engine and digital library for scientific and academic papers, primarily in the fields of computer and information science. Many[who?] consider it to be the first academic paper search engine and the first automated citation indexing system. CiteSeer holds a United States patent # 6289342, titled "Autonomous citation indexing and literature browsing using citation context," granted on September 11, 2001. Stephen R. Lawrence, C. Lee Giles, Kurt D. Bollacker are the inventors of this patent assigned to NEC Laboratories America, Inc. This patent was filed on May 20, 1998, which has its roots (Priority) to January 05, 1998. A continuation patent was also granted to the same inventors and also assigned to NEC Labs on this invention i.e. US Patent # 6738780 granted on May 18, 2004 and was filed on May 16, 2001. CiteSeer is considered as a predecessor of academic search tools such as Google Scholar and Microsoft Academic Search. CiteSeer-like engines and archives usually only harvest documents from publicly available websites and do not crawl publisher websites. For this reason, authors whose documents are freely available are more likely to be represented in the index. CiteSeer's goal is to improve the dissemination and access of academic and scientific literature. As a non-profit service that can be freely used by anyone, it has been considered as part of the open access movement that is attempting to change academic and scientific publishing to allow greater access to scientific literature. CiteSeer freely provided Open Archives Initiative metadata of all indexed documents and links indexed documents when possible to other sources of metadata such as DBLP and the ACM Portal. To promote open data, CiteSeerx shares its data for non-commercial purposes under a Creative Commons license. The name can be construed to have at least two explanations. As a pun, a 'sightseer' is a tourist who looks at the sights, so a 'cite seer' would be a researcher who looks at cited papers. Another is a 'seer' is a prophet and a 'cite seer' is a prophet of citations. CiteSeer changed its name to ResearchIndex at one point and then changed it back.
1.1 CiteSeer and CiteSeer.IST 1.2 CiteSeerx
2 Current features
2.1 Automated information extraction 2.2 Focused crawling 2.3 Usage 2.4 Data
3 Other SeerSuite-based search engines 4 See also 5 References 6 Further reading 7 External links
History CiteSeer and CiteSeer.IST CiteSeer was created by researchers Lee Giles, Kurt Bollacker and Steve Lawrence in 1997 while they were at the NEC Research Institute (now NEC Labs), Princeton, New Jersey, USA. CiteSeer's goal was to actively crawl and harvest academic and scientific documents on the web and use autonomous citation indexing to permit querying by citation or by document, ranking them by citation impact. At one point, it was called ResearchIndex. CiteSeer became public in 1998 and had many new features unavailable in academic search engines at that time. These included:
Autonomous Citation Indexing automatically created a citation index that can be used for literature search and evaluation. Citation statistics and related documents were computed for all articles cited in the database, not just the indexed articles. Reference linking allowing browsing of the database using citation links. Citation context showed the context of citations to a given paper, allowing a researcher to quickly and easily see what other researchers have to say about an article of interest. Related documents were shown using citation and word based measures and an active and continuously updated bibliography is shown for each document.
After NEC, in 2004 it was hosted as CiteSeer.IST on the World Wide Web
at the College of Information Sciences and Technology, The
Pennsylvania State University, and had over 700,000 documents. For
enhanced access, performance and research, similar versions of
CiteSeer were supported at universities such as the Massachusetts
Institute of Technology,
University of Zürich
arXiv Google Scholar Microsoft Academic Semantic Scholar Collection of Computer Science Bibliographies DBLP (Digital Bibliography & Library Project) List of academic databases and search engines Arnetminer Disciplinary repository Research Papers in Economics (RePEc)
^ " CiteSeerX Data Policy". Retrieved 2015-11-10. ^ a b "About CiteSeerX". Retrieved 2010-05-07. ^ "The CiteSeerX Team". Pennsylvania State University. Archived from the original on 2010-01-26. Retrieved 2010-07-24. ^ "Ranking Web of World Repositories: Top 800 Repositories". Cybermetrics Lab. July 2010. Archived from the original on 2010-07-24. Retrieved 2010-07-24. ^ "About CiteSeerX Data". Pennsylvania State University. Retrieved 2012-01-25.
Giles, C. Lee; Bollacker, Kurt D.; Lawrence, Steve (1998). "CiteSeer: an automatic citation indexing system". Proceedings of the third ACM conference on Digital libraries: 89–98. CiteSeerX 10.1.1.30.6847 . doi:10.1145/276675.276685. ISBN 0-89791-965-3.
Official website of CiteSeerx CiteSeerX on GitHub SeerSuite on SourceFo