HOME

TheInfoList



OR:

Archie is a tool for indexing
FTP The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and data ...
archives, allowing users to more easily identify specific files. It is considered the first
Internet The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, pub ...
search engine A search engine is a software system designed to carry out web searches. They search the World Wide Web in a systematic way for particular information specified in a textual web search query. The search results are generally presented in a ...
. The original implementation was written in 1990 by Alan Emtage, then a postgraduate student at
McGill University McGill University (french: link=no, Université McGill) is an English-language public research university located in Montreal, Quebec, Canada. Founded in 1821 by royal charter granted by King George IV,Frost, Stanley Brice. ''McGill University ...
in
Montreal Montreal ( ; officially Montréal, ) is the second-most populous city in Canada and most populous city in the Canadian province of Quebec. Founded in 1642 as '' Ville-Marie'', or "City of Mary", it is named after Mount Royal, the triple-pe ...
, Canada. Archie has since been superseded by other, more sophisticated search engines, including Jughead and Veronica. These were in turn superseded by search engines like
Yahoo! Yahoo! (, styled yahoo''!'' in its logo) is an American web services provider. It is headquartered in Sunnyvale, California and operated by the namesake company Yahoo! Inc. (2017–present), Yahoo Inc., which is 90% owned by investment funds ma ...
in 1995 and
Google Google LLC () is an American multinational technology company focusing on search engine technology, online advertising, cloud computing, computer software, quantum computing, e-commerce, artificial intelligence, and consumer electronics. I ...
in 1997. Work on Archie ceased in the late 1990s. A legacy Archie server is still maintained active for historic purposes in Poland at
University of Warsaw The University of Warsaw ( pl, Uniwersytet Warszawski, la, Universitas Varsoviensis) is a public university in Warsaw, Poland. Established in 1816, it is the largest institution of higher learning in the country offering 37 different fields of ...
's
Interdisciplinary Centre for Mathematical and Computational Modelling Interdisciplinary Centre for Mathematical and Computational Modelling (ICM) is a supercomputing and research data centre at the University of Warsaw in Poland. See also * Open access in Poland Open access scholarly communication of Poland ca ...
.


Origin

Archie began as a project for students and volunteer staff at the McGill University School of Computer Science in 1987, when Peter Deutsch (systems manager for the School), Alan Emtage, and Bill Heelan were asked to connect the School to the Internet. The name derives from the word "archive" without the v. Emtage has said that contrary to popular belief, there was no association with the
Archie Comics Archie Comic Publications, Inc., is an American comic book publisher headquartered in Pelham, New York.Jughead and Veronica were named after characters from the comics. Anarchie, one of the earliest graphical FTP clients was named for its ability to perform Archie searches.


How Archie worked

The earliest versions of Archie would simply search a list of public anonymous
File Transfer Protocol The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and data ...
(FTP) sites using the
Telnet Telnet is an application protocol used on the Internet or local area network to provide a bidirectional interactive text-oriented communication facility using a virtual terminal connection. User data is interspersed in-band with Telnet control ...
protocol and create an index of the FTP files. FTP is essentially a way to transfer files between computers. To view the contents of a file, it had first to be downloaded. The indexes are updated on a regular basis (contacting each roughly once a month, so as not to waste too many resources of the remote servers) and requested a listing. These listings were stored in local files to be searched using the
Unix Unix (; trademarked as UNIX) is a family of multitasking, multiuser computer operating systems that derive from the original AT&T Unix, whose development started in 1969 at the Bell Labs research center by Ken Thompson, Dennis Ritchie, and ot ...
command. The developers populated the engine's servers with databases of anonymous FTP host directories. This was used to find specific file titles since the list was plugged in to a searchable database of FTP sites. Archie did not recognize natural language requests nor index the content inside the files. Therefore, users had to know the title of the file they wanted. The ability to index the content inside the files was first introduced by
Gopher Pocket gophers, commonly referred to simply as gophers, are burrowing rodents of the family Geomyidae. The roughly 41 speciesSearch results for "Geomyidae" on thASM Mammal Diversity Database are all endemic to North and Central America. They are ...
.


Development

Emtage and Heelan wrote a script allowing people to log in and search collected information using the
Telnet Telnet is an application protocol used on the Internet or local area network to provide a bidirectional interactive text-oriented communication facility using a virtual terminal connection. User data is interspersed in-band with Telnet control ...
protocol at the host "archie.mcgill.ca" 32.206.2.3 Later, more efficient front- and back-ends were developed, and the system spread from a local tool, to a network-wide resource, and a popular service available from multiple sites around the
Internet The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, pub ...
. The collected data would be exchanged between the neighbouring Archie servers. The servers could be accessed in multiple ways: using a local client (such as ''archie'' or ''xarchie'');
telnet Telnet is an application protocol used on the Internet or local area network to provide a bidirectional interactive text-oriented communication facility using a virtual terminal connection. User data is interspersed in-band with Telnet control ...
ting to a server directly; sending queries by
electronic mail Electronic mail (email or e-mail) is a method of exchanging messages ("mail") between people using electronic devices. Email was thus conceived as the electronic (digital) version of, or counterpart to, mail, at a time when "mail" meant ...
; and later via a
World Wide Web The World Wide Web (WWW), commonly known as the Web, is an information system enabling documents and other web resources to be accessed over the Internet. Documents and downloadable media are made available to the network through web se ...
interface. At the zenith of its fame the Archie search engine accounted for 50% of Montreal Internet traffic. In 1992, Emtage along with Deutsch and some financial help of McGill University formed Bunyip Information Systems the world's first company expressly founded for and dedicated to providing Internet information services with a licensed commercial version of the Archie search engine used by millions of people worldwide. Heelan followed them into Bunyip soon after, where he together with Bibi Ali and
Sandro Mazzucato Sandro is an Italian, Portuguese, Spanish, Swiss, Georgian and Croatian given name, often a diminutive of Alessandro or Alexander. It is also a surname. Sandro may refer to: Given name or nickname Sports * Sandro (footballer, born 1973), Brazi ...
was a part of so-called Archie Group. The group significantly updated the archie database and indexed web-pages. Work on the search engine ceased in the late 1990s.


See also

* Jughead * Veronica *
Wide area information server Wide Area Information Server (WAIS) is a client–server text searching system that uses the ANSI Standard Z39.50 Information Retrieval Service Definition and Protocol Specifications for Library Applications" (Z39.50:1988) to search index databa ...


References


Further reading

*Archie—A Darwinian Development Process. Peter Deutsch.
IEEE Internet Computing ''IEEE Internet Computing'' is a bimonthly peer-reviewed scientific journal published by the IEEE Computer Society. It covers all aspects of emerging and maturing Internet technologies. The editor-in-chief is George Pallis (University of Cyprus). ...
, January/February 2000, 4(1):69-71. Part of Millennial Forecasts, . *P. Deutsch, A. Emtage, A. Marine
''How to Use Anonymous FTP''
(RFC1635, May 1994)


External links



- search seems to be dead (timeout) {{DEFAULTSORT:Archie Search Engine Internet Standards Unix Internet software Internet search engines History of the Internet