Archie search engine
   HOME

TheInfoList



OR:

Archie is a tool for indexing FTP archives, allowing users to more easily identify specific files. It is considered the first
Internet The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, p ...
search engine A search engine is a software system designed to carry out web searches. They search the World Wide Web in a systematic way for particular information specified in a textual web search query. The search results are generally presented in a ...
. The original implementation was written in 1990 by
Alan Emtage Alan Emtage (born November 27, 1964) is a Bajan- Canadian computer scientist who conceived and implemented the first version of Archie, a pre- Web Internet search engine for locating material in public FTP archives. It is widely considered the ...
, then a postgraduate student at
McGill University McGill University (french: link=no, Université McGill) is an English-language public research university located in Montreal, Quebec, Canada. Founded in 1821 by royal charter granted by King George IV,Frost, Stanley Brice. ''McGill Univer ...
in
Montreal Montreal ( ; officially Montréal, ) is the second-most populous city in Canada and most populous city in the Canadian province of Quebec. Founded in 1642 as '' Ville-Marie'', or "City of Mary", it is named after Mount Royal, the triple- ...
,
Canada Canada is a country in North America. Its ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, covering over , making it the world's second-largest country by to ...
. Archie has since been superseded by other, more sophisticated search engines, including Jughead and Veronica. These were in turn superseded by search engines like
Yahoo! Yahoo! (, styled yahoo''!'' in its logo) is an American web services provider. It is headquartered in Sunnyvale, California and operated by the namesake company Yahoo Inc., which is 90% owned by investment funds managed by Apollo Global Mana ...
in 1995 and
Google Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...
in 1997. Work on Archie ceased in the late 1990s. A legacy Archie server is still maintained active for historic purposes in Poland at
University of Warsaw The University of Warsaw ( pl, Uniwersytet Warszawski, la, Universitas Varsoviensis) is a public university in Warsaw, Poland. Established in 1816, it is the largest institution of higher learning in the country offering 37 different fields of ...
's Interdisciplinary Centre for Mathematical and Computational Modelling.


Origin

Archie began as a project for students and volunteer staff at the
McGill University School of Computer Science The School of Computer Science (SOCS) is an academic department in the Faculty of Science at McGill University in Montreal, Quebec, Canada. The school is the second most funded computer science department in Canada. It currently has 34 facult ...
in 1987, when Peter Deutsch (systems manager for the School), Alan Emtage, and Bill Heelan were asked to connect the School to the Internet. The name derives from the word "archive" without the v. Emtage has said that contrary to popular belief, there was no association with the
Archie Comics Archie Comic Publications, Inc., is an American comic book publisher headquartered in Pelham, New York.Jughead and Veronica were named after characters from the comics. Anarchie, one of the earliest graphical FTP clients was named for its ability to perform Archie searches.


How Archie worked

The earliest versions of Archie would simply search a list of public anonymous
File Transfer Protocol The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and da ...
(FTP) sites using the
Telnet Telnet is an application protocol used on the Internet or local area network to provide a bidirectional interactive text-oriented communication facility using a virtual terminal connection. User data is interspersed in-band with Telnet control i ...
protocol and create an index of the FTP files. FTP is essentially a way to transfer files between computers. To view the contents of a file, it had first to be downloaded. The indexes are updated on a regular basis (contacting each roughly once a month, so as not to waste too many resources of the remote servers) and requested a listing. These listings were stored in local files to be searched using the
Unix Unix (; trademarked as UNIX) is a family of multitasking, multiuser computer operating systems that derive from the original AT&T Unix, whose development started in 1969 at the Bell Labs research center by Ken Thompson, Dennis Ritchie, ...
command. The developers populated the engine's servers with databases of anonymous FTP host directories. This was used to find specific file titles since the list was plugged in to a searchable database of FTP sites. Archie did not recognize natural language requests nor index the content inside the files. Therefore, users had to know the title of the file they wanted. The ability to index the content inside the files was first introduced by Gopher.


Development

Emtage and Heelan wrote a script allowing people to log in and search collected information using the
Telnet Telnet is an application protocol used on the Internet or local area network to provide a bidirectional interactive text-oriented communication facility using a virtual terminal connection. User data is interspersed in-band with Telnet control i ...
protocol at the host "archie.mcgill.ca" 32.206.2.3 Later, more efficient front- and back-ends were developed, and the system spread from a local tool, to a network-wide resource, and a popular service available from multiple sites around the
Internet The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, p ...
. The collected data would be exchanged between the neighbouring Archie servers. The servers could be accessed in multiple ways: using a local client (such as ''archie'' or ''xarchie'');
telnet Telnet is an application protocol used on the Internet or local area network to provide a bidirectional interactive text-oriented communication facility using a virtual terminal connection. User data is interspersed in-band with Telnet control i ...
ting to a server directly; sending queries by
electronic mail Electronic mail (email or e-mail) is a method of exchanging messages ("mail") between people using electronic devices. Email was thus conceived as the electronic (digital) version of, or counterpart to, mail, at a time when "mail" meant ...
; and later via a
World Wide Web The World Wide Web (WWW), commonly known as the Web, is an information system enabling documents and other web resources to be accessed over the Internet. Documents and downloadable media are made available to the network through web ...
interface. At the zenith of its fame the Archie search engine accounted for 50% of Montreal Internet traffic. In 1992, Emtage along with Deutsch and some financial help of McGill University formed Bunyip Information Systems the world's first company expressly founded for and dedicated to providing Internet information services with a licensed commercial version of the Archie search engine used by millions of people worldwide. Heelan followed them into Bunyip soon after, where he together with Bibi Ali and Sandro Mazzucato was a part of so-called Archie Group. The group significantly updated the archie database and indexed web-pages. Work on the search engine ceased in the late 1990s.


See also

* Jughead * Veronica * Wide area information server


References


Further reading

*Archie—A Darwinian Development Process. Peter Deutsch. IEEE Internet Computing, January/February 2000, 4(1):69-71. Part of Millennial Forecasts, . *P. Deutsch, A. Emtage, A. Marine
''How to Use Anonymous FTP''
(RFC1635, May 1994)


External links



- search seems to be dead (timeout) {{DEFAULTSORT:Archie Search Engine Internet Standards Unix Internet software Internet search engines History of the Internet