Faceted search
   HOME

TheInfoList



OR:

Faceted search is a technique that involves augmenting traditional search techniques with a faceted navigation system, allowing users to narrow down search results by applying multiple filters based on
faceted classification A faceted classification is a classification scheme used in organizing knowledge into a systematic order. A faceted classification uses semantic categories, either general or subject-specific, that are combined to create the full classification ent ...
of the items. It is sometimes referred to as a '' parametric search'' technique. A faceted classification system classifies each information element along multiple explicit dimensions, called facets, enabling the classifications to be accessed and ordered in multiple ways rather than in a single, pre-determined, taxonomic order. Facets correspond to properties of the information elements. They are often derived by analysis of the text of an item using
entity extraction Named-entity recognition (NER) (also known as (named) entity identification, entity chunking, and entity extraction) is a subtask of information extraction that seeks to locate and classify named entities mentioned in unstructured text into pre ...
techniques or from pre-existing fields in a database such as author, descriptor, language, and format. Thus, existing web-pages, product descriptions or online collections of articles can be augmented with navigational facets. Faceted search interfaces were first developed in the academic world by
Ben Shneiderman Ben Shneiderman (born August 21, 1947) is an American computer scientist, a Distinguished University Professor in the University of Maryland Department of Computer Science, which is part of the University of Maryland College of Computer, Mathem ...
, Steven Pollitt, Marti Hearst, and
Gary Marchionini Gary Marchionini is an American information scientist and educator at the University of North Carolina at Chapel Hill (1998-present). Work Gary Marchionini is a leader in defining theory of human information interaction and exploratory search ...
in the 1990s and 2000s. The most well-known of these efforts was the Flamenco research project at
University of California, Berkeley The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California) is a public land-grant research university in Berkeley, California. Established in 1868 as the University of California, it is the state's first land-grant u ...
led by Marti Hearst. Concurrently, there was development of commercial faceted search systems, notably Endeca and Spotfire. Within the academic community, faceted search has attracted interest primarily among library and information science researchers, and to some extent among
computer science Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includi ...
researchers specializing in information retrieval.


Mass market use

Faceted search has become a popular technique in commercial search applications, particularly for online retailers and libraries. An increasing number of enterprise search vendors provide software for implementing faceted search applications. Online retail catalogs pioneered the earliest applications of faceted search, reflecting both the faceted nature of product data (most products have a type, brand, price, etc.) and the ready availability of the data in retailers' existing information-systems. In the early 2000s retailers started using faceted search, in part due to published studies that evaluated user search experience on popular sites. , among the 50 largest US-based online retailers, 40% had implemented faceted search.Smashing Magazine: The Current State of E-Commerce Search
Retrieved on 2014-08-27.
Examples include the filtering options that appear in the left column on
amazon.com Amazon.com, Inc. ( ) is an American multinational technology company focusing on e-commerce, cloud computing, online advertising, digital streaming, and artificial intelligence. It has been referred to as "one of the most influential econo ...
or
Google Shopping Google Shopping, formerly Google Product Search, Google Products and Froogle, is a Google service created by Craig Nevill-Manning which allows users to search for products on online shopping websites and compare prices between different vendor ...
after a keyword search has been performed.


Libraries and information science

In 1933, the noted librarian Ranganathan proposed a
faceted classification A faceted classification is a classification scheme used in organizing knowledge into a systematic order. A faceted classification uses semantic categories, either general or subject-specific, that are combined to create the full classification ent ...
system for library materials, known as colon classification. In the pre-computer era, he did not succeed in replacing the pre-coordinated
Dewey Decimal Classification The Dewey Decimal Classification (DDC), colloquially known as the Dewey Decimal System, is a proprietary library classification system which allows new books to be added to a library in their appropriate location based on subject. Section 4.1 ...
system. Modern online library catalogs, also known as
online public access catalog The online public access catalog (OPAC), now frequently synonymous with ''library catalog'', is an online database of materials held by a library or group of libraries. Online catalogs have largely replaced the analog card catalogs previously u ...
s (OPAC), have increasingly adopted faceted search interfaces. Noted examples include the North Carolina State University library catalog (part of the Triangle Research Libraries Network) and the OCLC Open
WorldCat WorldCat is a union catalog that itemizes the collections of tens of thousands of institutions (mostly libraries), in many countries, that are current or past members of the OCLC global cooperative. It is operated by OCLC, Inc. Many of the O ...
system. The
CiteSeerX CiteSeerX (formerly called CiteSeer) is a public search engine and digital library for scientific and academic papers, primarily in the fields of computer and information science. CiteSeer's goal is to improve the dissemination and access of ac ...
projectCiteSeerX
Citeseerx.ist.psu.edu. Retrieved on 2013-07-21.
at the Pennsylvania State University allows faceted search for academic documents and continues to expand into other facets such as table search.


See also

*
Enterprise search Enterprise search is the practice of making content from multiple enterprise-type sources, such as databases and intranets, searchable to a defined audience. "Enterprise search" is used to describe the software of search information within an ente ...
* Exploratory search *
Faceted classification A faceted classification is a classification scheme used in organizing knowledge into a systematic order. A faceted classification uses semantic categories, either general or subject-specific, that are combined to create the full classification ent ...
*
Human–computer information retrieval Human–computer information retrieval (HCIR) is the study and engineering of information retrieval techniques that bring human intelligence into the search process. It combines the fields of human-computer interaction (HCI) and information retri ...
*
Information extraction Information extraction (IE) is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents and other electronically represented sources. In most of the cases this activity concer ...
* NoSQL * Voxound


References

{{DEFAULTSORT:Faceted Search Information retrieval techniques Information retrieval genres