Freebase was a large collaborative
knowledge base
In computer science, a knowledge base (KB) is a set of sentences, each sentence given in a knowledge representation language, with interfaces to tell new sentences and to ask questions about what is known, where either of these interfaces migh ...
consisting of
data
Data ( , ) are a collection of discrete or continuous values that convey information, describing the quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted for ...
composed mainly by its
community
A community is a social unit (a group of people) with a shared socially-significant characteristic, such as place, set of norms, culture, religion, values, customs, or identity. Communities may share a sense of place situated in a given g ...
members. It was an
online collection of structured data harvested from many sources, including individual, user-submitted
wiki
A wiki ( ) is a form of hypertext publication on the internet which is collaboratively edited and managed by its audience directly through a web browser. A typical wiki contains multiple pages that can either be edited by the public or l ...
contributions.
[
] Freebase aimed to create a global resource that allowed people (and machines) to access common information more effectively. It was developed by the American software company
Metaweb and run publicly beginning in March 2007. Metaweb was acquired by
Google
Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
in a private sale announced on 16 July 2010. Google's
Knowledge Graph
In knowledge representation and reasoning, a knowledge graph is a knowledge base that uses a Graph (discrete mathematics), graph-structured data model or topology to represent and operate on data. Knowledge graphs are often used to store interl ...
is powered in part by Freebase.
During its existence, Freebase data was available for
commercial
Commercial may refer to:
* (adjective for) commerce, a system of voluntary exchange of products and services
** (adjective for) trade, the trading of something of economic value such as goods, services, information or money
* a dose of advertising ...
and
non-commercial
A non-commercial (also spelled noncommercial) activity is an activity that is not carried out in the interest of Profit (economics), profit. The opposite is Commerce, commercial, something that primarily serves profit interests and is focused on bu ...
use under a
Creative Commons Attribution License, and an open
API
An application programming interface (API) is a connection between computers or between computer programs. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how to build ...
,
RDF endpoint, and a database dump is provided for programmers.
On 16 December 2014, Google announced that it would shut down Freebase over the succeeding six months and help with the move of the data from Freebase to
Wikidata
Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, are able to use under the CC0 public domain ...
.
On 16 December 2015, Google officially announced the Knowledge Graph API, which is meant to be a replacement to the Freebase API. Freebase.com was officially shut down on 2 May 2016.
Both Graphd and MQL, the graph database and JSON-based query language developed by Metaweb for Freebase, are open-sourced by Google under the Apache 2.0 license, and are available on GitHub. Graphd is open-sourced on 8 September 2018. MQL is open-sourced on 4 August 2020.
Overview
On 3 March 2007 Metaweb announced Freebase, describing it as "an open shared database of the world's knowledge", and "a massive, collaboratively edited database of cross-linked data". Often understood as a
database model
A database model is a type of data model that determines the logical structure of a database. It fundamentally determines in which manner data can be stored, organized and manipulated. The most popular example of a database model is the relatio ...
using Wikipedia-turned-database or
entity-relationship model, Freebase provided an interface that allowed non-programmers to fill in structured data, or
metadata
Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including:
* Descriptive ...
, of general information and to categorize or connect data items in meaningful,
semantic
Semantics is the study of linguistic Meaning (philosophy), meaning. It examines what meaning is, how words get their meaning, and how the meaning of a complex expression depends on its parts. Part of this process involves the distinction betwee ...
ways.
Described by
Tim O'Reilly upon the launch, "Freebase is the bridge between the bottom up vision of
Web 2.0 collective intelligence
Collective intelligence (CI) is shared or group intelligence (GI) that Emergence, emerges from the collaboration, collective efforts, and competition of many individuals and appears in consensus decision making. The term appears in sociobiolog ...
and the more structured world of the
semantic web
The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.
To enable the encoding o ...
".
Freebase contained data harvested from sources such as
Wikipedia
Wikipedia is a free content, free Online content, online encyclopedia that is written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. Founded by Jimmy Wales and La ...
,
NNDB
The Notable Names Database (NNDB) is an online database of biographical details of over 40,000 people. Soylent Communications, a sole proprietorship that also hosted the later defunct Rotten.com, describes NNDB as an "intelligence aggregator" ...
,
Fashion Model Directory and
MusicBrainz
MusicBrainz is a MetaBrainz project that aims to create a collaborative music database that is similar to the freedb project. MusicBrainz was founded in response to the restrictions placed on the CDDB, Compact Disc Database (CDDB), a database for ...
, as well as data contributed by its users. The structured data was licensed under the
Creative Commons Attribution License,
and a
JSON
JSON (JavaScript Object Notation, pronounced or ) is an open standard file format and electronic data interchange, data interchange format that uses Human-readable medium and data, human-readable text to store and transmit data objects consi ...
-based
HTTP
HTTP (Hypertext Transfer Protocol) is an application layer protocol in the Internet protocol suite model for distributed, collaborative, hypermedia information systems. HTTP is the foundation of data communication for the World Wide Web, wher ...
API is provided to programmers for developing applications on any platform to utilize the Freebase data. The
source code
In computing, source code, or simply code or source, is a plain text computer program written in a programming language. A programmer writes the human readable source code to control the behavior of a computer.
Since a computer, at base, only ...
for the Metaweb application itself is proprietary.
Freebase ran on a database infrastructure created in-house by Metaweb that use a
graph
Graph may refer to:
Mathematics
*Graph (discrete mathematics), a structure made of vertices and edges
**Graph theory, the study of such graphs and their properties
*Graph (topology), a topological space resembling a graph in the sense of discret ...
model: Instead of using tables and keys to define data structures, Freebase defined its data structure as a set of
nodes and a set of links that established relationships between the nodes. Because its data structure was non-hierarchical, Freebase could model much more complex relationships between individual elements than a conventional
database
In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
, and was open for users to enter new objects and relationships into the underlying graph. Queries to the database are made in Metaweb Query Language (MQL) and served by a
triplestore
A triplestore or RDF store is a purpose-built database for the storage and retrieval of triples through semantic queries. A triple is a data entity composed of subject– predicate– object, like "Bob is 35" (i.e., Bob's age measured in years i ...
called graphd.
Development
Danny Hillis first described his idea for creating a knowledge web he called Aristotle in a paper in 2000, but he said he did not try to build the system until he had recruited technical experts.
Veda Hlubinka-Cook, an expert in
parallel computing
Parallel computing is a type of computing, computation in which many calculations or Process (computing), processes are carried out simultaneously. Large problems can often be divided into smaller ones, which can then be solved at the same time. ...
,
became Metaweb's Executive Vice President for Product.
Kurt Bollacker brought deep expertise in distributed systems, database design, and information retrieval to his role as Chief Scientist at Metaweb.
John Giannandrea, formerly Chief Technologist at
Tellme Networks
Tellme Networks, Inc. was an American company founded in 1999 by Mike McCue and Angus Davis, which specialized in telephone-based applications. Its headquarters were in Mountain View, California.
Tellme Networks was acquired by Microsoft on Mar ...
and Chief Technologist of the Web browser group at
Netscape
Netscape Communications Corporation (originally Mosaic Communications Corporation) was an American independent computer services company with headquarters in Mountain View, California, and then Dulles, Virginia. Its Netscape web browser was o ...
/AOL, was Chief Technology Officer.
Originally accessible by invitation only, Freebase opened full anonymous read access to the public in its
alpha
Alpha (uppercase , lowercase ) is the first letter of the Greek alphabet. In the system of Greek numerals, it has a value of one. Alpha is derived from the Phoenician letter ''aleph'' , whose name comes from the West Semitic word for ' ...
stage of development and later required registration only for data contributions.
On 29 October 2008, at the
International Semantic Web Conference
The International Semantic Web Conference (ISWC) is a series of academic conferences and the premier international forum for the Semantic Web, Linked Data and Knowledge Graph Community. Here, scientists, industry specialists, and practitioners m ...
, Freebase released its RDF service for generating RDF representations of Freebase topics, allowing Freebase to be used as
linked data
In computing, linked data is structured data which is interlinked with other data so it becomes more useful through semantic queries. It builds upon standard Web technologies such as HTTP, RDF and URIs, but rather than using them to serve web ...
.
Organization and policy
Freebase's subjects are called "topics", and the data stored about them depended on their "type", as to how they were classified. For example, an entry for
Arnold Schwarzenegger
Arnold Alois Schwarzenegger (born July30, 1947) is an Austrian and American actor, businessman, former politician, and former professional bodybuilder, known for his roles in high-profile action films. Governorship of Arnold Schwarzenegger, ...
, the former governor of California, would be entered as a topic that would include a variety of types describing him as an actor, bodybuilder, and politician. , Freebase had approximately 44 million topics and 2.4 billion facts.
Freebase's types are themselves user-editable.
Each type had a number of defined predicates, called "properties".
like the W3C
The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. Founded in 1994 by Tim Berners-Lee, the consortium is made up of member organizations that maintain full-time staff working together in ...
approach to the semantic web, which starts with controlled ontologies, Metaweb adopts a folksonomy
Folksonomy is a classification system in which end users apply public tags to online items, typically to make those items easier for themselves or others to find later. Over time, this can give rise to a classification system based on those tag ...
approach, in which people can add new categories (much like tags), in a messy sprawl of potentially overlapping assertions.
However, Freebase differed from the wiki model in many ways. User-created types were not adopted in the "public commons" until promoted by a Metaweb employee. Also, users could not modify each other's types. The reason Freebase could not open up permissions of schemas is that external applications relied on them; thus, changing a type's schema – for instance by deleting a property or changing a simple property – might have broken queries for API users and even within Freebase itself, for example in saved views.
Discontinuation
On 16 December 2014, the Freebase team officially announced
that the website and the
API
An application programming interface (API) is a connection between computers or between computer programs. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how to build ...
would be shut down by 30 June 2015. Google provided an update on 16 December 2015 that they would discontinue the Freebase API and widget three months after a Suggest widget replacement was launched in early 2016.
See also
*
BabelNet
*
Cyc
*
DBpedia
DBpedia (from "DB" for "database") is a project aiming to extract structured content from the information created in the Wikipedia project. This structured information is made available on the World Wide Web using OpenLink Virtuoso. DBpedia a ...
*
Entity–relationship model
An entity–relationship model (or ER model) describes interrelated things of interest in a specific domain of knowledge. A basic ER model is composed of entity types (which classify the things of interest) and specifies relationships that can e ...
*
True Knowledge
*
YAGO
*
Knowledge Vault
*
Wikidata
Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, are able to use under the CC0 public domain ...
References
External links
{{DEFAULTSORT:Freebase (Database)
Semantic wikis
Knowledge bases
Creative Commons-licensed databases
Google Search
Internet properties established in 2007