Michael Stonebraker
   HOME

TheInfoList



OR:

Michael Ralph Stonebraker (born October 11, 1943) is a
computer scientist A computer scientist is a person who is trained in the academic study of computer science. Computer scientists typically work on the theoretical side of computation, as opposed to the hardware side on which computer engineers mainly focus (a ...
specializing in
database systems In computing, a database is an organized collection of data stored and accessed electronically. Small databases can be stored on a file system, while large databases are hosted on computer clusters or cloud storage. The design of databases spa ...
. Through a series of academic prototypes and commercial startups, Stonebraker's research and products are central to many
relational databases A relational database is a (most commonly digital) database based on the relational model of data, as proposed by E. F. Codd in 1970. A system used to maintain relational databases is a relational database management system (RDBMS). Many relatio ...
. He is also the founder of many database companies, including
Ingres Corporation Actian is a computer software company headquartered in Sunnyvale, California that provides data management software. In July 2018, Actian was acquired by HCL Technologies and Sumeru Equity Partners for $330 million. On December 31, 2021, HCL Techn ...
, Illustra, Paradigm4, StreamBase Systems
Tamr
Vertica and VoltDB, and served as chief technical officer of
Informix IBM Informix is a product family within IBM's Information Management division that is centered on several relational database management system (RDBMS) offerings. The Informix products were originally developed by Informix Corporation, whose ...
. For his contributions to database research, Stonebraker received the 2014
Turing Award The ACM A. M. Turing Award is an annual prize given by the Association for Computing Machinery (ACM) for contributions of lasting and major technical importance to computer science. It is generally recognized as the highest distinction in compu ...
, often described as "the Nobel Prize for computing." Stonebraker's career can be broadly divided into two phases: his time at
University of California, Berkeley The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California) is a public land-grant research university in Berkeley, California. Established in 1868 as the University of California, it is the state's first land-grant un ...
when he focused on
relational database management system A relational database is a (most commonly digital) database based on the relational model of data, as proposed by E. F. Codd in 1970. A system used to maintain relational databases is a relational database management system (RDBMS). Many relati ...
s such as
Ingres Jean-Auguste-Dominique Ingres ( , ; 29 August 1780 – 14 January 1867) was a French Neoclassical painter. Ingres was profoundly influenced by past artistic traditions and aspired to become the guardian of academic orthodoxy against the a ...
and
Postgres PostgreSQL (, ), also known as Postgres, is a free and open-source relational database management system (RDBMS) emphasizing extensibility and SQL compliance. It was originally named POSTGRES, referring to its origins as a successor to the Ingr ...
, and at
Massachusetts Institute of Technology The Massachusetts Institute of Technology (MIT) is a private land-grant research university in Cambridge, Massachusetts. Established in 1861, MIT has played a key role in the development of modern technology and science, and is one of th ...
(MIT) where he developed more novel data management techniques such as C-Store, H-Store and
SciDB SciDB is a column-oriented database management system (DBMS) designed for multidimensional data management and analytics common to scientific, geospatial, financial, and industrial applications. It is developed by Paradigm4 and co-created by Mic ...
. Stonebraker is currently a professor emeritus at UC Berkeley and an adjunct professor at MIT'
Computer Science and Artificial Intelligence Laboratory
He is also known as an editor for the book ''Readings in Database Systems''.


Life

Stonebraker grew up in
Milton, New Hampshire Milton is a town in Strafford County, New Hampshire, United States. The population was 4,482 at the 2020 census. A manufacturing, resort and residential town, Milton includes the village of Milton Mills. The primary village in town, where 593 peop ...
. He earned his B.S.E. in
electrical engineering Electrical engineering is an engineering discipline concerned with the study, design, and application of equipment, devices, and systems which use electricity, electronics, and electromagnetism. It emerged as an identifiable occupation in the l ...
from
Princeton University Princeton University is a private research university in Princeton, New Jersey. Founded in 1746 in Elizabeth as the College of New Jersey, Princeton is the fourth-oldest institution of higher education in the United States and one of the ...
in 1965, and his
M.S. A Master of Science ( la, Magisterii Scientiae; abbreviated MS, M.S., MSc, M.Sc., SM, S.M., ScM or Sc.M.) is a master's degree in the field of science awarded by universities in many countries or a person holding such a degree. In contrast to ...
and Ph.D. from the
University of Michigan , mottoeng = "Arts, Knowledge, Truth" , former_names = Catholepistemiad, or University of Michigania (1817–1821) , budget = $10.3 billion (2021) , endowment = $17 billion (2021)As o ...
in 1967 and 1971 respectively. His awards include the
IEEE John von Neumann Medal The IEEE John von Neumann Medal was established by the IEEE Board of Directors in 1990 and may be presented annually "for outstanding achievements in computer-related science and technology." The achievements may be theoretical, technological, or ...
and the first SIGMOD Edgar F. Codd Innovations Award. In 1994 he was inducted as a
Fellow A fellow is a concept whose exact meaning depends on context. In learned or professional societies, it refers to a privileged member who is specially elected in recognition of their work and achievements. Within the context of higher education ...
of the
Association for Computing Machinery The Association for Computing Machinery (ACM) is a US-based international learned society for computing. It was founded in 1947 and is the world's largest scientific and educational computing society. The ACM is a non-profit professional member ...
. In 1997, he was elected a member of the
National Academy of Engineering The National Academy of Engineering (NAE) is an American nonprofit, non-governmental organization. The National Academy of Engineering is part of the National Academies of Sciences, Engineering, and Medicine, along with the National Academy of ...
for the development and commercialization of relational and object-relational database systems. In March 2015 it was announced he won the 2014 ACM Turing Award. In September 2015, he won the 2015 Commonwealth Award, chosen by council members of MassTLC.Geller, Jessica
"PTC Chief Heppelman named CEO of the year by Mass. tech council."
betaBoston. The Boston Globe. Sept. 16, 2015


The Berkeley years (1971–2000)

Stonebraker joined
University of California, Berkeley The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California) is a public land-grant research university in Berkeley, California. Established in 1868 as the University of California, it is the state's first land-grant un ...
as an assistant professor in 1971, and taught in the computer science department for twenty-nine years. It was there that he did his early pioneering work on relational databases.


Ingres

In 1973, Stonebraker and his colleague
Eugene Wong Eugene Wong (born December 24, 1934 in Nanking, China) is a Chinese-American computer scientist and mathematician. Wong's career has spanned academia, university administration, government and the private sector. Together with Michael Stonebra ...
started researching relational database systems after reading a series of seminal papers published by Edgar F. Codd on the relational data model. Their project, known as
Ingres Jean-Auguste-Dominique Ingres ( , ; 29 August 1780 – 14 January 1867) was a French Neoclassical painter. Ingres was profoundly influenced by past artistic traditions and aspired to become the guardian of academic orthodoxy against the a ...
(Interactive Graphics and Retrieval System), was one of the first systems (along with System R from IBM) to demonstrate that it was possible to build a practical and efficient implementation of the relational model. A number of key ideas from INGRES are still widely used in relational systems, including the use of
B-tree In computer science, a B-tree is a self-balancing tree data structure that maintains sorted data and allows searches, sequential access, insertions, and deletions in logarithmic time. The B-tree generalizes the binary search tree, allowing for ...
s, primary-copy replication, the query rewrite approach to views and
integrity constraints Data integrity is the maintenance of, and the assurance of, data accuracy and consistency over its entire life-cycle and is a critical aspect to the design, implementation, and usage of any system that stores, processes, or retrieves data. The ter ...
, and the idea of rules/triggers for integrity checking in an RDBMS. Additionally, much experimental work was done that provided insights into how to build a locking system that could provide satisfactory transaction performance. By the mid-1970s, Stonebraker's team produced, using a rotating team of student programmers, a usable relational database system. At the time Ingres was considered "low end" compared to IBM's System R, as it ran on
Unix Unix (; trademarked as UNIX) is a family of multitasking, multiuser computer operating systems that derive from the original AT&T Unix, whose development started in 1969 at the Bell Labs research center by Ken Thompson, Dennis Ritchie, ...
-based
Digital Equipment Corporation Digital Equipment Corporation (DEC ), using the trademark Digital, was a major American company in the computer industry from the 1960s to the 1990s. The company was co-founded by Ken Olsen and Harlan Anderson in 1957. Olsen was president un ...
machines as opposed to the "
big iron "Big Iron" is a country ballad written and performed by Marty Robbins, originally released as an album track on ''Gunfighter Ballads and Trail Songs'' in September 1959, then as a single in February 1960 with the song "Saddle Tramp" as the B-sid ...
"
IBM mainframe IBM mainframes are large computer systems produced by IBM since 1952. During the 1960s and 1970s, IBM dominated the large computer market. Current mainframe computers in IBM's line of business computers are developments of the basic design of th ...
s. By the early 1980s, however, the performance and capabilities of these low-end machines were seriously threatening IBM's mainframe market, and with the threat came the ability of Ingres to become a viable, "real" product for a large number of applications. Ingres used a variation of the
BSD license BSD licenses are a family of permissive free software licenses, imposing minimal restrictions on the use and distribution of covered software. This is in contrast to copyleft licenses, which have share-alike requirements. The original BSD lice ...
for a nominal fee, and soon a number of companies took advantage of this to create commercial versions of Ingres . These included Stonebraker, who with fellow Berkeley professors Larry Rowe and Eugene Wong helped found Relational Technology, Inc., later called Ingres Corporation. Subsequently, sold to
Computer Associates CA Technologies, formerly known as CA, Inc. and Computer Associates International, Inc., is an American multinational corporation headquartered in New York City. It is primarily known for its business-to-business (B2B) software with a product po ...
, Ingres was re-established as an independent company in 2005, and later renamed Actian. Other startups based on Ingres include
Sybase Sybase, Inc. was an enterprise software and services company. The company produced software to manage and analyze information in relational databases, with facilities located in California and Massachusetts. Sybase was acquired by SAP in 2010; ...
, founded by Robert Epstein, a student on the project, and Britton Lee, Inc. Sybase's code was later used as a basis for
Microsoft SQL Server Microsoft SQL Server is a relational database management system developed by Microsoft. As a database server, it is a software product with the primary function of storing and retrieving data as requested by other software applications—which ...
.


Postgres

After founding Relational Technology, Stonebraker and Rowe began a "post-Ingres" effort, to address the limitations of the relational model. The new project was named POSTGRES (POST inGRES), and was designed to add support for complex data types to database systems and improve end-to-end performance of data-intensive applications. Postgres provided an object relational programming model in which fields could be complex datatypes, and where users could register new types as well as scalar and aggregate functions over those types. POSTGRES was extensible in a number of other ways, making it easy for programmers to modify or add to the optimizer, query language, runtime, and indexing frameworks. These features improved both database programmability and performance, and made it possible to push large portions of a number of applications inside the database, including
geographic information systems A geographic information system (GIS) is a type of database containing geographic data (that is, descriptions of phenomena for which location is relevant), combined with software tools for managing, analyzing, and visualizing those data. In a ...
and time series processing. This had the effect of substantially broadening the commercial database market. POSTGRES was also offered using a BSD-like license, and the code forms the basis of today's
free software Free software or libre software is computer software distributed under terms that allow users to run the software for any purpose as well as to study, change, and distribute it and any adapted versions. Free software is a matter of liberty, n ...
,
PostgreSQL PostgreSQL (, ), also known as Postgres, is a free and open-source relational database management system (RDBMS) emphasizing extensibility and SQL compliance. It was originally named POSTGRES, referring to its origins as a successor to the ...
. Stonebraker also led an effort to commercialize the code, creating Illustra which was purchased by
Informix IBM Informix is a product family within IBM's Information Management division that is centered on several relational database management system (RDBMS) offerings. The Informix products were originally developed by Informix Corporation, whose ...
. PostgreSQL has been used as the basis for a number of other startup companies, including
Aster Data Systems Aster Data Systems was a data management and analysis software company headquartered in San Carlos, California. It was founded in 2005 and acquired by Teradata in 2011. History Aster Data was co-founded in 2005 by Stanford University graduate s ...
,
EnterpriseDB EnterpriseDB (EDB), a privately held company based in Massachusetts, provides software and services based on the open-source database PostgreSQL (also known as Postgres), and is one of the largest contributors to Postgres. EDB develops and int ...
, and
Greenplum Greenplum is a big data technology based on MPP architecture and the Postgres open source database technology. The technology was created by a company of the same name headquartered in San Mateo, California around 2005. Greenplum was acquired ...
.
Informix IBM Informix is a product family within IBM's Information Management division that is centered on several relational database management system (RDBMS) offerings. The Informix products were originally developed by Informix Corporation, whose ...
acquired Illustra in 1996, and Stonebraker became Informix's CTO, a position he held until September 2000. Informix integrated Illustra's O–R mapping and DataBlades into the 7.x OnLine product, resulting in Informix Universal Server (IUS), or more generally, Version 9.


Mariposa and Cohera

After the Postgres project, Stonebraker initiated the Mariposa project which became the basis of Cohera Corporation. Mariposa built a federated database over an economic model of resource trading, in which data distributed across multiple organizations could be integrated and queried from a single relational interface, governed by site-specific policies that would charge for data processing and storage. These economic policies allowed traditional ideas in query optimization to be carried out over competing sites, and also served as the basis for data storage, replication and movement within a federation. Cohera's initial mission was to commercialize Mariposa, but eventually focused on a business-to-business catalog management application on the core federated data integration engine. Cohera's intellectual property was purchased by
PeopleSoft PeopleSoft, Inc. is a company that provides human resource management systems (HRMS), Financial Management Solutions (FMS), supply chain management (SCM), customer relationship management (CRM), and enterprise performance management (EPM) software ...
in 2001, and used as the basis of PeopleSoft's Enterprise Catalog Management. PeopleSoft was in turn purchased by
Oracle Corporation Oracle Corporation is an American multinational computer technology corporation headquartered in Austin, Texas. In 2020, Oracle was the third-largest software company in the world by revenue and market capitalization. The company sells da ...
in 2004.


The MIT years (2001–present)

Stonebraker became an adjunct professor at MIT in 2001, where he began another series of research projects and founded a number of companies.


Aurora and StreamBase

In the Aurora Project, Stonebraker, along with colleagues from
Brandeis University , mottoeng = "Truth even unto its innermost parts" , established = , type = Private research university , accreditation = NECHE , president = Ronald D. Liebowitz , p ...
,
Brown University Brown University is a private research university in Providence, Rhode Island. Brown is the seventh-oldest institution of higher education in the United States, founded in 1764 as the College in the English Colony of Rhode Island and Providenc ...
, and MIT, focused on data management for streaming data, using a new data model and query language. Unlike relational systems, which "pull" data and process it a record at a time, in Aurora, data is "pushed", arriving asynchronously from external data sources (such as stock ticks, news feeds, or sensors.) The output is itself a stream of results (such as windowed averages) that are sent to users. Stonebraker co-founded StreamBase Systems in 2003 to commercialize the technology behind Aurora.


C-Store and Vertica

In the C-Store project, started in 2005, Stonebraker, along with colleagues from Brandeis, Brown, MIT, and
University of Massachusetts Boston The University of Massachusetts Boston (stylized as UMass Boston) is a public research university in Boston, Massachusetts. It is the only public research university in Boston and the third-largest campus in the five-campus University of Massa ...
, developed a parallel,
shared-nothing A shared-nothing architecture (SN) is a distributed computing architecture in which each update request is satisfied by a single node (processor/memory/storage unit) in a computer cluster. The intent is to eliminate contention among nodes. Nodes do ...
column-oriented DBMS for data warehousing. By dividing and storing data in columns, C-Store is able to perform less I/O and get better compression ratios than conventional database systems that store data in rows.(Print edition title: Database Pioneer Rethinks How Data is Organized. Stonebraker explained that it's because similar data items are side-by-side: Name,Name,Name,Name vs. Name,Address,Zip,Phone#. In 2005, Stonebraker co-founded Vertica to commercialize the technology behind C-Store.


Morpheus and Goby

In 2006, Stonebraker started the Morpheus project, along with researchers from the University of Florida. Morpheus is a
data integration Data integration involves combining data residing in different sources and providing users with a unified view of them. This process becomes significant in a variety of situations, which include both commercial (such as when two similar companies ...
system which relies on a collection of "transforms" to
mediate Mediate may refer to: * "Mediate" (song), by INXS * Domenic Mediate (born 1982), professional soccer player *Rocco Mediate (born 1962), professional golfer *A common misspelling of the website Mediaite Mediaite is a news website focusing on pol ...
between data sources. Each transform provides a queryable interface to particular web site or service, and Morpheus makes it possible to search for and compose multiple transforms to provide a new service or a unified view of several services. In 2009, Stonebraker co-founded Goby, a local search company based on ideas from Morpheus, for people to explore new things to do in free time.


H-Store and VoltDB

In 2007, with researchers from
Brown University Brown University is a private research university in Providence, Rhode Island. Brown is the seventh-oldest institution of higher education in the United States, founded in 1764 as the College in the English Colony of Rhode Island and Providenc ...
, MIT, and
Yale University Yale University is a private research university in New Haven, Connecticut. Established in 1701 as the Collegiate School, it is the third-oldest institution of higher education in the United States and among the most prestigious in the w ...
, Stonebraker started the H-Store project. H-Store is a distributed main-memory
online transaction processing In online transaction processing (OLTP), information systems typically facilitate and manage transaction-oriented applications. This is contrasted with online analytical processing. The term "transaction" can have two different meanings, both of w ...
(OLTP) system designed to provide very high throughput on transaction processing workloads. In 2009, Stonebraker co-founded, and then served as an adviser to, VoltDB a commercial startup based on ideas from the H-Store project.


SciDB

In 2008, along with
David DeWitt David J. DeWitt (July 20, 1948) is a computer scientist specializing in database management system research at the Massachusetts Institute of Technology. Prior to moving to MIT, DeWitt was the John P. Morgridge Professor (Emeritus) of Computer ...
and researchers from Brown, MIT,
Portland State University Portland State University (PSU) is a public research university in Portland, Oregon. It was founded in 1946 as a post-secondary educational institution for World War II veterans. It evolved into a four-year college over the following two dec ...
,
SLAC SLAC National Accelerator Laboratory, originally named the Stanford Linear Accelerator Center, is a United States Department of Energy National Laboratory operated by Stanford University under the programmatic direction of the U.S. Departme ...
, the
University of Washington The University of Washington (UW, simply Washington, or informally U-Dub) is a public research university in Seattle, Washington. Founded in 1861, Washington is one of the oldest universities on the West Coast; it was established in Seatt ...
, and the
University of Wisconsin–Madison A university () is an institution of higher (or tertiary) education and research which awards academic degrees in several academic disciplines. Universities typically offer both undergraduate and postgraduate programs. In the United Stat ...
, Stonebraker started
SciDB SciDB is a column-oriented database management system (DBMS) designed for multidimensional data management and analytics common to scientific, geospatial, financial, and industrial applications. It is developed by Paradigm4 and co-created by Mic ...
an open-source DBMS specially designed for scientific research applications. He founded Paradigm4 with Marilyn Matz, who became CEO. Paradigm4 developed SciDB, used mostly by life sciences and financial markets.
Novartis Novartis AG is a Swiss-American multinational pharmaceutical corporation based in Basel, Switzerland and Cambridge, Massachusetts, United States (global research).name="novartis.com">https://www.novartis.com/research-development/research-lo ...
, Foundation Medicine, and the
National Institutes of Health The National Institutes of Health, commonly referred to as NIH (with each letter pronounced individually), is the primary agency of the United States government responsible for biomedical and public health research. It was founded in the lat ...
are some of the company's clients.


NoSQL

In 2010 and 2011, Stonebraker criticized the
NoSQL A NoSQL (originally referring to "non- SQL" or "non-relational") database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases. Such databases have existed ...
movement.


Notable students

Stonebraker trained more than 30 students, including: * Daniel Abadi, co-founder of Hadapt (acquired by
Teradata Teradata Corporation is an American software company that provides cloud database and analytics-related software, products, and services. The company was formed in 1979 in Brentwood, California, as a collaboration between researchers at Caltech ...
) * Anupam Bhide, founder and CEO of Calsoft * Michael J. Carey, professor at UC Irvine * Robert Epstein, co-founder of
Sybase Sybase, Inc. was an enterprise software and services company. The company produced software to manage and analyze information in relational databases, with facilities located in California and Massachusetts. Sybase was acquired by SAP in 2010; ...
and Britton Lee * Paula Hawthorn, co-founder of Britton Lee *
Marti Hearst Marti Hearst is a professor in the School of Information at the University of California, Berkeley. She did early work in corpus-based computational linguistics, including some of the first work in automating sentiment analysis, and word sense d ...
, professor at UC Berkeley * Joseph M. Hellerstein, professor at UC Berkeley * Clifford A. Lynch, executive director of the
Coalition for Networked Information The Coalition for Networked Information (CNI) is an organization whose mission is to promote networked information technology as a way to further the advancement of intellectual collaboration and productivity. Overview The Coalition for Network ...
* John Newton, founder of
Documentum Documentum is an enterprise content management platform, now owned by OpenText, as well as the name of the software company that originally developed the technology. EMC acquired Documentum for $1.7 billion in December, 2003. The Documentum pl ...
and
Alfresco Alfresco may refer to: * ''Al fresco'', or fresco, a technique of mural painting * Al fresco dining * Alfresco Software, an open-source content-management system * ''Alfresco'' (TV series), a 1980s British television comedy series * ''Al fresco ...
* Mike Olson, former CEO of Sleepycat Software and founding CEO of
Cloudera Cloudera, Inc. is an American software company providing enterprise data management systems that make significant use of Apache Hadoop. As of January 31, 2021, the company had approximately 1,800 customers. History Cloudera, Inc. was formed on J ...
* Margo Seltzer, professor at the
University of British Columbia The University of British Columbia (UBC) is a public research university with campuses near Vancouver and in Kelowna, British Columbia. Established in 1908, it is British Columbia's oldest university. The university ranks among the top thr ...
, founder and former CTO of Sleepycat Software * Dale Skeen, founder of
Tibco TIBCO Software Inc. is an American business intelligence software company founded in 1997 in Palo Alto, California. It has headquarters in Palo Alto, California, and offices in North America, Europe, Asia, the Middle East, Africa and South A ...
, founder and CEO of Vitria


Selected works

* *


References


External links

* * *, a series of recent interviews and comments about and by Stonebraker. * *, a new search engine to find fun things to do in one's free time (co-founded by Stonebraker) * {{DEFAULTSORT:Stonebraker, Michael American computer scientists Database researchers American computer businesspeople Princeton University alumni University of Michigan alumni Fellows of the Association for Computing Machinery Foreign Members of the Russian Academy of Sciences Living people 1943 births Informix Turing Award laureates People from Newburyport, Massachusetts People from Milton, New Hampshire UC Berkeley College of Engineering faculty