HOME

TheInfoList



OR:

Databricks is an American enterprise software company founded by the creators of Apache Spark. Databricks develops a web-based platform for working with Spark, that provides automated
cluster management Within cluster and parallel computing, a cluster manager is usually backend graphical user interface (GUI) or command-line interface (CLI) software that runs on a set of cluster nodes that it manages (in some cases it runs on a different server or ...
and IPython-style
notebooks A notebook is a small book often used for writing. Notebook or The Notebook may also refer to: Computing *Laptop, a type of personal computer * Google Notebook, a discontinued online application * Notebook interface, a type of programming envir ...
.


History

Databricks grew out of the
AMPLab AMPLAB was a University of California, Berkeley lab focused on big data analytics located in Soda Hall. The name stands for the Algorithms, Machines and People Lab. It has been publishing papers since 2008 and was officially launched in 2011. The ...
project at
University of California, Berkeley The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California) is a public land-grant research university in Berkeley, California. Established in 1868 as the University of California, it is the state's first land-grant u ...
that was involved in making Apache Spark, an open-source distributed computing framework built atop Scala. The company was founded by
Ali Ghodsi Ali Ghodsi is an Iranian-Swedish computer scientist and entrepreneur specializing in distributed systems and big data. He is a co-founder and CEO of Databricks and an adjunct professor at UC Berkeley. Ideas from his academic research in the area of ...
, Andy Konwinski, Arsalan Tavakoli-Shiraji,
Ion Stoica Ion Stoica is a Romanian-American computer scientist specializing in distributed systems, cloud computing and computer networking. He is a professor of computer science at the University of California, Berkeley and co-director of AMPLab. He co-fo ...
,
Matei Zaharia Matei Zaharia is a Romanian-Canadian computer scientist, educator and the creator of Apache Spark. As of April 2022, Forbes ranked him and Ion Stoica as the 3rd- richest people in Romania with a net worth of $1.6 billion. Biography Zaharia g ...
, Patrick Wendell, and
Reynold Xin Reynold Xin is a computer scientist and engineer specializing in big data, distributed systems, and cloud computing. He is a co-founder and Chief Architect of Databricks. He is best known for his work on Apache Spark, which is the top open-sourc ...
. In November 2017, the company was announced as a first-party service on Microsoft Azure via the integration Azure Databricks. The company develops Delta Lake, an open-source project to bring reliability to data lakes for machine learning and other data science use cases. In June 2020, Databricks acquired Redash, an open-source tool designed to help data scientists and analysts visualize and build interactive dashboards of their data. In February 2021 together with
Google Cloud Google Cloud Platform offers numerous integrated cloud-computing services, including compute, network, and storage. Products Past and present products under the Google Cloud platform include: Current * Google Cloud Datastore, a NoSQL databa ...
, Databricks provided integration with the Google
Kubernetes Kubernetes (, commonly stylized as K8s) is an open-source container orchestration system for automating software deployment, scaling, and management. Google originally designed Kubernetes, but the Cloud Native Computing Foundation now maintains ...
Engine and Google's
BigQuery BigQuery is a fully managed, serverless data warehouse that enables scalable analysis over petabytes of data. It is a ''Platform as a Service'' ( PaaS) that supports querying using ANSI SQL. It also has built-in machine learning capabilities. B ...
platform. Fortune ranked Databricks as one of the best large "Workplaces for Millennials" in 2021. At the time, the company said more than 5,000 organizations used its products. In August 2021, Databricks finished their eighth round of funding by raising $1.6 billion and valuing the company at $38 billion. In October 2021, Databricks made its second acquisition of German no-code company 8080 Labs. 8080 Labs makes bamboolib, a data exploration tool that does not require coding to use.


Funding

In September 2013, Databricks announced it raised $13.9 million from
Andreessen Horowitz Andreessen Horowitz (also called a16z, legal name AH Capital Management, LLC) is a private American venture capital firm, founded in 2009 by Marc Andreessen and Ben Horowitz. The company is headquartered in Menlo Park, California. Andreessen H ...
and said it aimed to offer an alternative to Google's
MapReduce MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster. A MapReduce program is composed of a ''map'' procedure, which performs filtering ...
system. Microsoft was a noted investor of Databricks in 2019, participating in the company's Series E at an unspecified amount. The company has raised $1.9 billion in funding, including a $1 billion Series G led by Franklin Templeton at a $28 billion post-money valuation in February 2021. Other investors include
Amazon Web Services Amazon Web Services, Inc. (AWS) is a subsidiary of Amazon that provides on-demand cloud computing platforms and APIs to individuals, companies, and governments, on a metered pay-as-you-go basis. These cloud computing web services provide d ...
,
CapitalG CapitalG (formerly Google Capital) is the independent growth fund under Alphabet Inc. Alphabet Inc. is an American multinational technology conglomerate holding company headquartered in Mountain View, California. It was created through a r ...
(a growth equity firm under Alphabet, Inc.) and Salesforce Ventures.


Products

Databricks develops and sells a cloud data platform using the marketing term "lakehouse", a portmanteau based on the terms "
data warehouse In computing, a data warehouse (DW or DWH), also known as an enterprise data warehouse (EDW), is a system used for reporting and data analysis and is considered a core component of business intelligence. DWs are central repositories of integra ...
" and "
data lake A data lake is a system or repository of data stored in its natural/raw format, usually object blobs or files. A data lake is usually a single store of data including raw copies of source system data, sensor data, social data etc., and transform ...
". Databricks' lakehouse is based on the open source Apache Spark framework that allows analytical queries against semi-structured data without a traditional database schema. In October 2022, Lakehouse received FedRAMP authorized status for use with the U.S. federal government and contractors. Databricks' Delta Engine launched in June 2020 as a new query engine that layers on top of Delta Lake to boost query performance. It is compatible with Apache Spark and MLflow, which are also open source projects from Databricks. In November 2020, Databricks introduced Databricks SQL (previously known as SQL Analytics) for running
business intelligence Business intelligence (BI) comprises the strategies and technologies used by enterprises for the data analysis and management of business information. Common functions of business intelligence technologies include reporting, online analytical p ...
and analytics reporting on top of data lakes. Analysts can query data sets directly with standard SQL or use product connectors to integrate directly with business intelligence tools like
Tableau Tableau (French for 'little table' literally, also used to mean 'picture'; tableaux or, rarely, tableaus) may refer to: Arts * ''Tableau'', a series of four paintings by Piet Mondrian titled '' Tableau I'' through to ''Tableau IV'' * ''Tableau vi ...
,
Qlik Qlik ronounced "klik"(formerly known as Qliktech) provides a business analytics platform. The software company was founded in 1993 in Lund, Sweden and is now based in King of Prussia, Pennsylvania, United States. The company's main products ...

SigmaComputing
Looker ''Looker'' is a 1981 American science fiction film written and directed by Michael Crichton and starring Albert Finney, Susan Dey, and James Coburn. The film is a suspense/science-fiction piece that comments upon and satirizes media, advertisin ...
, and
ThoughtSpot ThoughtSpot, Inc. is a technology company that produces business intelligence analytics search software. The company is based in Mountain View, California, and was founded in 2012. History ThoughtSpot was founded in 2012 by a team of engineers ...
. Databricks offers a platform for other workloads, including machine learning, data storage and processing, streaming analytics, and business intelligence. The company has also created Delta Lake, MLflow and Koalas, open source projects that span data engineering, data science and
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
. In addition to building the Databricks platform, the company has co-organized
massive open online courses A massive open online course (MOOC ) or an open online course is an online course aimed at unlimited participation and open access via the Web. In addition to traditional course materials, such as filmed lectures, readings, and problem sets, man ...
about Spark and a conference for the Spark community called the Data + AI Summit, formerly known as Spark Summit.


Operations

Databricks is headquartered in
San Francisco San Francisco (; Spanish for " Saint Francis"), officially the City and County of San Francisco, is the commercial, financial, and cultural center of Northern California. The city proper is the fourth most populous in California and 17th ...
. It also has operations in
Canada Canada is a country in North America. Its ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, covering over , making it the world's second-largest country by tot ...
, the
United Kingdom The United Kingdom of Great Britain and Northern Ireland, commonly known as the United Kingdom (UK) or Britain, is a country in Europe, off the north-western coast of the European mainland, continental mainland. It comprises England, Scotlan ...
,
Netherlands ) , anthem = ( en, "William of Nassau") , image_map = , map_caption = , subdivision_type = Sovereign state , subdivision_name = Kingdom of the Netherlands , established_title = Before independence , established_date = Spanish Netherl ...
,
Singapore Singapore (), officially the Republic of Singapore, is a sovereign island country and city-state in maritime Southeast Asia. It lies about one degree of latitude () north of the equator, off the southern tip of the Malay Peninsula, bor ...
, Australia,
Germany Germany,, officially the Federal Republic of Germany, is a country in Central Europe. It is the second most populous country in Europe after Russia, and the most populous member state of the European Union. Germany is situated betwe ...
,
France France (), officially the French Republic ( ), is a country primarily located in Western Europe. It also comprises of overseas regions and territories in the Americas and the Atlantic, Pacific and Indian Oceans. Its metropolitan area ...
, Japan, China,
South Korea South Korea, officially the Republic of Korea (ROK), is a country in East Asia, constituting the southern part of the Korean Peninsula and sharing a land border with North Korea. Its western border is formed by the Yellow Sea, while its eas ...
,
India India, officially the Republic of India (Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the so ...
, and
Brazil Brazil ( pt, Brasil; ), officially the Federative Republic of Brazil (Portuguese: ), is the largest country in both South America and Latin America. At and with over 217 million people, Brazil is the world's fifth-largest country by area ...
.


References

{{reflist, 30em Big data companies Companies based in San Francisco Privately held companies based in California Software companies based in the San Francisco Bay Area Software companies established in 2013 Software companies of the United States