HOME

TheInfoList



OR:

Apache Kudu is a
free and open source Free and open-source software (FOSS) is a term used to refer to groups of software consisting of both free software and open-source software where anyone is freely licensed to use, copy, study, and change the software in any way, and the source ...
column-oriented data store of the
Apache Hadoop Apache Hadoop () is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage ...
ecosystem. It is compatible with most of the data processing frameworks in the
Hadoop Apache Hadoop () is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage an ...
environment. It provides completeness to Hadoop's storage layer to enable fast analytics on fast data. The
open source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
project to build Apache Kudu began as internal project at
Cloudera Cloudera, Inc. is an American software company providing enterprise data management systems that make significant use of Apache Hadoop. As of January 31, 2021, the company had approximately 1,800 customers. History Cloudera, Inc. was formed on J ...
. The first version Apache Kudu 1.0 was released 19 September 2016.


Comparison with other storage engines

Kudu was designed and optimized for
OLAP Online analytical processing, or OLAP (), is an approach to answer multi-dimensional analytical (MDA) queries swiftly in computing. OLAP is part of the broader category of business intelligence, which also encompasses relational databases, repor ...
workloads. Like HBase, it is a real-time store that supports key-indexed record lookup and mutation. Kudu differs from HBase since Kudu's datamodel is a more traditional relational model, while HBase is schemaless. Kudu's "on-disk representation is truly columnar and follows an entirely different storage design than HBase/
Bigtable Bigtable is a fully managed wide-column and key-value NoSQL database service for large analytical and operational workloads as part of the Google Cloud portfolio. History Bigtable development began in 2004.. It is now used by a number of Googl ...
".


See also

*
List of column-oriented DBMSes This article is a list of Column-oriented DBMS, column-oriented database management system software. Free and open-source software (FOSS) Platform as a Service (PaaS) *Amazon Redshift * Microsoft Azure SQL Data Warehouse * Google BigQuery ...


References


External links

*
Apache Kudu GitHub repository
{{DEFAULTSORT:Kudu
Kudu The kudus are two species of antelope of the genus '' Tragelaphus'': * Lesser kudu, ''Tragelaphus imberbis'', of eastern Africa * Greater kudu, ''Tragelaphus strepsiceros'', of eastern and southern Africa The two species look similar, th ...
Software using the Apache license C++ software Free system software Free software Free database management systems Cloud computing Online analytical processing Data warehousing Data warehousing products Data analysis software Distributed data stores Structured storage