YCSB
   HOME

TheInfoList



OR:

The Yahoo! Cloud Serving Benchmark (YCSB) is an open-source specification and program suite for evaluating retrieval and maintenance capabilities of computer programs. It is often used to compare the relative performance of
NoSQL A NoSQL (originally referring to "non- SQL" or "non-relational") database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases. Such databases have existed ...
database management systems. The original benchmark was developed by workers in the research division of
Yahoo! Yahoo! (, styled yahoo''!'' in its logo) is an American web services provider. It is headquartered in Sunnyvale, California and operated by the namesake company Yahoo Inc., which is 90% owned by investment funds managed by Apollo Global Man ...
who released it in 2010 with the stated goal of "facilitating performance comparisons of the new generation of cloud data serving systems", particularly for transaction-processing workloads which differed from ones measured by benchmarks designed for more traditional database management systems. YCSB was contrasted with the TPC-H benchmark from the
Transaction Processing Performance Council In online transaction processing (OLTP), information systems typically facilitate and manage transaction-oriented applications. This is contrasted with online analytical processing. The term "transaction" can have two different meanings, both of wh ...
, with YCSB being called a
big data Though used sometimes loosely partly because of a lack of formal definition, the interpretation that seems to best describe Big data is the one associated with large body of information that we could not comprehend when used only in smaller am ...
benchmark while TPC-H is a
decision support system A decision support system (DSS) is an information system that supports business or organizational decision-making activities. DSSs serve the management, operations and planning levels of an organization (usually mid and higher management) and h ...
benchmark. YCSB was used by DBMS vendors for "benchmark marketing". It has been used in scholarly or tutorial discussions, particularly for
Apache HBase HBase is an open-source non-relational distributed database modeled after Google's Bigtable and written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File Syst ...
. It has been used for multiple-product comparisons by industry observers such as
Network World International Data Group (IDG, Inc.) is a market intelligence and demand generation company focused on the technology industry. IDG, Inc.’s mission is centered around supporting the technology industry through research, data, marketing technol ...
(comparing
Cassandra Cassandra or Kassandra (; Ancient Greek: Κασσάνδρα, , also , and sometimes referred to as Alexandra) in Greek mythology was a Trojan priestess dedicated to the god Apollo and fated by him to utter true prophecies but never to be believe ...
,
MongoDB MongoDB is a source-available cross-platform document-oriented database program. Classified as a NoSQL database program, MongoDB uses JSON-like documents with optional schemas. MongoDB is developed by MongoDB Inc. and licensed under the Serve ...
, and
Riak Riak (pronounced "ree-ack" ) is a distributed NoSQL key-value data store based on Amazon's Dynamo paper, including its "tunable AP" approach, that is tunable consistency, to the tradeoffs imposed by the CAP Theorem. Riak offers high availability, ...
), Thumbtack Technologies (comparing Aerospike, Cassandra,
Couchbase Couchbase Server, originally known as Membase, is an open-source, distributed (shared-nothing architecture) multi-model NoSQL document-oriented database software package optimized for interactive applications. These applications may serve many ...
, and MongoDB), and the
Polytechnic Institute An institute of technology (also referred to as: technological university, technical university, university of technology, technological educational institute, technical college, polytechnic university or just polytechnic) is an institution of te ...
and
University of Coimbra The University of Coimbra (UC; pt, Universidade de Coimbra, ) is a Public university, public research university in Coimbra, Portugal. First established in Lisbon in 1290, it went through a number of relocations until moving permanently to Coi ...
(comparing Cassandra, HBase,
Elasticsearch Elasticsearch is a search engine based on the Lucene library. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. Elasticsearch is developed in Java and is dual-l ...
, MongoDB, Oracle NoSQL,
OrientDB OrientDB is an open source NoSQL database management system written in Java (programming language), Java. It is a Multi-model database, supporting Graph database, graph, Document-oriented database, document, Key-value database, key/value, and Obj ...
,
Redis Redis (; Remote Dictionary Server) is an in-memory data structure store, used as a distributed, in-memory key–value database, cache and message broker, with optional durability. Redis supports different kinds of abstract data structures, su ...
, Scalaris,
Tarantool Tarantool is an in-memory computing platform with a flexible data schema, best used for creating high-performance applications. Two main parts of it are an in-memory database and a Lua application server. Tarantool maintains data in memory and ...
, and
Voldemort Lord Voldemort ( , in the films) is a sobriquet for Tom Marvolo Riddle, a Character (arts), character and the main antagonist in J. K. Rowling's series of ''Harry Potter'' novels. The character first appeared in ''Harry Potter and the Ph ...
).
SanDisk Corporation SanDisk is a brand for flash memory products, including memory cards and readers, USB flash drives, solid-state drives, and digital audio players, manufactured and marketed by Western Digital. The original company, SanDisk Corporation was acquire ...
published results measured on the
Oracle NoSQL Database Oracle NoSQL Database is a NoSQL-type distributed key-value database from Oracle Corporation. It provides transactional semantics for data manipulation, horizontal scalability, and simple administration and monitoring. Oracle NoSQL Database Cl ...
.


Implementations


Original Java Implementation

GoLang Implementation

C++ Implementation for LevelDB, RocksDB, LMDB Embedded Key-Value Stores

C++ Implementation for LevelDB, RocksDB, LMDB, WiredTiger, and UDisk Embedded Key-Value Stores


References

{{Reflist 2010 software Benchmarks (computing) Yahoo!