HOME

TheInfoList



OR:

ScyllaDB is an
open-source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
distributed Distribution may refer to: Mathematics *Distribution (mathematics), generalized functions used to formulate solutions of partial differential equations *Probability distribution, the probability of a particular value or value range of a varia ...
NoSQL A NoSQL (originally referring to "non- SQL" or "non-relational") database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases. Such databases have existed ...
wide-column
data store In the pursuit of knowledge, data (; ) is a collection of discrete values that convey information, describing quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpret ...
. It was designed to be compatible with
Apache Cassandra Cassandra is a free and open-source, distributed, wide-column store, NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. Cass ...
while achieving significantly higher throughputs and lower latencies. It supports the same protocols as Cassandra ( CQL and
Thrift Thrift may refer to: * Frugality * A savings and loan association in the United States * Apache Thrift, a remote procedure call (RPC) framework * Thrift (plant), a plant in the genus ''Armeria'' * Syd Thrift (1929–2006), American baseball exec ...
) and the same file formats (SSTable), but is a completely rewritten implementation, using the C++20 language replacing Cassandra's Java, and the
Seastar Starfish or sea stars are star-shaped echinoderms belonging to the class Asteroidea (). Common usage frequently finds these names being also applied to ophiuroids, which are correctly referred to as brittle stars or basket stars. Starfish a ...
asynchronous programming library replacing classic Linux programming techniques such as threads, shared memory and mapped files. In addition to implementing Cassandra's protocols, ScyllaDB also implements the
Amazon DynamoDB Amazon DynamoDB is a fully managed proprietary NoSQL database service that supports key–value and document data structures and is offered by Amazon.com as part of the Amazon Web Services portfolio. DynamoDB exposes a similar data model to and ...
API. ScyllaDB uses a sharded design on each node, meaning that each
CPU A central processing unit (CPU), also called a central processor, main processor or just processor, is the electronic circuitry that executes instructions comprising a computer program. The CPU performs basic arithmetic, logic, controlling, and ...
core handles a different subset of data. Cores do not share data, but rather communicate explicitly when they need to. The ScyllaDB authors claim that this design allows ScyllaDB to achieve much better performance on modern NUMA
SMP SMP may refer to: Organisations * Scale Model Products, 1950s, acquired by Aluminum Model Toys * School Mathematics Project, UK developer of mathematics textbooks * '' Sekolah Menengah Pertama'', "junior high school" in Indonesia * Shanghai Mun ...
machines, and to scale very well with the number of cores. They have measured as much as 2 million requests per second on a single machine, and also claim that a ScyllaDB cluster can serve as many requests as a Cassandra cluster 10 times its size - and do so with lower latencies. Independent testing has not always been able to confirm such 10-fold throughput improvements, and sometimes measured smaller speedups, such as 2x. A 2017 benchmark from
Samsung The Samsung Group (or simply Samsung) ( ko, 삼성 ) is a South Korean multinational manufacturing conglomerate headquartered in Samsung Town, Seoul, South Korea. It comprises numerous affiliated businesses, most of them united under the ...
observed the 10x speedup on high-end machines - the Samsung benchmark reported that ScyllaDB outperformed Cassandra on a cluster of 24-core machines by a margin of 10–37x depending on the YCSB workload. ScyllaDB is available on-premise, on major public cloud providers, or as a DBaaS (ScyllaDB Cloud).


History

ScyllaDB was started in December 2014 by the
startup A startup or start-up is a company or project undertaken by an entrepreneur to seek, develop, and validate a scalable business model. While entrepreneurship refers to all new businesses, including self-employment and businesses that never intend t ...
Cloudius Systems (later renamed ScyllaDB Inc.), previously known for having created
OSv OSV may be: * OSV-96, a Russian anti-materiel rifle * Object–subject–verb word order * Offshore vessel * Old Sturbridge Village * Open-source voting * ''Our Sunday Visitor Our Sunday Visitor (OSV) is a Catholic publishing company in Hunti ...
. ScyllaDB was released as open source in September 2015,"Cassandra Rewritten In C++, Ten Times Faster"
September 22, 2015,
Slashdot ''Slashdot'' (sometimes abbreviated as ''/.'') is a social news website that originally advertised itself as "News for Nerds. Stuff that Matters". It features news stories concerning science, technology, and politics that are submitted and evalu ...
under the AGPL license. Employees of ScyllaDB Inc. remain the primary coders behind Scylla, but its development is open to the public and uses public
GitHub GitHub, Inc. () is an Internet hosting service for software development and version control using Git. It provides the distributed version control of Git plus access control, bug tracking, software feature requests, task management, co ...
repositories and public mailing lists.


References

{{reflist


External links


Scylla public GitHub repository
with source code repository and bug tracker
ScyllaDB Inc. homepage

ScyllaDB another contender to the open source NoSQL database crown

How Scylla Scaled to One Billion Rows a Second

ClickHouse Cost-Efficiency in Action: Analyzing 500 Billion Rows on an Intel NUC
Distributed data stores NoSQL Free database management systems Software that was rewritten in C++ Software using the GNU AGPL license