A vector clock is a
data structure
In computer science, a data structure is a data organization and storage format that is usually chosen for Efficiency, efficient Data access, access to data. More precisely, a data structure is a collection of data values, the relationships amo ...
used for determining the
partial ordering of events in a
distributed system and detecting
causality violations. Just as in
Lamport timestamps, inter-process messages contain the state of the sending process's
logical clock. A vector clock of a system of ''N'' processes is an
array/vector of ''N'' logical clocks, one clock per process; a local "largest possible values" copy of the global clock-array is kept in each process.
Denote
as the vector clock maintained by process
, the clock updates proceed as follows:
* Initially all clocks are zero.
* Each time a process experiences an internal event, it increments its own
logical clock in the vector by one. For instance, upon an event at process
, it updates
.
* Each time a process sends a message, it increments its own logical clock in the vector by one (as in the bullet above, but not twice for the same event) then it pairs the message with a copy of its own vector and finally sends the pair.
* Each time a process receives a message-vector clock pair, it increments its own logical clock in the vector by one and updates each element in its vector by taking the maximum of the value in its own vector clock and the value in the vector in the received pair (for every element). For example, if process
receives a message
from
, it first increments its own logical clock in the vector by one
and then updates its entire vector by setting
.
History
Lamport originated the idea of logical
Lamport clock
The Lamport timestamp algorithm is a simple logical clock algorithm used to determine the order of events in a distributed computer system. As different nodes or processes will typically not be perfectly synchronized, this algorithm is used to prov ...
s in 1978.
However, the logical clocks in that paper were scalars, not vectors. The generalization to vector time was developed several times, apparently independently, by different authors in the early 1980s.
At least 6 papers contain the concept.
The papers canonically cited in reference to vector clocks are Colin Fidge’s and
Friedemann Mattern’s 1988 works,
as they (independently) established the name "vector clock" and the mathematical properties of vector clocks.
[
]
Partial ordering property
Vector clocks allow for the partial causal ordering of events. Defining the following:
* denotes the vector clock of event , and denotes the component of that clock for process .
*