In
multi-threaded computer programming
Computer programming or coding is the composition of sequences of instructions, called computer program, programs, that computers can follow to perform tasks. It involves designing and implementing algorithms, step-by-step specifications of proc ...
, a function is thread-safe when it can be invoked or accessed concurrently by multiple threads without causing unexpected behavior,
race condition
A race condition or race hazard is the condition of an electronics, software, or other system where the system's substantive behavior is dependent on the sequence or timing of other uncontrollable events, leading to unexpected or inconsistent ...
s, or data corruption.
As in the multi-threaded context where a program executes several threads simultaneously in a shared
address space and each of those threads has access to every other thread's
memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembe ...
, thread-safe functions need to ensure that all those threads behave properly and fulfill their design specifications without unintended interaction.
There are various strategies for making thread-safe data structures.
Levels of thread safety
Different vendors use slightly different terminology for thread-safety, but the most commonly used thread-safety terminology are:
*Not thread safe: Data structures should not be accessed simultaneously by different threads.
*Thread safe, serialization: Uses a single mutex for all resources to guarantee the thread to be free of
race conditions when those resources are accessed by multiple threads simultaneously.
*Thread safe, MT-safe: Uses a mutex for every single resource to guarantee the thread to be free of
race conditions when those resources are accessed by multiple threads simultaneously.
Thread safety guarantees usually also include design steps to prevent or limit the risk of different forms of
deadlocks, as well as optimizations to maximize concurrent performance. However, deadlock-free guarantees cannot always be given, since deadlocks can be caused by
callbacks and violation of
architectural layering independent of the library itself.
Software libraries can provide certain thread-safety guarantees. For example, concurrent reads might be guaranteed to be thread-safe, but concurrent writes might not be. Whether a program using such a library is thread-safe depends on whether it uses the library in a manner consistent with those guarantees.
Implementation approaches
Listed are two classes of approaches for avoiding
race conditions to achieve thread-safety.
The first class of approaches focuses on avoiding shared state and includes:
;
Re-entrancy: Writing code in such a way that it can be partially executed by a thread, executed by the same thread, or simultaneously executed by another thread and still correctly complete the original execution. This requires the saving of
state information in variables local to each execution, usually on a stack, instead of in
static or
global
Global may refer to:
General
*Globe, a spherical model of celestial bodies
*Earth, the third planet from the Sun
Entertainment
* ''Global'' (Paul van Dyk album), 2003
* ''Global'' (Bunji Garlin album), 2007
* ''Global'' (Humanoid album), 198 ...
variables or other non-local state. All non-local states must be accessed through atomic operations and the data-structures must also be reentrant.
;
Thread-local storage: Variables are localized so that each thread has its own private copy. These variables retain their values across
subroutines and other code boundaries and are thread-safe since they are local to each thread, even though the code which accesses them might be executed simultaneously by another thread.
;
Immutable objects: The state of an object cannot be changed after construction. This implies both that only read-only data is shared and that inherent thread safety is attained. Mutable (non-const) operations can then be implemented in such a way that they create new objects instead of modifying the existing ones. This approach is characteristic of
functional programming
In computer science, functional programming is a programming paradigm where programs are constructed by Function application, applying and Function composition (computer science), composing Function (computer science), functions. It is a declarat ...
and is also used by the ''string'' implementations in Java, C#, and Python. (See
Immutable object.)
The second class of approaches are synchronization-related, and are used in situations where shared state cannot be avoided:
;
Mutual exclusion: Access to shared data is ''serialized'' using mechanisms that ensure only one thread reads or writes to the shared data at any time. Incorporation of mutual exclusion needs to be well thought out, since improper usage can lead to side-effects like
deadlocks,
livelocks, and
resource starvation
In computer science, resource starvation is a problem encountered in concurrent computing where a process is perpetually denied necessary resources
''Resource'' refers to all the materials available in our environment which are Technology, te ...
.
;
Atomic operations: Shared data is accessed by using atomic operations which cannot be interrupted by other threads. This usually requires using special
machine language instructions, which might be available in a
runtime library. Since the operations are atomic, the shared data is always kept in a valid state, no matter how other threads access it. Atomic operations form the basis of many thread locking mechanisms, and are used to implement mutual exclusion primitives.
Examples
In the following piece of
Java
Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
code, the Java keyword
synchronized makes the method thread-safe:
class Counter
In the
C programming language, each thread has its own stack. However, a
static variable
In computer programming, a static variable is a variable that has been allocated "statically", meaning that its lifetime (or "extent") is the entire run of the program. This is in contrast to shorter-lived automatic variables, whose storage is ...
is not kept on the stack; all threads share simultaneous access to it. If multiple threads overlap while running the same function, it is possible that a static variable might be changed by one thread while another is midway through checking it. This difficult-to-diagnose
logic error, which may compile and run properly most of the time, is called a
race condition
A race condition or race hazard is the condition of an electronics, software, or other system where the system's substantive behavior is dependent on the sequence or timing of other uncontrollable events, leading to unexpected or inconsistent ...
. One common way to avoid this is to use another shared variable as a
"lock" or "mutex" (from mutual exclusion).
In the following piece of C code, the function is thread-safe, but not reentrant:
# include
int increment_counter ()
In the above,
increment_counter
can be called by different threads without any problem since a mutex is used to synchronize all access to the shared
counter
variable. But if the function is used in a reentrant interrupt handler and a second interrupt arises while the mutex is locked, the second routine will hang forever. As interrupt servicing can disable other interrupts, the whole system could suffer.
The same function can be implemented to be both thread-safe and reentrant using the lock-free
atomics in
C++11
C++11 is a version of a joint technical standard, ISO/IEC 14882, by the International Organization for Standardization (ISO) and International Electrotechnical Commission (IEC), for the C++ programming language. C++11 replaced the prior vers ...
:
# include
int increment_counter ()
See also
*
Concurrency control
In information technology and computer science, especially in the fields of computer programming, operating systems, multiprocessors, and databases, concurrency control ensures that correct results for concurrent operations are generated, whil ...
*
Concurrent data structure
*
Exception safety
*
Priority inversion
*
ThreadSafe
References
External links
*
*
*
*{{cite web, url=http://www.thinkingparallel.com/2006/10/15/a-short-guide-to-mastering-thread-safety/, title=A Short Guide to Mastering Thread-Safety, last=Suess, first=Michael, date=15 October 2006, website=Thinking Parallel, access-date=2012-01-22
Threads (computing)
Programming language topics