Sparse is a computer software tool designed to find possible coding faults in the
Linux kernel
The Linux kernel is a Free and open-source software, free and open source Unix-like kernel (operating system), kernel that is used in many computer systems worldwide. The kernel was created by Linus Torvalds in 1991 and was soon adopted as the k ...
. Unlike
other such tools, this
static analysis tool was initially designed to only flag constructs that were likely to be of interest to
kernel developers, such as the mixing of pointers to user and kernel
address space
In computing, an address space defines a range of discrete addresses, each of which may correspond to a network host, peripheral device, disk sector, a memory cell or other logical or physical entity.
For software programs to save and retrieve ...
s.
Sparse checks for known problems and allows the developer to include annotations in the code that convey information about
data type
In computer science and computer programming, a data type (or simply type) is a collection or grouping of data values, usually specified by a set of possible values, a set of allowed operations on these values, and/or a representation of these ...
s, such as the address space that pointers point to and the locks that a function acquires or releases.
Linus Torvalds
Linus Benedict Torvalds ( , ; born 28 December 1969) is a Finnish software engineer who is the creator and lead developer of the Linux kernel. He also created the distributed version control system Git.
He was honored, along with Shinya Yam ...
started writing Sparse in 2003. Josh Triplett was its maintainer from 2006, a role taken over by Christopher Li in 2009
and by Luc Van Oostenryck in November 2018.
Sparse is released under the
MIT License
The MIT License is a permissive software license originating at the Massachusetts Institute of Technology (MIT) in the late 1980s. As a permissive license, it puts very few restrictions on reuse and therefore has high license compatibility.
Unl ...
.
Annotations
Some of the checks performed by Sparse require annotating the source code using the
__attribute__
GCC extension, or the Sparse-specific
__context__
specifier.
Sparse defines the following list of attributes:
*
address_space(''num'')
*
bitwise
*
force
*
context(''expression'',''in_context'',''out_context'')
When an API is defined with a macro, the specifier
__attribute__((context(''...'')))
can be replaced by
__context__(''...'')
.
Linux kernel definitions
The Linux kernel defines the following short forms as pre-processor macros in file
linux/compiler.han
linux/types.h(when building without the
__CHECKER__
flag, all these annotations are removed from the code):
#ifdef __CHECKER__
# define __user __attribute__((noderef, address_space(1)))
# define __kernel __attribute__((address_space(0)))
# define __safe __attribute__((safe))
# define __force __attribute__((force))
# define __nocast __attribute__((nocast))
# define __iomem __attribute__((noderef, address_space(2)))
# define __must_hold(x) __attribute__((context(x,1,1)))
# define __acquires(x) __attribute__((context(x,0,1)))
# define __releases(x) __attribute__((context(x,1,0)))
# define __acquire(x) __context__(x,1)
# define __release(x) __context__(x,-1)
# define __cond_lock(x,c) ((c) ? () : 0)
# define __percpu __attribute__((noderef, address_space(3)))
#ifdef CONFIG_SPARSE_RCU_POINTER
# define __rcu __attribute__((noderef, address_space(4)))
#else
# define __rcu
#endif
extern void __chk_user_ptr(const volatile void __user *);
extern void __chk_io_ptr(const volatile void __iomem *);
#else
# define __user
# define __kernel
# define __safe
# define __force
# define __nocast
# define __iomem
# define __chk_user_ptr(x) (void)0
# define __chk_io_ptr(x) (void)0
# define __builtin_warning(x, y...) (1)
# define __must_hold(x)
# define __acquires(x)
# define __releases(x)
# define __acquire(x) (void)0
# define __release(x) (void)0
# define __cond_lock(x,c) (c)
# define __percpu
# define __rcu
#endif
#ifdef __CHECKER__
# define __bitwise __attribute__((bitwise))
#else
# define __bitwise
#endif
Examples
The types
__le32
and
__be32
represent 32-bit integer types with different
endianness
file:Gullivers_travels.jpg, ''Gulliver's Travels'' by Jonathan Swift, the novel from which the term was coined
In computing, endianness is the order in which bytes within a word (data type), word of digital data are transmitted over a data comm ...
. However, the C language does not allow to specify that variables of these types should not be mixed. The
bitwise
attribute is used to mark these types as restricted, so Sparse will give a warning if variables of these types or other integer variables are mixed:
typedef __u32 __bitwise __le32;
typedef __u32 __bitwise __be32;
To mark valid conversions between restricted types, a casting with the
force
attribute is used to avoid Sparse giving a warning.
See also
*
List of tools for static code analysis
References
Further reading
*
*
*
*
*
External links
Official documentation ''Linux Kernel Documentation''
*
*{{man, 1, cgcc, Linux, Compiler wrapper to run Sparse after compiling
Static program analysis tools
Free software testing tools
Software using the MIT license