A region of interest (often abbreviated ROI) is a sample within a
data set A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the ...
identified for a particular purpose.
The concept of a ROI is commonly used in many application areas. For example, in
medical imaging
Medical imaging is the technique and process of imaging the interior of a body for clinical analysis and medical intervention, as well as visual representation of the function of some organs or tissues (physiology). Medical imaging seeks to rev ...
, the boundaries of a tumor may be defined on an image or in a volume, for the purpose of measuring its size. The endocardial border may be defined on an image, perhaps during different phases of the cardiac cycle, for example, end-systole and end-diastole, for the purpose of assessing cardiac function. In
geographical information system
A geographic information system (GIS) is a type of database containing geographic data (that is, descriptions of phenomena for which location is relevant), combined with software tools for managing, analyzing, and visualizing those data. In a br ...
s (GIS), a ROI can be taken literally as a polygonal selection from a 2D map. In
computer vision
Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate tasks that the hum ...
and
optical character recognition
Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scen ...
, the ROI defines the
borders
A border is a geographical boundary.
Border, borders, The Border or The Borders may also refer to:
Arts, entertainment and media Film and television
* ''Border'' (1997 film), an Indian Hindi-language war film
* ''Border'' (2018 Swedish film), ...
of an object under consideration. In many applications, symbolic (textual) labels are added to a ROI, to describe its content in a compact manner. Within a ROI may lie individual ''
points of interest
A point of interest (POI) is a specific point location that someone may find useful or interesting. An example is a point on the Earth representing the location of the Eiffel Tower, or a point on Mars representing the location of its highest m ...
'' (POIs).
Examples of regions of interest
* 1D dataset: a time or frequency interval on a waveform
* 2D dataset: the boundaries of an object on an image
* 3D dataset: the contours or surfaces outlining an object (sometimes known as the Volume of Interest (VOI)) in a volume
* 4D dataset: the outline of an object at or during a particular time interval in a time-volume
A ROI is a form of
Annotation
An annotation is extra information associated with a particular point in a document or other piece of information. It can be a note that includes a comment or explanation. Annotations are sometimes presented in the margin of book pages. For anno ...
, often associated with categorical or quantitative information (e.g., measurements like volume or mean intensity), expressed as text or in a structured form.
There are three fundamentally different means of encoding a ROI:
* As an integral part of the sample data set, with a unique or masking value that may or may not be outside the normal range of normally occurring values and which tags individual data cells
* As separate, purely graphic information, such as with vector or bitmap (rasterized) drawing elements, perhaps with some accompanying plain
(unstructured) text in the format of the data itself
* As a separate structured
semantic
Semantics (from grc, σημαντικός ''sēmantikós'', "significant") is the study of reference, meaning, or truth. The term can be used to refer to subfields of several distinct disciplines, including philosophy, linguistics and comput ...
information (such as coded
value type In computer programming, data types can be divided into two categories: value types (or by-value types) and reference types (or by-reference types). Value types are completely represented by their meaning, while reference types are references to ano ...
s) with a set of
spatial and/or temporal coordinates
Medical imaging
Medical imaging standards such as
DICOM
Digital Imaging and Communications in Medicine (DICOM) is the standard for the communication and management of medical imaging information and related data. DICOM is most commonly used for storing and transmitting medical images enabling the integ ...
provide general and application-specific mechanisms to support various use-cases.
For DICOM images (two or more dimensions):
* Burned in graphics and text may occur within the normal
pixel
In digital imaging, a pixel (abbreviated px), pel, or picture element is the smallest addressable element in a raster image, or the smallest point in an all points addressable display device.
In most digital display devices, pixels are the smal ...
value range (e.g., as the maximum white value) (deprecated)
* Bitmap (rasterized) overlay graphics and text may be present in unused high bits of the pixel data or in a separate attribute (deprecated)
* Vector graphics may be encoded in separate image attributes as curves (deprecated)
* Unstructured vector graphics and text as well as bitmap (rasterized) overlay graphics may be encoded in a separate object as a presentation state that references the image object to which it is to be applied
* Structured data may be encoded in a separate object as a structured report in the form of a tree of name-value pairs of coded or text concepts possibly associated with derived quantitative information can reference spatial and/or temporal coordinates that in turn reference the image objects to which they apply
* Reference locations may be encoded as
fiducials in the form of spatial coordinates with an associated coded purpose, either as pixel coordinates by reference to specific images or as coordinates in a named patient-relative 3D Cartesian space
* Pixels (possibly non-contiguous) may be classified into
segments encoded in a segmentation object as either binary or probabilistic values in a raster (which is not required to have the same spatial sampling or extent as the images from which the segmentation was derived); these are usually referenced by other objects containing structured content (structured reports)
For DICOM radiotherapy:
* Contours of objects may be defined as structure sets, either as pixel coordinates by reference to specific images or as coordinates in a named patient-relative 3D Cartesian space (these are also used for non-RT applications)
For DICOM time-based waveforms:
* Burned in values may occur with the waveform (deprecated)
* Annotations may be encoded in a separate attribute can select multiple time points or a range of time points, either by sample number or specified time
* Structured data may be encoded in a separate object as a structured report in the form of a tree of name-value pairs of coded or text concepts possibly associated with derived quantitative information can reference temporal coordinates that in turn reference the waveform objects to which they apply
HL7
Health Level Seven or HL7 refers to a set of international standards for transfer of clinical and administrative data between software applications used by various healthcare providers. These standards focus on the application layer, which is "la ...
Clinical Document Architecture
The HL7 Clinical Document Architecture (CDA) is an XML-based markup standard intended to specify the encoding, structure and semantics of clinical documents for exchange. In November 2000, HL7 published Release 1.0. The organization published R ...
also has a subset of mechanisms similar to (and intended to be compatible with) DICOM for referencing image-related spatial coordinates as observations; it allows for a circle, ellipse,
polyline
In geometry, a polygonal chain is a connected series of line segments. More formally, a polygonal chain is a curve specified by a sequence of points (A_1, A_2, \dots, A_n) called its vertices. The curve itself consists of the line segments co ...
or point to be defined as integer pixel-relative coordinates referencing an external multi-media image object, which may be of a consumer rather than medical image format (e.g., a
GIF
The Graphics Interchange Format (GIF; or , see pronunciation) is a bitmap image format that was developed by a team at the online services provider CompuServe led by American computer scientist Steve Wilhite and released on 15 June 1987. ...
,
PNG or
JPEG
JPEG ( ) is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and imag ...
).
Document analysis systems
In
Optical Character Recognition
Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scen ...
(OCR) and
Document Layout Analysis, regions of interest (ROIs) hierarchically encompass pages, text or graphical blocks, down to individual line-strip images, word and character image boxes. The de facto standard in archives and libraries is the tuplet , usually in the form of a *.tif file and its accompanying *.xml file.
Other 2D applications
As far as non-medical standards are concerned, in addition to the purely graphic markup languages (such as
PostScript
PostScript (PS) is a page description language in the electronic publishing and desktop publishing realm. It is a dynamically typed, concatenative programming language. It was created at Adobe Systems by John Warnock, Charles Geschke, Doug Br ...
or
PDF
Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. ...
) and vector graphic (such as
SVG) and 3D (such as
VRML
VRML (Virtual Reality Modeling Language, pronounced ''vermal'' or by its initials, originally—before 1995—known as the Virtual Reality Markup Language) is a standard file format for representing 3-dimensional (3D) interactive vector graphi ...
) drawing file formats that are widely available, and which carry no specific ROI semantics, some standards such as
JPEG 2000
JPEG 2000 (JP2) is an image compression standard and coding system. It was developed from 1997 to 2000 by a Joint Photographic Experts Group committee chaired by Touradj Ebrahimi (later the JPEG president), with the intention of superseding the ...
specifically provide mechanisms to label and/or compress to a different degree of fidelity, what they refer to as regions of interest.
References
{{reflist
Medical imaging
Optical character recognition
Geographic information systems