Visual analytics is a multidisciplinary science and technology field that emerged from
information visualization
Data and information visualization (data viz/vis or info viz/vis) is the practice of designing and creating Graphics, graphic or visual Representation (arts), representations of a large amount of complex quantitative and qualitative data and i ...
and
scientific visualization
Scientific visualization ( also spelled scientific visualisation) is an interdisciplinary branch of science concerned with the visualization of scientific phenomena. Michael Friendly (2008)"Milestones in the history of thematic cartography, st ...
. It focuses on how analytical
reasoning
Reason is the capacity of consciously applying logic by drawing valid conclusions from new or existing information, with the aim of seeking the truth. It is associated with such characteristically human activities as philosophy, religion, scien ...
can be facilitated by interactive
visual interfaces.
Overview
Visual analytics is "the science of analytical reasoning facilitated by interactive visual interfaces."
[ James J. Thomas and Kristin A. Cook (Ed.) (2005)]
''Illuminating the Path: The R&D Agenda for Visual Analytics''
National Visualization and Analytics Center. It can attack certain problems whose size, complexity, and need for closely coupled human and machine analysis may make them otherwise intractable.
[Robert Kosara (2007)]
''Visual Analytics''
ITCS 4122/5122, Fall 2007. Retrieved 28 June 2008. Visual analytics advances science and technology developments in analytical reasoning, interaction, data transformations and representations for computation and visualization, analytic reporting, and technology transition. As a research agenda, visual analytics brings together several scientific and technical communities from computer science, information visualization, cognitive and perceptual sciences, interactive design, graphic design, and social sciences.
Visual analytics integrates new computational and theory-based tools with innovative interactive techniques and
visual representations to enable human-information discourse. The design of the tools and techniques is based on
cognitive
Cognition is the "mental action or process of acquiring knowledge and understanding through thought, experience, and the senses". It encompasses all aspects of intellectual functions and processes such as: perception, attention, thought, ...
,
design
A design is the concept or proposal for an object, process, or system. The word ''design'' refers to something that is or has been intentionally created by a thinking agent, and is sometimes used to refer to the inherent nature of something ...
, and
perceptual
Perception () is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous syste ...
principles. This science of analytical reasoning provides the reasoning framework upon which one can build both strategic and tactical visual analytics technologies for threat analysis, prevention, and response. Analytical reasoning is central to the analyst’s task of applying human judgments to reach conclusions from a combination of evidence and assumptions.
Visual analytics has some overlapping goals and techniques with
information visualization
Data and information visualization (data viz/vis or info viz/vis) is the practice of designing and creating Graphics, graphic or visual Representation (arts), representations of a large amount of complex quantitative and qualitative data and i ...
and
scientific visualization
Scientific visualization ( also spelled scientific visualisation) is an interdisciplinary branch of science concerned with the visualization of scientific phenomena. Michael Friendly (2008)"Milestones in the history of thematic cartography, st ...
. There is currently no clear consensus on the boundaries between these fields, but broadly speaking the three areas can be distinguished as follows:
* Scientific visualization deals with data that has a natural geometric structure (e.g., MRI data, wind flows).
* Information visualization handles abstract data structures such as trees or graphs.
* Visual analytics is especially concerned with coupling interactive visual representations with underlying analytical processes (e.g., statistical procedures,
data mining
Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...
techniques) such that high-level, complex activities can be effectively performed (e.g., sense making, reasoning, decision making).
Visual analytics seeks to marry techniques from information visualization with techniques from computational transformation and analysis of data. Information visualization forms part of the direct interface between user and machine, amplifying human cognitive capabilities in six basic ways:
# by increasing cognitive resources, such as by using a visual resource to expand human working memory,
# by reducing search, such as by representing a large amount of data in a small space,
# by enhancing the recognition of patterns, such as when information is organized in space by its time relationships,
# by supporting the easy perceptual inference of relationships that are otherwise more difficult to induce,
# by perceptual monitoring of a large number of potential events, and
# by providing a manipulable medium that, unlike static diagrams, enables the exploration of a space of parameter values
These capabilities of information visualization, combined with computational data analysis, can be applied to analytic reasoning to support the sense-making process.
History
As an interdisciplinary approach, visual analytics has its roots in information visualization, cognitive sciences, and computer science. The term and scope of the field was defined in the early 2000s trough researchers such as
John Stasko,
Pak Chung Wong,
Ben Shneiderman
Ben Shneiderman (born August 21, 1947) is an American computer scientist, a Distinguished University Professor in the University of Maryland Department of Computer Science, which is part of the University of Maryland College of Computer, Mathem ...
,
Jim Thomas, and
David S. Ebert, and most notably through the creation
United States Department of Homeland Security
The United States Department of Homeland Security (DHS) is the U.S. United States federal executive departments, federal executive department responsible for public security, roughly comparable to the Interior minister, interior, Home Secretary ...
's creation of the
National Visualization and Analytics Center (NVAC) at
Pacific Northwest National Laboratory
Pacific Northwest National Laboratory (PNNL) is one of the United States Department of Energy national laboratories, managed by the Department of Energy's (DOE) Office of Science. The main campus of the laboratory is in Richland, Washington ...
in 2004, whose charter was to mitigate information overload after the
September 11 2001 attacks in the intelligence community. Their research work determined core challenges and positioned visual analytics as a new research domain.
In 2006, the IEEE VIS community by
Pak Chung Wong and
Daniel A. Keim launched the annual ''IEEE Conference on Visual Analytics Science and Technology (VAST)'', providing a dedicated venue for VA research, which in 2020 merged to form the
IEEE Visualization conference. The domain was further refined as part of the European Commissions
FP7 VisMaster program in the late 2000s.
Topics
Scope
Visual analytics is a multidisciplinary field that includes the following focus areas:
* Analytical reasoning techniques that enable users to obtain deep insights that directly support assessment, planning, and decision making
* Data representations and transformations that convert all types of conflicting and dynamic data in ways that support visualization and analysis
* Techniques to support production, presentation, and dissemination of the results of an analysis to communicate information in the appropriate context to a variety of audiences.
* Visual representations and
interaction techniques
[A. Kerren and F. Schreiber. Toward the Role of Interaction in Visual Analytics. In Proceedings of the 2012 Winter Simulation Conference (WSC '12), pages 420:1-420:13, Berlin, Germany, 2012. IEEE Computer Society Press.] that take advantage of the human eye’s broad bandwidth pathway into the mind to allow users to see, explore, and understand large amounts of information at once.
Analytical reasoning techniques
Analytical reasoning techniques are the method by which users obtain deep insights that directly support situation assessment, planning, and decision making. Visual analytics must facilitate high-quality human judgment with a limited investment of the analysts’ time. Visual analytics tools must enable diverse analytical tasks such as:
* Understanding past and present situations quickly, as well as the trends and events that have produced current conditions
* Identifying possible alternative futures and their warning signs
* Monitoring current events for emergence of warning signs as well as unexpected events
* Determining indicators of the intent of an action or an individual
* Supporting the decision maker in times of crisis.
These tasks will be conducted through a combination of individual and collaborative analysis, often under extreme time pressure. Visual analytics must enable hypothesis-based and scenario-based analytical techniques, providing support for the analyst to reason based on the available evidence.
Data representations
Data representations are structured forms suitable for computer-based transformations. These structures must exist in the original data or be derivable from the data themselves. They must retain the information and knowledge content and the related context within the original data to the greatest degree possible. The structures of underlying data representations are generally neither accessible nor intuitive to the user of the visual analytics tool. They are frequently more complex in nature than the original data and are not necessarily smaller in size than the original data. The structures of the data representations may contain hundreds or thousands of dimensions and be unintelligible to a person, but they must be transformable into lower-dimensional representations for visualization and analysis.
Theories of visualization
Theories of visualization include:
*
Jacques Bertin
Jacques Bertin (27 July 1918 – 3 May 2010) was a French cartographer and theorist, known from his book ''Sémiologie Graphique'' (''Semiology of Graphics''), published in 1967. This monumental work, based on his experience as a cartographer an ...
's ''
Semiology of Graphics'' (1967)
*
Nelson Goodman
Henry Nelson Goodman (7 August 1906 – 25 November 1998) was an American philosopher, known for his work on counterfactuals, mereology, the problem of induction, irrealism, and aesthetics.
Life and career
Goodman was born in Somerville, Ma ...
's ''Languages of Art'' (1977)
*
Jock D. Mackinlay's ''Automated design of optimal visualization'' (APT) (1986)
*
Leland Wilkinson
Leland Wilkinson (November 5, 1944 – December 10, 2021) was an American statistician and computer scientist at and adjunct professor of computer science at University of Illinois at Chicago. Wilkinson developed the SYSTAT statistical package ...
's ''Grammar of Graphics'' (1998)
Visual representations
Visual representations translate data into a visible form that highlights important features, including commonalities and anomalies. These visual representations make it easy for users to perceive salient aspects of their data quickly. Augmenting the cognitive reasoning process with perceptual reasoning through visual representations permits the analytical reasoning process to become faster and more focused.
File:Nlsy97_jobsatis.jpg, Bar chart representing the relative distribution of job satisfaction
File:NLSYpie.jpg, Pie chart representing the relative distribution of job satisfaction according to the National Longitudinal Survey of Youth, 1997 cohort, survey year 2008
Process
The input for the data sets used in the visual analytics process are
heterogeneous
Homogeneity and heterogeneity are concepts relating to the uniformity of a substance, process or image. A homogeneous feature is uniform in composition or character (i.e., color, shape, size, weight, height, distribution, texture, language, i ...
data sources (i.e., the internet, newspapers, books, scientific experiments,
expert system
In artificial intelligence (AI), an expert system is a computer system emulating the decision-making ability of a human expert.
Expert systems are designed to solve complex problems by reasoning through bodies of knowledge, represented mainly as ...
s). From these rich sources, the data sets ''S = S
1, ..., S
m'' are chosen, whereas each ''S
i , i ∈ (1, ..., m)'' consists of
attributes A
i1, ..., A
ik. The goal or output of the process is insight ''I''. Insight is either directly obtained from the set of created visualizations ''V'' or through confirmation of
hypotheses
A hypothesis (: hypotheses) is a proposed explanation for a phenomenon. A scientific method, scientific hypothesis must be based on observations and make a testable and reproducible prediction about reality, in a process beginning with an educ ...
''H'' as the results of automated analysis methods. This formalization of the visual analytics process is illustrated in the following figure. Arrows represent the transitions from one set to another one.
More formally the visual analytics process is a
transformation
Transformation may refer to:
Science and mathematics
In biology and medicine
* Metamorphosis, the biological process of changing physical form after birth or hatching
* Malignant transformation, the process of cells becoming cancerous
* Trans ...
''F: S → I'', whereas ''F'' is a concatenation of functions ''f ∈ '' defined as follows:
''D
W'' describes the basic data
pre-processing functionality with ''D
W : S → S and W ∈ '' including data transformation functions ''D
T'', data cleaning functions ''D
C'', data selection functions ''D
SL'' and data integration functions ''D
I'' that are needed to make analysis functions applicable to the data set.
''V
W, W ∈ '' symbolizes the visualization functions, which are either functions visualizing data ''V
S : S → V'' or functions visualizing hypotheses ''V
H : H → V''.
''H
Y, Y ∈ '' represents the hypotheses generation process. We distinguish between functions that generate hypotheses from data ''H
S : S → H'' and functions that generate hypotheses from visualizations ''H
V : V → H''.
Moreover, user interactions ''U
Z, Z ∈ '' are an integral part of the visual analytics process. User interactions can either effect only visualizations ''U
V : V → V'' (i.e., selecting or zooming), or can effect only hypotheses ''U
H : H → H'' by generating a new hypotheses from given ones. Furthermore, insight can be concluded from visualizations ''U
CV : V → I'' or from hypotheses ''U
CH : H → I''.
The typical data pre-processing applying data cleaning, data integration and data transformation functions is defined as ''D
P = D
T(D
I(D
C(S
1, ..., S
n)))''. After the pre-processing step either automated analysis methods ''H
S = '' (i.e., statistics, data mining, etc.) or visualization methods ''V
S : S → V, V
S = '' are applied to the data, in order to reveal patterns as shown in the figure above.
[Daniel A. Keim, Florian Mansmann, Jörn Schneidewind, Jim Thomas, and Hartmut Ziegler (2008). "Visual Analytics: Scope and Challenges"]
In general the following paradigm is used to process the data:
''Analyse First – Show the Important – Zoom, Filter and Analyse Further – Details on Demand''
[Keim D. A, Mansmann F, Schneidewind J, Thomas J, Ziegler H: ''Visual analytics: Scope and challenges''. ''Visual Data Mining:'' 2008, S. 82.]
See also
Related subjects
*
Cartography
Cartography (; from , 'papyrus, sheet of paper, map'; and , 'write') is the study and practice of making and using maps. Combining science, aesthetics and technique, cartography builds on the premise that reality (or an imagined reality) can ...
*
Computational visualistics
*
Critical thinking
Critical thinking is the process of analyzing available facts, evidence, observations, and arguments to make sound conclusions or informed choices. It involves recognizing underlying assumptions, providing justifications for ideas and actions, ...
*
Decision-making
In psychology, decision-making (also spelled decision making and decisionmaking) is regarded as the Cognition, cognitive process resulting in the selection of a belief or a course of action among several possible alternative options. It could be ...
*
Google Analytics
Google Analytics is a web analytics service offered by Google that tracks and reports website traffic and also mobile app traffic and events, currently as a platform inside the Google Marketing Platform brand. Google launched the service in N ...
*
Interaction design
Interaction design, often abbreviated as IxD, is "the practice of designing interactive digital products, environments, systems, and services." While interaction design has an interest in form (similar to other design fields), its main area of foc ...
*
Interactive visual analysis
*
Interactivity
Across the many fields concerned with interactivity, including information science, computer science, human-computer interaction, communication, and industrial design, there is little agreement over the meaning of the term "interactivity", but ...
*
Social network analysis software
Social network analysis (SNA) software is software which facilitates quantitative analysis of behavior, quantitative or qualitative research, qualitative social network analysis, analysis of social networks, by describing features of a network eit ...
*
Software visualization
Software consists of computer programs that instruct the execution of a computer. Software also includes design documents and specifications.
The history of software is closely tied to the development of digital computers in the mid-20th cen ...
*
Starlight Information Visualization System
*
Text analytics
Text mining, text data mining (TDM) or text analytics is the process of deriving high-quality information from plain text, text. It involves "the discovery by computer of new, previously unknown information, by automatically extracting information ...
*
Traffic analysis
Traffic analysis is the process of intercepting and examining messages in order to deduce information from patterns in communication. It can be performed even when the messages are encrypted. In general, the greater the number of messages observ ...
*
Visual reasoning
Related scientists
*
Cecilia R. Aragon
*
Robert E. Horn
*
Daniel A. Keim
*
Theresa-Marie Rhyne
*
Lawrence J. Rosenblum
*
Ben Shneiderman
Ben Shneiderman (born August 21, 1947) is an American computer scientist, a Distinguished University Professor in the University of Maryland Department of Computer Science, which is part of the University of Maryland College of Computer, Mathem ...
*
John Stasko
*
Jim Thomas
*
Pak Chung Wong
Related software
*
imc FAMOS
FAMOS (short for fast analysis and monitoring of signals) is a software for post-processing test & measurement data in time and frequency domain, and visually displaying measurement results. The program was introduced in 1987 by the German compa ...
(1987), graphical data analysis
References
Further reading
* Boris Kovalerchuk and James Schwing (2004). ''Visual and Spatial Analysis: Advances in Data Mining, Reasoning, and Problem Soving''
* Guoping Qiu (2007). ''Advances in Visual Information Systems: 9th International Conference (VISUAL). ''
* IEEE, Inc. Staff (2007). ''Visual Analytics Science and Technology (VAST), A Symposium of the IEEE 2007.''
* May Yuan, Kathleen and Stewart Hornsby (2007). ''Computation and Visualization for Understanding Dynamics in Geographic Domains. ''
* Daniel Keim, Gennady Andrienko, Jean-Daniel Fekete, Carsten Görg, Jörn Kohlhammer, and Guy Melançon (2008). ''Visual Analytics: Definition, Process, and Challenges''. In Andreas Kerren, John T. Stasko, Jean-Daniel Fekete, and Chris North (Eds.), Information Visualization - Human-Centered Issues and Perspectives, pages 154-175, Lecture Notes in Computer Science 4950, Springer Berlin Heidelberg.
Mastering the Visualization Age: Solving Problems with Visual Analytics(2010) (pdf)
* Kawa Nazemi (2014). Adaptive Semantics Visualization. Eurographics Associatio
TU Darmstadt Dissertation. Eurographics.
External links
*
{{Visualization
Computational science
Computer graphics
Infographics
Visualization (graphics)
Scientific visualization
Cartography
Types of analytics
Big data