HOME

TheInfoList



OR:

A geographic information system (GIS) consists of integrated computer hardware and
software Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications. The history of software is closely tied to the development of digital comput ...
that store, manage, analyze, edit, output, and visualize geographic data. Much of this often happens within a
spatial database A spatial database is a general-purpose database (usually a relational database) that has been enhanced to include spatial data that represents objects defined in a geometric space, along with tools for querying and analyzing such data. Most ...
; however, this is not essential to meet the definition of a GIS. In a broader sense, one may consider such a system also to include human users and support staff, procedures and workflows, the
body of knowledge A body of knowledge (BOK or BoK) is the complete set of concepts, terms and activities that make up a professional domain, as defined by the relevant learned society or professional association.Oliver, G.R. (2012). ''Foundations of the Assumed Bus ...
of relevant concepts and methods, and institutional organizations. The uncounted plural, ''geographic information systems'', also abbreviated GIS, is the most common term for the industry and profession concerned with these systems. The academic discipline that studies these systems and their underlying geographic principles, may also be abbreviated as GIS, but the unambiguous
GIScience Geographic information science (GIScience, GISc) or geoinformation science is a scientific discipline at the crossroads of computational science, social science, and natural science that studies geographic information, including how it represent ...
is more common. GIScience is often considered a subdiscipline of
geography Geography (from Ancient Greek ; combining 'Earth' and 'write', literally 'Earth writing') is the study of the lands, features, inhabitants, and phenomena of Earth. Geography is an all-encompassing discipline that seeks an understanding o ...
within the branch of
technical geography Technical geography is the branch of geography that involves using, studying, and creating tools to obtain, analyze, interpret, understand, and communicate spatial information. The other branches of geography, most commonly limited to human geo ...
. Geographic information systems are utilized in multiple technologies, processes, techniques and methods. They are attached to various operations and numerous applications, that relate to: engineering, planning, management, transport/logistics, insurance, telecommunications, and business, as well as the natural sciences such as forestry, ecology, and Earth science. For this reason, GIS and
location intelligence In business intelligence, location intelligence (LI), or spatial intelligence, is the process of deriving meaningful insight from geospatial data relationships to solve a particular problem. It involves layering multiple data sets spatially and/o ...
applications are at the foundation of location-enabled services, which rely on geographic analysis and visualization. GIS provides the ability to relate previously unrelated information, through the use of location as the "key index variable". Locations and extents that are found in the Earth's
spacetime In physics, spacetime, also called the space-time continuum, is a mathematical model that fuses the three dimensions of space and the one dimension of time into a single four-dimensional continuum. Spacetime diagrams are useful in visualiz ...
are able to be recorded through the date and time of occurrence, along with x, y, and z
coordinate In geometry, a coordinate system is a system that uses one or more numbers, or coordinates, to uniquely determine and standardize the position of the points or other geometric elements on a manifold such as Euclidean space. The coordinates are ...
s; representing,
longitude Longitude (, ) is a geographic coordinate that specifies the east- west position of a point on the surface of the Earth, or another celestial body. It is an angular measurement, usually expressed in degrees and denoted by the Greek lett ...
(''x''),
latitude In geography, latitude is a geographic coordinate system, geographic coordinate that specifies the north-south position of a point on the surface of the Earth or another celestial body. Latitude is given as an angle that ranges from −90° at t ...
(''y''), and
elevation The elevation of a geographic location (geography), ''location'' is its height above or below a fixed reference point, most commonly a reference geoid, a mathematical model of the Earth's sea level as an equipotential gravitational equipotenti ...
(''z''). All Earth-based, spatial–temporal, location and extent references should be relatable to one another, and ultimately, to a "real" physical location or extent. This key characteristic of GIS has begun to open new avenues of scientific inquiry and studies.


History and development

While digital GIS dates to the mid-1960s, when
Roger Tomlinson Roger Frank Tomlinson (17 November 1933 – 7 February 2014) was an English-Canadian geographer and the primary originator of modern geographic information systems (GIS), and has been acknowledged as the "father of GIS." Biography Roger T ...
first coined the phrase "geographic information system", many of the geographic concepts and methods that GIS automates date back decades earlier. One of the first known instances in which spatial analysis was used came from the field of
epidemiology Epidemiology is the study and analysis of the distribution (who, when, and where), patterns and Risk factor (epidemiology), determinants of health and disease conditions in a defined population, and application of this knowledge to prevent dise ...
in the (1832). French
cartographer Cartography (; from , 'papyrus, sheet of paper, map'; and , 'write') is the study and practice of making and using maps. Combining science, aesthetics and technique, cartography builds on the premise that reality (or an imagined reality) can ...
and geographer
Charles Picquet Charles is a masculine given name predominantly found in English and French speaking countries. It is from the French form ''Charles'' of the Proto-Germanic name (in runic alphabet) or ''*karilaz'' (in Latin alphabet), whose meaning was ...
created a map outlining the forty-eight districts in Paris, using
halftone Halftone is the reprographic technique that simulates continuous tone, continuous-tone imagery through the use of dots, varying either in size or in spacing, thus generating a gradient-like effect.Campbell, Alastair. ''The Designer's Lexicon''. ...
color gradients, to provide a visual representation for the number of reported deaths due to
cholera Cholera () is an infection of the small intestine by some Strain (biology), strains of the Bacteria, bacterium ''Vibrio cholerae''. Symptoms may range from none, to mild, to severe. The classic symptom is large amounts of watery diarrhea last ...
per every 1,000 inhabitants. In 1854,
John Snow John Snow (15 March 1813 – 16 June 1858) was an English physician and a leader in the development of anaesthesia and medical hygiene. He is considered one of the founders of modern epidemiology and early germ theory, in part because of hi ...
, an epidemiologist and physician, was able to determine the source of a cholera outbreak in London through the use of spatial analysis. Snow achieved this through plotting the residence of each casualty on a map of the area, as well as the nearby water sources. Once these points were marked, he was able to identify the water source within the cluster that was responsible for the outbreak. This was one of the earliest successful uses of a geographic methodology in pinpointing the source of an outbreak in epidemiology. While the basic elements of
topography Topography is the study of the forms and features of land surfaces. The topography of an area may refer to the landforms and features themselves, or a description or depiction in maps. Topography is a field of geoscience and planetary sci ...
and theme existed previously in
cartography Cartography (; from , 'papyrus, sheet of paper, map'; and , 'write') is the study and practice of making and using maps. Combining science, aesthetics and technique, cartography builds on the premise that reality (or an imagined reality) can ...
, Snow's map was unique due to his use of cartographic methods, not only to depict, but also to analyze clusters of geographically dependent phenomena. The early 20th century saw the development of
photozincography Photozincography, sometimes referred to as heliozincography but essentially the same process, known commercially as zinco, is the photographic process developed by Sir Henry James FRS (1803–1877) in the mid-nineteenth century. This method ...
, which allowed maps to be split into layers, for example one layer for vegetation and another for water. This was particularly used for printing contours – drawing these was a labour-intensive task but having them on a separate layer meant they could be worked on without the other layers to confuse the draughtsman. This work was initially drawn on glass plates, but later
plastic film Plastic film is a thin continuous polymeric material. Thicker plastic material is often called a "sheet". These thin plastic membranes are used to separate areas or volumes, to hold items, to act as barriers, or as printable surfaces. Plast ...
was introduced, with the advantages of being lighter, using less storage space and being less brittle, among others. When all the layers were finished, they were combined into one image using a large process camera. Once color printing came in, the layers idea was also used for creating separate printing plates for each color. While the use of layers much later became one of the typical features of a contemporary GIS, the photographic process just described is not considered a GIS in itself – as the maps were just images with no database to link them to. Two additional developments are notable in the early days of GIS: Ian McHarg's publication ''Design with Nature'' and its map overlay method and the introduction of a street network into the U.S. Census Bureau's DIME ( Dual Independent Map Encoding) system. The first publication detailing the use of computers to facilitate cartography was written by
Waldo Tobler Waldo Rudolph Tobler (November 16, 1930 – February 20, 2018) was an United States, American-Switzerland, Swiss geographer and cartographer. Tobler is regarded as one of the most influential geographers and cartographers of the late 20th centur ...
in 1959. Further
computer hardware Computer hardware includes the physical parts of a computer, such as the central processing unit (CPU), random-access memory (RAM), motherboard, computer data storage, graphics card, sound card, and computer case. It includes external devices ...
development spurred by
nuclear weapon A nuclear weapon is an explosive device that derives its destructive force from nuclear reactions, either fission (fission or atomic bomb) or a combination of fission and fusion reactions (thermonuclear weapon), producing a nuclear exp ...
research led to more widespread general-purpose computer "mapping" applications by the early 1960s. In 1963, the world's first true operational GIS was developed in
Ottawa, Ontario Ottawa is the capital city of Canada. It is located in the southern Ontario, southern portion of the province of Ontario, at the confluence of the Ottawa River and the Rideau River. Ottawa borders Gatineau, Gatineau, Quebec, and forms the cor ...
, Canada, by the federal Department of Forestry and Rural Development. Developed by
Roger Tomlinson Roger Frank Tomlinson (17 November 1933 – 7 February 2014) was an English-Canadian geographer and the primary originator of modern geographic information systems (GIS), and has been acknowledged as the "father of GIS." Biography Roger T ...
, it was called the Canada Geographic Information System (CGIS) and was used to store, analyze, and manipulate data collected for the Canada Land Inventory, an effort to determine the land capability for rural Canada by mapping information about
soil Soil, also commonly referred to as earth, is a mixture of organic matter, minerals, gases, water, and organisms that together support the life of plants and soil organisms. Some scientific definitions distinguish dirt from ''soil'' by re ...
s, agriculture, recreation, wildlife,
waterfowl Anseriformes is an order of birds also known as waterfowl that comprises about 180 living species of birds in three families: Anhimidae (three species of screamers), Anseranatidae (the magpie goose), and Anatidae, the largest family, which i ...
,
forestry Forestry is the science and craft of creating, managing, planting, using, conserving and repairing forests and woodlands for associated resources for human and Natural environment, environmental benefits. Forestry is practiced in plantations and ...
and land use at a scale of 1:50,000. A rating classification factor was also added to permit analysis. CGIS was an improvement over "computer mapping" applications as it provided capabilities for data storage, overlay, measurement, and
digitizing Digitization is the process of converting information into a digital (i.e. computer-readable) format.Collins Dictionary. (n.d.). Definition of 'digitize'. Retrieved December 15, 2021, from https://www.collinsdictionary.com/dictionary/english ...
/scanning. It supported a national coordinate system that spanned the continent, coded lines as arcs having a true embedded
topology Topology (from the Greek language, Greek words , and ) is the branch of mathematics concerned with the properties of a Mathematical object, geometric object that are preserved under Continuous function, continuous Deformation theory, deformat ...
and it stored the attribute and locational information in separate files. As a result of this, Tomlinson has become known as the "father of GIS", particularly for his use of overlays in promoting the spatial analysis of convergent geographic data. CGIS lasted into the 1990s and built a large digital land resource database in Canada. It was developed as a
mainframe A mainframe computer, informally called a mainframe or big iron, is a computer used primarily by large organizations for critical applications like bulk data processing for tasks such as censuses, industry and consumer statistics, enterpris ...
-based system in support of federal and provincial resource planning and management. Its strength was continent-wide analysis of complex
dataset A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
s. The CGIS was never available commercially. In 1964, Howard T. Fisher formed the Laboratory for Computer Graphics and Spatial Analysis at the
Harvard Graduate School of Design The Harvard Graduate School of Design (GSD) is the graduate school of design at Harvard University, a private research university in Cambridge, Massachusetts. It offers master's and doctoral programs in architecture, landscape architecture, urba ...
(LCGSA 1965–1991), where a number of important theoretical concepts in spatial data handling were developed, and which by the 1970s had distributed seminal software code and systems, such as SYMAP, GRID, and ODYSSEY, to universities, research centers and corporations worldwide. These programs were the first examples of general-purpose GIS software that was not developed for a particular installation, and was very influential on future commercial software, such as
Esri Environmental Systems Research Institute, Inc., doing business as Esri (), is an American Multinational corporation, multinational geographic information system (GIS) software company headquartered in Redlands, California. It is best known for ...
ARC/INFO, released in 1983. Working in the Harvard Lab, Tom Waugh developed his vector-based Geographic Information Mapping and Manipulation System (GIMMS) software from 1969. He returned to the
University of Edinburgh The University of Edinburgh (, ; abbreviated as ''Edin.'' in Post-nominal letters, post-nominals) is a Public university, public research university based in Edinburgh, Scotland. Founded by the City of Edinburgh Council, town council under th ...
and this software was sold commercially from 1973. By 1977 it was used at 300 sites worldwide. This can be considered the first globally-used GIS which anticipated some key characteristics of the Harvard Odyssey system by nearly five years and ARC/INFO by a decade . By the late 1970s, two public domain GIS systems (
MOSS Mosses are small, non-vascular plant, non-vascular flowerless plants in the taxonomic phylum, division Bryophyta (, ) ''sensu stricto''. Bryophyta (''sensu lato'', Wilhelm Philippe Schimper, Schimp. 1879) may also refer to the parent group bryo ...
and
GRASS GIS ''Geographic Resources Analysis Support System'' (commonly termed ''GRASS GIS'') is a geographic information system (GIS) software suite used for geospatial data management and analysis, image processing, producing graphics and maps, spatial and ...
) were in development, and by the early 1980s, M&S Computing (later
Intergraph Intergraph Corporation was an American software development and services company, which now forms part of Hexagon AB. It provides enterprise engineering and geospatially powered software to businesses, governments, and organizations around the w ...
) along with Bentley Systems Incorporated for the CAD platform, Environmental Systems Research Institute (
ESRI Environmental Systems Research Institute, Inc., doing business as Esri (), is an American Multinational corporation, multinational geographic information system (GIS) software company headquartered in Redlands, California. It is best known for ...
), CARIS (Computer Aided Resource Information System), and ERDAS (Earth Resource Data Analysis System) emerged as commercial vendors of GIS software, successfully incorporating many of the CGIS features, combining the first-generation approach to separation of spatial and attribute information with a second-generation approach to organizing attribute data into database structures. In 1986, Mapping Display and Analysis System (MIDAS), the first desktop GIS product, was released for the
DOS DOS (, ) is a family of disk-based operating systems for IBM PC compatible computers. The DOS family primarily consists of IBM PC DOS and a rebranded version, Microsoft's MS-DOS, both of which were introduced in 1981. Later compatible syste ...
operating system. This was renamed in 1990 to MapInfo for Windows when it was ported to the
Microsoft Windows Windows is a Product lining, product line of Proprietary software, proprietary graphical user interface, graphical operating systems developed and marketed by Microsoft. It is grouped into families and subfamilies that cater to particular sec ...
platform. This began the process of moving GIS from the research department into the business environment. By the end of the 20th century, the rapid growth in various systems had been consolidated and standardized on relatively few platforms and users were beginning to explore viewing GIS data over the
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
, requiring data format and transfer standards. More recently, a growing number of free, open-source GIS packages run on a range of operating systems and can be customized to perform specific tasks. The major trend of the
21st century The 21st century is the current century in the ''Anno Domini'' or Common Era, in accordance with the Gregorian calendar. It began on 1 January 2001, and will end on 31 December 2100. It is the first century of the 3rd millennium. The rise of a ...
has been the integration of GIS capabilities with other
Information technology Information technology (IT) is a set of related fields within information and communications technology (ICT), that encompass computer systems, software, programming languages, data processing, data and information processing, and storage. Inf ...
and
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
infrastructure, such as
relational database A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970. A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
s,
cloud computing Cloud computing is "a paradigm for enabling network access to a scalable and elastic pool of shareable physical or virtual resources with self-service provisioning and administration on-demand," according to International Organization for ...
,
software as a service Software as a service (SaaS ) is a cloud computing service model where the provider offers use of application software to a client and manages all needed physical and software resources. SaaS is usually accessed via a web application. Unlike o ...
(SAAS), and
mobile computing Mobile computing is human–computer interaction in which a computer is expected to be transported during normal usage and allow for transmission of data, which can include voice and video transmissions. Mobile computing involves mobile commun ...
.


GIS software

The distinction must be made between a singular ''geographic information system'', which is a single installation of software and data for a particular use, along with associated hardware, staff, and institutions (e.g., the GIS for a particular city government); and ''
GIS software A GIS software program is a computer program to support the use of a geographic information system, providing the ability to create, store, manage, query, Spatial analysis, analyze, and Cartographic design, visualize Geographic data and informati ...
'', a general-purpose
application program Application software is any computer program that is intended for end-user use not operating, administering or programming the computer. An application (app, application program, software application) is any program that can be categorized as ...
that is intended to be used in many individual geographic information systems in a variety of application domains. Starting in the late 1970s, many software packages have been created specifically for GIS applications. Esri's
ArcGIS ArcGIS is a family of client, server and online geographic information system (GIS) software developed and maintained by Esri. ArcGIS was first released in 1982 as ARC/INFO, a command line-based GIS. ARC/INFO was later merged into ArcGIS De ...
, which includes
ArcGIS Pro ArcGIS Pro is desktop GIS software developed by Esri Environmental Systems Research Institute, Inc., doing business as Esri (), is an American Multinational corporation, multinational geographic information system (GIS) software company headq ...
and the legacy software
ArcMap ArcMap is the former main component of Esri's ArcGIS suite of geospatial processing programs. Used primarily to view, edit, create, and analyze geospatial data. ArcMap allows the user to explore data within a data set, symbolize features accordi ...
, currently dominates the GIS market. Other examples of GIS include
Autodesk Autodesk, Inc. is an American multinational software corporation that provides software products and services for the architecture, engineering, construction, manufacturing, media, education, and entertainment industries. Autodesk is headquarte ...
and
MapInfo Professional MapInfo Pro is a desktop geographic information system (GIS) software developed by Precisely, used for mapping and location analysis. It was formerly developed by Pitney Bowes Software and the MapInfo Corporation. History Version 4 of the p ...
and open-source programs such as
QGIS QGIS is a geographic information system (GIS) software that is free and open-source. QGIS supports Windows, macOS, and Linux. It supports viewing, editing, printing, and analysis of geospatial data in a range of data formats. Its name comes from ...
,
GRASS GIS ''Geographic Resources Analysis Support System'' (commonly termed ''GRASS GIS'') is a geographic information system (GIS) software suite used for geospatial data management and analysis, image processing, producing graphics and maps, spatial and ...
, MapGuide, and Hadoop-GIS. These and other desktop GIS applications include a full suite of capabilities for entering, managing, analyzing, and visualizing geographic data, and are designed to be used on their own. Starting in the late 1990s with the emergence of the
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
, as computer network technology progressed, GIS infrastructure and data began to move to
server Server may refer to: Computing *Server (computing), a computer program or a device that provides requested information for other programs or devices, called clients. Role * Waiting staff, those who work at a restaurant or a bar attending custome ...
s, providing another mechanism for providing GIS capabilities. This was facilitated by standalone software installed on a server, similar to other server software such as HTTP servers and
relational database management system A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970. A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
s, enabling clients to have access to GIS data and processing tools without having to install specialized desktop software. These networks are known as distributed GIS. This strategy has been extended through the Internet and development of
cloud-based Cloud computing is "a paradigm for enabling network access to a scalable and elastic pool of shareable physical or virtual resources with self-service provisioning and administration on-demand," according to International Organization for ...
GIS platforms such as ArcGIS Online and GIS-specialized
software as a service Software as a service (SaaS ) is a cloud computing service model where the provider offers use of application software to a client and manages all needed physical and software resources. SaaS is usually accessed via a web application. Unlike o ...
(SAAS). The use of the Internet to facilitate distributed GIS is known as Internet GIS. An alternative approach is the integration of some or all of these capabilities into other software or
information technology Information technology (IT) is a set of related fields within information and communications technology (ICT), that encompass computer systems, software, programming languages, data processing, data and information processing, and storage. Inf ...
architectures. One example is a spatial extension to Object-relational database software, which defines a geometry datatype so that spatial data can be stored in relational tables, and extensions to
SQL Structured Query Language (SQL) (pronounced ''S-Q-L''; or alternatively as "sequel") is a domain-specific language used to manage data, especially in a relational database management system (RDBMS). It is particularly useful in handling s ...
for spatial analysis operations such as overlay. Another example is the proliferation of geospatial libraries and
application programming interface An application programming interface (API) is a connection between computers or between computer programs. It is a type of software Interface (computing), interface, offering a service to other pieces of software. A document or standard that des ...
s (e.g.,
GDAL The Geospatial Data Abstraction Library (GDAL) is a computer software library for reading and writing raster and vector geospatial data formats (e.g. shapefile), and is released under the permissive X/MIT style free software license by the ...
, Leaflet,
D3.js D3.js (also known as D3, short for Data-Driven Documents) is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics (SVG), HTML5, and Cascading Style Sheets (CSS) stan ...
) that extend programming languages to enable the incorporation of GIS data and processing into custom software, including
web mapping Web mapping or an online mapping is the process of using, creating, and distributing maps on the World Wide Web (the Web), usually through the use of Web GIS, Web geographic information systems (Web GIS). A web map or an online map is both served ...
sites and
location-based service Location-based service (LBS) is a general term denoting software service (economics), services which use geographic data and information to provide services or information to users. LBS can be used in a variety of contexts, such as health, indoor ...
s in
smartphone A smartphone is a mobile phone with advanced computing capabilities. It typically has a touchscreen interface, allowing users to access a wide range of applications and services, such as web browsing, email, and social media, as well as multi ...
s.


Geospatial data management

The core of any GIS is a
database In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
that contains representations of geographic phenomena, modeling their ''geometry'' (location and shape) and their ''properties'' or ''attributes''. A GIS database may be stored in a variety of forms, such as a collection of separate data files or a single spatially-enabled
relational database A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970. A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
. Collecting and managing these data usually constitutes the bulk of the time and financial resources of a project, far more than other aspects such as analysis and mapping.


Aspects of geographic data

GIS uses spatio-temporal (
space-time In physics, spacetime, also called the space-time continuum, is a mathematical model that fuses the three-dimensional space, three dimensions of space and the one dimension of time into a single four-dimensional continuum (measurement), continu ...
) location as the key index variable for all other information. Just as a relational database containing text or numbers can relate many different tables using common key index variables, GIS can relate otherwise unrelated information by using location as the key index variable. The key is the location and/or extent in space-time. Any variable that can be located spatially, and increasingly also temporally, can be referenced using a GIS. Locations or extents in Earth space–time may be recorded as dates/times of occurrence, and x, y, and z
coordinate In geometry, a coordinate system is a system that uses one or more numbers, or coordinates, to uniquely determine and standardize the position of the points or other geometric elements on a manifold such as Euclidean space. The coordinates are ...
s representing,
longitude Longitude (, ) is a geographic coordinate that specifies the east- west position of a point on the surface of the Earth, or another celestial body. It is an angular measurement, usually expressed in degrees and denoted by the Greek lett ...
,
latitude In geography, latitude is a geographic coordinate system, geographic coordinate that specifies the north-south position of a point on the surface of the Earth or another celestial body. Latitude is given as an angle that ranges from −90° at t ...
, and
elevation The elevation of a geographic location (geography), ''location'' is its height above or below a fixed reference point, most commonly a reference geoid, a mathematical model of the Earth's sea level as an equipotential gravitational equipotenti ...
, respectively. These GIS coordinates may represent other quantified systems of temporo-spatial reference (for example, film frame number, stream gage station, highway mile-marker, surveyor benchmark, building address, street intersection, entrance gate, water depth sounding, POS or CAD drawing origin/units). Units applied to recorded temporal-spatial data can vary widely (even when using exactly the same data, see
map projection In cartography, a map projection is any of a broad set of Transformation (function) , transformations employed to represent the curved two-dimensional Surface (mathematics), surface of a globe on a Plane (mathematics), plane. In a map projection, ...
s), but all Earth-based spatial–temporal location and extent references should, ideally, be relatable to one another and ultimately to a "real" physical location or extent in space–time. Related by accurate spatial information, an incredible variety of real-world and projected past or future data can be analyzed, interpreted and represented. This key characteristic of GIS has begun to open new avenues of scientific inquiry into behaviors and patterns of real-world information that previously had not been systematically
correlated In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistic ...
.


Data modeling

GIS data represents phenomena that exist in the real world, such as roads, land use, elevation, trees, waterways, and states. The most common types of phenomena that are represented in data can be divided into two conceptualizations: discrete objects (e.g., a house, a road) and continuous fields (e.g., rainfall amount or population density). Other types of geographic phenomena, such as events (e.g., location of
World War II World War II or the Second World War (1 September 1939 – 2 September 1945) was a World war, global conflict between two coalitions: the Allies of World War II, Allies and the Axis powers. World War II by country, Nearly all of the wo ...
battles), processes (e.g., extent of
suburbanization Suburbanization (American English), also spelled suburbanisation (British English), is a population shift from historic core cities or rural areas into suburbs. Most suburbs are built in a formation of (sub)urban sprawl. As a consequence ...
), and masses (e.g., types of
soil Soil, also commonly referred to as earth, is a mixture of organic matter, minerals, gases, water, and organisms that together support the life of plants and soil organisms. Some scientific definitions distinguish dirt from ''soil'' by re ...
in an area) are represented less commonly or indirectly, or are modeled in analysis procedures rather than data. Traditionally, there are two broad methods used to store data in a GIS for both kinds of abstractions mapping references: raster images and
vector Vector most often refers to: * Euclidean vector, a quantity with a magnitude and a direction * Disease vector, an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematics a ...
. Points, lines, and polygons represent vector data of mapped location attribute references. A new hybrid method of storing data is that of identifying point clouds, which combine three-dimensional points with
RGB The RGB color model is an additive color model in which the red, green, and blue primary colors of light are added together in various ways to reproduce a broad array of colors. The name of the model comes from the initials of the three ...
information at each point, returning a 3D color image. GIS thematic maps then are becoming more and more realistically visually descriptive of what they set out to show or determine.


Data acquisition

GIS data acquisition includes several methods for gathering spatial data into a GIS database, which can be grouped into three categories: ''primary data capture'', the direct measurement phenomena in the field (e.g.,
remote sensing Remote sensing is the acquisition of information about an physical object, object or phenomenon without making physical contact with the object, in contrast to in situ or on-site observation. The term is applied especially to acquiring inform ...
, the
global positioning system The Global Positioning System (GPS) is a satellite-based hyperbolic navigation system owned by the United States Space Force and operated by Mission Delta 31. It is one of the global navigation satellite systems (GNSS) that provide ge ...
); ''secondary data capture'', the extraction of information from existing sources that are not in a GIS form, such as paper maps, through
digitization Digitization is the process of converting information into a digital (i.e. computer-readable) format.Collins Dictionary. (n.d.). Definition of 'digitize'. Retrieved December 15, 2021, from https://www.collinsdictionary.com/dictionary/english/ ...
; and ''
data transfer Data communication, including data transmission and data reception, is the transfer of data, transmitted and received over a point-to-point or point-to-multipoint communication channel. Examples of such channels are copper wires, optical ...
'', the copying of existing GIS data from external sources such as government agencies and private companies. All of these methods can consume significant time, finances, and other resources.


Primary data capture

Survey data can be directly entered into a GIS from digital data collection systems on survey instruments using a technique called
coordinate geometry In mathematics, analytic geometry, also known as coordinate geometry or Cartesian geometry, is the study of geometry using a coordinate system. This contrasts with synthetic geometry. Analytic geometry is used in physics and engineering, and als ...
(COGO). Positions from a global navigation satellite system (
GNSS A satellite navigation or satnav system is a system that uses satellites to provide autonomous geopositioning. A satellite navigation system with global coverage is termed global navigation satellite system (GNSS). , four global systems are op ...
) like the
Global Positioning System The Global Positioning System (GPS) is a satellite-based hyperbolic navigation system owned by the United States Space Force and operated by Mission Delta 31. It is one of the global navigation satellite systems (GNSS) that provide ge ...
can also be collected and then imported into a GIS. A current trend in data collection gives users the ability to utilize field computers with the ability to edit live data using wireless connections or disconnected editing sessions. The current trend is to utilize applications available on smartphones and PDAs in the form of mobile GIS. This has been enhanced by the availability of low-cost mapping-grade GPS units with decimeter accuracy in real time. This eliminates the need to post process, import, and update the data in the office after fieldwork has been collected. This includes the ability to incorporate positions collected using a
laser rangefinder A laser rangefinder, also known as a laser telemeter or laser distance meter, is a rangefinder that uses a laser beam to determine the distance to an object. The most common form of laser rangefinder operates on the time of flight principle by ...
. New technologies also allow users to create maps as well as analysis directly in the field, making projects more efficient and mapping more accurate. Remotely sensed data also plays an important role in data collection and consist of sensors attached to a platform. Sensors include cameras, digital scanners and
lidar Lidar (, also LIDAR, an acronym of "light detection and ranging" or "laser imaging, detection, and ranging") is a method for determining ranging, ranges by targeting an object or a surface with a laser and measuring the time for the reflected li ...
, while platforms usually consist of aircraft and
satellite A satellite or an artificial satellite is an object, typically a spacecraft, placed into orbit around a celestial body. They have a variety of uses, including communication relay, weather forecasting, navigation ( GPS), broadcasting, scient ...
s. In England in the mid-1990s, hybrid kite/balloons called helikites first pioneered the use of compact airborne digital cameras as airborne geo-information systems. Aircraft measurement software, accurate to 0.4 mm, was used to link the photographs and measure the ground. Helikites are inexpensive and gather more accurate data than aircraft. Helikites can be used over roads, railways and towns where
unmanned aerial vehicle An unmanned aerial vehicle (UAV) or unmanned aircraft system (UAS), commonly known as a drone, is an aircraft with no human pilot, crew, or passengers onboard, but rather is controlled remotely or is autonomous.De Gruyter Handbook of Dron ...
s (UAVs) are banned. Recently, aerial data collection has become more accessible with miniature UAVs and drones. For example, the Aeryon Scout was used to map a 50-acre area with a
ground sample distance In remote sensing, ground sample distance (GSD) in a digital photo of the ground from air or space is the distance between pixel centers measured on the ground. For example, in an image with a one-meter GSD, adjacent pixels image locations are 1 me ...
of in only 12 minutes. The majority of digital data currently comes from
photo interpretation A photograph (also known as a photo, or more generically referred to as an ''image'' or ''picture'') is an image created by light falling on a photosensitive surface, usually photographic film or an electronic image sensor. The process and pra ...
of aerial photographs. Soft-copy workstations are used to digitize features directly from stereo pairs of digital photographs. These systems allow data to be captured in two and three dimensions, with elevations measured directly from a stereo pair using principles of
photogrammetry Photogrammetry is the science and technology of obtaining reliable information about physical objects and the environment through the process of recording, measuring and interpreting photographic images and patterns of electromagnetic radiant ima ...
. Analog aerial photos must be scanned before being entered into a soft-copy system, for high-quality digital cameras this step is skipped. Satellite
remote sensing Remote sensing is the acquisition of information about an physical object, object or phenomenon without making physical contact with the object, in contrast to in situ or on-site observation. The term is applied especially to acquiring inform ...
provides another important source of spatial data. Here satellites use different sensor packages to passively measure the reflectance from parts of the
electromagnetic spectrum The electromagnetic spectrum is the full range of electromagnetic radiation, organized by frequency or wavelength. The spectrum is divided into separate bands, with different names for the electromagnetic waves within each band. From low to high ...
or radio waves that were sent out from an active sensor such as radar. Remote sensing collects raster data that can be further processed using different bands to identify objects and classes of interest, such as land cover.


Secondary data capture

The most common method of data creation is
digitization Digitization is the process of converting information into a digital (i.e. computer-readable) format.Collins Dictionary. (n.d.). Definition of 'digitize'. Retrieved December 15, 2021, from https://www.collinsdictionary.com/dictionary/english/ ...
, where a
hard copy In information handling, the U.S. Federal Standard 1037C (Glossary of Telecommunication Terms) defines a hard copy as a permanent reproduction, or copy, in the form of a physical object, of any media suitable for direct use by a person (in par ...
map or survey plan is transferred into a digital medium through the use of a CAD program, and geo-referencing capabilities. With the wide availability of ortho-rectified imagery (from satellites, aircraft, Helikites and UAVs), heads-up digitizing is becoming the main avenue through which geographic data is extracted. Heads-up digitizing involves the tracing of geographic data directly on top of the aerial imagery instead of by the traditional method of tracing the geographic form on a separate digitizing tablet (heads-down digitizing). Heads-down digitizing, or manual digitizing, uses a special magnetic pen, or stylus, that feeds information into a computer to create an identical, digital map. Some tablets use a mouse-like tool, called a puck, instead of a stylus. The puck has a small window with cross-hairs which allows for greater precision and pinpointing map features. Though heads-up digitizing is more commonly used, heads-down digitizing is still useful for digitizing maps of poor quality. Existing data printed on paper or PET film maps can be
digitized Digitization is the process of converting information into a digital (i.e. computer-readable) format.Collins Dictionary. (n.d.). Definition of 'digitize'. Retrieved December 15, 2021, from https://www.collinsdictionary.com/dictionary/english ...
or scanned to produce digital data. A digitizer produces
vector Vector most often refers to: * Euclidean vector, a quantity with a magnitude and a direction * Disease vector, an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematics a ...
data as an operator traces points, lines, and polygon boundaries from a map. Scanning a map results in raster data that could be further processed to produce vector data. When data is captured, the user should consider if the data should be captured with either a relative accuracy or absolute accuracy, since this could not only influence how information will be interpreted but also the cost of data capture. After entering data into a GIS, the data usually requires editing, to remove errors, or further processing. For vector data it must be made "topologically correct" before it can be used for some advanced analysis. For example, in a road network, lines must connect with nodes at an intersection. Errors such as undershoots and overshoots must also be removed. For scanned maps, blemishes on the source map may need to be removed from the resulting raster. For example, a fleck of dirt might connect two lines that should not be connected.


Projections, coordinate systems, and registration

The earth can be represented by various models, each of which may provide a different set of coordinates (e.g., latitude, longitude, elevation) for any given point on the Earth's surface. The simplest model is to assume the earth is a perfect sphere. As more measurements of the earth have accumulated, the models of the earth have become more sophisticated and more accurate. In fact, there are models called datums that apply to different areas of the earth to provide increased accuracy, like North American Datum of 1983 for U.S. measurements, and the
World Geodetic System The World Geodetic System (WGS) is a standard used in cartography, geodesy, and satellite navigation including GPS. The current version, WGS 84, defines an Earth-centered, Earth-fixed coordinate system and a geodetic datum, and also describ ...
for worldwide measurements. The latitude and longitude on a map made against a local datum may not be the same as one obtained from a
GPS receiver A satellite navigation (satnav) device or GPS device is a device that uses satellites of the Global Positioning System (GPS) or similar global navigation satellite systems (GNSS). A satnav device can determine the user's geographic coordinat ...
. Converting coordinates from one datum to another requires a datum transformation such as a Helmert transformation, although in certain situations a simple
translation Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
may be sufficient. In popular GIS software, data projected in latitude/longitude is often represented as a
Geographic coordinate system A geographic coordinate system (GCS) is a spherical coordinate system, spherical or geodetic coordinates, geodetic coordinate system for measuring and communicating position (geometry), positions directly on Earth as latitude and longitude. ...
. For example, data in latitude/longitude if the datum is the '
North American Datum The North American Datum (NAD) is the Geodetic datum#Horizontal datum, horizontal datum now used to define the Geodesy, geodetic network in North America. A datum is a formal description of the shape of the Earth along with an "anchor" point fo ...
of 1983' is denoted by 'GCS North American 1983'.


Data quality

While no digital model can be a perfect representation of the real world, it is important that GIS data be of a high quality. In keeping with the principle of
homomorphism In algebra, a homomorphism is a morphism, structure-preserving map (mathematics), map between two algebraic structures of the same type (such as two group (mathematics), groups, two ring (mathematics), rings, or two vector spaces). The word ''homo ...
, the data must be close enough to reality so that the results of GIS procedures correctly correspond to the results of real world processes. This means that there is no single standard for data quality, because the necessary degree of quality depends on the scale and purpose of the tasks for which it is to be used. Several elements of data quality are important to GIS data: ;
Accuracy Accuracy and precision are two measures of ''observational error''. ''Accuracy'' is how close a given set of measurements (observations or readings) are to their ''true value''. ''Precision'' is how close the measurements are to each other. The ...
:The degree of similarity between a represented measurement and the actual value; conversely, ''error'' is the amount of difference between them. In GIS data, there is concern for accuracy in representations of location (''positional accuracy''), property (''attribute accuracy''), and time. For example, the US 2020 Census says that the population of
Houston Houston ( ) is the List of cities in Texas by population, most populous city in the U.S. state of Texas and in the Southern United States. Located in Southeast Texas near Galveston Bay and the Gulf of Mexico, it is the county seat, seat of ...
on April 1, 2020 was 2,304,580; if it was actually 2,310,674, this would be an error and thus a lack of attribute accuracy. ; Precision :The degree of refinement in a represented value. In a quantitative property, this is the number of significant digits in the measured value. An imprecise value is vague or ambiguous, including a range of possible values. For example, if one were to say that the population of Houston on April 1, 2020 was "about 2.3 million," this statement would be imprecise, but likely accurate because the correct value (and many incorrect values) are included. As with accuracy, representations of location, property, and time can all be more or less precise. '' Resolution'' is a commonly used expression of positional precision, especially in raster data sets. Scale is closely related to precision in maps, as it dictates a desirable level of spatial precision, but is problematic in GIS, where a data set can be shown at a variety of display scales (including scales that would not be appropriate for the quality of the data). ;
Uncertainty Uncertainty or incertitude refers to situations involving imperfect or unknown information. It applies to predictions of future events, to physical measurements that are already made, or to the unknown, and is particularly relevant for decision ...
:A general acknowledgement of the presence of error and imprecision in geographic data. That is, it is a degree of general doubt, given that it is difficult to know exactly how much error is present in a data set, although some form of estimate may be attempted (a confidence interval being such an estimate of uncertainty). This is sometimes used as a collective term for all or most aspects of data quality. ; Vagueness or fuzziness :The degree to which an aspect (location, property, or time) of a phenomenon is inherently imprecise, rather than the imprecision being in a measured value. For example, the spatial extent of the
Houston Houston ( ) is the List of cities in Texas by population, most populous city in the U.S. state of Texas and in the Southern United States. Located in Southeast Texas near Galveston Bay and the Gulf of Mexico, it is the county seat, seat of ...
metropolitan area A metropolitan area or metro is a region consisting of a densely populated urban area, urban agglomeration and its surrounding territories which share Industry (economics), industries, commercial areas, Transport infrastructure, transport network ...
is vague, as there are places on the outskirts of the city that are less connected to the central city (measured by activities such as
commuting Commuting is periodically recurring travel between a place of residence and place of work or study, where the traveler, referred to as a commuter, leaves the boundary of their home community. By extension, it can sometimes be any regular o ...
) than places that are closer. Mathematical tools such as
fuzzy set theory Fuzzy or Fuzzies may refer to: Music * Fuzzy (band), a 1990s Boston indie pop band * Fuzzy (composer), Danish composer Jens Vilhelm Pedersen (born 1939) * ''Fuzzy'' (album), 1993 debut album of American rock band Grant Lee Buffalo * "Fuzzy", a ...
are commonly used to manage vagueness in geographic data. ;Completeness :The degree to which a data set represents all of the actual features that it purports to include. For example, if a layer of "roads in
Houston Houston ( ) is the List of cities in Texas by population, most populous city in the U.S. state of Texas and in the Southern United States. Located in Southeast Texas near Galveston Bay and the Gulf of Mexico, it is the county seat, seat of ...
" is missing some actual streets, it is incomplete. ;Currency :The most recent point in time at which a data set claims to be an accurate representation of reality. This is a concern for the majority of GIS applications, which attempt to represent the world "at present," in which case older data is of lower quality. ;
Consistency In deductive logic, a consistent theory is one that does not lead to a logical contradiction. A theory T is consistent if there is no formula \varphi such that both \varphi and its negation \lnot\varphi are elements of the set of consequences ...
:The degree to which the representations of the many phenomena in a data set correctly correspond with each other. Consistency in topological relationships between spatial objects is an especially important aspect of consistency. For example, if all of the lines in a street network were accidentally moved 10 meters to the East, they would be inaccurate but still consistent, because they would still properly connect at each intersection, and
network analysis Network analysis can refer to: * Network theory, the analysis of relations through mathematical graphs ** Social network analysis, network theory applied to social relations * Network analysis (electrical circuits) See also *Network planning and d ...
tools such as shortest path would still give correct results. ;
Propagation of uncertainty In statistics, propagation of uncertainty (or propagation of error) is the effect of variables' uncertainties (or errors, more specifically random errors) on the uncertainty of a function based on them. When the variables are the values of ex ...
:The degree to which the quality of the results of
Spatial analysis Spatial analysis is any of the formal Scientific technique, techniques which study entities using their topological, geometric, or geographic properties, primarily used in Urban design, Urban Design. Spatial analysis includes a variety of techni ...
methods and other processing tools derives from the quality of input data. For example,
interpolation In the mathematics, mathematical field of numerical analysis, interpolation is a type of estimation, a method of constructing (finding) new data points based on the range of a discrete set of known data points. In engineering and science, one ...
is a common operation used in many ways in GIS; because it generates estimates of values between known measurements, the results will always be more precise, but less certain (as each estimate has an unknown amount of error). The quality of a dataset is very dependent upon its sources, and the methods used to create it. Land surveyors have been able to provide a high level of positional accuracy utilizing high-end GPS equipment, but GPS locations on the average smartphone are much less accurate. Common datasets such as digital terrain and aerial imagery are available in a wide variety of levels of quality, especially spatial precision. Paper maps, which have been digitized for many years as a data source, can also be of widely varying quality. A quantitative analysis of maps brings accuracy issues into focus. The electronic and other equipment used to make measurements for GIS is far more precise than the machines of conventional map analysis. All geographical data are inherently inaccurate, and these inaccuracies will propagate through GIS operations in ways that are difficult to predict.


Raster-to-vector translation

Data restructuring can be performed by a GIS to convert data into different formats. For example, a GIS may be used to convert a satellite image map to a vector structure by generating lines around all cells with the same classification, while determining the cell spatial relationships, such as adjacency or inclusion. More advanced data processing can occur with
image processing An image or picture is a visual representation. An image can be two-dimensional, such as a drawing, painting, or photograph, or three-dimensional, such as a carving or sculpture. Images may be displayed through other media, including a pr ...
, a technique developed in the late 1960s by
NASA The National Aeronautics and Space Administration (NASA ) is an independent agencies of the United States government, independent agency of the federal government of the United States, US federal government responsible for the United States ...
and the private sector to provide contrast enhancement, false color rendering and a variety of other techniques including use of two dimensional
Fourier transforms In mathematics, the Fourier transform (FT) is an integral transform that takes a function (mathematics), function as input then outputs another function that describes the extent to which various Frequency, frequencies are present in the origin ...
. Since digital data is collected and stored in various ways, the two data sources may not be entirely compatible. So a GIS must be able to convert geographic data from one structure to another. In so doing, the implicit assumptions behind different ontologies and classifications require analysis. Object ontologies have gained increasing prominence as a consequence of
object-oriented programming Object-oriented programming (OOP) is a programming paradigm based on the concept of '' objects''. Objects can contain data (called fields, attributes or properties) and have actions they can perform (called procedures or methods and impl ...
and sustained work by Barry Smith and co-workers.


Spatial ETL

Spatial ETL tools provide the data processing functionality of traditional
extract, transform, load Extract, transform, load (ETL) is a three-phase computing process where data is ''extracted'' from an input source, ''transformed'' (including cleaning), and ''loaded'' into an output data container. The data can be collected from one or mor ...
 (ETL) software, but with a primary focus on the ability to manage spatial data. They provide GIS users with the ability to translate data between different standards and proprietary formats, whilst geometrically transforming the data en route. These tools can come in the form of add-ins to existing wider-purpose software such as
spreadsheet A spreadsheet is a computer application for computation, organization, analysis and storage of data in tabular form. Spreadsheets were developed as computerized analogs of paper accounting worksheets. The program operates on data entered in c ...
s.


Spatial analysis

GIS spatial analysis is a rapidly changing field, and GIS packages are increasingly including analytical tools as standard built-in facilities, as optional toolsets, as add-ins or 'analysts'. In many instances these are provided by the original software suppliers (commercial vendors or collaborative non commercial development teams), while in other cases facilities have been developed and are provided by third parties. Furthermore, many products offer software development kits (SDKs), programming languages and language support, scripting facilities and/or special interfaces for developing one's own analytical tools or variants. The increased availability has created a new dimension to
business intelligence Business intelligence (BI) consists of strategies, methodologies, and technologies used by enterprises for data analysis and management of business information. Common functions of BI technologies include Financial reporting, reporting, online an ...
termed " spatial intelligence" which, when openly delivered via intranet, democratizes access to geographic and social network data.
Geospatial intelligence In the United States, geospatial intelligence (GEOINT) is intelligence about the human activity on Earth derived from the exploitation and analysis of imagery, signals, or signatures with geospatial information. GEOINT describes, assesses, and visu ...
, based on GIS spatial analysis, has also become a key element for security. GIS as a whole can be described as conversion to a vectorial representation or to any other digitisation process. Geoprocessing is a GIS operation used to manipulate spatial data. A typical geoprocessing operation takes an input
dataset A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
, performs an operation on that dataset, and returns the result of the operation as an output dataset. Common geoprocessing operations include geographic feature overlay, feature selection and analysis,
topology Topology (from the Greek language, Greek words , and ) is the branch of mathematics concerned with the properties of a Mathematical object, geometric object that are preserved under Continuous function, continuous Deformation theory, deformat ...
processing, raster processing, and data conversion. Geoprocessing allows for definition, management, and analysis of information used to form decisions.


Terrain analysis

Many geographic tasks involve the
terrain Terrain (), alternatively relief or topographical relief, is the dimension and shape of a given surface of land. In physical geography, terrain is the lay of the land. This is usually expressed in terms of the elevation, slope, and orientati ...
, the shape of the surface of the earth, such as
hydrology Hydrology () is the scientific study of the movement, distribution, and management of water on Earth and other planets, including the water cycle, water resources, and drainage basin sustainability. A practitioner of hydrology is called a hydro ...
, earthworks, and
biogeography Biogeography is the study of the species distribution, distribution of species and ecosystems in geography, geographic space and through evolutionary history of life, geological time. Organisms and biological community (ecology), communities o ...
. Thus, terrain data is often a core dataset in a GIS, usually in the form of a raster
Digital elevation model A digital elevation model (DEM) or digital surface model (DSM) is a 3D computer graphics representation of elevation data to represent terrain or overlaying objects, commonly of a planet, Natural satellite, moon, or asteroid. A "global DEM" refer ...
(DEM) or a Triangulated irregular network (TIN). A variety of tools are available in most GIS software for analyzing terrain, often by creating derivative datasets that represent a specific aspect of the surface. Some of the most common include: * Slope or grade is the steepness or gradient of a unit of terrain, usually measured as an angle in degrees or as a percentage. * Aspect can be defined as the direction in which a unit of terrain faces. Aspect is usually expressed in degrees from north. * Cut and fill is a computation of the difference between the surface before and after an excavation project to estimate costs. * Hydrological modeling can provide a spatial element that other hydrological models lack, with the analysis of variables such as slope, aspect and watershed or
catchment area A catchment area in human geography, is the area from which a location, such as a city, service or institution, attracts a population that uses its services and economic opportunities. Catchment areas may be defined based on from where people are ...
. Terrain analysis is fundamental to hydrology, since water always flows down a slope. As basic terrain analysis of a
digital elevation model A digital elevation model (DEM) or digital surface model (DSM) is a 3D computer graphics representation of elevation data to represent terrain or overlaying objects, commonly of a planet, Natural satellite, moon, or asteroid. A "global DEM" refer ...
(DEM) involves calculation of slope and aspect, DEMs are very useful for hydrological analysis. Slope and aspect can then be used to determine direction of
surface runoff Surface runoff (also known as overland flow or terrestrial runoff) is the unconfined flow of water over the ground surface, in contrast to ''channel runoff'' (or ''stream flow''). It occurs when excess rainwater, stormwater, meltwater, or other ...
, and hence flow accumulation for the formation of streams, rivers and lakes. Areas of divergent flow can also give a clear indication of the boundaries of a catchment. Once a flow direction and accumulation matrix has been created, queries can be performed that show contributing or dispersal areas at a certain point. More detail can be added to the model, such as terrain roughness, vegetation types and soil types, which can influence infiltration and evapotranspiration rates, and hence influencing surface flow. One of the main uses of hydrological modeling is in environmental contamination research. Other applications of hydrological modeling include groundwater and surface water mapping, as well as flood risk maps. * Viewshed analysis predicts the impact that terrain has on the visibility between locations, which is especially important for wireless communications. *
Shaded relief Terrain cartography or relief mapping is the depiction of the shape of the surface of the Earth on a map, using one or more of several techniques that have been developed. Terrain or relief is an essential aspect of physical geography, and as su ...
is a depiction of the surface as if it were a three dimensional model lit from a given direction, which is very commonly used in maps. Most of these are generated using algorithms that are discrete simplifications of
vector calculus Vector calculus or vector analysis is a branch of mathematics concerned with the differentiation and integration of vector fields, primarily in three-dimensional Euclidean space, \mathbb^3. The term ''vector calculus'' is sometimes used as a ...
. Slope, aspect, and surface curvature in terrain analysis are all derived from neighborhood operations using elevation values of a cell's adjacent neighbours. Each of these is strongly affected by the level of detail in the terrain data, such as the resolution of a DEM, which should be chosen carefully.


Proximity analysis

Distance is a key part of solving many geographic tasks, usually due to the friction of distance. Thus, a wide variety of analysis tools have analyze distance in some form, such as buffers, Voronoi or Thiessen polygons,
Cost distance analysis In spatial analysis and geographic information systems, cost distance analysis or cost path analysis is a method for determining one or more optimal routes of travel through unconstrained (two-dimensional) space.de Smith, Michael, Paul Longley, M ...
, and
network analysis Network analysis can refer to: * Network theory, the analysis of relations through mathematical graphs ** Social network analysis, network theory applied to social relations * Network analysis (electrical circuits) See also *Network planning and d ...
.


Data analysis

It is difficult to relate
wetlands A wetland is a distinct semi-aquatic ecosystem whose groundcovers are flooded or saturated in water, either permanently, for years or decades, or only seasonally. Flooding results in oxygen-poor ( anoxic) processes taking place, especially ...
maps to
rainfall Rain is a form of precipitation where water droplets that have condensed from atmospheric water vapor fall under gravity. Rain is a major component of the water cycle and is responsible for depositing most of the fresh water on the Earth. ...
amounts recorded at different points such as airports, television stations, and schools. A GIS, however, can be used to depict two- and three-dimensional characteristics of the Earth's surface, subsurface, and atmosphere from information points. For example, a GIS can quickly generate a map with isopleth or
contour line A contour line (also isoline, isopleth, isoquant or isarithm) of a Function of several real variables, function of two variables is a curve along which the function has a constant value, so that the curve joins points of equal value. It is a ...
s that indicate differing amounts of rainfall. Such a map can be thought of as a rainfall contour map. Many sophisticated methods can estimate the characteristics of surfaces from a limited number of point measurements. A two-dimensional contour map created from the surface modeling of rainfall point measurements may be overlaid and analyzed with any other map in a GIS covering the same area. This GIS derived map can then provide additional information - such as the viability of
water power Hydropower (from Ancient Greek -, "water"), also known as water power or water energy, is the use of falling or fast-running water to produce electricity or to power machines. This is achieved by converting the gravitational potential or kin ...
potential as a
renewable energy Renewable energy (also called green energy) is energy made from renewable resource, renewable natural resources that are replenished on a human lifetime, human timescale. The most widely used renewable energy types are solar energy, wind pow ...
source. Similarly, GIS can be used to compare other
renewable energy Renewable energy (also called green energy) is energy made from renewable resource, renewable natural resources that are replenished on a human lifetime, human timescale. The most widely used renewable energy types are solar energy, wind pow ...
resources to find the best geographic potential for a region. Additionally, from a series of three-dimensional points, or
digital elevation model A digital elevation model (DEM) or digital surface model (DSM) is a 3D computer graphics representation of elevation data to represent terrain or overlaying objects, commonly of a planet, Natural satellite, moon, or asteroid. A "global DEM" refer ...
, isopleth lines representing elevation contours can be generated, along with slope analysis,
shaded relief Terrain cartography or relief mapping is the depiction of the shape of the surface of the Earth on a map, using one or more of several techniques that have been developed. Terrain or relief is an essential aspect of physical geography, and as su ...
, and other elevation products. Watersheds can be easily defined for any given reach, by computing all of the areas contiguous and uphill from any given point of interest. Similarly, an expected
thalweg In geography, hydrography, and fluvial geomorphology, a thalweg or talweg () is the line or curve of lowest elevation within a valley or watercourse. Normally only the horizontal position of the curve is considered (as viewed on a map); the c ...
of where surface water would want to travel in intermittent and permanent streams can be computed from elevation data in the GIS.


Topological modeling

A GIS can recognize and analyze the spatial relationships that exist within digitally stored spatial data. These
topological Topology (from the Greek words , and ) is the branch of mathematics concerned with the properties of a geometric object that are preserved under continuous deformations, such as stretching, twisting, crumpling, and bending; that is, wit ...
relationships allow complex spatial modelling and analysis to be performed. Topological relationships between geometric entities traditionally include adjacency (what adjoins what), containment (what encloses what), and proximity (how close something is to something else).


Geometric networks

Geometric networks are linear networks of objects that can be used to represent interconnected features, and to perform special spatial analysis on them. A geometric network is composed of edges, which are connected at junction points, similar to
graphs Graph may refer to: Mathematics *Graph (discrete mathematics), a structure made of vertices and edges **Graph theory, the study of such graphs and their properties * Graph (topology), a topological space resembling a graph in the sense of discre ...
in mathematics and computer science. Just like graphs, networks can have weight and flow assigned to its edges, which can be used to represent various interconnected features more accurately. Geometric networks are often used to model road networks and
public utility A public utility company (usually just utility) is an organization that maintains the infrastructure for a public service (often also providing a service using that infrastructure). Public utilities are subject to forms of public control and ...
networks, such as electric, gas, and water networks. Network modeling is also commonly employed in
transportation planning Transportation planning is the process of defining future policies, goals, investments, and spatial planning designs to prepare for future needs to move people and goods to destinations. As practiced today, it is a collaborative process that i ...
,
hydrology Hydrology () is the scientific study of the movement, distribution, and management of water on Earth and other planets, including the water cycle, water resources, and drainage basin sustainability. A practitioner of hydrology is called a hydro ...
modeling, and
infrastructure Infrastructure is the set of facilities and systems that serve a country, city, or other area, and encompasses the services and facilities necessary for its economy, households and firms to function. Infrastructure is composed of public and pri ...
modeling.


Cartographic modeling

Dana Tomlin coined the term ''cartographic modeling'' in his PhD dissertation (1983); he later used it in the title of his book, ''Geographic Information Systems and Cartographic Modeling'' (1990). Cartographic modeling refers to a process where several thematic
layers Layer or layered may refer to: Arts, entertainment, and media * ''Layers'' (Kungs album) * ''Layers'' (Les McCann album) * ''Layers'' (Royce da 5′9″ album) *“Layers”, the title track of Royce da 5′9″’s sixth studio album * Layer, a ...
of the same area are produced, processed, and analyzed. Tomlin used raster layers, but the overlay method (see below) can be used more generally. Operations on map layers can be combined into algorithms, and eventually into simulation or optimization models.


Map overlay

The combination of several spatial datasets (points, lines, or
polygons In geometry, a polygon () is a plane figure made up of line segments connected to form a closed polygonal chain. The segments of a closed polygonal chain are called its '' edges'' or ''sides''. The points where two edges meet are the polygon' ...
) creates a new output vector dataset, visually similar to stacking several maps of the same region. These overlays are similar to mathematical
Venn diagram A Venn diagram is a widely used diagram style that shows the logical relation between set (mathematics), sets, popularized by John Venn (1834–1923) in the 1880s. The diagrams are used to teach elementary set theory, and to illustrate simple ...
overlays. A union overlay combines the geographic features and attribute tables of both inputs into a single new output. An intersect overlay defines the area where both inputs overlap and retains a set of attribute fields for each. A
symmetric difference In mathematics, the symmetric difference of two sets, also known as the disjunctive union and set sum, is the set of elements which are in either of the sets, but not in their intersection. For example, the symmetric difference of the sets \ and ...
overlay defines an output area that includes the total area of both inputs except for the overlapping area. Data extraction is a GIS process similar to vector overlay, though it can be used in either vector or raster data analysis. Rather than combining the properties and features of both datasets, data extraction involves using a "clip" or "mask" to extract the features of one data set that fall within the spatial extent of another dataset. In raster data analysis, the overlay of datasets is accomplished through a process known as "local operation on multiple rasters" or " map algebra", through a function that combines the values of each raster's
matrix Matrix (: matrices or matrixes) or MATRIX may refer to: Science and mathematics * Matrix (mathematics), a rectangular array of numbers, symbols or expressions * Matrix (logic), part of a formula in prenex normal form * Matrix (biology), the m ...
. This function may weigh some inputs more than others through use of an "index model" that reflects the influence of various factors upon a geographic phenomenon.


Geostatistics

Geostatistics Geostatistics is a branch of statistics focusing on spatial or spatiotemporal datasets. Developed originally to predict probability distributions of ore grades for mining operations, it is currently applied in diverse disciplines including pet ...
is a branch of statistics that deals with field data, spatial data with a continuous index. It provides methods to model spatial correlation, and predict values at arbitrary locations (interpolation). When phenomena are measured, the observation methods dictate the accuracy of any subsequent analysis. Due to the nature of the data (e.g. traffic patterns in an urban environment; weather patterns over the
Pacific Ocean The Pacific Ocean is the largest and deepest of Earth's five Borders of the oceans, oceanic divisions. It extends from the Arctic Ocean in the north to the Southern Ocean, or, depending on the definition, to Antarctica in the south, and is ...
), a constant or dynamic degree of precision is always lost in the measurement. This loss of precision is determined from the scale and distribution of the data collection. To determine the statistical relevance of the analysis, an average is determined so that points (gradients) outside of any immediate measurement can be included to determine their predicted behavior. This is due to the limitations of the applied statistic and data collection methods, and interpolation is required to predict the behavior of particles, points, and locations that are not directly measurable.
Interpolation In the mathematics, mathematical field of numerical analysis, interpolation is a type of estimation, a method of constructing (finding) new data points based on the range of a discrete set of known data points. In engineering and science, one ...
is the process by which a surface is created, usually a raster dataset, through the input of data collected at a number of sample points. There are several forms of interpolation, each which treats the data differently, depending on the properties of the data set. In comparing interpolation methods, the first consideration should be whether or not the source data will change (exact or approximate). Next is whether the method is subjective, a human interpretation, or objective. Then there is the nature of transitions between points: are they abrupt or gradual. Finally, there is whether a method is global (it uses the entire data set to form the model), or local where an algorithm is repeated for a small section of terrain. Interpolation is a justified measurement because of a spatial autocorrelation principle that recognizes that data collected at any position will have a great similarity to, or influence of those locations within its immediate vicinity.
Digital elevation model A digital elevation model (DEM) or digital surface model (DSM) is a 3D computer graphics representation of elevation data to represent terrain or overlaying objects, commonly of a planet, Natural satellite, moon, or asteroid. A "global DEM" refer ...
s, triangulated irregular networks, edge-finding algorithms, Thiessen polygons,
Fourier analysis In mathematics, Fourier analysis () is the study of the way general functions may be represented or approximated by sums of simpler trigonometric functions. Fourier analysis grew from the study of Fourier series, and is named after Joseph Fo ...
, (weighted) moving averages,
inverse distance weighting Inverse distance weighting (IDW) is a type of Deterministic algorithm, deterministic method for multivariate interpolation with a known homogeneously scattered set of points. The assigned values to unknown points are calculated with a Weighted m ...
,
kriging In statistics, originally in geostatistics, kriging or Kriging (), also known as Gaussian process regression, is a method of interpolation based on Gaussian process governed by prior covariances. Under suitable assumptions of the prior, kriging g ...
, spline, and trend surface analysis are all mathematical methods to produce interpolative data.


Address geocoding

Geocoding is interpolating spatial locations (X,Y coordinates) from street addresses or any other spatially referenced data such as ZIP Codes, parcel lots and address locations. A reference theme is required to
geocode A geocode is a code that represents a geographic entity (location or Geographical feature, object). It is a unique identifier of the entity, to distinguish it from others in a finite set of geographic entities. In general the ''geocode'' is a ...
individual addresses, such as a road centerline file with address ranges. The individual address locations have historically been interpolated, or estimated, by examining address ranges along a road segment. These are usually provided in the form of a table or database. The software will then place a dot approximately where that address belongs along the segment of centerline. For example, an address point of 500 will be at the midpoint of a line segment that starts with address 1 and ends with address 1,000. Geocoding can also be applied against actual parcel data, typically from municipal tax maps. In this case, the result of the geocoding will be an actually positioned space as opposed to an interpolated point. This approach is being increasingly used to provide more precise location information.


Reverse geocoding

Reverse geocoding is the process of returning an estimated
street address An address is a collection of information, presented in a mostly fixed format, used to give the location of a building, apartment, or other structure or a plot of land, generally using political boundaries and street names as references, along ...
number as it relates to a given coordinate. For example, a user can click on a road centerline theme (thus providing a coordinate) and have information returned that reflects the estimated house number. This house number is interpolated from a range assigned to that road segment. If the user clicks at the
midpoint In geometry, the midpoint is the middle point of a line segment. It is equidistant from both endpoints, and it is the centroid both of the segment and of the endpoints. It bisects the segment. Formula The midpoint of a segment in ''n''-dim ...
of a segment that starts with address 1 and ends with 100, the returned value will be somewhere near 50. Note that reverse geocoding does not return actual addresses, only estimates of what should be there based on the predetermined range.


Multi-criteria decision analysis

Coupled with GIS,
multi-criteria decision analysis Multiple-criteria decision-making (MCDM) or multiple-criteria decision analysis (MCDA) is a sub-discipline of operations research that explicitly evaluates multiple conflicting criteria in decision making (both in daily life and in settings ...
methods support decision-makers in analysing a set of alternative spatial solutions, such as the most likely ecological habitat for restoration, against multiple criteria, such as vegetation cover or roads. MCDA uses decision rules to aggregate the criteria, which allows the alternative solutions to be ranked or prioritised. GIS MCDA may reduce costs and time involved in identifying potential restoration sites.


GIS data mining

GIS or spatial
data mining Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...
is the application of data mining methods to spatial data. Data mining, which is the partially automated search for hidden patterns in large databases, offers great potential benefits for applied GIS-based decision making. Typical applications include
environmental monitoring Environmental monitoring is the processes and activities that are done to characterize and describe the state of the environment. It is used in the preparation of environmental impact assessments, and in many circumstances in which human activit ...
. A characteristic of such applications is that spatial correlation between data measurements require the use of specialized algorithms for more efficient data analysis.


Data output and cartography

Cartography Cartography (; from , 'papyrus, sheet of paper, map'; and , 'write') is the study and practice of making and using maps. Combining science, aesthetics and technique, cartography builds on the premise that reality (or an imagined reality) can ...
is the design and production of maps, or visual representations of spatial data. The vast majority of modern cartography is done with the help of computers, usually using GIS but production of quality cartography is also achieved by importing layers into a design program to refine it. Most GIS software gives the user substantial control over the appearance of the data. Cartographic work serves two major functions: First, it produces graphics on the screen or on paper that convey the results of analysis to the people who make decisions about resources. Wall maps and other graphics can be generated, allowing the viewer to visualize and thereby understand the results of analyses or simulations of potential events. Web Map Servers facilitate distribution of generated maps through web browsers using various implementations of web-based application programming interfaces (
AJAX Ajax may refer to: Greek mythology and tragedy * Ajax the Great, a Greek mythological hero, son of King Telamon and Periboea * Ajax the Lesser, a Greek mythological hero, son of Oileus, the king of Locris * Ajax (play), ''Ajax'' (play), by the an ...
,
Java Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
,
Flash Flash, flashes, or FLASH may refer to: Arts, entertainment, and media Fictional aliases * The Flash, several DC Comics superheroes with super speed: ** Flash (Jay Garrick) ** Barry Allen ** Wally West, the first Kid Flash and third adult Flash ...
, etc.). Second, other database information can be generated for further analysis or use. An example would be a list of all addresses within one mile (1.6 km) of a toxic spill. An archeochrome is a new way of displaying spatial data. It is a thematic on a 3D map that is applied to a specific building or a part of a building. It is suited to the visual display of heat-loss data.


Terrain depiction

Traditional maps are abstractions of the real world, a sampling of important elements portrayed on a sheet of paper with symbols to represent physical objects. People who use maps must interpret these symbols.
Topographic map In modern mapping, a topographic map or topographic sheet is a type of map characterized by large- scale detail and quantitative representation of relief features, usually using contour lines (connecting points of equal elevation), but histori ...
s show the shape of land surface with
contour line A contour line (also isoline, isopleth, isoquant or isarithm) of a Function of several real variables, function of two variables is a curve along which the function has a constant value, so that the curve joins points of equal value. It is a ...
s or with
shaded relief Terrain cartography or relief mapping is the depiction of the shape of the surface of the Earth on a map, using one or more of several techniques that have been developed. Terrain or relief is an essential aspect of physical geography, and as su ...
. Today, graphic display techniques such as
shading Shading refers to the depiction of depth perception in 3D models (within the field of 3D computer graphics) or illustrations (in visual art) by varying the level of darkness. Shading tries to approximate local behavior of light on the object's ...
based on
altitude Altitude is a distance measurement, usually in the vertical or "up" direction, between a reference datum (geodesy), datum and a point or object. The exact definition and reference datum varies according to the context (e.g., aviation, geometr ...
in a GIS can make relationships among map elements visible, heightening one's ability to extract and analyze information. For example, two types of data were combined in a GIS to produce a perspective view of a portion of San Mateo County,
California California () is a U.S. state, state in the Western United States that lies on the West Coast of the United States, Pacific Coast. It borders Oregon to the north, Nevada and Arizona to the east, and shares Mexico–United States border, an ...
. *The
digital elevation model A digital elevation model (DEM) or digital surface model (DSM) is a 3D computer graphics representation of elevation data to represent terrain or overlaying objects, commonly of a planet, Natural satellite, moon, or asteroid. A "global DEM" refer ...
, consisting of surface elevations recorded on a 30-meter horizontal grid, shows high elevations as white and low elevation as black. *The accompanying
Landsat The Landsat program is the longest-running enterprise for acquisition of satellite imagery of Earth. It is a joint NASA / USGS program. On 23 July 1972, the Earth Resources Technology Satellite was launched. This was eventually renamed to Lan ...
Thematic Mapper image shows a false-color infrared image looking down at the same area in 30-meter pixels, or picture elements, for the same coordinate points, pixel by pixel, as the elevation information. A GIS was used to register and combine the two images to render the three-dimensional perspective view looking down the
San Andreas Fault The San Andreas Fault is a continental Fault (geology)#Strike-slip faults, right-lateral strike-slip transform fault that extends roughly through the U.S. state of California. It forms part of the tectonics, tectonic boundary between the Paci ...
, using the Thematic Mapper image pixels, but shaded using the elevation of the
landform A landform is a land feature on the solid surface of the Earth or other planetary body. They may be natural or may be anthropogenic (caused or influenced by human activity). Landforms together make up a given terrain, and their arrangement ...
s. The GIS display depends on the viewing point of the
observer An observer is one who engages in observation or in watching an experiment. Observer may also refer to: Fiction * ''Observer'' (novel), a 2023 science fiction novel by Robert Lanza and Nancy Kress * ''Observer'' (video game), a cyberpunk horr ...
and time of day of the display, to properly render the shadows created by the sun's rays at that latitude, longitude, and time of day.


Web mapping

In recent years there has been a proliferation of free-to-use and easily accessible mapping software such as the
proprietary {{Short pages monitor * {{DEFAULTSORT:Geographic Information System