The Geographic Names Information System (GNIS) is a database of name and locative information about more than two million physical and cultural features throughout the United States and its territories,
Antarctica
Antarctica () is Earth's southernmost and least-populated continent. Situated almost entirely south of the Antarctic Circle and surrounded by the Southern Ocean, it contains the geographic South Pole. Antarctica is the fifth-largest co ...
United States Board on Geographic Names
The United States Board on Geographic Names (BGN) is a federal body operating under the United States Secretary of the Interior. The purpose of the board is to establish and maintain uniform usage of geographic names throughout the federal governm ...
(BGN) to promote the standardization of feature names.
Data were collected in two phases.
Although a third phase was considered, which would have handled name changes where local usages differed from maps, it was never begun.
The database is part of a system that includes topographic map names and bibliographic references. The names of books and historic maps that confirm the feature or place name are cited. Variant names, alternatives to official federal names for a feature, are also recorded. Each feature receives a permanent, unique feature record identifier, sometimes called the GNIS identifier. The database never removes an entry, "except in cases of obvious duplication."
Original purposes
The GNIS was originally designed for four major purposes: to eliminate duplication of effort at various other levels of government that were already compiling geographic data, to provide standardized datasets of geographic data for the government and others, to index all of the names found on official U.S. government federal and state maps, and to ensure uniform geographic names for the federal government.
Phase 1
Phase 1 lasted from 1978 to 1981, with a precursor pilot project run over the states of Kansas and Colorado in 1976, and produced 5 databases.
It excluded several classes of feature because they were better documented in non-USGS maps, including airports, the broadcasting masts for radio and television stations, civil divisions, regional and historic names, individual buildings, roads, and triangulation station names.
The databases were initially available on paper (2 to 3 spiral-bound volumes per state), on microfiche, and on
magnetic tape
Magnetic tape is a medium for magnetic storage made of a thin, magnetizable coating on a long, narrow strip of plastic film. It was developed in Germany in 1928, based on the earlier magnetic wire recording from Denmark. Devices that use magne ...
encoded (unless otherwise requested) in EBCDIC with 248-byte fixed-length
record
A record, recording or records may refer to:
An item or collection of data Computing
* Record (computer science), a data structure
** Record, or row (database), a set of fields in a database related to one entity
** Boot sector or boot record, ...
s in 4960-byte blocks.
The feature classes for association with each name included (for examples) "locale" (a "place at which there is or was human activity" not covered by a more specific feature class), "populated place" (a "place or area with clustered or scattered buildings"), "spring" (a spring), "lava" (a
lava flow
Lava is molten or partially molten rock (magma) that has been expelled from the interior of a terrestrial planet (such as Earth) or a moon onto its surface. Lava may be erupted at a volcano or through a fracture in the crust, on land or und ...
, kepula, or other such feature), and "well" (a well).
Mountain features would fall into "ridge", "range", or "summit" classes.
A feature class "tank" was sometimes used for lakes, which was problematic in several ways.
This feature class was undocumented, and it was (in the words of a 1986 report from the Engineer Topographic Laboratories of the United States Army Corps of Engineers) "an unreasonable determination", with the likes of Cayuga Lake being labelled a "tank".
The USACE report assumed that "tank" meant "reservoir", and observed that often the coördinates of "tanks" were outside of their boundaries and were "possibly at the point where a dam is thought to be".
National Geographic Names database
The National Geographic Names database (NGNDB hereafter) was originally 57 computer files, one for each state and territory of the United States (except Alaska which got two) plus one for the District of Columbia.
The second Alaska file was an earlier database, the Dictionary of Alaska Place Names that had been compiled by the USGS in 1967.
A further two files were later added, covering the entire United States and that were abridged versions of the data in the other 57: one for the 50,000 most well known populated places and features, and one for most of the populated places.
The files were compiled from all of the names to be found on USGS topographic maps, plus data from various state map sources.
In phase 1, elevations were recorded in feet only, with no conversion to metric, and only if there was an actual elevation recorded for the map feature.
They were of either the lowest or highest point of the feature, as appropriate.
Interpolated elevations, calculated by
interpolation
In the mathematical field of numerical analysis, interpolation is a type of estimation, a method of constructing (finding) new data points based on the range of a discrete set of known data points.
In engineering and science, one often has a n ...
between contour lines, were added in phase 2.
Names were the official name, except where the name contained
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
characters that the computer file encodings of the time could not handle (which were in phase 1 marked with an asterisk for update in a later phase).
Generic designations were given after specific names, so (for examples) Mount Saint Helens was recorded as "Saint Helens, Mount", although cities named Mount Olive, not actually being mountains, would not take "Mount" to be a generic part and would retain their order "Mount Olive".
The primary geographic coördinates of features which occupy an area, rather than being a single point feature, were the location of the feature's mouth, or of the approximate centre of the area of the feature.
Such approximate centres were "eye-balled" estimates by the people performing the digitization, subject to the constraint that centres of areal features were not placed within other features that are inside them.
alluvial fans and river deltas counted as mouths for this purpose.
For cities and other large populated places, the coördinates were taken to be those of a primary civic feature such as the
city hall
In local government, a city hall, town hall, civic centre (in the UK or Australia), guildhall, or a municipal building (in the Philippines), is the chief administrative building of a city, town, or other municipality. It usually houses ...
or
town hall
In local government, a city hall, town hall, civic centre (in the UK or Australia), guildhall, or a municipal building (in the Philippines), is the chief administrative building of a city, town, or other municipality. It usually houses ...
, main
public library
A public library is a library that is accessible by the general public and is usually funded from public sources, such as taxes. It is operated by librarians and library paraprofessionals, who are also Civil service, civil servants.
There are ...
, main highway intersection, main post office, or
central business district
A central business district (CBD) is the commercial and business centre of a city. It contains commercial space and offices, and in larger cities will often be described as a financial district. Geographically, it often coincides with the "city ...
.
Secondary coördinates were only an aid to locating which topographic map(s) the feature extended across, and were "simply anywhere on the feature and on the topographic map with which it is associated".
River sources were determined by the shortest drain, subject to the proxmities of other features that were clearly related to the river by their names.
USGS Topographic Map Names database
The USGS Topographic Map Names database (TMNDB hereafter) was also 57 computer files containing the names of maps: 56 for 1:24000 scale USGS maps as with the NGNDB, the 57th being (rather than a second Alaska file) data from the 1:100000 and 1:250000 scale USGS maps.
Map names were recorded exactly as on the maps themselves, with the exceptions for diacritics as with the NGNDB.
Unlike the NGNDB, locations were the geographic coördinates of the south-east corner of the given map, except for American Samoa and Guam maps where they were of the north-east cornder.
The TMNDB was later renamed the Geographic Cell Names database (GCNDB hereafter) in the 1990s.
Generic database
The Generic database was in essence a machine-readable glossary of terms and abbreviations taken from the map sources, with their definitions, grouped into collections of related terms.
National Atlas database
The National Atlas database was an abridged version of the NGNDB that contained only those entries that were in the index to the USGS National Atlas of the United States, with the coördinates published in the latter substituted for the coördinates from the former.
Board on Geographic Names database
The Board on Geographic Names database was a record of investigative work of the USGS Board on Geographic Names' Domestic Names Committee, and decisions that it had made from 1890 onwards, as well as names that were enshrined by Acts of Congress.
Elevation and location data followed the same rules as for the NGNDB.
So too did names with diacritic characters.
Phase 2
Phase 2 was broader in scope than phase 1, extending the scope to a much larger set of data sources.
It ran from the end of phase 1 and had manged to completely process data from 42 states by 2003, with 4 still underway and the remaining 4 (Alaska, Kentucky, Michigan, and New York) awaiting the initial systematic compilation of the sources to use.
Many more feature classes were included, including abandoned Native American settlements, ghost towns, railway stations on railway lines that no longer existed, housing developments, shopping centres, and highway rest areas.
The actual compilation was outsourced by the U.S. government, state by state, to private entities such as university researchers.
Antarctica Geographic Names database
The Antarctica Geographic Names database (AGNDB hereafter) was added in the 1990s and comprised records for BGN-approved names in Antarctica and various off-lying islands such as the
South Orkney Islands
The South Orkney Islands are a group of islands in the Southern Ocean, about north-east of the tip of the Antarctic PeninsulaSouth Shetland Islands, the
Balleny Islands
The Balleny Islands () are a series of uninhabited islands in the Southern Ocean extending from 66°15' to 67°35'S and 162°30' to 165°00'E. The group extends for about in a northwest-southeast direction. The islands are heavily glaciated an ...
South Georgia
South Georgia ( es, Isla San Pedro) is an island in the South Atlantic Ocean that is part of the British Overseas Territory of South Georgia and the South Sandwich Islands. It lies around east of the Falkland Islands. Stretching in the east� ...
, and the
South Sandwich Islands
)
, anthem = "God Save the King"
, song_type =
, song =
, image_map = South Georgia and the South Sandwich Islands in United Kingdom.svg
, map_caption = Location of South Georgia and the South Sandwich Islands in the southern Atlantic Oce ...
.
It only contained records for natural features, not for scientific outposts.
Additional media
The media on which one could obtain the databases were extended in the 1990s (still including tape and paper) to floppy disc, over FTP, and on
CD-ROM
A CD-ROM (, compact disc read-only memory) is a type of read-only memory consisting of a pre-pressed optical compact disc that contains data. Computers can read—but not write or erase—CD-ROMs. Some CDs, called enhanced CDs, hold both compu ...
.
The CD-ROM edition only included the NGNDB, the AGNDB, the GCNDB, and a bibliographic reference database (RDB); but came with database search software that ran on PC DOS (or compatible) version 3.0 or later.
The FTP site included extra topical databases: a subset of the NGNDB that only included the records with feature classes for populated places, a "Concise" subset of the NGNDB that listed "major features", and a "Historical" subset that included the features that no longer exist.
Populated places
There is no differentiation amongst different types of populated places.
In the words of the aforementioned 1986 USACE report, "
subdivision
Subdivision may refer to:
Arts and entertainment
* Subdivision (metre), in music
* ''Subdivision'' (film), 2009
* "Subdivision", an episode of ''Prison Break'' (season 2)
* ''Subdivisions'' (EP), by Sinch, 2005
* "Subdivisions" (song), by Rus ...
having one inhabitant is as significant as a major metropolitan center such as New York City".
In comparing GNIS populated place records with data from the Thematic Mapper of the Landsat program, researchers from the University of Connecticut in 2001 discovered that "a significant number" of populated places in Connecticut had no identifiable human settlement in the land use data and were at road intersections.
They found that such populated places with no actual settlement often had "Corner" in their names, and hypothesized that either these were historical records or were "cartographic locators".
In
surveying
Surveying or land surveying is the technique, profession, art, and science of determining the terrestrial two-dimensional or three-dimensional positions of points and the distances and angles between them. A land surveying professional is ca ...
in the United States, a "Corner" is a corner of the surveyed polygon enclosing an area of land, whose location is, or was (since corners can become "lost" or "obliterated"), marked in various ways including with trees known as "bearing trees" ("witness trees" in older terminology) or "corner monuments".
From analysing Native American names in the database in order to compile a dictionary, professor William Bright of UCLA observed in 2004 that some GNIS entries are "erroneous; or refer to long-vanished railroad sidings where no one ever lived". Such false classifications have propagated to other geographical information sources, such as incorrectly classified train stations appearing as towns or neighborhoods on Google Maps.
Name changes
The usual sources of name change requests are an individual state's board on geographic names, or a county board of governors.
This does not always succeed, the State Library of Montana having submitted three large sets of name changes that have not been incorporated into the GNIS database.
Conversely, a group of middle school students in Alaska succeeded, with the help of their teachers, a professor of linguistics, and a man who had been conducting a years-long project to collect Native American placenames in the area, in changing the names of several places that they had spotted in class one day and challenged for being racist, including renaming "Negrohead Creek" to an Athabascan name Lochenyatth Creek and "Negrohead Mountain" to Tl'oo Khanishyah Mountain, both of which translate to "grassy tussocks" in
Lower Tanana
Lower Tanana (also Tanana and/or Middle Tanana) is an endangered language spoken in Interior Alaska in the lower Tanana River villages of Minto and Nenana. Of about 380 Tanana people in the two villages, about 30 still speak the language. As of ...
and
Gwichʼin
The Gwichʼin (or Kutchin) are an Athabaskan-speaking First Nations people of Canada and an Alaska Native people. They live in the northwestern part of North America, mostly above the Arctic Circle.
Gwichʼin are well-known for their crafting ...
respectively.
Likewise, in researching a 2008 book on ethnic slurs in U.S. placenames Mark Monmonier of
Syracuse University
Syracuse University (informally 'Cuse or SU) is a Private university, private research university in Syracuse, New York. Established in 1870 with roots in the Methodist Episcopal Church, the university has been nonsectarian since 1920. Locate ...
discovered "Niger Hill" in Potter County, Pennsylvania, an erroneous transcription of "Nigger Hill" from a 1938 map, and persuaded the USBGN to change it to "Negro Hill".
Removal of racial and ethnic slurs
In November 2021, the United States Secretary of the Interior issued an order instructing that "Squaw" be removed from usage by the U.S. federal government.
Prior efforts had included a 1962 replacement of the "Nigger" racial pejorative for African Americans with "Negro" and a 1974 replacement of the "Jap" racial pejorative for
Japanese American
are Americans of Japanese ancestry. Japanese Americans were among the three largest Asian American ethnic communities during the 20th century; but, according to the 2000 census, they have declined in number to constitute the sixth largest Asi ...
s with "Japanese".
In 2015, a cross-reference of the GNIS database against the Racial Slur Database had found 1441 racial slur placenames, every state of the United States having them, with California having 159 and the state with the most such names being Arizona.
One of the two standard reference works for placenames in Arizona is Byrd Howell Granger's 1983 book ''Arizona's Names: X Marks the Place'', which contains many additional names with racial slurs not in the GNIS database.
Despite "Nigger" having been removed from federal government use by
Stewart Udall
Stewart Lee Udall (January 31, 1920 – March 20, 2010) was an American politician and later, a federal government official. After serving three terms as a congressman from Arizona, he served as Secretary of the Interior from 1961 to 1969, unde ...
, its replacement "Negro" still remained in GNIS names in 2015, as did " Pickaninny", " Uncle Tom", and "
Jim Crow
The Jim Crow laws were state and local laws enforcing racial segregation in the Southern United States. Other areas of the United States were affected by formal and informal policies of segregation as well, but many states outside the Sout ...
" and 33 places named "Niggerhead".
There were 828 names containing "squaw", including 11 variations on "Squaw Tit" and "Squaw Teat", contrasting with the use of "Nipple" in names with non- Native American allusions such as "Susies Nipple".
Other authorities
* The United States Census Bureau (USCB) defines ''Census Designated Places'' as a subset of locations in the National Geographic Names Database.
* United States Postal Service (USPS) Publication 28 gives standards for addressing mail. In this publication, the postal service defines two-letter state abbreviations, street identifiers such as boulevard (BLVD) and street (ST), and secondary identifiers such as suite (STE).
United Nations Conference on the Standardization of Geographical Names
The United Nations Group of Experts on Geographical Names (UNGEGN) is one of the nine expert groups of the United Nations Economic and Social Council (ECOSOC) and deals with the national and international standardization of geographical names. Ev ...
References
Bibliography
*
*
*
*
*
*
*
*
*
*
*
*
*
*
Further reading
*
* United States Department of the Interior (DOI), ''Digital Gazeteer: Users Manual'', (Reston, Virginia: United States Geological Survey (USGS), 1994).
* Least Heat Moon, William, ''Blue Highways: A Journey Into America'', (Boston: Little Brown and Company, 1982).
* Jouris, David, ''All Over The Map'', (Berkeley, California: Ten Speed Press, 1994.)
* Report: "Countries, Dependencies, Areas of Special Sovereignty and Their Principal Administrative Divisions, Federal Information Processing Standards (FIPS) FIPS 10-4. Standard was withdrawn in September 2008, Se Federal Register (FR) Notice: Vol. 73, No. 170, page 51276 (September 2, 2008)
* Report: Principles, Policies and Procedures: Domestic Geographic Names