National Hydrography Dataset
February 2000ContentsOverview Features Feature types and characteristics Delineation rules Common identifier Special feature types: Artificial Path, Connector, and Underpass Reaches Reach types and delineation Transport reach Delineation Underlying feature rule Confluence-to-confluence rule Branched path rule Application of, and deviation from, the rules Coastline Reach Waterbody reach Shoreline reach Reach code Common identifier Reach summary Encoding flow relations among transport and coastline reaches Direction of flow using flow relations Sequencing flow relations along a reach Identifying level paths through the drainage network Stream level Tracing stream levels among reach flow relations Metadata and digital update units Digital update units Geographic names Characteristics of domestic geographic names Entry conventions for geographic names Common identifier Coordinates and related measures Horizontal coordinate referencing system Lengths and areas Elevations of water surfaces Data Quality Conterminous United States (excluding the Pacific Northwest), Hawaii, Puerto Rico, and the Virgin Islands Lineage Attribute accuracy Logical consistency Completeness Horizontal positional accuracy Vertical positional accuracy Pacific Northwest and Alaska Glossary References Appendix A. Overview of features, reaches, and related items. Appendix B. Feature code and description field structures and definitions Feature code structure Description field structure Appendix C. Encoding characteristics using field names Appendix D. Development of Reach Files and related concepts Purpose and approach Reach file development Appendix E. Transport reach delineation rules and examples Underlying feature rule Confluence-to-confluence rule Like feature types (and their surrogates) Unlike feature types: stream/river and canal/ditch No confluence: underpasses and pipelines Branched path rule Appendix F. Organization and examples of hydrologic units Appendix G. Peculiarities Quad Edge Effects DLG-3 coding inconsistencies Inland oceans Disguised aqueducts Coastal features - foreshore versus sand Stream/rivers controlled by dams that become a series of slackwater pools Reservoir versus lake/pond NHD linework doesn’t match the published USGS 1:100,000-scale map Names Subbasin (formerly known as cataloging unit) boundaries and the features that touch and cross them Squared-off coastal subbasin boundaries Empty subbasins Coastline reaches that bound stream/rivers Waterbody reaches Flow/coordinate direction/measure direction Artificial Paths that fall outside of the 2-D features they represent
OverviewThe National Hydrography Dataset (NHD) is a comprehensive set of digital spatial data that encodes information about naturally occurring and constructed bodies of water, paths through which water flows, and related entities. The information encoded about these features includes classification and other characteristics, delineation, geographic name, position and related measures, a "reach code" through which other information can be related to the NHD, and the direction of water flow. In addition to this geographic information, the dataset contains metadata and information that supports the exchange of future updates and improvements to the data. The data support many applications, such as:
In 1999, coverage was made available for the conterminous United States and Hawaii. Data for Puerto Rico, the Virgin Islands, and parts of Alaska will follow. The production of data for the remainder of Alaska will be ongoing for several years. Efforts to maintain and improve the data will occur continually. The NHD is the culmination of cooperative efforts of the U.S. Environmental Protection Agency (USEPA) and the U.S. Geological Survey (USGS). Other organizations also contributed to the effort. This volume describes the concepts and information content of the NHD, including features, reaches, metadata, geographic names, coordinate systems and related measures, and data quality.
Features![]() A feature is a defined entity and its representation. In the NHD, features include naturally occurring and constructed bodies of water, paths through which water flows, and related entities. Features are classified by type, may be described by additional characteristics, and are delineated using standard methods. Feature types and characteristics Features are classified by type. These feature types, such as "stream/river", "canal/ditch", and "lake/pond", provide the basic description of the features. Each type has a name and a definition. For example, the three most frequently encountered feature types and corresponding definitions are as follows:
Characteristics, which are traits, qualities, or properties of features, are provided for many feature types. Each characteristic has a name, a definition, and a list of values and corresponding definitions. For example, the features lake/pond and stream/river have the characteristic Hydrographic Category:
Appendix A lists the names and characteristics associated with each feature type. The "Standards for National Hydrography Dataset" (USGS, 1999) contains the names and definitions of all feature types, characteristics, and values. The document is available online through http://mapping.usgs.gov/standards/.
Encoding feature types and characteristics A five-digit feature code encodes the feature type and combinations of characteristics and values that can be assigned to a type. The first three digits encode the feature type, and the last two digits encode a set of characteristics and values. For example, the feature type "dam/weir" has the code "343". There are five combinations of characteristics and values that can be assigned to features of this type. These combinations are assigned the values of "00" through "04". The resulting possible feature codes are listed below:
Feature codes are stored in a data element named "FCODE". For those who prefer to use text instead of the numeric code, words also are used to encode the feature type, characteristic, and value information in the feature code:
For example, a feature classified as a "Dam/Weir" may have the characteristic of "Construction Material" with a value of "earthen", and the characteristic of "Operational Status" with a value of "operational". This information is encoded as follows:
Appendix B lists each feature code and its corresponding description. Appendix C lists the name of the field for each characteristic and the list of values for each characteristic.
A feature type and feature code are assigned to each feature. Description fields, fields for each characteristic, and feature codes are encoded in a lookup table. Associating the lookup table with features by matching the feature codes allows words denoting characteristics and values to be substituted for the numeric feature code. The shape and extent of features are delineated using points (including nodes), lines, or areas (see figure 1). ![]() Figure 1. Features are delineated using points, lines, or areas. The delineation of each feature follows three rules:
Delineations of linear and areal features of different types may overlap. Where they overlap, they use the same lines or areas for their delineations. For example (see figure 2) the delineation of linear features of the types canal/ditch and bridge use the same lines where they overlap. Similarly, the delineations of areal features of the types lake/pond and swamp/marsh share the same areas where they overlap. ![]() Figure 2. Examples of overlapping delineations of features. Features delineated with lines have two additional rules: they may not have branches, and they must start and stop at decision or merge points along a network. These points exist where a path represented by a network can branch among two or more choices. For example, a decision point exists at the confluence of two stream/rivers; at the confluence, one can choose among two or more paths. Conversely, a decision point does not exist where features at different elevations cross; travel along the path of each feature is independent from that of the other. For example, a decision point does not exist where a canal/ditch passes over a stream/river. Lines always have a direction; that is, lines trace a path between places where they start and stop. This characteristic provides a means to encode the direction of the flow of water. For features delineated with lines, for which the direction of flow is a prominent characteristic (the feature types artificial path, canal/ditch, connector, pipeline, and stream/river), and for which the direction of flow is known, the lines are oriented in the direction of the flow of water. Note that the direction of flow is not always known (for example, where source materials are ambiguous) or uniform (for example, in tidal areas), and so the lines are not always oriented in the direction of flow. In addition, along the coastline of the United States, the lines are oriented so that the water is to the right of the direction of the line. The delineation of features stops at the borders of the United States. The common identifier is a 10-digit integer value that uniquely identifies the occurrence of each feature. Each value occurs only once throughout the Nation. Once assigned, the value is associated permanently with its feature. When a feature is deleted, the value for its identifier is retired. The common identifier is stored in a data element named "COM_ID". Special feature types: Artificial Path, Connector, and Underpass The feature types artificial path, connector, and underpass serve special functions. The artificial path and connector ensure that the hydrographic network is complete. The artificial path represents the flow of water into, through, and out of features1 delineated using areas (that is, it serves as a centerline) and also delineates the coastline. The connector fills gaps in the delineation of other features. An underpass and two relations ("above" and "below"), represent places where features cross at different elevations. An example is where a feature of the type canal/ditch passes over a feature of the type stream/river (see figure 3.) The canal/ditch is encoded as being above an underpass, and the stream/river is encoded as being below the (same) underpass. ![]() ![]() Figure 3. Encoding an underpass. Appendix A identifies the feature types that may be related to an underpass. Note that underpasses are encoded only where they can be observed from source materials. Reaches![]() A reach is a continuous, unbroken stretch or expanse of surface water. In the NHD, this idea has been expanded to define a reach as a significant segment of surface water that has similar hydrologic characteristics, such as a stretch of stream/river between two confluences, or a lake/pond. Reaches also are defined for unconnected (isolated) features, such as an isolated lake/pond. Once a reach is defined for a segment of water and assigned a reach code, the reach will rarely be changed, if at all. Many activities for improving and updating the data, such as the integration of more accurate coordinate data, the replacement of linear feature delineations with areas, or the addition of smaller features, change only the alignment of existing reaches and do not require that they be redefined. The stability of reach definition and reach code assignment makes reaches a useful foundation for geocoding observations and statistics. Changes to the surface waters (for example, the creation of a new reservoir) and corrections to erroneous delineations of reaches, of course, would change reaches and reach codes. The NHD is the latest refinement of reaches and reach codes. Information about earlier implementations is in Appendix D Three types of reaches are in use: transport, coastline, and waterbody reaches. A fourth type, shoreline reach, has not been implemented. A transport reach represents the pathway for the movement of water through a drainage network. These reaches also are used to encode the direction in which water flows along the reach when the direction is known. They provide a basis on which locations of observations can be geocoded and linked to the drainage network. Lines delineate transport reaches. Only lines that delineate features of the types canal/ditch, pipeline, stream/river, artificial path, and connector delineate transport reaches. For transport reaches for which the direction of flow is known, the lines are oriented in the direction of the flow of water. Note that the direction of flow is not always known (for example, where source materials are ambiguous) or uniform (for example, in tidal areas), and so some lines are not oriented in the direction of flow. Three general rules determine the location of the ends of transport reaches: the underlying feature rule, the confluence-to-confluence rule, and the branched path rule. The delineation of a transport reach follows that of one or more features. Where two or more features are followed, a transport reach follows delineations of :
A transport reach always follows the entire delineation of the underlying feature or features; the delineation of a feature is not split among reaches. Transport reaches abut and do not overlap. In the confluence-to-confluence rule, a transport reach is a stretch of water between:
In the application of this rule, divergences serve the same role as confluences. Reaches defined by this rule must be contiguous and may not branch (see figure 4). ![]() Figure 4. Confluence-to-confluence reach delineation. Note that some confluences are not considered to be significant enough to break the delineation of a reach. Thus, although transport reaches start and end at confluences, not every confluence causes a transport reach to start or end (see Appendix E for more information). A branched path transport reach connects reaches that enter and exit an areal feature (see figure 5). Reaches that follow this rule occur most often in large2 features of type lake/pond and swamp/marsh, and they also may occur in other areal features. Artificial paths delineated within the areal feature provide the lines needed to delineate this special transport reach. The reach may branch and at times be discontiguous. ![]() Figure 5. Branched path reach delineation. The branched path transport reach avoids the need to define flow channels, confluences and divergences, and confluence-based transport reaches in areal features. It is used where the information needed to delineate these items reliably is not available, and at other places. Application of, and deviation from, the rules The underlying feature, confluence-to-confluence, and branched path rules govern the delineation of most transport reaches. These rules, however, have exceptions. Unusual configurations of features require modification of the rules, as do the variable condition and ambiguities of information sources. In places where unusual configurations of features or ambiguities in sources occur more often, a larger percentage of reaches delineated using modified rules will be found. Appendix E lists more examples of reach delineation rules and exceptions to the rules. A coastline reach represents a section of coastline along the Atlantic, Pacific, or Arctic Oceans, the Great Lakes, the Gulf of Mexico, or the Caribbean Sea. These reaches provide a basis for geocoding locations of observations along the coastline. Artificial paths that follow the coastline provide the lines used for coastline reach delineation. The delineation of a coastline reach may follow one or more artificial paths. A coastline reach always follows the entire delineation of the underlying artificial path or paths; the delineation of an artificial path is not split among reaches. Coastline reaches abut and do not overlap. The lines are usually oriented so that the water is to the right of the line. This results in a general orientation of coastline reaches northward along the Atlantic Ocean, southward along the Pacific Ocean, eastward along the Gulf of Mexico, westward along the Arctic Ocean and the U.S. side of the Great Lakes, and counterclockwise around islands. A coastline reach is delineated for coastal islands that have a drainage network or more than 5 miles (approximately 8.06 kilometers) of isolated drainage. Other coastal islands may or may not be delineated with a coastline reach. The ends of coastline reaches occur where transport reaches discharge into the oceans, Gulf of Mexico, Caribbean Sea, or Great Lakes (although not every point of discharge is the end of a coastline reach). At the mouth of an areal stream/river, coastline reaches end where the artificial path used to delineate the transport reach intersects the coastline. A waterbody is a hydrographic feature delineated using areas. Reaches assigned to waterbodies are termed waterbody reaches. These reaches provide a means to geocode observations for areas of water. (In contrast, transport reaches represent the path of a flow of water and provide a means of geocoding observations along the path.) Areal delineations of features provide the areas used to delineate waterbody reaches3. The shoreline reach has not been implemented and is discussed for information purposes only. A shoreline reach would represent all or part of the shoreline of an inland waterbody. Analogous to coastline reaches, they would provide a basis for geocoding locations of observations along the shoreline. Lines would delineate shoreline reaches. These reaches would abut and not overlap. The types of features that would provide these lines are being investigated, as are questions of the direction, relative to the water, in which these reaches should be ordered, means of handling islands, and rules for deciding where these reaches would start and stop. A reach code is a numeric code that uniquely labels each reach. This 14-digit code has 2 parts: the first 8 digits are the hydrologic unit4 code for the subbasin in which the reach exists; the last 6 digits are assigned in sequential order, and arbitrarily among the reaches. Each reach code occurs only once throughout the Nation. Once assigned, a reach code is associated with its reach permanently. If a reach is deleted, its reach code is retired. A reach code should not be altered. Reach codes can serve to geocode an observation to a reach or a position along a reach. Observations can be geocoded to an entire reach by associating the reach code with the observation data, or to sections of a transport, coastline, or (the planned) shoreline reach by using the reach and reach code as the basis of a linear referencing system. Reach codes are stored in data elements named "RCH_CODE". In addition to the reach code, the date on which the code was assigned in the NHD is encoded. The date is stored in data elements named "RCH_DATE". In addition to identifying each feature, the common identifier uniquely identifies each reach. Each 10-digit integer value occurs only once throughout the Nation. Once assigned, the value is associated permanently with its reach. When a reach is deleted, the value for its identifier is retired. The common identifier is stored in a data element named "COM_ID". Table 1 summarizes the types, related items, delineation, and underlying features of reaches. Table 1. Summary of reach organization by type.
Encoding flow relations among transport and coastline reaches![]() Flow relations among transport and coastline reaches encode drainage network connectivity among reaches independently from their delineations. Relations among transport reaches define a connected hydrographic network and encode the direction of water flow among reaches. This connectivity enables hydrologic sequencing of reaches (what is upstream and downstream of a given point in the hydrographic network) and navigating the network in an upstream or downstream direction. Similarly, relations among coastline reaches allow the traversal of the coastline using reaches. Relations among transport and coastline reaches connect hydrographic networks to the coastline and enable the ordering of the mouths of these networks along the coastline. Flow relations are especially useful in applications that require information about connectivity among, but not about, the underlying delineations and coordinate positions of reaches. Flow relations are encoded by identifying a pair of reaches that touch and describing the flow of water between the reaches. Note that the direction of flow is not always known (for example, where source materials are ambiguous); in these cases, flow relations are not created.
Direction of flow using flow relations The common identifiers for a pair of reaches and a description of the direction of water flow between the reaches are used to encode flow relations. Five values describe the direction of water flow between transport reaches; a sixth value describes connections between pairs of touching coastline reaches or between touching transport and coastline reaches:
![]() Figure 6. Flow relations illustrating in, out, network start, and network end directions. (A common identifier value of "0" represents a null entry.) ![]() Figure 7. Flow relations illustrating out and in directions. ![]() Figure 8. Flow relations for a branched path reach. ![]() Figure 9. Flow relations illustrating non-flowing connections. (A common identifier value of "0" represents a null entry.) Flow relations are stored as four data elements. The common identifier for the first reach is stored in a data element named "COM_ID_1". The common identifier for the second reach is stored in a data element named "COM_ID_2". The direction description is stored in two data elements: as a character string in a data element named "DIR_TEXT" and as a numeric alias5 in a data element named "DIRECTION". Sequencing flow relations along a reach Sequence numbers order flow relations of transport reaches that flow into or out of the interior of another transport reach. The reach that flows in or out is always the first reach in the flow relation; the reach along whose interior the inflowing and outflowing reach is being sequenced is the second reach in the flow relation. The flow relations are numbered sequentially, starting with 1, from upstream to downstream along the interior of the second reach. Sequence numbers also order flow relations of transport reaches that have a non-flowing connection to the interior of a coastline reach. The non-flowing connection flow relations of intersecting transport reaches are numbered sequentially, starting with 1, from start to end along the interior of the coastline reach. A sequence number of 0 means that the first reach touches the second reach at its end and not along its interior. The sequence number is stored with the flow relations in a data element named "SEQUENCE". Figure 10 illustrates the sequencing of flow relations. ![]() Figure 10. Sequencing flow relations along transport and coastline reaches. (A common identifier value of "0" represents a null entry.) Identifying level paths through the drainage network![]() Transport reaches and flow relations provide the components of the drainage network. The level path builds on these elements to create a sequence of transport reaches that trace the main stem for a given flow of water. The level path traces the flow between a head and the next largest flow of water or between a head and the ocean. Geographic names often follow these paths. Level paths are encoded by associating stream levels to transport reaches and by associating a delta level with flow relations. The stream level is a numeric code that identifies each main path of water flow through a drainage network. Stream level is assigned by identifying the terminus of a drainage network (see Figure 11). The lowest value6 for stream level is assigned to a transport reach at the end of a flow and to upstream transport reaches that trace the main path of flow back to the head. The stream level value is incremented by one and is assigned to all transport reaches that terminate at this path (that is, all tributaries to the path) and to all transport reaches that trace the main path of the flow along each tributary back to its head. The stream level value is incremented again and is assigned to transport reaches that trace the main path of the tributaries to to their heads. This process is continued until all transport reaches for which flow is encoded are assigned a stream level. ![]() Figure 11. Stream level assignment for a simple drainage network. For example (see Figure 12), the Mississippi River terminates at the Gulf of Mexico. The transport reaches that trace the main flow of the river, from the head to the mouth on the Gulf, are assigned a stream level of 1. The transport reaches that trace the main flow of each tributary to the Mississippi River (such as the Ohio/Monongahela Rivers) from their heads to their termini on the Mississippi River, are assigned a stream level of 2. The transport reaches that trace the main flow of each tributary to the level 2 tributaries (such as the Tennessee River, which is a tributary to the Ohio/Monongahela Rivers), from each head to each mouth on their level 2 tributary, are assigned a stream level of 3. ![]() Figure 12. Stream level assignments along the Mississippi River. Ideally, each main path would trace the flow of the largest volume of water. Stream levels encoded in two predecessors7 to the NHD were based on flow volume data, and these paths were retained wherever possible. Data about flow volumes were not available for other reaches. Instead, the path with the same geographic name was used to determine the level path. Where a geographic name was not available or known, the longest and straightest path was used to determine a main path. Where this rule did not adequately discriminate among the choices, the rightmost (looking upstream) transport reach was assumed to be the continuation of a main path. Stream level is stored in a data element named "LEVEL". The special value "-9998" means that a value can be applied to the transport reach but has not been specified. This value usually occurs where flow relations cannot be determined or have not been encoded. Without this information, main paths cannot be identified and stream level cannot be assigned. This value is also assigned to coastline reaches. Tracing stream levels among reach flow relations To help identify main paths in the flow relations table, the difference, or delta level, between the stream levels of inflowing and outflowing reaches is encoded. To calculate this value, the stream level of the second reach in the relation is subtracted from the stream level of the first reach in the relation (see Figure 13). Delta levels of "0" mean that a relation links two reaches that are on the same main path. In most other cases, the delta level will have a value of "1". Areas of complex divergence and braided stream configurations can yield other values. A special value of "-9999" means that a value for delta level is not applicable to the relation, and this occurs when the direction is bidirectional, network start, network end, or non-flowing connection. A special value of "-9998" means that a value could not be specified because the stream level was not specified for one or both reaches that are associated by the flow relation. The values are stored with the flow relations in a data element named "DELTA_LVL". ![]() Figure 13. Difference in stream levels encoded with flow relations. Metadata and digital update units![]() Metadata, or data about data, are data that describe the content, quality, condition, and other characteristics of data. Metadata answer questions such as "How current are these data?"; "How accurate are they?"; "Are there any restrictions on their use?"; "What is their coordinate system?"; and many others. Metadata help organizations manage data, advertise and share data, and make informed use of data. Metadata for the NHD use data elements from the "Content Standards for Digital Geospatial Metadata" (Federal Geographic Data Committee, 1994). The standard allows the identity, quality, spatial data organization and reference, entity and attribute definitions, distribution sources and forms, and metadata of the data to be documented. The metadata are provided as text files. For the NHD, a general set of metadata accompanies each set of data. This NHD (for National Hydrography Dataset) metadata provides general information that applies to all data. Within the NHD, values differ for metadata elements such as currentness and quality. For example, features originate from more than 1,800 datasets of different vintages, and reaches from more than 2,100 datasets. In the future, many organizations will collaborate to maintain and improve the data. These new data will differ in currentness, source, accuracy, and other characteristics. A single document that provides metadata for these dynamic and varied data would be either very general or very unwieldy. Digital update units associate specific metadata information with selected features and reaches. A digital update unit is a collection of one or more features and (or) reaches to which a set of metadata values applies. These values include only those needed to describe unique aspects of the associated features or reaches. A feature or reach may be a member of one or more digital update units. Metadata associated with digital update units supplement information provided in the NHD metadata. These digital update unit metadata amplify, and in some cases replace, the more general information provided in the NHD metadata. In the initial release of the NHD, there are two types of digital update units; subbasin and quadrangle digital update units (see Figure 14):
![]() Figure 14. Digital update units associate metadata with a set of features and (or) reaches. Metadata associated with each digital update unit amplify or replace information provided as NHD metadata. As development and improvement of the data continue, additional digital update units will be created. These may have collections of features, reaches, and shapes different from the digital update units in the initial release. These new digital update units also may have metadata elements different from those included in the digital update units in the initial release. Digital update units are assigned unique identifiers and stored in data elements named "DUU_ID". Geographic names![]() A geographic name is "the proper name, specific term, or expression by which a particular geographical entity is, or was, known" (Orth and Payne, 1997, p. 43). Geographic names designated as being official for Federal use are encoded for many reaches and features. These names were taken from the National Geographic Names Database of the Geographic Names Information System8 (USGS, 1995), the Federal Government's primary source for identifying official names. Reaches and features can carry geographic names (see Figure 15 and Table 2). Reaches carry geographic names most often. If there is no reach to carry a name, then the feature carries the name. Appendix A lists the types of features that may carry names. Geographic names are stored in data elements named "NAME". In addition, the eight-digit identifier for a name in the Geographic Names Information System is stored in data elements named "GNIS_ID".
![]() Figure 15. Geographic names applied to transport and waterbody reaches. Note that geographic names have not been applied to all reaches and features. Names are applied to transport and coastline reaches that were named in the USEPA's Reach File version 3 (see Appendix D). Names are also applied to waterbody reaches and features delineated using points or areas in cases where batch processes can reliably associate the name with the reach or feature. Table 2. Reaches and features that can carry geographic names (Notes for table: (1) Type: The table lists all reach and feature types defined for the NHD. Note that not all types have been collected in the initial release. (2) Geographic name: An "X" in the column indicates that a geographic name may be associated with reaches or features of this type. Note that geographic names have not been encoded for many reaches and features. (3) An asterisk (*) means that the feature or names are not in the initial release of the NHD. (4) Coastline reaches carry the name of the water feature that they bound. (5) For the State of Washington, waterbody reaches carry the names of the features types ice mass, lake/pond, reservoir, and swamp/marsh. Elsewhere, waterbody reaches carry the names of the features type lake/pond only.)
Characteristics of domestic geographic names (Adapted from "Principles, policies, and procedures: domestic geographic names" (Orth and Payne, 1997, p. 3); "The national gazetteer of the United States of America: concise: (USGS, 1990, p. xi); and "Geographic Names Information System" (USGS, 1995, p. 7-8).) Geographic names normally originate in and are influenced by spoken language. It is important to remember this fact because many people are concerned with written forms of names, including matters of spelling, capitalization, word form, and writing marks, that may have little to do with the way names are spoken. Geographic names in the United States most often reflect English, French, and Spanish naming traditions. Most geographic names are binomial in that they have two parts, denoting specific and generic information. The generic part tells the kind of place, feature, or area to which the name refers and usually is a single topographic term, such as brook, creek, lake, spring, or river. The specific part uniquely identifies the particular place, feature, or area, and it may consist of one or more words. For example:
The binomial (two-part) form of specific and generic information is strong and in written usage often leads to combining words in the specific part of the name, such as Threemile Run and Redhill Gulch. The names of some features can be long, especially if the specific part is a prepositional phrase: Cliffs of the Seven Double Pillars, Foot of the Mountain Run, and Cañon del Rajadero de los Negros. Some geographic names have a "false" generic that does not describe the feature correctly. For features of the type stream/river, common false generics are terms for depressions in the Earth's surface (such as "draw", "gully" or "gulch") that usually are not defined as applying to water flowing through the depression. Some names have rare generic forms. Examples include colorful American names such as Bowl of Tears (lake), Butlers Toothpick (pinnacle rock), Titans Piazza (hill), and Devils Racepath (ridge). Among variations of the binomial form are one-word names that require a capitalized article: The Canal, The Lagoon, The Lakes, and La Laguna. Entry conventions for geographic names The entries for geographic names include uppercase and lowercase alphabetic, numeric, and punctuation and other special characters. Most names are entered in the way they are commonly written; for example "Adams Creek" and "Green Lake". Exceptions include the following:
Common identifier![]() The common identifier is a 10-digit integer value that uniquely identifies each feature or reach. Each value occurs only once throughout the Nation. Each feature and reach is assigned an identifier, stored in the data element named "COM_ID". Once assigned, the identifier is associated permanently with its feature or reach and should not be altered. When a feature or reach is deleted, its identifier is retired. Common identifiers serve several purposes. They are the basis for relating features and reaches for several purposes, including the following:
Common identifiers are also the basis for tracking and sharing deletions, additions, and modifications of features, reaches, and relations. Coordinates and related measures![]() Positions are encoded using a common coordinate referencing system. Other measurements, including lengths, areas, and selected surface elevations, also are encoded. Horizontal coordinate referencing system The locations of points, lines, and the boundaries of areas are encoded using geographic (longitude-latitude) coordinates. The coordinates are encoded in decimal degrees, with west longitude and south latitude represented by negative values. The horizontal datum is the North American Datum of 1983 (NAD83). Lengths of linear features and areas of areal features are supplied for convenience in applications that require these measures. The coordinate data were projected from longitude-latitude coordinates to the Albers Equal Area projection, and lengths and areas were calculated and saved. The parameters for the projected coordinate systems are shown below:
The North American Datum of 1983 was specified for the projected coordinate system. The distance units of the projected system were meters. Lengths are computed in meters and stored in data elements named "METERS". Areas are computed in square kilometers and stored in data elements named "SQ_KM". The elevation of water surfaces where water pools is encoded for a few features. Water surface elevations may be encoded for features of the type area to be submerged, canal/ditch, inundation area, lake/pond, reservoir, and stream/river. Note that these elevations have not been entered for most features. The vertical datum for the conterminous United States and Alaska is the National Geodetic Vertical Datum of 1929. For Puerto Rico and the Virgin Islands, the datum is Local Mean Sea Level. Elevations are recorded in meters and stored in data elements named "ELEV". Two encoded values have special meanings. A value of -9999 means that an elevation value will not be applied to the feature. A value of -9998 means that a value can be applied to the feature but has not been specified. In addition to the elevation, the prevailing condition (for example, "average water elevation") to which the elevation applies is provided. The condition is stored as a text string in data elements named "STAGE". Where the "ELEV" field has a value of -9999 or -9998, the "STAGE" field is blank. Data Quality![]() The NHD resulted from a multiyear effort to process and integrate datasets containing delineations and classification of features, reaches, hydrologic units, and geographic names. This section provides an overview of the production processes, sources of information used in these processes, and other statements related to data quality. This section describes what is "generally true" about the initial release of the data for the conterminous United States, Hawaii, Puerto Rico, and the Virgin Islands. The NHD, even in its initial release, has variations in currentness, processes, and other characteristics. It is important to note that, even before the initial release of the data, collaborative efforts are under way to correct and improve the data. Completion of these efforts will contradict even these "generally true" statements for places where the improvements are made. It is important to review the metadata to understand the condition of data. The discussion below applies to the conterminous United States (excluding the Pacific Northwest10), Hawaii, Puerto Rico, and the Virgin Islands. Figure 16 illustrates the information sources and processes used to develop the data. ![]() Figure 16. Information sources and processes used to create the NHD. Sources of information used to construct the initial release of the NHD include:
Efforts are under way to improve the NHD; most of these involve the collecting data for the conterminous United States from 1:24,000-scale maps, and digital images with positional accuracies commensurate with 1:12,000 and larger scales. Currentness varies by individual maps; see digital update units for more information.
This process converted DLG data to features and associated characteristics and converted the coordinate system to geographic (longitude-latitude) coordinates in the NAD83 in five steps:
This process generated the "features" data. The basic steps for building reaches are as follows:
The accuracy of the attributes of the DLG data is estimated to be 98.5 percent. One or more of the following methods was used to test attribute accuracy:
In addition, software was used to validate feature types and characteristics against a master set of types and characteristics, to check that combinations of types and characteristics were valid, and to check that types and characteristics were valid for the delineation of the feature. Feature types, characteristics, and other attributes conform to the "Standards for National Hydrography Dataset" (USGS, 1999) as of the date they were loaded into the database. The entry and identifier for geographic names match those in the Geographic Names Information System as of March 1999. The association of each name to reaches has not been methodically checked, and so a name may be applied to the wrong reaches. Anecdotal reviews indicate that 80 percent or more of the named reaches have the correct name. Reaches were delineated with a batch procedure and were checked extensively during the "visual pass" steps of processing. According to automated quality assurance/quality control checks performed at various intervals during the processing, it is estimated that approximately 99 percent of the reaches are delineated according to standards. Points, nodes, lines, and areas conform to topological rules. Lines intersect only at nodes, and all nodes anchor the ends of lines. Lines do not "overshoot" or "undershoot" other lines where they are supposed to meet. There are no duplicate lines within a dataset. Lines bound areas, and lines identify the areas to the left and right of the lines. Gaps and overlaps among areas do not exist. All areas close. The completeness of the data reflects the content of the sources, which in the initial release of the NHD, are most often USGS topographic maps. Features found on the ground may have been eliminated or generalized on the source graphic because of scale and legibility constraints. In general, streams longer than 1mile (approximately 1.6 kilometers) were collected. Most streams that flow from a lake were collected regardless of their length. Only definite channels were collected, so not all swamp/marsh features have stream/rivers delineated through them. Lake/ponds having an area greater than 6 acres (approximately 2.4 hectares) were collected. Note, however, that these general rules were applied unevenly among maps during compilation. Some map quadrangles have a much sparser pattern of hydrography than do adjoining maps, and these differences continue in the digital rendition of these features. Rectification of these differences is a priority for maintenance of the NHD. Transport reaches are defined on almost all of the features of types stream/river, canal/ditch, pipeline, artificial path, and connector. Waterbody reaches are defined on the subset of lake/pond features that were identified as waterbodies during the development of Reach File Version 3. Most attention in applying geographic names was given to transport reaches that follow stream/rivers and to waterbody reaches. Near the international boundaries with Canada and Mexico, only the parts of features within the United States are delineated. Detailed capture conditions are provided for every feature type in the "Standards for National Hydrography Dataset" (USGS, 1999), available online through http://mapping.usgs.gov/standards/. Horizontal positional accuracy Statements of horizontal positional accuracy are based on accuracy statements made for USGS topographic quadrangle maps. These maps were compiled to meet National Map Accuracy Standards. For horizontal accuracy, this standard is met if at least 90 percent of points tested are within 0.02 inch (at map scale) of their true positions12. Additional offsets to positions may have been introduced where there are many features to improve the legibility of map symbols. In addition, the digitizing of maps is estimated to contain a horizontal positional error of less than or equal to 0.003-inch standard error (at map scale) in the two component directions relative to the source maps. Visual comparison between the map graphic (including digital scans of the graphic) and plots or digital displays of points, lines, and areas is used to assess the positional accuracy of digital data. Linear features of the same type along the adjoining edges of datasets are aligned if they are within a 0.02-inch tolerance (at map scale). To align the features, the midpoint between the end of the corresponding features is computed, and the ends of features are moved to this point. Features outside the tolerance are not moved; instead, a feature of the type connector was added to join the features. Statements of vertical positional accuracy for elevation of water surfaces are based on accuracy statements made for USGS topographic quadrangle maps. These maps were compiled to meet National Map Accuracy Standards. For vertical accuracy, this standard is met if at least 90 percent of well-defined points tested are within one-half contour interval of the correct value. Elevations of water surface printed on the published map meet this standard; the contour intervals of the maps vary. These elevations were transcribed into the digital data; the accuracy of this transcription was checked by visual comparison between the data and the map. Similar approaches are being used to develop data for the Pacific Northwest and Alaska. Specific statements will be provided when these data are released. Glossary(The main sources for the definitions include Darcy and Boston (1988); Federal Geographic Data Committee (1994, 1997); Merriam-Webster (1998); Padmanabhan, Yoon, and Leipnik (1992); and Stamp and Clark (1979).) ARC/INFO ® 13 - Geographic information systems software developed by the Environmental Systems Research Institute of Redlands, California. area - A generic term for a bounded, continuous, two-dimensional object that may or may not include its boundary. See line, node, and point. cataloging unit - Currently known as subbasin. characteristic - A distinguishing trait, quality, or property. coastline - A line that follows the main outline of the land, including bays, but crosses rivers at their mouths. In the NHD, the outlines of selected coastal islands are included as part of the coastline. See shoreline. coastline reach - A reach that represents a section of coastline. common identifier - A 10-digit integer value that uniquely identifies each feature or reach in the NHD. conflation - A process by which two sets of map data for the same region may be aligned and merged on the basis of matches of corresponding features portrayed in both sets. confluence - Flowing together; the junction and union of two or more streams or moving fluids. In the NHD, this idea has been generalized to allow the feature types of artificial path, canal/ditch, connector, pipeline, and stream/river to meet at a confluence. See divergence. conterminous - Having a common boundary; enclosed within a common boundary. conterminous United States - the lower 48 States and the District of Columbia. contiguous - Touching along a boundary or at a point; touching or connected throughout in an unbroken sequence. contour - An imaginary line on the ground, all points of which are at the same elevation. contour interval - The difference in elevation between two adjacent contours. coordinate reference system - A set of points, lines, and (or) surfaces and a set of rules that creates a reference frame whereby each point in a given surface can be identified uniquely by a set of numbers. coordinates - Linear and (or) angular quantities that describe the location of a point in relation to a given coordinate reference system. coverage - A digital version of a map forming the basic unit of vector data storage in ARC/INFO ® . datum - Any quantity, or set of quantities, that may serve as a reference or basis for the calculation of other quantities. In relation to mapping and geographic information systems, datum usually refers to a set of quantities that serve as a reference for the calculation of positions. See horizontal datum, linear referencing system, and vertical datum. decimal degree - Representation of the measure of an angle using whole and decimal fractions of a degree. decimal degree = degrees + (minutes/60) + (seconds/3600). See latitude, longitude. degree - An angular unit of measure equal to 1/360th of the circumference of a circle. See decimal degree, latitude, and longitude. delineate - To mark the outline of; to describe, portray, or set forth with accuracy or in detail. delineation - The act of delineating; something made by delineating. diacritical mark - A mark near or through a character or combination of characters that indicates a phonetic value (a spoken sound) different from that given the unmarked or otherwise marked character. digital update unit - A collection of one or more features and (or) reaches to which a set of metadata elements apply. divergence - Flowing apart; the junction and separation of a stream or moving fluids into two or more paths. In the NHD, this idea has been generalized to allow the feature types of artificial path, canal/ditch, connector, pipeline, and stream/river to meet at a divergence. See confluence. elevation - The vertical distance above or below a vertical datum to a point or object on the Earth's surface. entity - Generally, something that has separate and distinct existence and objective or conceptual reality. In a database, an object of interest about which data can be collected. feature - A defined entity and its representation. In the NHD, features include naturally occurring and constructed bodies of water, paths through which water flows, and related entities. These spatial phenomena are classified into defined feature types, are described by additional characteristics, and are delineated in standard ways. feature code - A numeric value that encodes the type and values for a set of characteristics of a feature. This five-digit code has two parts: the first three digits encode the feature type; the last two digits encode values for a set of characteristics associated with the feature. feature type - A member of a classification scheme for features. For example, in the NHD, features can be classified by types such as canal/ditch, lake/pond, and stream/river. field - A single piece of information, the smallest unit normally manipulated by a database management system. geocode - A location identifier. More specifically, a geocode is a data value assigned to a spatial object that encodes the location of the object (or a place along the object). A geocode can be used to associate other data with the object (or a place along the object). The term geocode also denotes the process of assigning a location identifier to an object. In the NHD, reach codes provide a starting point for geocoding observations to or along hydrographic features. geographic name - The proper name, specific term, or expression by which a particular geographical entity is, or was, known. head - The source of a stream. horizontal datum - A set of constants to which horizontal coordinates are referred; a reference for position. hydrography - (1) The science comprising the description, study, and mapping of the waters of the Earth's surface (the seas, lakes, rivers, and so on), including their forms and physical features. (2) The subject matter of this science, the hydrographic features of the globe or part of it; the distribution of water on the Earth's surface. hydrologic unit - A member of the hierarchical system for identifying and subdividing river-basin units of the United States. Hydrologic units are used for the collection and organization of hydrologic data. The levels of the hierarchy, listed in order of largest to smallest in area, are region, subregion, accounting unit, and subbasin. For example: region - New England region: the drainage within the United States that ultimately discharges into: (a) the Bay of Fundy, (b) the Atlantic Ocean within and between the States of Maine and Connecticut, (c) Long Island Sound north of the New York-Connecticut state line, and (d) the Riviere St. Francois, a tributary of the St. Lawrence River. subbasin - The fourth level of subdivision of hydrologic units. A subbasin represents the geographic area of part or all of a surface drainage basin, a combination of drainage basins, or a distinct hydrologic feature. Subbasins are uniquely identified with an eight-digit hydrologic unit code. -subregion - Connecticut: the Connecticut River basin. accounting unit - Lower Connecticut: the Connecticut River basin below Vernon Dam. subbasin - Chicopee. Each hydrologic unit is identified uniquely with a hydrologic unit code. hydrologic unit code - A hierarchical, numeric code that uniquely identifies hydrologic units. The first two digits identify the region, the first four digits identify subregions, the first six digits identify accounting units, and the full eight digits identify subbasins. For example, from the example provided with the definition of hydrologic unit, the hydrologic unit codes are: 01 - the region (New England) 0108 - the subregion (Connecticut) 010802 - the accounting unit (Lower Connecticut) 01080204 - the subbasin (Chicopee) Zeroes in the two-digit accounting unit field indicate that the accounting unit and the subregion are the same. Zeroes in the two-digit subbasin field indicate that the subbasin and the accounting unit are the same. latitude - Distance north or south of the Equator, measured as an angle with the center of the Earth. In the NHD, latitude values are encoded in decimal degrees. See longitude. level path - A sequence of transport reaches that trace the main stem for a given flow of water. line - A generic term for a one-dimensional object having a length and direction. See area, node, and point. linear referencing system - A set of datums, networks, and linear referencing methods, whereby each point along a network can be identified uniquely by specifying the direction and distance from any known point on the network. longitude - Distance east or west on the Earth's surface, measured by the angle that the meridian of a particular place makes with the Prime (Greenwich) Meridian. In the NHD, longitude values are encoded in decimal degrees. See latitude. lookup table - A table that provides the ability to use a known value to locate an unknown value. In the NHD, the feature code links features to the textual descriptions of their associated characteristics. map scale - The relationship between a distance on a map, chart, or photograph and the corresponding distance on the Earth. For map scales commonly associated with the NHD, see the following table:
metadata - Data about data; data that describe the content, quality, condition, and other characteristics of data. mouth - The place where a stream enters a larger body of water. node - A zero-dimensional object that is the topological junction of two or more lines, or an end point of a line. See area, line, and point. point - A zero-dimensional object that specifies geometric location. See area, line, and node. quadrangle - A four-sided area, bounded by parallels of latitude and meridians of longitude, used as an areal unit in mapping. reach - A continuous unbroken stretch or expanse of surface water. In the NHD, this idea has been expanded to define reach as a significant segment of surface water that has similar hydrologic characteristics. Reaches have standard types and delineations. See coastline reach, shoreline reach, transport reach, and waterbody reach. reach code - A numeric code that uniquely identifies a reach. This 14-digit code has 2 parts: the first 8 digits are the hydrologic unit code for the subbasin in which the reach is located; the last 6 digits are a sequentially ordered, arbitrarily assigned number. relation - In a database, a named association among sets of entities. shoreline - The line where water and land meet. In the NHD, shoreline applies to inland waters only. See coastline. shoreline reach - A reach that represents all or part of a shoreline. This type of reach has not been implemented. stage - The elevation of the surface of a body of water measured relative to a vertical datum. subbasin - The fourth level of subdivision of hydrologic units. A subbasin represents the geographic area of part or all of a surface drainage basin, a combination of drainage basins, or a distinct hydrologic feature. Subbasins are uniquely identified with an eight-digit hydrologic unit code. terminus (plural termini) - A finishing point; a part that forms the end. topology - A branch of geometrical mathematics concerned with order, contiguity, and relative position rather than actual linear dimensions. Topology is used to establish and describe spatial relationships among features. transport reach - A reach that represents the pathway for the flow of water through a drainage network. traverse - To move or pass along. traversal - An instance of traversing. tributary - A stream or river that flows into a larger one. In the NHD, this idea has been generalized to allow transport reaches with underlying feature types of artificial path, canal/ditch, connector, pipeline, and stream/river to have or be tributaries. vertical datum - A set of constants specifying a coordinate system to which elevations are referred. waterbody - A hydrographic feature that is delineated using areas. waterbody reach - A reach that represents a waterbody. (In contrast, a transport reach represents the flows of water through such areas.) workspace - A directory containing geographic datasets for use with ARC/INFO ® . ReferencesAllord, G.J., 1992, 1 to 2,000,000 hydrologic unit map of the conterminous United States (digital dataset): Reston, VA, U.S. Geological Survey, http://water.usgs.gov/GIS/huc.html. Darcy, L., and Boston, L., comps., 1988, Webster's new world dictionary of computer terms (3d ed.): New York, Simon & Schuster, Inc. Federal Geographic Data Committee, 1994, Content standards for digital geospatial metadata: Washington, Federal Geographic Data Committee, 66 p. [http://www.fgdc.gov/metadata/metadata.html] Federal Geographic Data Committee, 1997, Framework introduction and guide: Washington, Federal Geographic Data Committee, 105 p. [http://www.fgdc.gov/framework/frameworkintroguide/] Merriam-Webster, 1998, WWWebster: Merriam-Webster's collegiate dictionary (10th ed.) [Online]: Springfield, MA, Merriam-Webster, Inc., http://www.m-w.com/netdict.htm. National Geodetic Survey, [n.d.], NADCON version 2.1 [Software]: Silver Spring, MD, National Geodetic Survey. [Information about the current version of the software is available through http://www.ngs.noaa.gov/PC_PROD/pc_prod.shtml.] Orth, D., and Payne, R., 1997, Principles, policies, and procedures: domestic geographic names: Reston, VA, U.S. Geological Survey, 51 p. [http://geonames.usgs.gov/bgn.html] Padmanabhan, G., Yoon, J., and Leipnik, M., comps., 1992, A glossary of GIS terminology: National Center for Geographic Information and Analysis Technical Report 92-13, 79 p. Seaber, P., Kapinos, F.P., and Knapp, G., 1987, Hydrologic unit maps: U.S. Geological Survey Water-Supply Paper 2294, 63 p., 1 pl. (Reprinted 1994.) Stamp, L.D., and Clark, A., eds., 1979 [1981], A glossary of geographical terms (3d ed.): New York, Longman, 571 p. U.S. Environmental Protection Agency, 1994, The U.S. EPA reach file version 3.0 Alpha release (RF3-Alpha) technical reference [Online], http://www.epa.gov/waters/doc/techref.html U.S. Geological Survey, 1990, The national gazetteer of the United States of America: concise: U.S. Geological Survey Professional Paper 1200-US, 526 p. U.S. Geological Survey, 1995, Geographic names information system: U.S. Geological Survey data users guide 6, 29 p. [http://geonames.usgs.gov/gnis_users_guide_toc.html] U.S. Geological Survey, 1998, Hydrologic unit maps [Online], http://water.usgs.gov/lookup/get?huc. U.S. Geological Survey, 1999, Standards for National Hydrography Dataset: Reston, VA, U.S. Geological Survey, http://mapping.usgs.gov/standards/ U.S. Geological Survey, 1999, National Atlas, Hydrologic unit boundaries [Online], http://www.nationalatlas.gov/atlasftp.html .
Appendix AOverview of features, reaches, and related items.Table 1A. Feature types and related items Notes on table: (1) Feature type: The table lists all feature types and characteristics defined for the NHD. Note that not all feature types have been collected. (2) Geo(graphic) name: An X in the column indicates that a geographic name may be associated with features of this type. Note that geographic names have not been encoded for many features. (3) For the State of Washington, waterbody reaches carry the geographic names of lake/pond, reservoir, ice mass, and swamp/marsh. (4) Surface elevation: An X in the column indicates that the elevation of the surface of the water may be associated with the features of this feature type. Note that the surface elevations have not been encoded for most features. (5) An asterisk (*) means that the item is not in the initial release of the NHD.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||