USGS - science for a changing world
National Park Service Home Page.

Biology - Biological Informatics Program - Vegetation Characterization Program

Accuracy Assessment FAQ Metadata

Frequently-anticipated questions:


What does this data set describe?

Title: Delaware Water Gap National Recreation Area Vegetation Mapping Project - Accuracy Assessment Points Modified
Abstract:
Location of thematic accuracy assessment sampling points used in the vegetation and fire fuel model mapping in Delaware Water Gap National Recreation Area. In this data set, the X, Y coordinates for five plots have been altered to protect a rare species that could be vulnerable to collection.
  1. How should this data set be cited?

    North Carolina State University, Center for Earth Observation, 200803, dewa_veg_aa_2006 - Modified.

    Online Links:

    This is part of the following larger work.

    Pennsylvania Natural Heritage Program, 20060831, Vegetation Classification and Mapping of Vegetation and Fire Fuel Models at Delaware Water Gap National Recreation Area.

  2. What geographic area does the data set cover?

    West_Bounding_Coordinate: -77.280351
    East_Bounding_Coordinate: -77.155395
    North_Bounding_Coordinate: 39.855883
    South_Bounding_Coordinate: 39.774649
    Description_of_Geographic_Extent: Delaware Water Gap National Recreation Area and environs

  3. What does it look like?

  4. Does the data set describe conditions during a particular time period?

    Calendar_Date: 31-Aug-2006
    Currentness_Reference: 20060831

  5. What is the general form of this data set?

    Geospatial_Data_Presentation_Form: vector digital data

  6. How does the data set represent geographic features?

    1. How are geographic features stored in the data set?

      This is a Vector data set. It contains the following vector data types (SDTS terminology):

      • Entity point (1130)

    2. What coordinate system is used to represent geographic features?

      Grid_Coordinate_System_Name: Universal Transverse Mercator
      Universal_Transverse_Mercator:
      UTM_Zone_Number: 18
      Transverse_Mercator:
      Scale_Factor_at_Central_Meridian: 0.999600
      Longitude_of_Central_Meridian: -75.000000
      Latitude_of_Projection_Origin: 0.000000
      False_Easting: 500000.000000
      False_Northing: 0.000000

      Planar coordinates are encoded using coordinate pair
      Abscissae (x-coordinates) are specified to the nearest 0.000016
      Ordinates (y-coordinates) are specified to the nearest 0.000016
      Planar coordinates are specified in meters

      The horizontal datum used is North American Datum of 1983.
      The ellipsoid used is Geodetic Reference System 80.
      The semi-major axis of the ellipsoid used is 6378137.000000.
      The flattening of the ellipsoid used is 1/298.257222.

  7. How does the data set describe geographic features?

    dewa_veg_aa_2006
    attributes of shapefile table (Source: Pennsylvania Natural Heritage Program)

    FID
    Internal feature number. (Source: ESRI)

    Sequential unique whole numbers that are automatically generated.

    Shape
    Feature geometry. (Source: ESRI)

    Coordinates defining the features.

    AA_OBS_ID
    accuracy assessment point number (Source: NPS four letter park code plus unique number)

    NPS four letter park code plus unique number

    crd_alt
    Field used to indicate whether X, Y coordinates for the plot have been altered to protect a rare species that could be vulnerable to collection. (Source: North Carolina State University)

    ValueDefinition
    NoThe X, Y coordinates for the plot have not been altered.
    YesThe X, Y coordinates for the plot have been altered.

  8. What biological taxa does this data set concern?

    Taxonomy:
    Keywords/Taxon:
    Taxonomic_Keyword_Thesaurus: None
    Taxonomic_Keywords: Standardized National Vegetation Classification System
    Taxonomic_Keywords: vegetation classification
    Taxonomic_Keywords: alliance
    Taxonomic_Keywords: community association
    Taxonomic_Classification:
    Taxon_Rank_Name: Kingdom
    Taxon_Rank_Value: Plantae


Who produced the data set?

  1. Who are the originators of the data set? (may include formal authors, digital compilers, and editors)

    • North Carolina State University, Center for Earth Observation

  2. Who also contributed to the data set?

    Pennsylvania Natural Heritage Program, Western Pennsylvania Conservancy

  3. To whom should users address questions about the data?

    Pennsylvania Natural Heritage Program
    Ecologist
    208 Airport Drive
    Middletown, Pennsylvania 17055
    USA

    (717) 948-3962 (voice)
    (717) 948-3957 (FAX)


Why was the data set created?

To identify the locations of sampling points used to assess the thematic accuracy of the vegetation association, fire behavior fuel model, and canopy cover class maps in Delaware Water Gap National Recreation Area


How was the data set created?

  1. From what previous works were the data drawn?

  2. How were the data generated, processed, and modified?

    Date: 31-Aug-2006 (process 1 of 2)
    The number and distribution of sampling points was determined according to the USGS/NPS Vegetation Mapping protocols. A stratified random sampling approach was used, distributing the sampling effort across the preliminary vegetation association map. Since the vegetation association map contained many more types to be assessed than the fire fuel model or the canopy cover class map, one set of random sampling points could be used to assess the accuracy of all three maps. This combined sampling strategy was inherently more efficient and resulted in adequate sampling of the vegetation map and over sampling of the fire fuel model and canopy cover class map.

    In this stratified random sampling design, the sampling effort was distributed across the preliminary vegetation associations. For the thematic accuracy assessment sampling strategy, polygons labeled as a mosaic of two vegetation associations were lumped with the least abundant of the component associations. Since mosaics are not unique associations themselves, they should not be sampled separately in the accuracy assessment. Lumping the mosaics with the least abundant association favored additional sampling in less abundant associations, thus increasing the sampling power.

    In the stratified random sampling approach, the number of samples per association varied according to the rarity of the vegetation association, both in terms of number of polygons and polygon size. The following rules were used to determine the number of points assigned to each association:

    Scenario A: The class is abundant. It covers more than 50 hectares in total area and consists of at least 30 polygons. In this case, it is recommended that 30 polygons be selected at random from the set of the association's polygons. One sampling point will be assigned to each of the 30 selected polygons.

    Scenario B: The association is relatively abundant. It covers more than 50 hectares in total area but consists of fewer than 30 polygons. In this case, it is recommended that 20 polygons be selected at random from the set of the association's polygons, and that one sampling point be assigned to each of the 20 selected polygons. If the association contains less than 20 polygons, some polygons will contain multiple sampling points. The number of sampling points assigned to each polygon is determined by the relative area of that polygon compared with the other polygon in that association.

    Scenario C: The association is relatively rare. It covers less than 50 hectares in total area but consists of more than 30 polygons. In this case, it is recommended that 20 polygons be selected at random from the set of the association's polygons. One sampling point will be assigned to each of the 20 selected polygons.

    Scenario D: The class is rare. It has 5 - 30 polygons and covers less than 50 hectares. In this case, it is recommended that five polygons be selected at random from the set of the association's polygons. One sampling point will be assigned to each of the five selected polygons.

    Scenario E: The association is very rare. It has fewer than five polygons and occupies less than 50 hectares of the total area. In this case, it is recommended that one sampling point be assigned to each polygon.

    In order to randomly select the polygons in Scenarios A, B, C, and D, the Create Random Selection tool in Hawth's Analysis Tools was used in ArcGIS. For all of the scenarios, the Generate Random Points tool in Hawth's Analysis Tools was used to randomly determine the location of the sampling points in the polygon. If the randomly selected polygon or point fell on inaccessible privately-owned lands, the point was reassigned to a randomly selected polygon of the same association that fell on publicly-owned land. This resulted in the creation of 1,130 thematic accuracy assessment sampling points.

    These sampling points were also used to assess the thematic accuracy of the fire behavior fuel model map and the canopy cover class map. Because the vegetation association map contained many more types than either the fuel model or the cover class maps, the sampling design for the vegetation association map was also sufficient for the fire behavior fuel model map and the canopy cover class map. Even though the sampling design was created for the vegetation association map, the design distributed the sampling points across the fuel model and cover class map units proportionally to the number of polygons and the total mapped hectares of the units. For the fire behavior fuel models, five of the 12 models were oversampled while seven of the 12 models were undersampled according to the USGS/NPS Vegetation Mapping Program protocol. For the canopy cover class map, all 4 classes were oversampled, yielding approximately 3 to 15 times the number of points recommended by the USGS/NPS Vegetation Mapping Program protocol.

    From July through September 2005 and April through July 2006, each accuracy assessment point was located in the field using one of the following GPS units: Trimble Pocket Pathfinder attached to a Beacon-on-a-Belt with a Compaq Ipaq Pocket PC interface, or Garmin GPS map 76 WAAS enabled.

    Date: 05-Mar-2008 (process 2 of 2)
    The accuracy assessment plots shapefile was altered to create a shapefile suitable for public distribution. The X, Y coordinates for each plot that contains a rare plant species that could be vulnerable to collection were changed to the coordinates for the park centroid.

    Coordinates were altered in this manner for five plots. As a result, it is possible that some or all of the points now fall outside a vegetation polygon or outside the park boundary. A field named "crd_alt" was added to the shapefil attribute table; the value for this field is "no" if the plot coordinates have not been altered and "yes" if the plot coordinates were altered.

  3. What similar or related data should the user be aware of?


How reliable are the data; what problems remain in the data set?

  1. How well have the observations been checked?

  2. How accurate are the geographic locations?

  3. How accurate are the heights or depths?

  4. Where are the gaps in the data? What is missing?

    Each accuracy assessment point is representaed by one point in the shapefile. All points fall within vegetation association polygons.

  5. How consistent are the relationships among the observations, including topology?

    ESRI point geometry.


How can someone get a copy of the data set?

Are there legal restrictions on access or use of the data?

Access_Constraints: none
Use_Constraints: none

  1. Who distributes the data set? (Distributor 1 of 1)

    USGS/CSS/Core Science Analytics and Synthesis
    c/o USGS-NPS Vegetation Mapping Program Coordinator
    Denver Federal Center, Building 810, Room 8000, MS 302
    Denver, Colorado 80225

    (303) 202-4220 (voice)
    303-202-4219 (FAX)
    gs-b-npsveg@usgs.gov

  2. What's the catalog number I need to order this data set?

    Downloadable Data

  3. What legal disclaimers am I supposed to read?

    The U.S. Geological Survey and the National Park Service shall not be held liable for improper or incorrect use of the data described and/or contained herein. These data and related graphics (if available) are not legal documents and are not intended to be used as such.

    The information contained in these data is dynamic and may change over time. The data are not better than the original sources from which they were derived. It is the responsibility of the data user to use the data appropriately and consistent within the limitations of geospatial data in general and these data in particular. Any related graphics (if available) are intended to aid the data user in acquiring relevant data; it is not approriate to use the related graphics as data.

    The U.S. Geological Survey and the National Park Service gives no warranty, expressed or implied, as to the accuracy, reliability, or completeness of these data. It is strongly recommended that these data are directly acquired from an U.S. Geological Survey and National Park Service servers and not indirectly through other sources which may have changed the data in some way. Although these data have been processed successfully on a computer system at the U.S. Geological Survey, no warranty expressed or implied is made regarding the utility of the data on another system or for general or scientific purposes, nor shall the act of distribution constitute any such warranty. This disclaimer applies both to individual use of the data and aggregate use with other data. It is also strongly recommended that careful attention be paid to the contents of the metadata file associated with these data. Mention of trade names or commercial products in this metadata report does not constitute endorsement or recommendation for use by the U. S. Department of the Interior, U. S. Geological Survey and National Park Service.

  4. How can I download or order the data?


Who wrote the metadata?

Dates:
Last modified: 21-Mar-2008
Last Reviewed: 19-May-2010
Metadata author:
USGS/CSS/Core Science Analytics and Synthesis
c/o USGS-NPS Vegetation Mapping Program Coordinator
Denver Federal Center, Building 810, Room 8000, MS 302
Denver, Colorado 80225

(303) 202-4220 (voice)
303-202-4219 (FAX)
gs-b-npsveg@usgs.gov

Metadata standard:
FGDC Biological Data Profile of the Content Standard for Digital Geospatial Metadata (FGDC-STD-001.1-1999)
Metadata extensions used:


Generated by mp version 2.9.12 on Wed May 19 14:15:25 2010

Return to USGS Vegetation Characterization Program at Delaware Water Gap National Recreation Area
USGS Home :: Geology :: Geography :: Water
Accessibility FOIA Privacy Policies and Notices
Take Pride in America logo USA.gov logo U.S. Department of the Interior | U.S. Geological Survey
URL: biology.usgs.gov/npsveg/dewa/metadewaaa.faq.html
Page Contact Information:gs-b-npsveg@usgs.gov
Page Last Modified: Thursday, 12-Jan-2012 07:22:41 MST