<?xml version="1.0" encoding="UTF-8"?>
<metadata>
  <idinfo>
    <citation>
      <citeinfo>
        <origin>Phillipe A. Wernette</origin>
        <origin>Daniel D. Buscombe</origin>
        <origin>Sharon Fitzpatrick</origin>
        <origin>Jaycee Favela</origin>
        <origin>Nicholas Enwright</origin>
        <origin>Evan Goldstein</origin>
        <origin>Erin Dunand</origin>
        <pubdate>20220319</pubdate>
        <title>Coast Train--Labeled imagery for training and evaluation of data-driven models for image segmentation</title>
        <geoform>list of images and details in csv format; imagery in NumPy binary file format</geoform>
        <serinfo>
          <sername>data release</sername>
          <issue>DOI:10.5066/P91NP87I</issue>
        </serinfo>
        <pubinfo>
          <pubplace>Pacific Coastal and Marine Science Center, Santa Cruz, California</pubplace>
          <publish>U.S. Geological Survey</publish>
        </pubinfo>
        <onlink>https://doi.org/10.5066/P91NP87I</onlink>
      </citeinfo>
    </citation>
    <descript>
      <abstract>Coast Train is a library of images of coastal environments, annotations, and corresponding thematic label masks (or ‘label images’) collated for the purposes of training and evaluating machine learning (ML), deep learning, and other models for image segmentation. It includes image sets from both geospatial satellite, aerial, and UAV imagery and orthomosaics, as well as non-geospatial oblique and nadir imagery. Images include a diverse range of coastal environments from the U.S. Pacific, Gulf of Mexico, Atlantic, and Great Lakes coastlines, consisting of time-series of high-resolution (≤1m) orthomosaics and satellite image tiles (10–30m). Each image, image annotation, and labelled image is available as a single NPZ zipped file. NPZ files follow the following naming convention: {datasource}_{numberofclasses}_{threedigitdatasetversion}.zip, where {datasource} is the source of the original images (for example, NAIP, Landsat 8, Sentinel 2), {numberofclasses} is the number of classes used to annotate the images, and {threedigitdatasetversion} is the three-digit code corresponding to the dataset version (in other words, 001 is version 1). Each zipped folder contains a collection of NPZ format files, each of which corresponds to an individual image. An individual NPZ file is named after the image that it represents and contains (1) a CSV file with detail information for every image in the zip folder and (2) a collection of the following NPY files: orig_image.npy (original input image unedited), image.npy (original input image after color balancing and normalization), classes.npy (list of classes annotated and present in the labelled image), doodles.npy (integer image of all image annotations), color_doodles.npy (color image of doodles.npy), label.npy (labelled image created from the classes present in the annotations), and settings.npy (annotation and machine learning settings used to generate the labelled image from annotations). All NPZ files can be extracted using the utilities available in Doodler (Buscombe, 2022). A merged CSV file containing detail information on the complete imagery collection is available at the top level of this data release, details of which are available in the Entity and Attribute section of this metadata file.</abstract>
      <purpose>Training machine learning (ML) and other models for segmentation will greatly facilitate the creation of land cover maps from geospatial imagery with greater specificity, as well as mapping coastal sediments, transient waterbodies, landforms, and other features of interest, in both geospatial and non-geospatial imagery. Coast Train adheres to the principle of ‘Map Once, Use Many Times’ and is well positioned to transfer learning across a wide range of coastal environments.</purpose>
      <supplinf>Any use of trade, product, or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government.</supplinf>
    </descript>
    <timeperd>
      <timeinfo>
        <rngdates>
          <begdate>20080101</begdate>
          <enddate>20201231</enddate>
        </rngdates>
      </timeinfo>
      <current>date range of imagery in library</current>
    </timeperd>
    <status>
      <progress>Planned</progress>
      <update>As needed</update>
    </status>
    <spdom>
      <bounding>
        <westbc>-180.0</westbc>
        <eastbc>180.0</eastbc>
        <northbc>90.0</northbc>
        <southbc>-90.0</southbc>
      </bounding>
    </spdom>
    <keywords>
      <theme>
        <themekt>USGS Metadata Identifier</themekt>
        <themekey>USGS:9cdb71c1-cc5a-4786-9232-93d7e7a340cf</themekey>
      </theme>
      <theme>
        <themekt>ISO 19115 Topic Category</themekt>
        <themekey>environment</themekey>
        <themekey>geoscientificinformation</themekey>
      </theme>
      <theme>
        <themekt>Data Categories for Marine Planning</themekt>
        <themekey>distributions</themekey>
        <themekey>Physical Habitats and Geomorphology</themekey>
        <themekey>habitat</themekey>
        <themekey>infrastructure</themekey>
        <themekey>structures</themekey>
      </theme>
      <theme>
        <themekt>USGS Thesaurus</themekt>
        <themekey>image analysis</themekey>
        <themekey>mitigation of coastal hazards</themekey>
        <themekey>remote sensing</themekey>
        <themekey>aerial photography</themekey>
        <themekey>multispectral imaging</themekey>
        <themekey>visible light imaging</themekey>
        <themekey>datasets</themekey>
        <themekey>geospatial datasets</themekey>
        <themekey>image collections</themekey>
        <themekey>earth sciences</themekey>
        <themekey>geography</themekey>
        <themekey>geology</themekey>
        <themekey>life sciences</themekey>
        <themekey>botany</themekey>
        <themekey>ecology</themekey>
        <themekey>social sciences</themekey>
        <themekey>biological and physical processes</themekey>
        <themekey>hazards</themekey>
        <themekey>human impacts</themekey>
        <themekey>land use change</themekey>
        <themekey>land use and land cover</themekey>
        <themekey>coastal ecosystems</themekey>
      </theme>
      <theme>
        <themekt>Marine Realms Information Bank (MRIB) keywords</themekt>
        <themekey>agents of coastal change</themekey>
        <themekey>coastal processes</themekey>
        <themekey>anthropogenic agents of coastal change</themekey>
        <themekey>coastal development</themekey>
        <themekey>coastal protection structures</themekey>
        <themekey>effects of coastal change</themekey>
        <themekey>human responses to coastal change</themekey>
        <themekey>earth system</themekey>
        <themekey>bay</themekey>
        <themekey>beach</themekey>
        <themekey>cape</themekey>
        <themekey>cliff</themekey>
        <themekey>coast</themekey>
        <themekey>coastal barrier</themekey>
        <themekey>coastal plain</themekey>
        <themekey>cove</themekey>
        <themekey>dune</themekey>
        <themekey>island</themekey>
        <themekey>lagoon</themekey>
        <themekey>lake</themekey>
        <themekey>marsh</themekey>
        <themekey>mudflat</themekey>
        <themekey>ocean</themekey>
        <themekey>shore</themekey>
        <themekey>swamp</themekey>
        <themekey>tidal flat</themekey>
        <themekey>tidal inlet</themekey>
        <themekey>beach zone communities</themekey>
        <themekey>breakwater/shoreline stabilization structure</themekey>
        <themekey>bridge</themekey>
        <themekey>canal</themekey>
        <themekey>jetty</themekey>
        <themekey>environment</themekey>
        <themekey>hazards and disasters</themekey>
        <themekey>erosion</themekey>
        <themekey>floods</themekey>
        <themekey>remote sensing</themekey>
        <themekey>aerial and satellite photography</themekey>
        <themekey>biology</themekey>
        <themekey>ecology</themekey>
        <themekey>computer science</themekey>
        <themekey>geography</themekey>
        <themekey>environmental geography</themekey>
        <themekey>physical geography</themekey>
        <themekey>geology</themekey>
        <themekey>information science</themekey>
      </theme>
      <theme>
        <themekt>None</themekt>
        <themekey>U.S. Geological Survey</themekey>
        <themekey>USGS</themekey>
        <themekey>Coastal and Marine Hazards and Resources Program</themekey>
        <themekey>CMHRP</themekey>
        <themekey>Pacific Coastal and Marine Science Center</themekey>
        <themekey>PCMSC</themekey>
        <themekey>St. Petersburg Coastal and Marine Science Center</themekey>
        <themekey>SPCMSC</themekey>
        <themekey>Wetland and Aquatic Research Center</themekey>
        <themekey>WARC</themekey>
        <themekey>Woods Hole Coastal and Marine Science Center</themekey>
        <themekey>WHCMSC</themekey>
      </theme>
      <place>
        <placekt>Geographic Names Information System (GNIS)</placekt>
        <placekey>United States</placekey>
      </place>
    </keywords>
    <accconst>None</accconst>
    <useconst>USGS-authored or produced data and information are in the public domain from the U.S. Government and are freely redistributable with proper metadata and source attribution. Please recognize and acknowledge the U.S. Geological Survey as the originator(s) of the dataset and in products derived from these data.</useconst>
    <ptcontac>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S. Geological Survey, Pacific Coastal and Marine Science Center</cntorg>
          <cntper>PCMSC Science Data Coordinator</cntper>
        </cntorgp>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>2885 Mission Street</address>
          <city>Santa Cruz</city>
          <state>CA</state>
          <postal>95060</postal>
        </cntaddr>
        <cntvoice>831-427-4747</cntvoice>
        <cntemail>pcmsc_data@usgs.gov</cntemail>
      </cntinfo>
    </ptcontac>
    <browse>
      <browsen>coast_train_thumbnail.png</browsen>
      <browsed>Split graphic with the original image on the left and the segmented right half of the image on the right.</browsed>
      <browset>PNG</browset>
    </browse>
    <native>Images were processed using Doodler software (Buscombe, 2022) in Windows operating systems.</native>
    <crossref>
      <citeinfo>
        <origin>Daniel D. Buscombe</origin>
        <pubdate>2022</pubdate>
        <title>Doodler--A web application built with plotly/dash for image segmentation with minimal supervision</title>
        <geoform>software</geoform>
        <serinfo>
          <sername>software release</sername>
          <issue>DOI:10.5066/P9YVHL23</issue>
        </serinfo>
        <pubinfo>
          <pubplace>Pacific Coastal and Marine Science Center, Santa Cruz, California</pubplace>
          <publish>U.S. Geological Survey</publish>
        </pubinfo>
        <othercit>Buscombe, D.D., 2022, Doodler--A web application built with plotly/dash for image segmentation with minimal supervision: U.S. Geological Survey software release, https://doi.org/10.5066/P9YVHL23</othercit>
        <onlink>https://doi.org/10.5066/P9YVHL23</onlink>
      </citeinfo>
    </crossref>
  </idinfo>
  <dataqual>
    <attracc>
      <attraccr>Mean Intersection over Union (IoU) scores for quantifying inter-labeler agreement were computed using 120 images across two datasets, namely NAIP (70 image pairs) and Sentinel-2 (50 image pairs), that have been labeled independently by experienced labelers. Mean IoU is the standard way to report agreement between two realizations of the same label image. Further, because IoU quantifies spatial overlap and is prone to class imbalance, Kullback-Leibler divergence scores were also computed to quantify agreement between class-frequency distributions. When comparing IoU and Dice scores, it is preferable to examine agreement using multiple independent metrics. The mean of mean IoU scores was 0.88, which we recommend using as an expected irreducible error. Previous research suggests that mean IoU scores tend to be inversely correlated with number of classes; therefore, this error is a conservative estimate.</attraccr>
    </attracc>
    <logic>All annotation values are integer based, with each integer corresponding to a unique class. The program used to generate the final classified/labelled images ensured that every pixel in the original image is classified into one of the annotated classes. There is no possibility that the actual values are outside of the reported ranges of values.</logic>
    <complete>Dataset is considered complete for the information presented, as described in the abstract. Users are advised to read the rest of the metadata record carefully for additional details.</complete>
    <posacc>
      <horizpa>
        <horizpar>A formal accuracy assessment of the horizontal positional information in the dataset has not been conducted. Each of the input data sources has its own horizontal accuracy available in their source metadata.</horizpar>
      </horizpa>
    </posacc>
    <lineage>
      <procstep>
        <procdesc>Image Annotation/Doodling--Each image was opened using Doodler with the class list provided in the metadata sheet. The user would then select a single class from the options on the right and click and hold on the image to begin drawing a line (annotate/doodle) where the selected class exists on the image. Annotations can be quick and as simple as a point or single line or as complex as a meandering or looping back series of lines. This annotation process was repeated one or more times for every class present in the image.</procdesc>
        <procdate>20211231</procdate>
      </procstep>
      <procstep>
        <procdesc>Image Classification/Segmentation--Once image annotations were complete for all classes present in the image, the program will segment the image and classify every pixel in it by checking the “Compute/Show segmentation” box on the right. If the final image is not accurate to the classes present and their distribution, then the user can uncheck the “Compute/Show segmentation” box and repeat the annotation and classification/segmentation steps until they are satisfied with the final segmented image.</procdesc>
        <procdate>20211231</procdate>
      </procstep>
      <procstep>
        <procdesc>Edited to correct spelling of author name. No data were changed. (scochran@usgs.gov)</procdesc>
        <procdate>20230504</procdate>
      </procstep>
    </lineage>
  </dataqual>
  <spdoinfo>
    <indspref>Original images were downloaded at sites along the conterminous U.S. coastline, including sites along the U.S. Atlantic, Gulf of Mexico, Pacific, and Great Lakes coasts. Sites were selected to provide a representative sample of an array of coastal types (for example, sandy, cliff, marsh, wetland, developed). Refer to the self-contained NPZ files for more information on locations of original images.</indspref>
  </spdoinfo>
  <eainfo>
    <detailed>
      <enttyp>
        <enttypl>CoastTrain_imagery_details.csv</enttypl>
        <enttypd>Table containing detailed information about the imagery in this dataset.</enttypd>
        <enttypds>Producer defined</enttypds>
      </enttyp>
      <attr>
        <attrlabl>name</attrlabl>
        <attrdef>Name of image source</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>Unique identifier of image name.</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>publisher</attrlabl>
        <attrdef>Original publisher of the image source</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>Unique identifier of image publisher.</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>labels</attrlabl>
        <attrdef>The image label file. One-hot-encoded label image (2D raster) in 8-bit unsigned integer. Each integer encodes a class label, incrementing through 'classes' starting at zero.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>Unique identifier of label image file.</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>images</attrlabl>
        <attrdef>The original image file used in classification.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>Unique identifier of image filename.</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>annotation_image_filename</attrlabl>
        <attrdef>Image filename with annotations.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>Unique identifier of annotation image filename.</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>classes_array</attrlabl>
        <attrdef>An array of classification classes in the image.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <edom>
            <edomv>water</edomv>
            <edomvd>Classified as water</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>whitewater</edomv>
            <edomvd>Classified as whitewater</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>surf</edomv>
            <edomvd>Classified as surf</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>mud_silt</edomv>
            <edomvd>Classified as mud or fine sediment</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>sand</edomv>
            <edomvd>Classified as bare sand</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>gravel</edomv>
            <edomvd>Classified as gravel</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>gravel_shell</edomv>
            <edomvd>Classified as mixture of gravel and shells</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>cobble_gravel</edomv>
            <edomvd>Classified as gravel with cobbles</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>bedrock</edomv>
            <edomvd>Classified as exposed bedrock</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>ice_snow</edomv>
            <edomvd>Classified as ice or snow</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>bare_ground</edomv>
            <edomvd>Classified as bare ground</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>sediment</edomv>
            <edomvd>Classified as bare sediment</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>other_natural_terrain</edomv>
            <edomvd>Classified as other natural terrain</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>other_bare_natural_terrain</edomv>
            <edomvd>Classified as other bare natural terrain</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>vegetated</edomv>
            <edomvd>Classified as vegetation</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>vegetated_ground</edomv>
            <edomvd>Classified as ground with vegetation</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>vegetated_surface</edomv>
            <edomvd>Classified as vegetation</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>marsh_vegetation</edomv>
            <edomvd>Classified as marsh vegetation</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>terrestrial_vegetation</edomv>
            <edomvd>Classified as terrestrial vegetation</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>agricultural</edomv>
            <edomvd>Classified as agricultural</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>cloud</edomv>
            <edomvd>Classified as cloud</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>development</edomv>
            <edomvd>Classified as consisting of human development</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>dev</edomv>
            <edomvd>Classified as consisting of human development</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>coastal_defense</edomv>
            <edomvd>Classified as coastal defense</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>buildings</edomv>
            <edomvd>Classified as building</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>pavement_road</edomv>
            <edomvd>Classified as pavement</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>vehicles</edomv>
            <edomvd>Classified as vehicles</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>people</edomv>
            <edomvd>Classified as person</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>other_anthro</edomv>
            <edomvd>Classified as other anthropogenic object</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>unusual</edomv>
            <edomvd>Classified as unusal object or land cover</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>unknown</edomv>
            <edomvd>Classified as unknown land cover</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>no_data</edomv>
            <edomvd>No data contained in pixels</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
        <attrdomv>
          <edom>
            <edomv>nodata</edomv>
            <edomvd>No data contained in pixels</edomvd>
            <edomvds>Producer defined</edomvds>
          </edom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>num_classes</attrlabl>
        <attrdef>Number of classification classes in the image.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>12</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>classes_integer</attrlabl>
        <attrdef>One integer per class in num_classes.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0</rdommin>
            <rdommax>12</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>classes_present_integer</attrlabl>
        <attrdef>An array of integer classes present in the image.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0</rdommin>
            <rdommax>12</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>classes_present_array</attrlabl>
        <attrdef>An array of classes present in the image.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>Values present in image from classes</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>pen_width</attrlabl>
        <attrdef>Final width in pixels of pen used to annotate in the Doodler program.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>10</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>CRF_theta</attrlabl>
        <attrdef>Internal classifier hyperparameter used by the Doodler program.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>1</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>CRF_mu</attrlabl>
        <attrdef>Internal classifier hyperparameter used by the Doodler program.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>99</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>CRF_downsample_factor</attrlabl>
        <attrdef>Internal classifier hyperparameter used by the Doodler program.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>5</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>Classifier_downsample_factor</attrlabl>
        <attrdef>Internal classifier hyperparameter used by the Doodler program.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>8</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>prob_of_unary_potential</attrlabl>
        <attrdef>Internal classifier hyperparameter used by the Doodler program.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.1</rdommin>
            <rdommax>3.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>doodle_spatial_density</attrlabl>
        <attrdef>Proportion of the image annotated.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.000526323</rdommin>
            <rdommax>0.999627422</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>num_of_scales</attrlabl>
        <attrdef>Internal classifier hyperparameter used by the Doodler program.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>3</rdommin>
            <rdommax>3</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>acc_georef</attrlabl>
        <attrdef>Accuracy, in meters, of the specification of 'XMin', 'XMax' and 'YMin', 'YMax'.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>11.248</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>epsg</attrlabl>
        <attrdef>EPSG code for the projected coordinate system. See 'CRS' attribute for a complete description of codes used.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>26910</rdommin>
            <rdommax>32618</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>year</attrlabl>
        <attrdef>Acquisition year of the image source.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>2008</rdommin>
            <rdommax>2021</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>month</attrlabl>
        <attrdef>Acquisition month of the image source.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>12</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>day</attrlabl>
        <attrdef>Acquisition day of the image source.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>1</rdommin>
            <rdommax>31</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>hour</attrlabl>
        <attrdef>Acquisition hour of the image source.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0</rdommin>
            <rdommax>23</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>minute</attrlabl>
        <attrdef>Acquisition minute of the image source.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0</rdommin>
            <rdommax>59</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>second</attrlabl>
        <attrdef>Acquisition second of the image source.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0</rdommin>
            <rdommax>59</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>XMin</attrlabl>
        <attrdef>Minimum easting of the image footprint.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>233870.0</rdommin>
            <rdommax>787860.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>XMax</attrlabl>
        <attrdef>Maximum easting of the image footprint.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>235750.0</rdommin>
            <rdommax>790530.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>YMin</attrlabl>
        <attrdef>Minimum northing of the image footprint.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>2875253.0</rdommin>
            <rdommax>5332914.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>YMax</attrlabl>
        <attrdef>Maximum northing of the image footprint.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>2884030.0</rdommin>
            <rdommax>5333378.0</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>LonMin</attrlabl>
        <attrdef>Minimum longitude (WGS84) of image footprint.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>-124.0922272</rdommin>
            <rdommax>-69.95201111</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>LonMax</attrlabl>
        <attrdef>Maximum longitude (WGS84) of image footprint.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>-124.0478924</rdommin>
            <rdommax>-69.9405098</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>LatMin</attrlabl>
        <attrdef>Minimum longitude (WGS84) of image footprint.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>25.98761753</rdommin>
            <rdommax>48.14810677</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>LatMax</attrlabl>
        <attrdef>Maximum latitude (WGS84) of image footprint.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>26.06287667</rdommin>
            <rdommax>48.15232107</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>CRS</attrlabl>
        <attrdef>The projected coordinate system description relating to XMin, XMax, YMin, YMax.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <udom>Projected coordinate system definition</udom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>px_m</attrlabl>
        <attrdef>Horizontal size of pixel in meters.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>0.15</rdommin>
            <rdommax>15</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ImageHeightPx</attrlabl>
        <attrdef>Number of pixels in horizontal dimension of height.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>31</rdommin>
            <rdommax>2481</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ImageWidthPx</attrlabl>
        <attrdef>Number of pixels in horizontal dimension of width.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>32</rdommin>
            <rdommax>2209</rdommax>
          </rdom>
        </attrdomv>
      </attr>
      <attr>
        <attrlabl>ImageBands</attrlabl>
        <attrdef>Number of bands in the image.</attrdef>
        <attrdefs>Producer defined</attrdefs>
        <attrdomv>
          <rdom>
            <rdommin>3</rdommin>
            <rdommax>3</rdommax>
          </rdom>
        </attrdomv>
      </attr>
    </detailed>
    <overview>
      <eaover>Each image, image annotation, and labelled image is available as a single NPZ zipped file. NPZ files follow the following naming convention: {datasource}_{numberofclasses}_{threedigitdatasetversion}.zip, where {datasource} is the source of the original images (for example, NAIP, Landsat 8, Sentinel 2), {numberofclasses} is the number of classes used to annotate the images, and {threedigitdatasetversion} is the three digit code corresponding to the dataset version (in other words, 001 is version 1). Each zipped folder contains a collection of NPZ format files, each of which corresponds to an induvial image. An individual NPZ file is named after the image that it represents and contains (1) a CSV file with metadata information for every image and (2) a collection of the following NPY files: orig_image.npy (original input image unedited), image.npy (original input image after color balancing and normalization), classes.npy (list of classes annotated and present in the labelled image), doodles.npy (integer image of all image annotations), color_doodles.npy (color image of doodles.npy), label.npy (labelled image created from the classes present in the annotations), and settings.npy (annotation and machine learning settings used to generate the labelled image from annotations). All NPZ files can be extracted using the utilities available in Doodler (Buscombe, 2022; https://doi.org/10.5066/P9YVHL23).</eaover>
      <eadetcit>The entity and attribute information was generated by the individual and/or agency identified as the originator of the data set. Please review the rest of the metadata record for additional details and information.</eadetcit>
    </overview>
  </eainfo>
  <distinfo>
    <distrib>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S. Geological Survey - CMGDS</cntorg>
        </cntorgp>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>2885 Mission Street</address>
          <city>Santa Cruz</city>
          <state>CA</state>
          <postal>95060</postal>
        </cntaddr>
        <cntvoice>831-427-4747</cntvoice>
        <cntemail>pcmsc_data@usgs.gov</cntemail>
      </cntinfo>
    </distrib>
    <resdesc>Images are provided in NPZ format. Each NPZ file corresponds to a single image that has been annotated/labelled and classified/segmented. The NPZ file names consist of the {image_source}_{number_of_classes}_{data_release_version_number}, delimited by underscores. The first element {image_source} represents the original source of the image (for example, Landsat 8 would be “L8”, Sentinel 2 would be “S2”), the second element {number_of_classes} represents the number of classes used during labelling (for example, “6 classes”, “11classes”, and The third element {data_release_version_number} represents the data release version that the image is part of (for example, all datasets for version 1 will have “001” as the third part of the NPZ filename). Each NPZ file contains at least seven (7) different NPY files: (1) orig_image.npy (original input image unedited), (2) image.npy (original input image after color balancing and normalization), (3) classes.npy (list of classes annotated and present in the labelled image), (4) doodles.npy (integer image of all image annotations), (5) color_doodles.npy (color image of doodles.npy), (6) label.npy (labelled image created from the classes present in the annotations), and (7) settings.npy (annotation and machine learning settings used to generate the labelled image from annotations). Some NPZ files may contain one or more additional sets of seven files with one or more zeros appended to the beginning of the NPY file names. These additional files are grouped by the number of zeros preceding the regular files described above and represent previous attempts at annotation and classification/segmentation for that image. For example, all NPY files with one zero appended to the beginning of the NPY file names represent the first attempt, all NPY files with two zeros appended to the beginning of the NPY file names represent the second attempt, etc. A merged CSV file (CoastTrain_imagery_details.csv) contains detailed information on the complete imagery collection.</resdesc>
    <distliab>Unless otherwise stated, all data, metadata and related materials are considered to satisfy the quality standards relative to the purpose for which the data were collected. Although these data and associated metadata have been reviewed for accuracy and completeness and approved for release by the U.S. Geological Survey (USGS), no warranty expressed or implied is made regarding the display or utility of the data on any other system or for general or scientific purposes, nor shall the act of distribution constitute any such warranty.</distliab>
    <stdorder>
      <digform>
        <digtinfo>
          <formname>comma-delimited text</formname>
          <filedec>none</filedec>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://doi.org/10.5066/P91NP87I</networkr>
              </networka>
            </computer>
            <accinstr>CSV files can be downloaded using the Network_Resource_Name link then scrolling down to the Land Cover Data section.</accinstr>
          </onlinopt>
        </digtopt>
      </digform>
      <digform>
        <digtinfo>
          <formname>NPY</formname>
          <filedec>Each ZIP file contains multiple NPZ files. Individual NPZ files can be unzipped and all NPY files converted to readable raster formats by using utility scripts associated with Doodler (Buscombe, 2022; https://doi.org/10.5066/P9YVHL23). NPZ files were compressed using deflate compression.</filedec>
        </digtinfo>
        <digtopt>
          <onlinopt>
            <computer>
              <networka>
                <networkr>https://doi.org/10.5066/P91NP87I</networkr>
              </networka>
            </computer>
            <accinstr>Imagery data can be downloaded using the Network_Resource_Name link then scrolling down to the Land Cover Data section.</accinstr>
          </onlinopt>
        </digtopt>
      </digform>
      <fees>None.</fees>
    </stdorder>
    <techpreq>These data can be viewed with Doodler software (Buscombe, 2022; https://doi.org/10.5066/P9YVHL23).</techpreq>
  </distinfo>
  <metainfo>
    <metd>20220504</metd>
    <metc>
      <cntinfo>
        <cntorgp>
          <cntorg>U.S. Geological Survey, Pacific Coastal and Marine Science Center</cntorg>
          <cntper>PCMSC Science Data Coordinator</cntper>
        </cntorgp>
        <cntaddr>
          <addrtype>mailing and physical</addrtype>
          <address>2885 Mission Street</address>
          <city>Santa Cruz</city>
          <state>CA</state>
          <postal>95060</postal>
        </cntaddr>
        <cntvoice>831-427-4747</cntvoice>
        <cntemail>pcmsc_data@usgs.gov</cntemail>
      </cntinfo>
    </metc>
    <metstdn>Content Standard for Digital Geospatial Metadata</metstdn>
    <metstdv>FGDC-STD-001-1998</metstdv>
  </metainfo>
</metadata>
