Seabed still images in JPEG format with EXIF location information collected by the U.S. Geological Survey on Stellwagen Bank during three surveys aboard the R/V Auk, September 2020 to August 2021

Metadata also available as - [Outline] - [Parseable text] - [XML]

Frequently anticipated questions:


What does this data set describe?

Title:
Seabed still images in JPEG format with EXIF location information collected by the U.S. Geological Survey on Stellwagen Bank during three surveys aboard the R/V Auk, September 2020 to August 2021
Abstract:
These data are a part of the effort to map geologic substrates of the Stellwagen Bank National Marine Sanctuary region off Boston, Massachusetts. The overall goal is to develop high-resolution (1:25,000) interpretive maps, based on multibeam sonar data and seabed sampling, showing surficial geology and seabed sediment dynamics. The work was conducted in collaboration with the Stellwagen Bank National Marine Sanctuary, and the data collected will aid research on the ecology of fish and invertebrate species that inhabit the region. The Sanctuary's research vessel, R/V Auk, visited a total of 169 locations on Stellwagen Bank during three separate surveys from September 2020 to August 2021. During these surveys a customized Van Veen grab sampler (SEABOSS) equipped with a video camera and a CTD was deployed in drift mode to collect sediment for grain-size analysis, video imagery of the seabed, and measurements of water column properties including temperature, salinity, density, and fluorescence.
Supplemental_Information:
The locations of image can be extracted from the EXIF header of the images (described in the process steps). Alternatively, the image locations are available from the 2020-2021_station_data_summary dataset available from the Larger Work citation.
Additional information regarding the field activities associated with these data can be obtained from the following locations https://cmgds.marine.usgs.gov/fan_info.php?fan=2020-012-FA, https://cmgds.marine.usgs.gov/fan_info.php?fan=2021-016-FA, and https://cmgds.marine.usgs.gov/fan_info.php?fan=2021-017-FA .
  1. How might this data set be cited?
    Valentine, Page C., 20230321, Seabed still images in JPEG format with EXIF location information collected by the U.S. Geological Survey on Stellwagen Bank during three surveys aboard the R/V Auk, September 2020 to August 2021: data release DOI:10.5066/P9G7QARQ, U.S. Geological Survey, Coastal and Marine Hazards and Resources Program, Woods Hole Coastal and Marine Science Center, Woods Hole, MA.

    Online Links:

    This is part of the following larger work.

    Valentine, Page C., and Cross, VeeAnn A., 2023, Sea-floor Sediment Samples, Seabed and Sample Imagery, and CTD Instrument Data Collected on Stellwagen Bank from September 2020 to August 2021: data release DOI:10.5066/P9G7QARQ, U.S. Geological Survey, Reston, VA.

    Online Links:

    Other_Citation_Details:
    Suggested citation: Valentine, P.C., and Cross, V.A., 2023, Sea-floor sediment samples, seabed and sample imagery, and CTD instrument data collected on Stellwagen Bank from September 2020 to August 2021: U.S. Geological Survey data release, https://doi.org/10.5066/P9G7QARQ .
  2. What geographic area does the data set cover?
    West_Bounding_Coordinate: -70.56265
    East_Bounding_Coordinate: -70.11801
    North_Bounding_Coordinate: 42.43998
    South_Bounding_Coordinate: 42.09603
  3. What does it look like?
    https://www.sciencebase.gov/catalog/file/get/633c93b5d34e900e86cf1aa8?name=seabed_image_browse.jpg&allowOpen=true (JPEG)
    Browse graphic of an example seabed still image on Stellwagen Bank.
  4. Does the data set describe conditions during a particular time period?
    Beginning_Date: 24-Sep-2020
    Ending_Date: 04-Aug-2021
    Currentness_Reference:
    Ground condition of when data were collected. These data were collected during three separate surveys, with the following ranges of dates for each survey: field activity 2020-012-FA: 20200924 to 20200925; field activity 2021-016-FA: 20210721 to 20210723; and field activity 2021-017-FA: 20210802 to 20210804.
  5. What is the general form of this data set?
    Geospatial_Data_Presentation_Form: raster
  6. How does the data set represent geographic features?
    1. How are geographic features stored in the data set?
      Indirect_Spatial_Reference:
      The location information for the images is available in the EXIF headers of the images and is also indicated by the station locations in the station data summary dataset (available from the Larger Work citation).
    2. What coordinate system is used to represent geographic features?
      Horizontal positions are specified in geographic coordinates, that is, latitude and longitude. Latitudes are given to the nearest 0.00001. Longitudes are given to the nearest 0.00001. Latitude and longitude values are specified in Decimal degrees. The horizontal datum used is D_WGS_1984.
      The ellipsoid used is WGS_1984.
      The semi-major axis of the ellipsoid used is 6378137.000000.
      The flattening of the ellipsoid used is 1/298.257224.
  7. How does the data set describe geographic features?
    Entity_and_Attribute_Overview:
    Although the images do not have attributes per se, the EXIF header information can be extracted or viewed to give additional information about each image. The last process step explains how to extract the information from the image header, and is replicated here.
    To extract the information from the image headers using ExifTool, the following command can be used (tested with ExifTool version 11.54):
    
    
    exiftool.exe -csv -f -filename -GPSTimeStamp -GPSDateStamp -GPSLongitude -GPSLatitude -n -GPSAreaInformation -GPSMapDatum -ImageDescription -Artist -Credit -comment -contact -keywords -Caption -Copyright -CopyrightNotice -Caption-Abstract -UserComment -AttributionURL -UsageTerms -Author -ImageUniqueID -ExternalMetadataLink *.jpg > out.csv
    
    
    The -csv option writes the information out in a comma-delimited format. The -n option formats the latitude and longitude as signed decimal degrees. Whether you use exiftool or exiftool.exe will depend on the operating system on which the command is being run.
    Entity_and_Attribute_Detail_Citation: U.S. Geological Survey

Who produced the data set?

  1. Who are the originators of the data set? (may include formal authors, digital compilers, and editors)
    • Page C. Valentine
  2. Who also contributed to the data set?
  3. To whom should users address questions about the data?
    U.S. Geological Survey
    Attn: Page C. Valentine
    Research Geologist
    384 Woods Hole Rd.
    Woods Hole, MA
    USA

    (508) 548-8700 x2239 (voice)
    (508) 457-2310 (FAX)
    whsc_data_contact@usgs.gov
    Contact_Instructions:
    The contact email address is a generic address in the event the person is no longer with the USGS.

Why was the data set created?

To provide images of the sea floor acquired with the SEABed Observation and Sampling System (SEABOSS). These data provide ground-truth information for geophysical datasets and are used in seabed interpretations. The images in this dataset are frame-grabs from GoPro video.

How was the data set created?

  1. From what previous works were the data drawn?
  2. How were the data generated, processed, and modified?
    Date: 2021 (process 1 of 4)
    Seabed video imagery was collected using two battery-operated, high-definition (1920 x 1080 pixels) GoPro digital video cameras and a SeaViewer high-definition (1920 x 720 pixels) digital camera mounted on the SEABOSS. The SeaViewer camera had a live feed to the surface and was used only to observe obstacles, like gill nets and lobster gear, in order to avoid entanglement. In the case of camera failure, the additional cameras provided backup video. All images in this dataset are frame-grabs from the the GoPro videos. As the SEABOSS drifted over the seabed, the cameras imaged the seabed at a rate of 60 frames per second from an altitude of ~1 meter. Lighting was provided by two battery-operated Keldan LED dive lights. Each light provided a light flux of up to 9000 lumens at a correlated color temperature of 5000 kelvins. Two parallel red lasers mounted 20 centimeters apart provided a scale in the imagery.
    Video imagery includes the launch of the SEABOSS, transit through the water column, and the drift over the seabed. The duration of 168 individual video files ranges from 2 to 13.5 minutes. Video files are archived in .MP4 format and range in size from 0.2 to 3.0 gigabytes.
    These video files are not part of this data release. Video files are archived at the USGS Woods Hole Coastal and Marine Science Center, 384 Woods Hole Road, Woods Hole, MA 02543.
    Data were collected from 2020 to 2021 with the process date representing the most recent acquisition year. Person who carried out this activity:
    U.S. Geological Survey
    Attn: Page C. Valentine
    Research Geologist
    384 Woods Hole Rd.
    Woods Hole, MA
    USA

    (508) 548-8700 x2239 (voice)
    (508) 457-2310 (FAX)
    pvalentine@usgs.gov
    Date: 2022 (process 2 of 4)
    Video frame-grab images were extracted from high-definition (1920 x 1080 pixels) GoPro digital video using open source VLC Media Player software (version 2.0.0) developed by the VideoLAN project ( http://www.videolan.org/vlc/index.html ).
    Two video frame-grab images are provided for each station. One image (staXXXX_1) shows a broad view of the landscape as the SEABOSS drifted over the seabed. The second image (staXXXX_2) shows a close-up of the seabed as the SEABOSS was stationary during the collection of the sediment sample. Where visible, the two parallel red lasers mounted 20 cm apart provide a scale.
    Data were collected from 2020 to 2021 with the process date representing the most recent acquisition year. Person who carried out this activity:
    U.S. Geological Survey
    Attn: Page C. Valentine
    Research Geologist
    384 Woods Hole Rd.
    Woods Hole, MA
    USA

    (508) 548-8700 x2239 (voice)
    (508) 457-2310 (FAX)
    pvalentine@usgs.gov
    Date: 2022 (process 3 of 4)
    Using the station data summary CSV file for each field activity, the approximate time and location of each photo during that field activity was derived. The location (latitude and longitude), date, and time for each station was extracted from the CSV file. This information was reformatted using an AWK script so that the following columns of information were available: image name (based on the station number), latitude, longitude, GPS time (in the format HH:MM:SS), and GPS date (in the format YYYYMMDD). This information was written to a new CSV file. This process was repeated for each of the three field activities. Person who carried out this activity:
    U.S. Geological Survey
    Attn: VeeAnn A. Cross
    Marine Geologist
    384 Woods Hole Road
    Woods Hole, MA

    508-548-8700 x2251 (voice)
    508-457-2310 (FAX)
    vatnipp@usgs.gov
    Data sources used in this process:
    • *-FA_station_data_summary.csv
    Data sources produced in this process:
    • *-FA_seabedimage_locations.csv
    Date: 2022 (process 4 of 4)
    A Python notebook (EXIF_headers.ipynb) was run that incorporated the information from an individual field activity image locations file *-FA_seabedimage_locations.csv where the * represents the field activity number) along with additional information into the appropriate tags in the EXIF header of each seabed image. Although the two images from each station are not from the exact same place, the same location information was populated in each station's photos. Then the IPython Notebook (using Python version 3.8.8) was executed from a Jupyter Notebook (version 6.3.0) running under Anaconda version 3 on a Windows 10 machine. The Python script uses ExifTool (version 11.54) to write survey information to the image headers of the full-resolution images. The following tags were populated in the JPEG image headers. Information is duplicated in some tags because different software packages access different tags.
    GPS tags: The values populated are unique for each station image and based on the information exported from *-FA_station_data_summary.csv to the *-FA_seabedimage_location.csv.
    GPSLatitudeRef
    GPSLatitude
    GPSLongitudeRef
    GPSLongitude
    GPSTimeStamp
    GPSDateStamp
    
    
    Two other elements that are unique for each image (and will actually have the same value) are the XMP tags for the original filename.
    OriginalFileName
    PreservedFileName
    
    
    JPEG tag: The tag is listed along with the information used to populate it, which is the same for every image of a given field activity. The example below is from field activity 2020-012-FA. The DOI number is the same for all the field activities and represents this data release.
    
    Comment: Video frame grab image from a GoPro digital video camera mounted on the USGS SEABOSS deployed from the R/V Auk during survey 2020-012-FA (https://cmgds.marine.usgs.gov/fan_info.php?fan=2020-012-FA) .
    
    
    EXIF tags: The tag is listed along with the information used to populate it, which is the same for every image.
    
    EXIF:ImageDescription: https://cmgds.marine.usgs.gov/fan_info.php?fan=2020-012-FA; Video frame grab image from a GoPro digital video camera on Stellwagen Bank, MA from survey 2020-012-FA 
    EXIF:Artist: Page C. Valentine
    EXIF:Copyright: Public Domain - please credit U.S. Geological Survey
    EXIF:UserComment: Image part of USGS data release https://doi.org/10.5066/P9G7QARQ
    EXIF:Make: GoPro
    EXIF:GPSAreaInformation: Position post-processed from GPS on ship
    EXIF:GPSMapDatum: EPSG:4326 (WGS 84)
    
    
    IPTC tags: The tag is listed along with the information used to populate it, which is the same for every image.
    
    IPTC:Credit: U.S. Geological Survey
    IPTC:Contact: whsc_data_contact@usgs.gov
    IPTC:Keywords: Stellwagen Bank, Massachusetts, 2020-012-FA, SEABOSS, sea floor, USGS, ground-truth, https://doi.org/10.5066/P9G7QARQ
    IPTC:CopyrightNotice: Public Domain - please credit U.S. Geological Survey
    IPTC:Caption-Abstract: Video frame grab image from a GoPro digital video camera of the sea floor on Stellwagen Bank, MA from survey 2020-012-FA
    
    
    XMP tags: The tag is listed along with the information used to populate it, which is the same for every image of a given field activity. Three tags described elsewhere (ImageUniqueID, OriginalFileName, and PreservedFileName), are unique for each image.
    
    XMP:Caption: Video frame grab image from a GoPro digital video camera of the sea floor on Stellwagen Bank, MA from survey 2020-012-FA
    XMP:AttributionURL: https://doi.org/10.5066/P9G7QARQ
    XMP:Author: Page C. Valentine
    XMP:UsageTerms: Unless otherwise stated, all data, metadata and related materials are considered to satisfy the quality standards relative to the purpose for which the data were collected. Although these data and associated metadata have been reviewed for accuracy and completeness and approved for release by the U.S. Geological Survey (USGS), no warranty expressed or implied is made regarding the display or utility of the data for other purposes, nor on all computer systems, nor shall the act of distribution constitute any such warranty.
    
    
    Another script run in Python Notebooks (Image_PIDwork.ipnb) was used to mint a unique persistent identifier (image PID) for each image and placed in the header. Additionally, the persistent URL for the metadata record associated with this dataset was also inserted into the image header.
    
    
    XMP tag:
    
    XMP:ImageUniqueID: unique for each image
    ExternalMetadataLink: https://www1.usgs.gov/pir/api/identifiers/USGS:633c93b5d34e900e86cf1aa8
    
    
    From the command-line in Cygwin, the image name was also added to the header with the following command: exiftool -P "-originalfilename<Filename" *.jpg
    
    
    From the command-line in Cygwin, the image name was also added to another element in the header with the following command: exiftool.exe -P "-XMP:PreservedFileName<Filename" *.jpg
    
    
    Finally, another command run from the command line in order to duplicate information to appropriate XMP headers:
    exiftool -P "-XMP-photoshop:Credit<IPTC:Credit" "-XMP-iptcCore:CreatorWorkEmail<IPTC:Contact" "-XMP-dc:Rights<EXIF:Copyright" "-XMP-dc:Description<EXIF:ImageDescription" "-XMP-exif:all<GPS:all" "-XMP-exif:GPSLatitude<Composite:GPSLatitude" "-XMP-exif:GPSLongitude<Composite:GPSLongitude" "-XMP-exif:GPSDateTime<Composite:GPSDateTime" "-XMP-photoshop:DateCreated<EXIF:DateTimeOriginal" "-XMP-xmp:ModifyDate<EXIF:ModifyDate" "-XMP-dc:Creator<EXIF:Artist" "-XMP-tiff:Make<EXIF:Make" "-XMP-tiff:Model<EXIF:Model" *.jpg
    
    
    To extract the information (except for OriginalFileName, PreservedFileName, and the duplicated tags) from the image headers using ExifTool, the following example command can be used (tested with ExifTool version 11.54, slight modifications of the command might be necessary depending on the operating system of the user):
    
    
    exiftool.exe -csv -f -filename -GPSTimeStamp -GPSDateStamp -GPSLongitude -GPSLatitude -n -GPSAreaInformation -GPSMapDatum -ImageDescription -Artist -Credit -comment -contact -keywords -Caption -Copyright -CopyrightNotice -Caption-Abstract -UserComment -AttributionURL -UsageTerms -Author -ImageUniqueID -ExternalMetadataLink *.jpg > out.csv
    
    
    The -csv option writes the information out in a comma-delimited format. The -n option formats the latitude and longitude as signed decimal degrees. Whether you use exiftool or exiftool.exe will depend on the system on which the command is being run. Person who carried out this activity:
    U.S. Geological Survey
    Attn: VeeAnn A. Cross
    Marine Geologist
    384 Woods Hole Road
    Woods Hole, MA

    508-548-8700 x2251 (voice)
    508-457-2310 (FAX)
    vatnipp@usgs.gov
    Data sources used in this process:
    • *.jpg
    Data sources produced in this process:
    • *.jpg
  3. What similar or related data should the user be aware of?
    Blackwood, D.S., and Parolski, K.F., 2001, Seabed observation and sampling system: Sea Technology v. 42, no. 2, p. 39-43.

    Valentine, P., Blackwood, D., and Parolski, K., 2000, Seabed observation and sampling system: U.S. Geological Survey Fact Sheet FS-142-00, U.S. Geological Survey, Reston, VA.

    Online Links:

    Valentine, P.C., and Cross, V.A., 2019, Location and analyses of sediment samples collected on Stellwagen Bank off Boston, Massachusetts from November 5, 2013 to April 30, 2019 on U.S. Geological Survey field activities: data release DOI:10.5066/P9FWFLPD, U.S. Geological Survey, Reston, VA.

    Online Links:

    Other_Citation_Details:
    This publication includes the grain-size analyses of the sediment samples collected on 20 other field activities.

How reliable are the data; what problems remain in the data set?

  1. How well have the observations been checked?
  2. How accurate are the geographic locations?
    Although the images do not represent spatial data, the spatial information is available in the EXIF header of each image, which is derived from the station location in the station data summary dataset (available from the Larger Work citation). The location information for stations was acquired from a satellite navigation receiver (Furuno Satellite Compass SC-30, GPS receiver) in geographic coordinates (World Geodetic System of 1984 [WGS 84]) with a horizontal accuracy of 10 m. Cruise 2020-012-FA acquired navigation at two-second intervals on September 24 and five-second intervals on September 25. Cruise 2021-016-FA acquired navigation at a two-second interval. Cruise 2021-017-FA acquired navigation at a two-second interval. The navigation antenna was located 5.3 m forward of the SEABOSS deployment location. This offset was not accounted for in the navigation acquisition, so navigation fixes represent the antenna location. The SEABOSS may also drift away from the survey vessel when deployed to the sea floor.
  3. How accurate are the heights or depths?
  4. Where are the gaps in the data? What is missing?
    The images are frame-grab images from video imagery. Each station that has video data has two images - one showing a broader view of the landscape and the second a close-up of the seabed that was recorded when sampling was attempted. In some images from shallow depths, ambient daylight is the dominant light source and causes the images to have a “greenish” coloration.
    During cruise 2020-012-FA, station 4973 had no video collected therefore there are no seabed images. The 32 stations with video imagery results in 64 seabed images.
    Cruise 2021-016-FA has video imagery from all 43 stations yielding a total of 86 seabed images.
    Cruise 2021-017-FA has video imagery from all 93 stations yielding a total of 186 seabed images.
  5. How consistent are the relationships among the observations, including topology?
    The image names are in the format staXXXX_Y.jpg where XXXX is the station number and Y is the image number at that station. Each station has two images - one showing a broader view of the landscape (Y=1) and the second a close-up of the seabed (Y=2). The difference in view is related to the closeness of the SEABOSS to the seafloor. See the cross-references for a description of the SEABOSS (Blackwood and Parolski, 2001; Valentine and others, 2000). The two images from a station are not from the same geographic location on the video drift, but the coordinates stored in the EXIF header information in each station's images are the same. This is because the source video imagery did not display time. The latitude and longitude of the two images are based on the station location time recorded in the cruise logbook (the location of a sediment sample, or if no sample was collected, the video drift end location). The latitude and longitude of stations recorded in the log book were later adjusted to match the latitude and longitude recorded at that time in the GPS navigation file. The 2020-2021_station_data_summary dataset (available from the Larger Work citation) station locations are the same as the image locations at each station.
    All the field activities took place on the Stellwagen Bank National Marine Sanctuary's research vessel, R/V Auk. Although these data represent three separate field activities, the set-up and acquisition on all the cruises is consistent with the exception of the navigation - which was sometimes acquired at 2-second intervals and sometimes at 5-second intervals.

How can someone get a copy of the data set?

Are there legal restrictions on access or use of the data?
Access_Constraints None.
Use_Constraints None.
  1. Who distributes the data set? (Distributor 1 of 1)
    U.S. Geological Survey - ScienceBase
    Denver Federal Center, Building 810, Mail Stop 302
    Denver, CO
    USA

    1-888-275-8747 (voice)
    sciencebase@usgs.gov
  2. What's the catalog number I need to order this data set? The dataset contains the following three zip files of video frame grab images of the seabed from three field activities, two images at each station: 2020-012-FA_seabedimages.zip contains 64 JPEG images; 2021-016-FA_seabedimages.zip contains 86 JPEG images; and 2021-017-FA_seabedimages.zip contains 186 JPEG images. The dataset contains a total of 336 JPEG images. The dataset also contains the browse graphic (seabed_image_browse.jpg) and the FGDC CSDGM metadata in XML format.
  3. What legal disclaimers am I supposed to read?
    Unless otherwise stated, all data, metadata and related materials are considered to satisfy the quality standards relative to the purpose for which the data were collected. Although these data and associated metadata have been reviewed for accuracy and completeness and approved for release by the U.S. Geological Survey (USGS), no warranty expressed or implied is made regarding the display or utility of the data for other purposes, nor on all computer systems, nor shall the act of distribution constitute any such warranty.
  4. How can I download or order the data?
  5. What hardware or software do I need in order to use the data set?
    The zip file contains images in JPEG format. The user must have software capable of uncompressing the zip file and viewing the JPEG images.

Who wrote the metadata?

Dates:
Last modified: 21-Mar-2023
Metadata author:
U.S. Geological Survey
Attn: VeeAnn A. Cross
Marine Geologist
384 Woods Hole Rd.
Woods Hole, MA

508-548-8700 x2251 (voice)
508-457-2310 (FAX)
whsc_data_contact@usgs.gov
Contact_Instructions:
The metadata contact email address is a generic address in the event the person is no longer with the USGS.
Metadata standard:
Content Standard for Digital Geospatial Metadata (FGDC-STD-001-1998)

This page is <https://cmgds.marine.usgs.gov/catalog/whcmsc/SB_data_release/DR_P9G7QARQ/SeabedImages_2020-2021_meta.faq.html>
Generated by mp version 2.9.51 on Tue Mar 21 11:09:10 2023