Skip to main content

ViTexOCR; A script to extract text overlays from digital video

Dates

Publication Date
Time Period
2017

Citation

Dailey, E.T., 2017, ViTexOCR; a script to extract text overlays from digital video: U.S. Geological Survey software release, https://doi.org/10.5066/F7833Q56.

Summary

The ViTexOCR script presents a new method for extracting navigation data from videos with text overlays using optical character recognition (OCR) software. Over the past few decades, it was common for videos recorded during surveys to be overlaid with real-time geographic positioning satellite chyrons including latitude, longitude, date and time, as well as other ancillary data (such as speed, heading, or user input identifying fields). Embedding these data into videos provides them with utility and accuracy, but using the location data for other purposes, such as analysis in a geographic information system, is not possible when only available on the video display. Extracting the text data from imagery using software allows these videos [...]

Contacts

Point of Contact :
Evan T. Dailey
Originator :
Evan T. Dailey
Metadata Contact :
Evan T. Dailey
Publisher :
U.S. Geological Survey
Distributor :
U.S. Geological Survey - ScienceBase
USGS Mission Area :
Natural Hazards
SDC Data Owner :
Pacific Coastal and Marine Science Center

Attached Files

Click on title to download individual files attached to this item.

DisplayImage.png thumbnail 517.81 KB image/png
ViTexOCR_Documentation.pdf
“README”
3.99 MB application/pdf
ViTexOCR.py 33.56 KB text/x-python
TesseractTraining.py 14.23 KB text/x-python
TestVideos.zip 4.89 MB application/zip
tessdata.zip 53.92 KB application/zip

Purpose

The ViTexOCR script was developed to geospatially locate videos, primarily for the purpose of including videos collected through the USGS Coastal and Marine Geology Program in the USGS Video and Photograph Portal.
Preview Image

Map

Communities

  • Pacific Coastal and Marine Science Center
  • USGS Data Release Products

Tags

Provenance

Data source
Input directly

Additional Information

Identifiers

Type Scheme Key
DOI https://www.sciencebase.gov/vocab/category/item/identifier doi:10.5066/F7833Q56

Item Actions

View Item as ...

Save Item as ...

View Item...