Skip to main content

Climate impacts to inland fishes: Topic modeling of literature trends analysis script

Originator
Andrew DiSanto

Dates

Publication Date
Start Date
1985
End Date
2021

Citation

DiSanto, A., 2023, Climate impacts to inland fishes: Topic modeling of literature trends analysis script: National and Regional Climate Adaptation Science Centers data release, https://doi.org/10.21429/n7z0-b813.

Summary

This script applies topic modeling to analyze literature trends of climate impacts to inland fish based on the papers within the Fish and Climate Change Database (FiCli, DOI: 10.5066/P9973SMC). Sections 1-8 loaded the .bib file with all of the papers in the database and cleaned the text. This included combining the title/abstract/keywords, removing non-informative words, stemming words, removing punctuation, and forming phrases (ie. climate change to climate_change). Sections 9-10 divided the papers into discrete topics by identifying the ideal number of topics and then using Latent Dirichlet Allocation (LDA) modeling and Gibbs sampling to assign topics to each paper. Sections 11-17 analyzed the topic modeling results and generated [...]

Contacts

Attached Files

Click on title to download individual files attached to this item.

CumulativeDocProj.csv
“Count and ratio of projected and documented papers by year in the FiCli database”
1.47 KB text/csv
FiCli_Papers_2021_bib.bib
“Papers comprising the FiCli database from 1985-2021 (.bib)”
726.72 KB application/x-bibtex-text-file
FiCli_Papers_2021_bib.csv
“Papers comprising the FiCli database from 1985-2021 (.csv)”
654.46 KB text/csv
PaperVariables.csv
“Paper details extracted from the FiCli database from 1985-2021, including title ”
605.83 KB text/csv
Published_USGS_README.Rmd
“Script ReadMe ”
5.37 KB text/plain
Published_USGS_Topic_Modeling_Script.Rmd
“R script for conducting topic modeling analysis of papers in the FiCli database ”
31.96 KB text/plain
TopicModelingXML.xml
“R script for conducting topic modeling analysis of papers in the FiCli database ”
Original FGDC Metadata

View
24.14 KB application/fgdc+xml

Purpose

This code was made for a paper conducting topic modeling of the FiCli database, but it can also be applied to more general topic modeling analysis. Users can upload their own papers, common phrases, and paper variables to perform topic modeling on papers related to any theme, not just climate impacts on inland fish.

Map

Communities

  • National CASC
  • National and Regional Climate Adaptation Science Centers

Associated Items

Tags

Provenance

Data source
Input directly

Additional Information

Identifiers

Type Scheme Key
DOI https://www.sciencebase.gov/vocab/category/item/identifier 10.21429/n7z0-b813

Citation Extension

citationTypeData Release
parts
typeDOI
valuedoi.org/10.21429/n7z0-b813

Item Actions

View Item as ...

Save Item as ...

View Item...