Filters: Tags: Random Forest (X)
10 results (69ms)
Filters
Date Range
Extensions Types
Contacts
Categories Tag Types
|
The datasets are to accompany a manuscript describing the prediction of submersed aquatic vegetation presence and its potential vulnerability and recovery potential. The data and accompanying analysis scripts allow users to run the final random forests predictive model and reproduce the figures reported in the manuscript. Files from several data sources (aqa_2010_lvl3_pct_oute_joined_VEG_BARCODE.csv, eco_states_near_SAV.csv, ltrm_vegsrs_thru2019_GEOMORPHIC_METRICS_final.csv, vegetation_data.csv, and water_full.csv) were combined into a single .csv file (analysis_data_for_SAV_RandomForest.csv) used as the input for the random forest model. When intersecting points with geomorphic metrics some sites were moved slightly...
This data release documents the data used for the associated publication "Evaluating hydrologic region assignment techniques for ungaged watersheds in Alaska, USA" (Barnhart and others, 2022) The data sets within this release are stored in 14 files: (1) Streamflow observations and sites used. (2) Statistically estimated streamflow values computed for each site. (3) Streamflow statistics computed from observed and estimated streamflow values at each site, basin characteristics for each site, and hydrologic regions (clusters) for each site. (4) A dataset describing the optimal number of hydrologic regions into which the considered sites were grouped. (5) P-values from a multiple comparisons analysis testing for statistical...
Categories: Data,
Data Release - Revised;
Tags: Alaska,
USGS Science Data Catalog (SDC),
hydrologic region,
inlandWaters,
random forest,
To identify the degree of hydrologic alteration of streams in the Mississippi Alluvial Plain (MAP), we used random forest (RF) regression methods (Breiman, 2001) to model the relation between six selected streamflow characteristics and explanatory variables (such as drainage area, precipitation, soils, and other watershed characteristics). RFs were chosen for this study because they have been proven to be more robust and accurate than traditional linear regression methods (Carlisle and others, 2010; Lawler and others, 2006; Prasad and others, 2006; Cutler and others, 2007). Estimated expected monthly mean streamflow from the RF models were compared to observed monthly mean streamflow at 68 sites located within the...
Categories: Data;
Tags: Arkansas,
Ecology,
Historical streamflow,
Hydrogeology,
Hydrologic alteration,
As more hydrocarbon production from hydraulic fracturing and other methods produce large volumes of water, innovative methods must be explored for treatment and reuse of these waters. However, understanding the general water chemistry of these fluids is essential to providing the best treatment options optimized for each producing area. Machine learning algorithms can often be applied to datasets to solve complex problems. In this study, we used the U.S. Geological Survey’s National Produced Waters Geochemical Database (USGS PWGD) in an exploratory exercise to determine if systematic variations exist between produced waters and geologic environment that could be used to accurately classify a water sample to a given...
This data release contains predictions of stream biological condition as defined by the Chesapeake basin-wide index of biotic integrity for stream macroinvertebrates (Chessie BIBI) using Random Forest models with landscape data for small streams (≤ 200 km2 in upstream drainage) across the Chesapeake Bay Watershed (CBW). Predictions were made at eight time periods (2001, 2004, 2006, 2008, 2011, 2013, 2016, and 2019) according to changes in landcover using the National Land Cover Database (NLCD). The Chessie BIBI data used were provided by the Interstate Commission on the Potomac River Basin. Uncertainty was calculated using model prediction intervals. For complete data descriptions and data interpretation see associated...
Categories: Data;
Tags: Aquatic Biology,
Chesapeake Bay watershed,
Delaware (state),
District of Columbia (national district),
Ecology,
This dataset represents the cumulative result of multi-season classification of land cover in the GCPO LCC geography to NatureServe Ecological Systems based on 2011 seasonal Landsat Satellite Imagery. The approach used a Random Forest algorithm and several dozen input data layers to classify land cover at a 30 m pixel resolution. The description below is taken directly from the report titled “Update of the Eastern GCPO Land Cover Database to 2011 Using a LS2SRC Approach”, by Dr. Qingmin Meng, Department of Geosciences, Mississippi State University.Random Forest classifier is based on the general decision tree approach, which has been a popular approach to multilevel and multistage decision making. Its basic idea...
Categories: Data;
Types: ArcGIS REST Map Service,
ArcGIS Service Definition,
Downloadable,
Map Service,
OGC WFS Layer,
OGC WMS Layer,
OGC WMS Service;
Tags: Data,
Data Acquisition and Development,
Data Management and Integration,
GAP,
GCPO,
Many aspects of recurring plant developmental events – vegetation phenology – are measured by remote sensing. By consistently measuring the timing and magnitude of the growing season, it is possible to study the complex relationships among drivers of the seasonal cycle of vegetation, including legacy conditions. We studied the role of current and legacy climate, and contextual factors on the land surface phenology of the U.S. Northern Great Plains. Specifically, we used annual and seasonal climate variables (e.g., temperature, precipitation, growing degree days, and vapor pressure deficit) covering the current year and the past four years derived from the PRISM climate dataset. We also included soils, disturbance,...
Modeling streamflow is an important approach for understanding landscape-scale drivers of flow and estimating flows where there are no streamgage records. In this study conducted by the U.S. Geological Survey in cooperation with Colorado State University, the objectives were to model streamflow metrics on small, ungaged streams in the Upper Colorado River Basin and identify streams that are potentially threatened with becoming intermittent under drier climate conditions. The Upper Colorado River Basin is a region that is critical for water resources and also projected to experience large future climate shifts toward a drying climate. A random forest modeling approach was used to model the relationship between streamflow...
Yellow sweetclover (Melilotus officinalis; YSC), an invasive biennial legume, bloomed throughout the Northern Great Plains (NGP) following greater-than-average precipitation during 2018-2019. YSC can increase nitrogen (N) levels and potentially cause broad changes in the composition of native plant species communities. There is little knowledge of the drivers behind its spatiotemporal variability, including conditions causing significant widespread blooms across western South Dakota (SD). We aimed to develop a generalized prediction model to map the relative abundance of YSC in suitable habitats across rangelands of western SD for the recent sweet clover year 2019. The following research questions were asked: 1....
This data release contains predictions of selected fish community metrics and fish species occurrence using Random Forest models with landscape data for inland reaches across the Chesapeake Bay Watershed (CBW). Predictions were made at four time intervals (2001, 2006, 2011, and 2016) according to changes in landcover using the National Land Cover Database (NLCD). The fish sampling data used to compute these metrics were compiled from various fish sampling programs conducted by state and federal agencies, county governments, universities, and river basin commissions across the watershed. Community metrics describe composition, tolerances, habitat preferences, and functional traits of fish communities (and were derived...
Categories: Data;
Tags: Aquatic Biology,
District of Columbia (national district),
Ecology,
Ecology,
Environmental Health,
|
|