Skip to main content
Advanced Search

Filters: Tags: k-Nearest Neighbors (X)

2 results (57ms)   

View Results as: JSON ATOM CSV
thumbnail
This data release contains one dataset and one model archive in support of the journal article, "Leveraging machine learning to automate regression model evaluations for large multi-site water-quality trend studies," by Jennifer C. Murphy and Jeffrey G. Chanat. The model archive contains scripts (run in R) to reproduce the four machine learning models (logistic regression, linear and quadratic discriminant analysis, and k-nearest neighbors) trained and tested as part of the journal article. The dataset contains the estimated probabilities for each of these models when applied to a training and test dataset.
As more hydrocarbon production from hydraulic fracturing and other methods produce large volumes of water, innovative methods must be explored for treatment and reuse of these waters. However, understanding the general water chemistry of these fluids is essential to providing the best treatment options optimized for each producing area. Machine learning algorithms can often be applied to datasets to solve complex problems. In this study, we used the U.S. Geological Survey’s National Produced Waters Geochemical Database (USGS PWGD) in an exploratory exercise to determine if systematic variations exist between produced waters and geologic environment that could be used to accurately classify a water sample to a given...
Categories: Data; Tags: Alabama, Alaska, Alaska Region, Arizona, Arkansas, All tags...


    map background search result map search result map Input Files and Code for: Machine learning can accurately assign geologic basin to produced water samples using major geochemical parameters Data to support Leveraging machine learning to automate regression model evaluations for large multi-site water-quality trend studies Input Files and Code for: Machine learning can accurately assign geologic basin to produced water samples using major geochemical parameters Data to support Leveraging machine learning to automate regression model evaluations for large multi-site water-quality trend studies