Skip to main content

Data for machine learning predictions of pH in the glacial aquifer system, northern USA

Dates

Publication Date
Start Date
1988-01-15
End Date
2018-09-04

Citation

Brown, C.J., Belitz, K., Erickson, M.L., Elliott, S.M., Kauffman, L.J., Ransom, K.M., Reddy, J.E., and Stackelberg, P.E., 2020, Data for machine learning predictions of pH in the glacial aquifer system, northern USA: U.S. Geological Survey data release, https://doi.org/10.5066/P9RF0R6E.

Summary

A boosted regression tree (BRT) model was developed to predict pH conditions in three-dimensions throughout the glacial aquifer system (GLAC) of the contiguous United States using pH measurements in samples from 18,258 wells and predictor variables that represent aspects of the hydrogeologic setting. Model results indicate that the carbonate content of soils and aquifer materials strongly controls pH and when coupled with long flow paths, results in the most alkaline conditions. Conversely, in areas where glacial sediments are thin and carbonate-poor, pH conditions remain acidic. At depths typical of drinking-water supplies, predicted pH > 7.5 – which is associated with arsenic mobilization – occurs more frequently than predicted pH [...]

Contacts

Attached Files

Click on title to download individual files attached to this item.

rstack_dom.7z
“Domestic supply explanatory model variables”
586.53 MB application/x-7z-compressed
rstack_pub.7z
“Public supply explanatory model variables”
586.23 MB application/x-7z-compressed
pH_Predictions_GLAC_GeochMod_Dataset.csv
“pH and geochemical data”
1.34 MB text/csv
pH_Predictions_GLAC_Variable_Descriptions.txt
“BRT model variable descriptions”
13.94 KB text/plain
model_archive.7z
“BRT model archive”
59.22 MB application/x-7z-compressed

Purpose

Data on pH measurements were compiled for wells withdrawing water from the glacial aquifer system. These pH data were combined with measured and calculated data from 97 predictor variables that integrate information on geology, hydrology, climate and other factors that might affect pH conditions. A machine learning model (boosted regression trees) was developed to predict pH conditions throughout the glacial aquifer system.

Additional Information

Identifiers

Type Scheme Key
DOI https://www.sciencebase.gov/vocab/category/item/identifier doi:10.5066/P9RF0R6E

Item Actions

View Item as ...

Save Item as ...

View Item...