The dataset described here includes estimates of historical (1980–2020) daily surface water temperature, lake metadata, and daily weather conditions for lakes bigger than 4 ha in the conterminous United States (n = 185,549), and also in situ temperature observations for a subset of lakes (n = 12,227). Estimates were generated using a long short-term memory deep learning model and compared to existing process-based and linear regression models. Model training was optimized for prediction on unmonitored lakes through cross-validation that held out lakes to assess generalizability and estimate error. On the held-out lakes with in situ observations, median lake-specific error was 1.24°C, and the overall root mean squared error was 1.61°C. [...]