Curriculum Vitae
Lewis Blake
Computational Scientist
Education
- Ph.D. in Statistics, 2017-December 2021
Department of Applied Mathematics and Statistics, Colorado School of Mines M.Sc. in Statistics, 2017-2018
Department of Applied Mathematics and Statistics, Colorado School of Mines- B.A. in Mathematics, 2012-2016
Hampshire College- Thesis: Assume a Spherical Cow: Mathematical Modeling of Bovine Dynamical Systems
- Supervisors: Dr. Sarah Hews, Dr. Geremías Polanco Encarnación
- Minor Concentrations: Computer Science, Agriculture, Entrepreneurship
Experience
Computational Scientist, Norwegian Meterological Institute, March 2022 - Present
- Researcher and software developer in the Department of Research and Development, Division For Climate Modeling and Air Pollution
- Managed and implemented contracts for the Copernicus Atmosphere Monitoring Service (CAMS) for regional and global air quality model evaluation from software package development through production
- Achieved significant advancements in pollution deposition modeling for Norway, utilizing cutting-edge machine learning-based data fusion techniques
- Authored a comprehensive technical report outlining the methodology, findings, and environmental and economic impacts for the Norwegian Environment Agency
- Graduate Research Assistant, NCAR - National Center for Atmospheric Research, Summers 2018, 2019, and 2021
- Contributed to the Computational and Informational Systems Laboratory, Analytics and Integrative Machine Learning Group during summer appointments in 2018, 2019, and 2021
- Developed and implemented highly efficient parallel machine learning models (Multi-Resolution Approximation for Gaussian Processes), enabling processing of large datasets (hundreds-of-millions of observations) with reduced computation times by 75%
- Analyzed extensive geospatial satellite data on high-performance computing systems (Cheyenne and Capser), uncovering global patterns and trends
- Facilitated development of streamlined sea-surface temperature models through parameter reduction techniques
- Data Scientist, Lumen Technologies (formerly CenturyLink), June 2020 - August 2020
- Contributed to the Artificial Intelligence and Machine Learning Center of Excellence during a summer internship
- Developed and implemented advanced algorithms using timeseries LSTMs and CNNs to predict IT application health and detect anomalies, resulting in improved efficiency and cost savings
- Automated IT analytics exploratory data analysis (EDA), uncovering previously unknown application issues and driving proactive troubleshooting efforts
- Generated significant monetary savings through effective identification and resolution of IT application issues
- Research and Teaching Assistant, Colorado School of Mines, 2018-2020
- Developed and released practical and theoretical tools to analyze highly nonstationary environmental data sets on the order of hundreds-of-millions of observations
- A primary component of my PhD work was as a Research Assistant at the National Center for Atmospheric Research in Boulder, Colorado
- AmeriCorps Math Fellow, Denver Public Schools, August 2016 - June 2017
- Enhanced middle school math education by providing personalized and small-group instruction to accelerate students’ learning progress
- Orchestrated after-school STEM programs, effectively managing resources and activities to promote student engagement and achievement
- Research Fellow, Four-College Biomathematics Consortium, May 2015 - May 2016
- Conducted innovative research on bovine water-intake in pasture, resulting in valuable insights for the industry
- Developed cost-effective and efficient data collection tools using Arduino and C++, enhancing data accuracy and streamlining processes
- Utilized advanced Matlab modeling techniques to analyze ice phenology of Lake Linné, Svalbard, contributing to the understanding of Arctic ecosystems
- Actively shared research findings at consortium conferences, fostering collaboration and knowledge exchange within the scientific community
Computer Skills
- Programming: Python, R, Matlab, C++, Bash/Shell script, LaTeX, PostgreSQL
- Packages: Numpy, Scipy, Pandas, scikit-learn, Keras, Tensorflow, ggplot2, dplyr, cartopy, netCDF4, dask, xarray, iris, geopandas, pytest
- Other: Git, MPI Programming, CI/CD
Software Publications and Contributions
Matlab: DeepTreeMRA, MRA-Parallel, MRA-Serial (Implementations of the Multi-resolution Approximation spatial model for various computational infrastructures)
Python: pyaerocom (Python tools for the AeroCom project), optimparallel (Parallel computing interface to the L-BFGS-B optimizer)
Professional Services
- Referee: Electronic Journal of Statistics, The Annals of Applied Statistics, Environmetrics
- Judge: Colorado School of Mines Undergraduate Research Symposium 2020
Peer-Reviewed Publications
Diabetic Retinopathy Screening in the Yucatan Peninsula Using Smartphone-based Fundus Photography and Deep-learning Artificial Intelligence: A Field Study
Wroblewski, J.J., Sanchez-Buenfil, E., Inciarte, M., Berdia, J., Blake, L.R., Wroblewski, S., Patti, A., Suter, G., Sanborn, G.E.: "Diabetic Retinopathy Screening in the Yucatan Peninsula Using Smartphone-based Fundus Photography and Deep-learning Artificial Intelligence: A Field Study", Journal of Diabetes Science and Technology. 2023;0(0).DOI: 10.1177/19322968231194644
Parametric Nonstationary Covariance Functions On Spheres
Blake, L.R., Porcu, E., Hammerling, D.M.: "Parametric Nonstationary Covariance Functions On Spheres", Stat, DOI: https://doi.org/10.1002/sta4.468, 2022.
Discussion on Competition for Spatial Statistics for Large Datasets
Blake, L.R., Khaliukova, O., Pinard, A., Nychka, D., Hammerling, D.M., Bandyopadhyay, S.: "Discussion on Competition for Spatial Statistics for Large Datasets", Journal of Agricultural, Biological and Environmental Statistics, DOI: https://doi.org/10.1007/s13253-021-00460-4, 2021.
Underground Mine Scheduling Under Uncertainty
Nesbitt, P., Blake, L.R., Lamas, P., Goycoolea, M., Pagnoncelli, B.K., Newman, A., Brickey, A.: "Underground Mine Scheduling Under Uncertainty", European Journal of Operational Research, Volume 294, Issue 1, DOI: https://doi.org/10.1016/j.ejor.2021.01.011, Pages 340-352, ISSN 0377-2217, 2021.
Technical Reports
Deposition of sulfur and nitrogen in Norway 2017-2021
Blake, L.R., Aas, W., Denby, B., Hjellbrekke, A., Mu, Q., Simpson, D., Ytre-Eide, M., Fagerli, H. : "Deposition of sulfur and nitrogen in Norway 2017-2021", MET Report ISSN 2387-4201, 2023.
Upgrade verification note for the CAMS near-real time global atmospheric composition service: Evaluation of the e-suite for the CAMS CY48R1 upgrade of 27 June 2023
H.J. Eskes, A. Tsikerdekis, A. Benedictow, Y. Bennouna, L. Blake, I. Bouarar, Q. Errera, J. Griesfeller, S. Basart, J. Kapsomenakis, B. Langerock, A. Mortier, I. Pison, M.R.A. Pitkänen, A. Richter, A. Schoenhardt, M. Schulz, V. Thouret, T. Warneke, C. Zerefos: "Upgrade verification note for the CAMS near-real time global atmospheric composition service: Evaluation of the e-suite for the CAMS CY48R1 upgrade of 27 June 2023", Copernicus Atmosphere Monitoring Service DOI:10.24380/rzg1-8f3l, 2023.
Validation report of the CAMS global reanalysis of aerosols and reactive trace gases, period 2003-2022
Y. Bennouna, A. Arola, A. Benedictow, L. Blake, I. Bouarar, E. Cuevas, Q. Errera, H.J. Eskes, J. Griesfeller, L. Ilic, J. Kapsomenakis, B. Langerock, A. Mortier, I. Pison, M.R.A. Pitkänen, A. Richter, A. Schoenhardt, M. Schulz, V. Thouret, A. Tsikerdekis, T. Warneke, C. Zerefos : "Validation report of the CAMS global reanalysis of aerosols and reactive trace gases, period 2003-2022", Copernicus Atmosphere Monitoring Service DOI:10.24380/1rx8-mwi7, 2023.
The Deep-Tree Approach: An Improved Parallel Matlab Implementation of the Multi-resolution Approximation for Massive Spatial Data on High-Performance Computing Systems
Blake, L.R., Huang, H., Vanderwende, B., and Hammerling, D.M.: "The Deep-Tree Approach: An Improved Parallel Matlab Implementation of the Multi-resolution Approximation for Massive Spatial Data on High-Performance Computing Systems", NCAR Technical Note NCAR/TN-565+STR, DOI: 10.5065/pzzt-wj18, 2021.
A Shallow-Tree Multi-Resolution Approximation for Distributed and High-Performance Computing Systems
Blake, L.R., Huang, H., Vanderwende, B., and Hammerling, D.M.: "A Shallow-Tree Multi-Resolution Approximation for Distributed and High-Performance Computing Systems", NCAR Technical Note NCAR/TN-559+STR, DOI: 10.5065/hvvq-j471, 2019.
Pushing the Limit: A Hybrid Parallel Implementation of the Multi-Resolution Approximation for Massive Data
Huang, H., Blake, L.R., and Hammerling, D.M.: "Pushing the Limit: A Hybrid Parallel Implementation of the Multi-Resolution Approximation for Massive Data", NCAR Technical Note NCAR/TN-551+STR, DOI: 10.5065/nnt6-q689H, 2019.
Parallel implementation and computational analysis of the multi-resolution approximation
Blake, L.R., Simonson, P., and Hammerling, D.M.: "Parallel implementation and computational analysis of the multi-resolution approximation", NCAR Technical Note NCAR/TN-551+STR, DOI: 10.5065/D6XW4HNH, 2018.
Poster Publications
Nonstationary spatial modeling of massive global satellite data
Hammerling, D., Huang, H., Blake, L.R., Katzfuss, M.: "Nonstationary spatial modeling of massive global satellite data", Poster, Computational Statistics 2022
Student-led investigation of TROPOMI data for the US.
Hammerling, D.M., Blake, L.R., Daniels, W., Dykstal, A., and Crowell, S.: "Student-led investigation of TROPOMI data for the US.", Poster, European Geosciences Union General Assembly 2020, DOI: https://doi.org/10.5194/egusphere-egu2020-22133, 2020.
Manuscripts in Preparation and Preprints
Nonstationary Spatial Modeling of Massive Global Satellite Data
Huang, H., Blake, L.R., Katzfuss, M., Hammerling, D.M.: "Nonstationary Spatial Modeling of Massive Global Satellite Data", arXiv https://arxiv.org/abs/2111.13428, 2021
Conference Contributions & Talks
Computational Developments and Applications of the Multi-Resolution Approximation for Massive Spatial Data
Talk at Computational and Methodological Statistics, King's College London
Computational Developments and Applications of the Multi-Resolution Approximation for Massive Spatial Data
Talk at Colorado Wyoming Chapter of the American Statistical Association Fall 2021 Meeting
Computational Developments and Applications of the Multi-Resolution Approximation for Massive Spatial Data
Talk at Joint Statistical Meeting
Computational Developments and Applications of the Multi-Resolution Approximation for Massive Spatial Data
Talk at Spatial and Temporal Statistics Symposium, University of Wollongong
A Nonstationary Multi-Resolution Approximation for Massive Spatial Data
Poster at Graduate Research and Discovery Symposium (GRADS), Colorado School of Mines
Implementing Spatial Statistical Methods for Massive Data
Talk at Applied Mathematics & Statistics Graduate Colloquium, Colorado School of Mines
Scaling the Multi-Resolution Approximation (MRA) to Massive Spatial Data Sets
Poster at Graduate Research and Discovery Symposium (GRADS), Colorado School of Mines
Scaling the Multi-Resolution Approximation to Massive Spatial Data Sets
Talk at SIAM Front Range Applied Mathematics Student Conference, University of Colorado Denver
Assume a Spherical Cow: Mathematical Modeling of Bovine Dynamical Systems
Talk at School of Natural Science Symposium, Hampshire College
Exploring Water Intake Modeling: A Collaborative Division III
Talk at School of Natural Science Fantastic Friday Talk, Hampshire College
Exploring Water Intake Modeling for Cattle in Pasture
Poster at Four College Biomath Consortium 2016 Graduating Fellows Symposium, Smith College
Analyzing Sensitivities in an Arctic Lake Ice Model
Talk at Four College Biomath Consortium Seminar, Hampshire College
Teaching
Statistical Learning II (MATH-561)
Graduate course, Teaching assistant
Spatial Statistics (MATH-532)
Graduate course, Grader
Statistical Learning I (MATH-560)
Graduate course, Teaching assistant
Statistics Practicum (MATH-482)
Undergraduate course, Teaching assistant
Survival Analysis (MATH-539)
Graduate course, Grader
Statistical Methods I (MATH-530)
Graduate course, Teaching assistant
Differential Equations (MATH-225)
Undergraduate course, Teaching assistant