Kaleb Coberly
Bellingham, WA, USADownload Résumé
Download this résumé as: PDF | Word (.docx)Summary
Data engineer, backend software engineer with a background in research and analysis. Builds and deploys tools, analytics, and pipelines that help others reliably use data better. Supports users and developers with documentation, dev tools, issue triage, hands-on training, and debugging. Improves systems with a focus on clarity and maintainability. Prioritizes data quality, reproducibility, validity, and meaningful results.
Skills
Programming & Scripting: Python, R, Bash, SQL, NoSQL, C++, HTML.
DevOps & Tooling: Git, Make, Docker, GitHub Actions, Jenkins, CI/CD, Python packaging, debugging, QC and testing frameworks.
Data Engineering: ETL pipelines, RDBMS, database design, data modeling, data cleaning, data validation, methodological implementation, data versioning, MySQL, MariaDB, MongoDB, ElasticSearch.
ML Engineering: ML pipelines, feature engineering, performance evaluation, model validation.
Analysis & Visualization: EDA, Jupyter Notebook, RStudio, R Markdown, mermaid, Excel, Kibana, Tableau.
Documentation & Support: Sphinx, reStructuredText, user training, contributor onboarding, issue triage.
Platforms & Environments: MacOS, Windows, Linux, HPC (Slurm, UGE), Kubernetes (k8s), conda, GitHub, Bitbucket/Stash, VS Code, JIRA.
Experience
Advisory Board Member and Citizen Science Volunteer Engineer
Cascade STEAM, April 2025 – Present
- cascadesteam.org
- Leading the Citizen Science group’s development of computer-vision ETL tool for stormwater monitoring field observation datasheets:
- Integrating DevOps tools and workflows with Cascade STEAM infrastructure.
- Consulting on selection of collaborative and project-management platforms.
Founder and Lead Engineer
Crickets and Comb, November 2024 – Present
- Implemented and maintain reusable DevOps tools and workflows for local + Docker development and GitHub CI/CD.
- Onboard developers with documentation, presentations, and hands-on support.
-
Automated part of Bellingham Food Bank’s delivery operations, reducing several staff-hours with each weekly use:
- cricketsandcomb.org/#projects
- Designed and implemented an ETL CLI integrated with staff workflows and 3rd-party API.
- Achieved high test coverage, by line count, problem space, and abstraction level.
- Led incremental design, rollout, and user training.
- Wrote and deployed technical and non-technical documentation.
- Maintain ongoing user support and dependency management.
- Released as open-source project; lead collaborative development at all skill levels.
Research Engineer
IHME — Central Computation (GBD), Nov 2022 – Oct 2024
- www.healthdata.org/research-analysis/gbd
- Developed and maintained internal Python and dev tools for data modeling, versioning, access, and dev workflows.
- Contributed to a large in-house Python ecosystem: features, bugfixes, major test coverage increases, and documentation.
- Maintained Jenkins builds across ~100 packages; debugged issues across DBs, Python APIs/CLIs, Docker, k8s, and HPC.
- Directly supported research teams: Investigated failed model runs, data access issues, and tooling gaps.
- Analyzed system usage and tool performance via ElasticSearch; ran ad-hoc reports and built automated report.
Data Analyst
IHME — Pandemics Team, Nov 2021 – Dec 2022
- Owned multiple data pipelines for COVID-19 production; reduced key runtime by 65%.
- Refactored vaccine and booster models for maintainability and extensibility.
- Built diagnostic tools and plots, with automated reporting for data QC; inspected and cleaned data.
- Led early UGE-to-Slurm transition for production jobs.
- Built ETL pipeline in R of national data into crosswalked, versioned output.
- Produced executive reports for world leaders, running ETL into R Markdown for PDFs; adapted to changing model outputs and feature requests:
Education
B.S., Database Management and Data Analytics – Western Governors University
B.A., Critical Studies and Pedagogy – The Evergreen State College
Certifications
- Quantum Computing (Qiskit & Python) – Udemy, Apr 2025
- Mathematics for ML & Data Science – DeepLearning.AI, Jan 2025
- Oracle Database SQL Certified Associate – Oracle, Jul 2020
- CompTIA A+, Network+, Project+ – CompTIA, 2019 / 2020
References
Available upon request.