Kaleb Coberly

Bellingham, WA, USA kaleb.coberly@gmail.com Visit my website: kalebcoberly.com

Download Résumé

Download this résumé as: PDF | Word (.docx)

Summary

Data engineer, backend software engineer with a background in research and analysis. Builds and deploys tools, analytics, and pipelines that help others reliably use data better. Supports users and developers with documentation, dev tools, issue triage, hands-on training, and debugging. Improves systems with a focus on clarity and maintainability. Prioritizes data quality, reproducibility, validity, and meaningful results.

Skills

Programming & Scripting: Python, R, Bash, SQL, NoSQL, C++, HTML.
DevOps & Tooling: Git, Make, Docker, GitHub Actions, Jenkins, CI/CD, Python packaging, debugging, QC and testing frameworks.
Data Engineering: ETL pipelines, RDBMS, database design, data modeling, data cleaning, data validation, methodological implementation, data versioning, MySQL, MariaDB, MongoDB, ElasticSearch.
ML Engineering: ML pipelines, feature engineering, performance evaluation, model validation.
Analysis & Visualization: EDA, Jupyter Notebook, RStudio, R Markdown, mermaid, Excel, Kibana, Tableau.
Documentation & Support: Sphinx, reStructuredText, user training, contributor onboarding, issue triage.
Platforms & Environments: MacOS, Windows, Linux, HPC (Slurm, UGE), Kubernetes (k8s), conda, GitHub, Bitbucket/Stash, VS Code, JIRA.

Experience

Advisory Board Member and Citizen Science Volunteer Engineer

Cascade STEAM, April 2025 – Present

Founder and Lead Engineer

Crickets and Comb, November 2024 – Present

  • Implemented and maintain reusable DevOps tools and workflows for local + Docker development and GitHub CI/CD.
  • Onboard developers with documentation, presentations, and hands-on support.
  • Automated part of Bellingham Food Bank’s delivery operations, reducing several staff-hours with each weekly use:
    • cricketsandcomb.org/#projects
    • Designed and implemented an ETL CLI integrated with staff workflows and 3rd-party API.
    • Achieved high test coverage, by line count, problem space, and abstraction level.
    • Led incremental design, rollout, and user training.
    • Wrote and deployed technical and non-technical documentation.
    • Maintain ongoing user support and dependency management.
    • Released as open-source project; lead collaborative development at all skill levels.

Research Engineer

IHME — Central Computation (GBD), Nov 2022 – Oct 2024

  • www.healthdata.org/research-analysis/gbd
  • Developed and maintained internal Python and dev tools for data modeling, versioning, access, and dev workflows.
  • Contributed to a large in-house Python ecosystem: features, bugfixes, major test coverage increases, and documentation.
  • Maintained Jenkins builds across ~100 packages; debugged issues across DBs, Python APIs/CLIs, Docker, k8s, and HPC.
  • Directly supported research teams: Investigated failed model runs, data access issues, and tooling gaps.
  • Analyzed system usage and tool performance via ElasticSearch; ran ad-hoc reports and built automated report.

Data Analyst

IHME — Pandemics Team, Nov 2021 – Dec 2022

  • Owned multiple data pipelines for COVID-19 production; reduced key runtime by 65%.
  • Refactored vaccine and booster models for maintainability and extensibility.
  • Built diagnostic tools and plots, with automated reporting for data QC; inspected and cleaned data.
  • Led early UGE-to-Slurm transition for production jobs.
  • Built ETL pipeline in R of national data into crosswalked, versioned output.
  • Produced executive reports for world leaders, running ETL into R Markdown for PDFs; adapted to changing model outputs and feature requests:

Education

B.S., Database Management and Data Analytics – Western Governors University
B.A., Critical Studies and Pedagogy – The Evergreen State College

Certifications

References

Available upon request.