Systems · Reliability · Human-centered tooling
Research Lab
We prototype resilient platforms, study how teams adopt new practices, and translate research into measurable outcomes—through experiments, mentoring, and open collaboration.
Lab overview
The lab exists to close the gap between strong ideas and systems that stay reliable in production. We combine rigorous experimentation with adoption support: clear metrics, safe rollout patterns, and documentation that helps the next engineer succeed.
Current themes include distributed systems resilience, observability for complex workflows, and interfaces that make operational data legible for both experts and newcomers.
Mission
Ship evidence-backed improvements with transparent trade-offs and reproducible methods.
Focus
Reliability engineering, performance characterization, and developer experience.
Goals
Fewer incidents, faster recovery, and stronger knowledge transfer across partner teams.
People
Mehedi Hasan
Principal Investigator
Systems research, platform reliability, and mentoring engineers through complex migrations.
Aisha Rahman
PhD Student
Latency-aware scheduling and workload modeling under bursty traffic.
Daniel Kim
Postdoctoral Researcher
Observability pipelines, SLO design, and incident archaeology.
Sofia Nguyen
MSc Researcher
Human factors in on-call tooling and alert fatigue reduction.
James Park
Lab Coordinator
Operations, procurement, and keeping experiments reproducible week to week.
Lina Torres
Undergraduate Researcher
Benchmark harnesses and visualization for comparative load tests.
Open positions
Summer reliability intern
Build measurement tooling and dashboards for controlled chaos experiments. You’ll pair with senior researchers and leave with a public write-up of findings.
- Coursework in systems or networking; comfort with Python or Go
- Interest in SRE practices and clear technical writing
RA — observability & tracing
Help extend our tracing study across services: ingestion hygiene, sampling strategies, and operator interviews.
- Experience with OpenTelemetry or similar
- Ability to work 15–20 hours/week for at least two terms
Visiting researcher
Short-term collaboration slots for aligned work on resilience patterns, benchmarks, or education tooling. Remote-friendly with periodic syncs.
- Short proposal (1–2 pages) and expected outcomes
- Letter of support from your home institution (if applicable)
Independent study / thesis
If you are already enrolled and want to align coursework with lab themes, we can scope a project with concrete deliverables and grading rubric.
- Prerequisites agreed with your program advisor
- Weekly standups and a mid-term demo