Hi, I'm Sibasis — Staff Site Reliability Engineer & Platform Engineer
Staff Site Reliability Engineer with 12+ years of experience designing, operating, and evolving production infrastructure for distributed systems at global scale — spanning large-scale Kubernetes platform engineering, cloud architecture, high-stakes incident command, and reliability practice across hypergrowth environments. At Delivery Hero's Tech Foundations Domain, I own the reliability, scalability, and operational posture of a Kubernetes platform processing 5M+ API requests per hour across 100+ production clusters. I define SLOs and error budgets with engineering teams, lead production readiness reviews, and drive incident management from detection through post-mortem — having contained some of the highest revenue-impact failures in the company's history, including AWS region outages and cluster-wide IP exhaustion. I actively integrate agentic AI into operational workflows — building LLM-powered tooling for alert correlation, incident summarization, and autonomous runbook execution. I build platforms that survive production, not just pass a load test.
postgres:// DSN pgview │ <Esc> back <g> top <G> bottom │ Data public.routes
admin@mydb · local │ <d> describe <f> row view/edit │ 42 rows ~1.2K est · PK: id
─────────────────────────────────────────────────────────────────────────────────────────
id name status created_at tags
▶ 1 Alice Johnson active 2024-01-15 09:23:11 {platform,growth}
3 Carol White active 2024-03-19 11:02:44 {platform,api}
7 Eve Martinez active 2024-05-01 16:14:09 {growth}
WHERE "status"::text ILIKE 'active'