
Results-driven software engineer with over 12 years of experience in designing, building, and operating large-scale distributed systems. Expertise in Java and Spring Boot backend development, microservices, and cloud-native architectures, complemented by hands-on experience in Site Reliability Engineering (SRE) practices. Proficient in implementing SLIs/SLOs, creating observability dashboards using tools such as New Relic, Dynatrace, and DataDog, and developing automation tools in Python to enhance reliability and minimize downtime. Strong commitment to bridging development and operations fosters the creation of resilient, scalable, and business-critical systems.
Programming & Automation: Java, Python, Spring Boot, gRPC, REST, Bash
Cloud & Infrastructure: Google Cloud Platform (GCP), Azure, Terraform, Kubernetes, Docker, Helm
Observability & Monitoring: New Relic, Dynatrace, DataDog, SLI/SLO design
CI/CD & DevOps: GitHub Actions, Gradle
Practices: SRE best practices, Blameless Postmortems, Service Maturity Models, TDD, BDD, Agile/Kanban
Misc : Capacity planning, ITIL framework, System monitoring, Incident management, Log analysis
GenAI : Python, Pandas, ML Fundamentals, Scikit-learn, PyTorch, TensorFlow Vibe Coding, Gihub Copilot, Prompt Engineering