Site Reliability Engineer
Amicon Hub Services
2 - 5 years
Bengaluru
Posted: 15/04/2026
Job Description
Site Reliability Engineer (Cloud & Infrastructure)
About the Role We are seeking a smart and proactive Site Reliability Engineer (SRE) to manage and scale distributed systems and cloud infrastructure for a leading E-Commerce company. The role involves end-to-end ownership of Kafka and Redis clusters, and the broader cloud infrastructure supporting high-scale production systems. While GCP experience is preferred, strong engineers from AWS or Azure backgrounds who can quickly ramp up on GCP are equally encouraged to apply.
The ideal candidate is not only technically strong but also a champion of SRE principlessomeone who continuously drives reliability, performance, automation, and cost optimization across teams.
Key Responsibilities
Infrastructure Management: Manage, scale, and tune Kafka, Redis, and related tech stacks (e.g., Druid, Hadoop, Spark, ClickHouse). Handle cluster upgrades, packaging (e.g., Debian), and optimizations for performance and reliability.
Automation & IaC: Automate infrastructure provisioning, scaling, and operations using Terraform, Ansible, and scripting (Python/Go/Shell). Continuously improve reliability through automation.
Monitoring, Logging & Observability: Implement and optimize observability stacks. Monitor system health and performance, set up alerting, and leverage common monitoring platforms and logging frameworks.
CI/CD & Deployment: Design, maintain, and improve CI/CD pipelines for build, test, and deployment automation. Ensure secure, rapid, and repeatable deployments.
Security & Compliance: Implement and maintain security best practices to protect systems from vulnerabilities and attacks. Understand and act on the business impact of system reliability.
Incident Management: Lead incident response, root cause analysis, and post-mortems. Handle on-call responsibilities, troubleshoot issues, and resolve incidents with minimal guidance.
Cost Optimization: Identify and implement cloud cost optimization strategies, including automation of quota and usage reports and monitoring for unused resources.
Cross-Team Collaboration: Work closely with multiple engineering and operations teams, taking ownership of key initiatives and driving them to successful completion.
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
