Staff Engineer Network - Observability [T500-19766]
Albertsons Companies India
2 - 5 years
Bengaluru
Posted: 10/01/2026
Job Description
ANSR is hiring for one of its clients.
About Albertsons Companies Inc.:
As a leading food and drug retailer in the United States, Albertsons Companies, Inc. operates over 2,200 stores across 35 states and the District of Columbia. Our well-known banners across the United States, including Albertsons, Safeway, Vons, Jewel-Osco and others, serve more than 36 million U.S customers each week.
We build and shape technology solutions that solve customers problems every day, making things easier for them when they shop with us online or in a store. We have made bold, strategic moves to migrate and modernize our core foundational capabilities, positioning ourselves as the first fully cloud-based grocery tech company in the industry.
Our success is built on a one-team approach, driven by the desire to understand and enhance the customer experience. By constantly pushing the boundaries of retail, we are transforming shopping into an experience that is easy, efficient, fun and engaging.
About Albertsons Companies India:
At Albertsons Companies India, we're not just pushing the boundaries of technology and retail innovation, we're cultivating a space where ideas flourish and careers thrive. Our workplace in India is a vital extension of the Albertsons Companies Inc. workforce and important to the next phase in the companys technology journey to support millions of customers lives every day.
At the Albertsons Companies India, we are raising the bar to grow across Technology & Engineering, AI, Digital and other company functions, and transform a 165-year-old American retailer. At Albertsons Companies India associates collaborate directly with international teams, enhancing decision-making processes and organizational agility through exciting and pivotal projects. Your work will make history and help millions of lives each day come together around the joys of food and inspire their well-being.
Position Title: Staff Engineer Network - Observability
Job Description:
Experience Required:
- Bachelors degree in Computer Science, Engineering, or a related field
- 9+ years of experience in network engineering or SRE roles, with a strong focus on cloud observability.
- Proven hands-on experience in Azure, GCP, and OCI networking and monitoring services.
- Expert-level experience with Grafana, including dashboard creation, alerting, and plugin integration.
- Proficiency in Prometheus, Loki, Tempo, or similar observability stack components.
- Strong knowledge of networking fundamentals: BGP, DNS, VNETs / VPCs, subnets, routing, firewalls, load balancing.
- Experience with cloud-native logging and metrics services (e.g., Azure Log Analytics, GCP Cloud Logging, OCI Logging).
- Scripting and automation skills in Python, Bash, or PowerShell.
Core Technical Skills:
Networking Expertise
- Deep understanding of protocols: TCP/IP, BGP, OSPF, MPLS, DNS, etc.
- Familiarity with network architectures (LAN / WAN, SD-WAN, cloud networking)
Monitoring & Telemetry Tools
- Tools like ThousandEyes, Kentik, AppNeta, SolarWinds, Nagios, Zabbix
- SNMP, NetFlow/sFlow/IPFIX, syslog, packet capture tools (Wireshark, tcpdump)
Observability Platforms Must
- Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana)
- Splunk, Datadog, New Relic, OpenTelemetry
Data Analysis & Visualization
- Ability to interpret metrics, logs, traces
- Experience with time-series databases (InfluxDB, Prometheus TSDB)
Scripting & Automation - Must
- Python, Bash, or PowerShell for custom data collection and automation
- REST APIs for integrating observability tools
Cloud & Hybrid Infrastructure Monitoring
- AWS CloudWatch, Azure Monitor, GCP Operations Suite
- Understanding of cloud-native networking and service meshes (e.g., Istio)
Analytical & Soft Skills:
Troubleshooting & Root Cause Analysis
- Ability to correlate data across layers (network, application, infrastructure)
Performance Tuning
- Identifying bottlenecks and optimizing network paths
Security Awareness
- Detecting anomalies, DDoS patterns, and unauthorized access
Collaboration
- Working with SREs, DevOps, and network teams to improve visibility
Documentation & Reporting
- Creating dashboards, reports, and incident postmortems
Bonus Skills:
- Experience with AI/ML for anomaly detection
- Familiarity with SIEM tools (Security Information and Event Management)
- Knowledge of SLAs, SLOs, and SLIs in service reliability
Must have Skills:
- Observability Platforms (Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana) or any other platform)
- Scripting & Automation (Python, Bash, PowerShell etc. for custom data collection and automation)
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
