Staff Site Reliability Engineer

SolarWinds Corporation

8 - 10 years

Bangalore

Posted: 7/13/2023

Job Description

Your Impact:


  • Work collaboratively with software engineering to define infrastructure and deployment requirements;
  • Be the driving force behind our automation and observability initiatives
  • Build and maintain operational tools for deployment, monitoring, and analysis of cloud (AWS & AZURE) infrastructure and systems
  • Leading the response to production incidents, conducting postmortems and continuous improvement and be on on-call rotation
  • Establish and drive operations performance through SLOs
  • Provide project management, sprint planning, and road-mapping support to the SRE team
  • Expert level technical skills and able to provide mentoring to team members
  • Our team uses practices to maximize our development velocity, including but not limited to: continuous integration/deployment, code review via GitHub pull requests


Ideal Attributes:


  • Strong customer orientation
  • Excellent interpersonal and organizational skills
  • Attention to detail and focus on quality
  • Strong communication skills to effectively liaise with both technical and non-technical staff
  • Ability to act decisively and works well under pressure
  • Must be a collaborative problem solver
  • Strong bias for ownership and action


Your Experience:


  • At least 8 + years of experience designing, building and maintaining SAAS environments
  • 5+ years of experience designing, building and maintaining AWS/AZURE infrastructure with Terraform
  • Experience building and running Kubernetes clusters
  • Experience with observability (monitoring – logging, tracing, metrics)
  • Experience with GitOps CI/CD processes
  • Experience with scripting with Python, Go (Golang), bash, or PowerShell and AWS CLI tools
  • Experience with security operations – security policies, infrastructure, key management, setup of encryption at rest and transport


About Company

SolarWinds Corporation is an American company that develops software for businesses to help manage their networks, systems, and information technology infrastructure. It is headquartered in Austin, Texas, with sales and product development offices in a number of locations in the United States and several other countries. The company was publicly traded from May 2009 until the end of 2015, and again from October 2018. It has also acquired a number of other companies, some of which it still operates under their original names, including Pingdom, Papertrail, and Loggly. It had about 300,000 customers as of December 2020, including nearly all Fortune 500 companies and numerous agencies of the US federal government. A SolarWinds product, Orion, used by about 33,000 public and private sector customers, was the focus of a large-scale attack disclosed in December 2020. The attack persisted undetected for months in 2020, and additional details about the breadth and depth of compromised systems continued to surface after the initial disclosure. In February 2021, Microsoft President Brad Smith said that it was "the largest and most sophisticated attack the world has ever seen".

Services you might be interested in

One-Shot Campaign

Reach out to ideal employees in one shot!

The intelligent campaign for reaching out to the ideal audience to whom you can ask for help (guidance or referral).