Senior Site Reliability Engineer

Basic Information







Date Published


Job ID


Travel Amount


Description and Requirements

From core to cloud to edge, BMC delivers the software and services that enable over 10,000 global customers, including 84% of the Forbes Global 100, to thrive in their ongoing evolution to an Autonomous Digital Enterprise.

Thanks to our ongoing expansion we have the opportunity to grow our Site Reliability Engineering team. We’re looking for people who are just as passionate about solving issues with distributed systems as they are to automate, code and collaborate to tackle problems.

Primary Role & Responsibilities:

  • You are either a DevOps or an SRE Engineer with real interest and experience in Linux systems, networking, monitoring and automation, containerization, cloud technologies etc, and a proven interest and experience in using software engineering to solve operational problems.
  • You are comfortable writing software to automate API-driven tasks at scale. Python preferred.
  • You will participate in SRE software engineering, writing code for the continuing reduction of human intervention in operational tasks and automation of processes.
  • You will manage Cloud provider infrastructure, system deployments and product release operations.
  • Monitor the application ecosystem, responding to incidents, correcting and improving systems to prevent incidents and planning capacity.
  • Own resolving Elasticsearch related customer issues.
  • Participate in 24x365 on-call schedules.


  • Overall 2+ years of experience with monitoring and observability to proactively predict failure. Thorough understanding of logging and monitoring tools ELK Stack, Prometheus, Grafana, etc.
  • Good Experience with CI/CD pipelines and DevOps tools such as Jenkins, Docker, Git, Kubernetes, Terraform, and Ansible.
  • You have a passion for collaborating cross-functionally & cross-product to identify and own the RCA and mitigation plan and reduce MTTR
  • You have experience automating the build and deployment of software products, and understand the related challenges in distributed systems.
  • You have experience using a Public Cloud: AWS, GCP, Azure, SoftLayer or OpenStack.
  • Linux system administration, in both COLOs and Cloud environment.
  • Experience working remotely with a fully distributed team, with the communication and adaptability it requires.
  • Experience mentoring and helping folks grow their abilities to use/contribute to the tooling you help build.
  • Ability to explain technical concepts to multiple audiences.
  • experience with building public cloud agnostic software.

BMC helps customers run and reinvent their businesses in the digital age by tackling their IT management challenges, championing their innovation, and celebrating their success.
Every BMC employee has the potential to have a tremendous impact on customer success—and when customers thrive, we all do.

BMC offers bold and fearless career-seekers like you the opportunity to expand your skills, your network, and your horizons as you work to enable customer growth and innovation every day. You will be surrounded by peers who inspire you, drive you, support you, and make you laugh out loud, in an environment that fosters individuality, respect, and personal ambition.

It is the policy of BMC Software to afford equal opportunity for employment to all individuals regardless of race, color, creed, sex, age, sexual orientation, national origin, disability, ancestry, citizenship status, political affiliation, religion, gender, transgender, gender identity, gender expression, marital status, status as a parent, disabled veteran or status as a protected veteran, genetic information or other factors prohibited by law, and to prohibit harassment or retaliation based on any of these factors.

If you need a reasonable accommodation for any part of the application and hiring process, visit the accommodation request page.