DevOps Engineer - Monitoring & Observability
Basic Information
Country:
State:
City:
Date Published:
Job ID:
Travel:
Secondary locations:
Description and Requirements
CareerArc Code
"At BMC trust is not just a word - it's a way of life!"
Job Summary:
We are seeking a highly skilled and motivated DevOps Engineer with experience in BMC Helix Operations Management, TrueSight Operations Management, Prometheus, or related monitoring tools. The ideal candidate will be responsible for developing, implementing, and maintaining robust monitoring solutions to ensure the health, performance, and reliability of our applications and infrastructure. This role requires a blend of software development, system administration, and operational support skills.
Key Responsibilities:
Monitoring and Alerting:
Design, implement, and maintain monitoring systems using BMC Helix Operations Management, TrueSight, Prometheus, or similar tools.
Develop custom dashboards and alerts to provide real-time insights into application performance and system health.
Ensure comprehensive coverage of all critical systems and applications with appropriate alerting thresholds and escalation processes.
Collaboration and Support:
Work closely with development teams to integrate monitoring solutions into the software development lifecycle.
Provide support for incident management and root cause analysis.
Participate in on-call rotation to ensure 24/7 support for critical systems.
Documentation and Training:
Create and maintain comprehensive documentation for monitoring configurations, procedures, and best practices.
Train and mentor team members on the effective use of monitoring tools and techniques.
Experience:
3+ years of experience in a DevOps or related role with a strong focus on monitoring and observability.
Hands-on experience with BMC Helix Operations Management, TrueSight Operations Management, Prometheus, or other monitoring tools.
Proficiency in scripting languages such as Python, Bash, or similar is an add-on
Technical Skills:
BMC Helix Operations Management (BHOM)
Knowledge of cloud platforms (OKE, GKE, AWS) and container orchestration tools (Kubernetes, Docker).
Experience with configuration management and automation tools (Ansible, git).
Familiarity with log management and analysis tools (ELK stack).
Understanding of networking, security, and system administration.
Soft Skills:
Excellent problem-solving and analytical skills.
Strong communication and collaboration abilities.
Ability to work in a fast-paced, dynamic environment.Our commitment to you!
BMC’s culture is built around its people. We have 6000+ brilliant minds working together across the globe. You won’t be known just by your employee number, but for your true authentic self. BMC lets you be YOU!
If after reading the above, You’re unsure if you meet the qualifications of this role but are deeply excited about BMC and this team, we still encourage you to apply! We want to attract talents from diverse backgrounds and experience to ensure we face the world together with the best ideas!
BMC is committed to equal opportunity employment regardless of race, age, sex, creed, color, religion, citizenship status, sexual orientation, gender, gender expression, gender identity, national origin, disability, marital status, pregnancy, disabled veteran or status as a protected veteran. If you need a reasonable accommodation for any part of the application and hiring process, visit the accommodation request page.
(Returnship@BMC)
Had a break in your career? No worries. This role is eligible for candidates who have taken a break in their career and want to re-enter the workforce. If your expertise matches the above job, visit to https://bmcrecruit.avature.net/returnship know more and how to apply.
Min salary
Mid point salary
Max salary