Happy got a job!
Seela got a call!
Tanisha applied for a job!

Search Jobs

Back to Jobs

Site Reliability Engineer

HuntingCube Recruitment Solutions

Mumbai

Not Disclosed

5 - 9 Years

Full Time - Permanent

Views:91

Applicants:0

Posted on 22 Jul, 2025

In Office

Job Description | Responsibilities

  1. Monitor & alert with Prometheus, Grafana, Datadog

  2. Tune alert rules to reduce noise & detect anomalies

  3. Lead alerts triage, incidents & RCA

  4. Track SLA/SLO/SLI, MTTA/MTTD, report

  5. Debug production issues across stacks

  6. Automate ops via Python & Google Apps Script

  7. Collaborate with devs to boost reliability & refine on-call processes

Overview

  • Industry - HR, HUMAN RESOURCES
  • Functional Area - IT Software Programming / Analysis / Quality / Testing / Training, IT Software Maintenance / Operations / Support Services
  • Job Role - Software Engineer / Programmer
  • Employment type - Full Time - Permanent
  • Work Mode - In Office

Qualifications

  • Any Graduate - Any Specialization
  • Any Post Graduate - Any Specialization
  • Any Doctorate - Any Specialization

Job Related Keywords

pub/sub Troubleshooting / Trouble Shooting Kubernetes Cloud Engineer Software Engineer Google Cloud Site Reliability Engineer