Site Reliability Engineer

 

Description:

Primary Responsibilities

 

- **Handling Major Incidents:**

- Manage Critical Issue Response System (CIRS) for major incidents.

- Provide frequent updates on CES-based CIRS until the issue is stabilized.

- Perform deep dive troubleshooting on applications.

 

- **Preventive Actions and Requests:**

- Identify and create preventive action items for CIRS.

- Handle CIRS-based requests, including DFs, feature toggles, and deployments.

- Follow up on major production incidents to ensure resolution.

 

- **Monitoring and Planned Activities:**

- Utilize monitoring tools such as Dynatrace, Kibana, etc.

- Drive and monitor planned activities.

- Write new monitoring scripts and enhance the existing monitoring scope.

 

- **Customer Escalations and Application Issues:**

- Handle customer escalations efficiently.

- Deep dive into application issues to identify root causes.

- Create Splunk alerts based on CIRS learnings.

- Troubleshoot and coordinate customer escalations raised by Support and Engineering teams.

 

- **Ad-hoc Requests:**

- Address ad-hoc requests from CES teams.

Organization JSS ASSOCIATES
Industry Engineering
Occupational Category Site Reliability Engineer
Job Location Dublin,Ireland
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Experienced Professional
Experience 5 Years
Posted at 2024-07-09 5:32 pm
Expires on 2024-11-17