Description:
Primary Responsibilities
- **Handling Major Incidents:**
- Manage Critical Issue Response System (CIRS) for major incidents.
- Provide frequent updates on CES-based CIRS until the issue is stabilized.
- Perform deep dive troubleshooting on applications.
- **Preventive Actions and Requests:**
- Identify and create preventive action items for CIRS.
- Handle CIRS-based requests, including DFs, feature toggles, and deployments.
- Follow up on major production incidents to ensure resolution.
- **Monitoring and Planned Activities:**
- Utilize monitoring tools such as Dynatrace, Kibana, etc.
- Drive and monitor planned activities.
- Write new monitoring scripts and enhance the existing monitoring scope.
- **Customer Escalations and Application Issues:**
- Handle customer escalations efficiently.
- Deep dive into application issues to identify root causes.
- Create Splunk alerts based on CIRS learnings.
- Troubleshoot and coordinate customer escalations raised by Support and Engineering teams.
- **Ad-hoc Requests:**
- Address ad-hoc requests from CES teams.
Organization | JSS ASSOCIATES |
Industry | Engineering |
Occupational Category | Site Reliability Engineer |
Job Location | Dublin,Ireland |
Shift Type | Morning |
Job Type | Full Time |
Gender | No Preference |
Career Level | Experienced Professional |
Experience | 5 Years |
Posted at | 2024-07-09 5:32 pm |
Expires on | 2024-12-23 |