Resilience and Reliability Architect

May 18, 2026
Urgent

Job Description

Site Reliability Engineering (SRE) Architect / Consultant
Company

Ernst & Young

Location

Hybrid / Client-facing role (Global opportunities)

Experience Required

12+ Years

Employment Type

Full-Time

About the Role

EY is seeking an experienced Site Reliability Engineering (SRE) Architect / Consultant to help enterprise clients improve reliability, scalability, automation, and operational excellence across modern IT environments.

The role involves designing SRE transformation roadmaps, implementing observability and automation solutions, and driving resilient cloud-native architectures using DevOps and SRE best practices.

Key Responsibilities
Define and implement SLA, SLO, and SLI frameworks for enterprise applications and platforms.
Design resilient, scalable, and highly available architectures across cloud and on-premise environments.
Implement observability and monitoring solutions using APM, logging, and analytics platforms.
Reduce operational toil through automation, scripting, and CI/CD improvements.
Drive Infrastructure as Code (IaC) and DevOps best practices.
Optimize infrastructure and operational costs using FinOps principles.
Troubleshoot and resolve performance, scalability, and availability issues.
Conduct JVM tuning, thread dump analysis, heap dump analysis, and performance optimization.
Collaborate with development, infrastructure, and operations teams to improve service reliability.
Define and implement Non-Functional Requirements (NFRs) for enterprise systems.
Support cloud-native modernization initiatives and microservices architectures.
Required Skills & Technologies
Core Technologies
Java / J2EE
Spring Boot
Microservices Architecture
RESTful Services
Web Services / SOA / ESB
Application & Web Servers
Apache Tomcat
IBM HTTP Server
WebSphere Application Server
Databases
Oracle
SQL Query Tuning
Database Architecture
DevOps & Automation
Jenkins
GitLab CI/CD
Azure DevOps
Terraform
Ansible
AWS CloudFormation
Cloud & Containerization
AWS / Azure / GCP
Kubernetes
OpenShift
Docker
Monitoring & Observability
Dynatrace
AppDynamics
Splunk
ELK Stack
CloudWatch
Azure Monitor
Azure AppInsights
Operating Systems & Scripting
Linux (RHEL)
Python
Git
Jira
Confluence
Preferred Skills
AI/ML and Data Analytics exposure
Experience with FinOps strategies
Knowledge of queuing models and thread pool optimization
Expertise in cloud-native architectures and enterprise scalability
Qualifications
Bachelor’s degree in Computer Science, Engineering, IT, or related field
12+ years of experience in software engineering, IT operations, or SRE environments
Strong hands-on expertise in enterprise architecture and reliability engineering
What EY Offers
Flexible hybrid work model
Global exposure and collaborative work culture
Learning and development opportunities
Competitive compensation and benefits
Inclusive and diverse workplace environment
How to Apply

Apply through the official careers portal of EY Careers

You can also explore more about the company at EY Official Website

Location