Project Role Description : Supports the technology systems performance and reliability to meet service level targets. Assists with the creation and deploys continuous performance and capacity models using performance and availability monitoring tools, processes, and techniques. Collaborates with the Technology and Enterprise Architects for the selection and design of run-time and DevOps technologies.
Must have skills : Site Reliability Engineering
Good to have skills : NA
Minimum 7.5 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary A Site Reliability Engineer (SRE) ensures systems are stable, scalable, and highly available, bridging the gap between Business Application development and IT operations. This role combines automation, observability, incident response, and performance engineering to maintain continuous service reliability while accelerating delivery velocity. The Site Reliability Engineer designs and maintains production systems that meet defined Service Level Objectives (SLOs) and error budgets. Using software engineering principles, an SRE prevents downtime, automates operations, and improves platform performance through observability, fault tolerance, and system resilience. Key Responsibilities: Reliability and Performance: Monitor and optimize system uptime, latency, and throughput to meet SLOs and SLIs. Incident Management: Lead incident response, manage escalations, perform root cause analysis (RCA), and drive postmortem reviews. Automation and Tooling: Develop CI/CD pipelines, automate infrastructure management, and eliminate manual toil through scripting and orchestration. Monitoring and Observability: Implement metrics, logging, and tracing frameworks (Prometheus, Grafana, ELK, Datadog) to gain real-time visibility into distributed systems. Capacity Planning: Conduct resource forecasting, design scalable infrastructure, and handle performance under surge conditions. Change & Release Management: Partner with developers to ensure safe, reliable rollout of new features with automated testing and rollback mechanisms. Disaster Recovery & Resilience Engineering: Implement multi-region resilience strategies, chaos tests, and failover automation for business continuity. Process Improvement: Use post-incident analytics to refine operational practices and improve reliability with data-driven improvements. Collaborate with product, design, ML, and DevOps teams to build intelligent workflows and user experiences Implement Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, AZURE DEV OPS or Pulumi. Expert in Cloud IaaS and PaaS services. Required Skills: Expertise in Python, Go, Bash, or JavaScript for automation and tooling. Hands-on with cloud environments AWS, Azure, GCP and orchestration tools like Kubernetes and Terraform. Deep understanding of Linux systems, networking, and distributed architectures. Experience with observability solutions Prometheus, Grafana, Datadog, CloudWatch, or New Relic. Familiarity with incident management and alerting platforms (PagerDuty, xmatters) Proficiency in CI/CD frameworks such as Jenkins, GitHub Actions, or GitLab CI. Working knowledge of security, compliance, and performance optimization for highly available systems. Certifications (Required / Preferred): AWS Certified Solutions Architect Professional Microsoft Certified: Azure Solutions Architect Expert Google Professional Cloud Architect Certified Kubernetes Administrator (CKA) HashiCorp Certified: Terraform Associate Certified DevOps Engineer certifications (AWS, Azure, or Google) Resource needs to be AI Ready.15 years full time education
About Accenture
Accenture is a leading global professional services company that helps the world’s leading businesses, governments and other organizations build their digital core, optimize their operations, accelerate revenue growth and enhance citizen services—creating tangible value at speed and scale. We are a talent- and innovation-led company with approximately 791,000 people serving clients in more than 120 countries. Technology is at the core of change today, and we are one of the world’s leaders in helping drive that change, with strong ecosystem relationships. We combine our strength in technology and leadership in cloud, data and AI with unmatched industry experience, functional expertise and global delivery capability. Our broad range of services, solutions and assets across Strategy & Consulting, Technology, Operations, Industry X and Song, together with our culture of shared success and commitment to creating 360° value, enable us to help our clients reinvent and build trusted, lasting relationships. We measure our success by the 360° value we create for our clients, each other, our shareholders, partners and communities.Visit us at www.accenture.com
Equal Employment Opportunity Statement
We believe that no one should be discriminated against because of their differences. All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, military veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by applicable law. Our rich diversity makes us more innovative, more competitive, and more creative, which helps us better serve our clients and our communities.
Top Skills
Accenture Bengaluru, Karnataka, IND Office
SEZ, Divyashree Techno Park 36/2, K R Puram, Hobli, Kundalahalli, Whitefield, Bengaluru, Karnataka, India, 560066


