Telesign Logo

Telesign

Site Reliability Engineer (SRE) III

Posted 2 Days Ago
Be an Early Applicant
Bangalore, Bengaluru, Karnataka
Senior level
Bangalore, Bengaluru, Karnataka
Senior level
The Site Reliability Engineer (SRE) will design and implement automated systems to improve reliability and performance of cloud services, develop tools for deployment and management, engage in incident response, and advocate for cloud-agnostic architecture principles.
The summary above was generated by AI

Site Reliability Engineer (SRE) - Automation & Tooling 

Position Overview: 

We are looking for a talented and motivated Site Reliability Engineer (SRE) with a strong focus on system administration, automation and tooling. As part of our dynamic engineering team, you will play a crucial role in building and maintaining reliable, scalable, and efficient cloud infrastructure. You will work closely with development, operations, and product teams to enhance our systems and services while championing the best practices of SRE. 

Key Responsibilities: 

  • Design, develop, and implement automated systems to improve the reliability, performance, and scalability of our services.
  • Create and maintain tooling that facilitates rapid deployment, monitoring, and management of our infrastructure.
  • Collaborate with cross-functional teams to integrate automation solutions with existing workflows and pipelines.
  • Identify and resolve performance bottlenecks and ensure high availability of critical services.
  • Develop and follow SRE best practices to enhance system reliability and operational efficiency.
  • Contribute to incident response and postmortem analysis to continuously improve our systems.
  • Participate in on-call rotations to support continuous 24/7 operations.
  • Foster a culture of continuous improvement through proactive monitoring, performance tuning, and capacity planning.
  • Advocate for cloud-agnostic architecture principles and assist in the integration and management of multi-cloud environments. 

Qualifications:

  • S./M.S. in Computer Science, Engineering, or a related field, or equivalent industry experience.
  • 5+ years experience as a Site Reliability Engineer, System Engineer, or similar role.
  • Familiarity with CI/CD pipelines and relevant tools (e.g., Jenkins, Bitbucket).
  • 5+ years hands-on experience with VMWARE (or similar virtualization solution) and Linux (RedHat).
  • Solid understanding and experience with cloud platforms (e.g. AWS, Azure) and cloud-agnostic architectural principles.
  • Strong proficiency in configuration automation tools and frameworks (e.g., , Terraform, Puppet).

Nice to have:

  • Demonstrated knowledge of incident management and post-incident analysis processes (e.g., SLIs, SLOs, SLAs).
  • Solid understanding and experience with RabbitMQ (queuing tools), Redis Cluster (caching tools), nGinx, Apache, Gunicorn (web layer tools) applications.
  • Extensive experience with scripting and programming languages (e.g., Python, PowerShell, Bash, Cloud CLIs).
  • Solid understanding and experience with container orchestration tools (e.g., Kubernetes, Docker, EKS).
  • Expertise in monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack).

Top Skills

Bash
Powershell
Python

Similar Jobs

Be an Early Applicant
4 Days Ago
Bengaluru, Karnataka, IND
Hybrid
289,097 Employees
Mid level
289,097 Employees
Mid level
Financial Services
As a Software Engineer III, you will implement site reliability principles and practices, produce design artifacts, support resiliency testing, and contribute to software engineering communities. You will also collaborate on monitoring tools and assist in automating solutions.
Be an Early Applicant
20 Days Ago
Bengaluru, Karnataka, IND
Hybrid
289,097 Employees
Mid level
289,097 Employees
Mid level
Financial Services
The Site Reliability Engineer III will enhance complex systems in the Asset & Wealth Management sector by applying coding and cloud infrastructure skills. Responsibilities include monitoring, optimization of applications, collaboration on deployment solutions, and implementation of best practices in reliability engineering.
Be an Early Applicant
12 Hours Ago
Bengaluru, Karnataka, IND
Hybrid
1,810 Employees
Senior level
1,810 Employees
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
The Sr. Site Reliability Engineer at BlackLine will manage the performance and operational aspects of applications and services, implement DevOps practices, and facilitate multi-release technical projects. Responsibilities include developing effective tools and processes, ensuring service availability, and collaborating across teams. The role emphasizes problem-solving, documentation, and strategic planning to support the company's cloud-based services.

What you need to know about the Bengaluru Tech Scene

Dubbed the "Silicon Valley of India," Bengaluru has emerged as the nation's leading hub for information technology and a go-to destination for startups. Home to tech giants like ISRO, Infosys, Wipro and HAL, the city attracts and cultivates a rich pool of tech talent, supported by numerous educational and research institutions including the Indian Institute of Science, Bangalore Institute of Technology, and the International Institute of Information Technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account