Elanco Logo

Elanco

Lead Engineer – Site Reliability Engineer

Job Posted 9 Days Ago Posted 9 Days Ago
Be an Early Applicant
Bangalore, Bengaluru Urban, Karnataka
Senior level
Bangalore, Bengaluru Urban, Karnataka
Senior level
The Lead Engineer – Site Reliability Engineer will oversee application reliability, define SLOs/SLIs, lead telemetry implementation, and promote reliability across the software lifecycle while mentoring junior engineers.
The summary above was generated by AI

At Elanco (NYSE: ELAN) – it all starts with animals!

As a global leader in animal health, we are dedicated to innovation and delivering products and services to prevent and treat disease in farm animals and pets. We’re driven by our vision of ‘Food and Companionship Enriching Life’ and our approach to sustainability – the Elanco Healthy Purpose™ – to advance the health of animals, people, the planet and our enterprise.

At Elanco, we pride ourselves on fostering a diverse and inclusive work environment. We believe that diversity is the driving force behind innovation, creativity, and overall business success. Here, you’ll be part of a company that values and champions new ways of thinking, work with dynamic individuals, and acquire new skills and experiences that will propel your career to new heights.

Making animals’ lives better makes life better – join our team today!

Solution Ops Engineer Job Description

Role Title: Lead Engineer – Site Reliability Engineer

Location: India

Team: Software Engineering and Platforms

Supervisor: Software Engineering Director

Career Progression: Engineering, Architecture

Position Description:

Historically, the role of IT has been to provide a reliable ecosystem to run the business, drive

efficiencies and reduce costs. These areas remain integral, however, driven by the quickening pace

of innovation, IT must evolve, proactively partnering with the business to enable new digital

business models that power new types of customer engagement.

At Elanco, our engineer roles bring adaptive set of skills covering Software-as-a-Service (SaaS),

Commercial-of-the-Shelf (CotS) and/or Custom Developed applications. The role is part of our

software engineering team established to deliver Engineering expertise to business facing products

and services. As an Engineer you will be deployed into a multi-disciplined product team applying

your software engineering talent to Elanco’s biggest opportunities.

To be successful in an engineering role in Elanco requires a highly motivated individual, with an

innovative mindset and a willingness to drive tangible outcomes. The individual must be able to

articulate complex technical topics and collaborate with the internal engineering organisation to

improve engineering across the enterprise.

The Role

We are seeking a skilled and motivated engineer, passionate about improving application reliability

across our enterprise. As part of our Platform Engineering organization, you will join a product team

focused on a suite of capabilities designed to enhance all aspects of our engineering portfolio.

In this role, you will be primarily accountable for configuring and operating our observability toolset.

You will also lead the charge across the enterprise, driving the transition from reactive to proactive

application support.

This is a fantastic opportunity to join a growing engineering team with the scope to partner across

our entire enterprise of products. Your contributions will help ensure that everything we deliver to

our customers comes with top-notch reliability as standard.Typical responsibilities:

Help define Elanco’s approach to reliability of applications partnering with our product

manager for our portfolio health products.

Collaborate with stakeholders such as product and platform owners, to define service level

objectives (SLOs), and service-level indicators (SLIs) for system operations focused on the

critical features of the customers journey and experience.

Assist and coach product teams implementation of telemetry against SLIs/SLOs to ensure

adequate traceability is in place.

Track and manage reliability performance against agreed SLOs, in partnership with product

teams or other stakeholders, and ensure systems continue to meet SLOs over time.

Ensure key stakeholders, product owners, and platform owners are informed of reliability

concerns and their potential impact to the customers experience.

Provide expert knowledge on reliability approaches, to ensure our organization achieves its

goals and roadmap for reliability.

Champion reliability being treated as a feature in products and platforms and promote the

concept across all phases of the software development life cycle.

Create dashboards and reports to communicate key metrics, to product teams and key

stakeholders.

Beyond observability engage in initiatives across the product line including cost, security,

and adoption helping the team drive to a health portfolio throughout an applications

lifecycle.

Participate problem management activities, including post-mortem incident analysis, and

provision of technical insight, documented findings, outcomes and recommendations as part

of a root cause analysis to troubleshoot priority incidents.

Implement automation to reduce probability and/or impact of problems recurring and

target ‘self-healing’ through automation of reoccurring incidents.

For critical applications, utilize practices such as chaos engineering and performance

engineering to test in preproduction environments. This includes disaster recovery (DR)

testing, performance testing, and tabletop planning exercises.

Participate and exert influence in organizational learning initiatives such as communities of

practice to share knowledge and foster a continuous learning and improvement mindset.

Support architects working on new solutions, including analyzing requirements, supporting

technical architecture activities, prototyping, designing and developing reusable

infrastructure artifacts, testing, implementing, and preparing for ongoing support.

Train and mentor junior and engineers to ensure SRE best practices evolve and scale

successfully in the organization

Partner with the product manager of portfolio health to build out golden paths, education

and services to ‘package’ the capability in a consumable way on our developer portal.

Be a product team champion extending into product teams helping to deliver foundational

platform engineering capabilities where applicable.

Partner with compliance teams to ensure the data we bring into observability platforms

meets privacy and compliance standards

Maintain consistent standards and set out a taxonomy of telemetry to enable future

opportunities including leveraging of AI capabilityBasic Qualifications:

Experience in some of the following areas essential.

o 10-15 years of hands-on engineering experience.

o 5 year’s experience in Platform Engineering, SRE or similar role

o 5-10 years of experience working with modern application architecture

methodologies (Service Orientated Architecture, API-Centric Design, Twelve-Factor

App, FAIR, etc.).

o 5 + years of experience working with Cloud Native design patterns, with a

preference towards Microsoft Azure / Google Cloud.

o 5 + years of experience designing and delivering digital solutions following a

product-mindset and a variety of delivery methodologies (e.g. Agile, CCPM, etc.).

o 5 + years of experience working within a “DevSecOps” culture, including modern

software development practices, covering Continuous Integration and Continuous

Delivery (CI/CD), Test-Driven Development (TDD), etc.

Experience with enterprise observability platforms. E.g Datadog, New Relic

Experience with monitoring 3rd party and SaaS applications.

Experience establishing standards around MELT (Metrics, Events, Logging and Tracing and

implementing at an enterprise level.

Experience with Open Telemetry advantageous.

Experience supporting digital platforms, including Integrations, Release Management,

Regression Testing, Integrations, Data Obfuscation, etc.

Experience scaling an “API-Ecosystem”, designing, and implementing “API-First” integration

patterns.

Experience working with authentication and authorisation protocols/patterns.

Experience defining and implementing large-scale, transformative digital solutions.

Demonstrated influence and communication skills across all levels of IT and third parties.

Experience working in complex, diverse landscapes (business, technology, regulatory,

partners, providers, geographies, etc.).

Strong organizational and communications skills with multiple examples of being able to

convey complex technical topics, that resulted in a definitive direction.

Education Requirements: Bachelor’s Degree in Information Technology.

Other Information: Occasional travel may be required.

Elanco is an EEO/Affirmative Action Employer and does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status

Top Skills

Ci/Cd
Cloud Native
Datadog
GCP
Azure
New Relic
Open Telemetry
Tdd

Similar Jobs

2 Days Ago
Easy Apply
Hybrid
Bangalore, Bengaluru, Karnataka, IND
Easy Apply
Senior level
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
As a Lead DevOps/SRE Engineer, you will improve production systems, automate deployment, optimize CI/CD pipelines, and manage cloud environments.
Top Skills: AnsibleAWSGoGrafanaJavaLinuxLokiPerlPrometheusPuppetPythonSaltTerraform
3 Days Ago
Hybrid
Bengaluru, Karnataka, IND
Senior level
Senior level
Financial Services
The Lead Site Reliability Engineer at JPMorgan Chase is responsible for enhancing the reliability and stability of applications, mentoring team engineers, and leading initiatives using data-driven analytics. The role involves extensive technical expertise, solving technology-related challenges, and acting as the point of contact during major incidents.
Top Skills: .NetDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
Senior level
Financial Services
The Lead Site Reliability Engineer will oversee initiatives to enhance the reliability and stability of applications, lead technical projects, mentor team members, and manage incidents impacting critical payment systems.
Top Skills: AWSDatadogDockerDynatraceEcsGitlabGrafanaJenkinsKubernetesPrometheusSplunkTerraform

What you need to know about the Bengaluru Tech Scene

Dubbed the "Silicon Valley of India," Bengaluru has emerged as the nation's leading hub for information technology and a go-to destination for startups. Home to tech giants like ISRO, Infosys, Wipro and HAL, the city attracts and cultivates a rich pool of tech talent, supported by numerous educational and research institutions including the Indian Institute of Science, Bangalore Institute of Technology, and the International Institute of Information Technology.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account