First Advantage Logo

First Advantage

Lead - SRE (Site Reliability Engineering)

Posted 7 Hours Ago
Be an Early Applicant
560042, Shivaji Nagar, Karnataka
Senior level
560042, Shivaji Nagar, Karnataka
Senior level
The Lead SRE will manage site reliability engineering to ensure high availability and performance of platforms. Responsibilities include monitoring system health, automating processes, optimizing system performance, and leading incident response. The role involves cross-functional collaboration to enhance user experiences and implementing effective recovery strategies.
The summary above was generated by AI

At First Advantage (Nasdaq: FA), people are at the heart of everything we do. From our customers and partners to our greatest advantage — our team members. Operating with empathy and compassion, First Advantage fosters a global inclusive workforce devoted to the diverse voices that make up our talent and products. Our team members empower each other to be their authentic selves and treat all with respect, integrity, and fairness.
Say hello to a rewarding career and come join a leading provider of mission-critical background screening solutions to some of the most recognized Fortune 100 and Global 500 brands.
We are seeking a Tech Lead SRE to empower our platforms with high availability, and stellar performance level.
What We Do:
We are on the frontline of recruitment enabling organizations to Hire Smarter. Onboard Faster™ First Advantage is an HR Tech company delivering innovative solutions and insights to enable our clients to manage risk and hire the best talent. Leveraging an advanced technology platform, First Advantage builds fully scalable, configurable screening programs that meet the unique needs of over 33,000 clients. Headquartered in Atlanta, GA and with an internationally distributed workforce spanning 19 countries with about 5,500 employees, First Advantage performs over 93 million screens in over 200 countries and territories annually.
Who You Are:
You are self-motivated and ready to “roll up your sleeves." While you are an independent contributor, you are also collaborative. You can spearhead a project and see it through from start to completion.
As a team player, you navigate cross-functional teams and work well with team members in other business units and departments toward a common goal.
An Innovator — you see gaps in current processes or workflows as an opportunity to improve and try something new.
A lifelong learner and always seeking out opportunities to learn and upskill, you understand the importance of thorough and secure screenings and are interested in the Human Capital sector and the confluence of people, process, and technology.

What You'll Do 

A successful Technical Lead of site reliability engineers (SREs) to empower our platforms with high availability, and stellar performance level. As we expand our SRE team, we are currently seeking an experienced SRE to deliver reliable and scalable Technology solutions to our clients that enable best in class customer experience. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

Responsibilities:

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Drive implementation of automation and monitoring to promote early detection, self-healing, improved availability, and decreased number of outages
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Reduce operational inefficiencies in the incident management process to ensure the fastest path to recovery through automation and continuous process improvement. Identify when escalation is required and trigger such escalation accordingly.
  • This role will be strategic in nature implementing best in class Incident response and communications through modern solutions such as Teams, SharePoint, etc. This will ensure our internal stakeholders and customers have accurate communications of any ongoing outages and what we are doing to restore as well as prevent it from occurring again. This includes driving Incident bridges to resolution with highest sense of urgency
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Create and maintain recovery playbooks for commonly occurring customer patterns and issues. Drive down resolution times by improving alert coverage and accuracy.
  • Create sustainable systems and services through automation and uplifts
  • Implement Automated Recovery Scripts and other monitoring enhancements
  • Participate in system design consulting, platform management, and capacity planning
  • Provide primary operational support and engineering for multiple large, distributed software applications
  • Lead after action reviews and root cause analysis on a timely basis that identify repair items preventing future customer impact. Ensure resolution of product/service defects, process improvements and documentation enhancement to address live site or customer reported incidents

What You May Need to be Successful

4-year College minimum in related technology field (Computer, Engineering, Science, etc.) or comparable job experience. SRE (Site Reliability Engineering) related Certification.

  • 7+ years of experience in information technology preferably managing large-scale environments
  • Recent work experience in an SRE role implementing best in class Reliability solutions in a Large Product development organization
  • 3+ years of work experience with public cloud platform Azure
  • Experience with Azure Monitor & AppInsights is preferred
  • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, and JavaScript
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.
  • Outstanding communication and presentation skills, written and verbal. Excellent listening skills and a high degree of empathy.
  • Proficient in quick problem-solving skills with attention to detail.
  • You must be able to work outside of normal business hours (weekend shifts, holidays, & evenings)
  • Excellent managerial skills and ability to collaborate with team members.
  • Strong analytical, and time management skills.
  • Incorporate various software engineering aspects to develop and implement services that improve IT and support teams. Services can range from production code changes to alerting and monitoring adjustments

Why First Advantage is Your Next Big Career Move

First Advantage is going through a technology transformation! We are looking for experts who are excited to work with advanced technologies and provide best-in-class user experiences, drive the development and deployment of scalable solutions, and smoothly guide our agile teams and clients through meaningful changes as we continue to expand our impact.

More About Our Values Code

  • Honor Honesty, Consistency, Responsibility: Do the right thing
  • Cultivate an environment of dignity: Show respect for the individual
  • Take an Outside-In approach: Put the client first
  • Think out-of-the-box: Innovate and create
  • Stay Team-Oriented: Collaborate and appreciate each other

What Are You Waiting For? Apply Today!

You have learned a little about us today – we want to learn about you! If you think this position and our company are a great fit for your areas of interest and expertise, tell us about you by applying now!

EMPLOYEE BENEFITS – India Region:

  • Most of the roles are enabled with the ability to work remotely with occasional business travel. Hybrid working model
  • Comprehensive employee Leave policy
  • Career progressions through Internal job opportunities and Global Talent mobility programs
  • Career Development: Mentoring Program, People Management Program, cross-functions training, soft skills training.
  • Continuous learning and development opportunities. Upskilling and reskilling opportunities mobilized through e-learning platforms
  • Training and Certification reimbursement programs
  • Medical Insurance coverage for employees and parental insurance benefits available. Calendarized Employee Wellness programs
  • Quarterly Rewards and Recognition program to recognize exemplary performance
  • Other attractive allowances – Weekend working, Holiday pay, Relocation assistance, Maternity bonus, Creche allowance, Shift allowance etc.

Top Skills

C/C++
Java
JavaScript
Python

Similar Jobs

Be an Early Applicant
2 Days Ago
Bengaluru, Karnataka, IND
Hybrid
289,097 Employees
Mid level
289,097 Employees
Mid level
Financial Services
As a Site Reliability Engineer III, you will optimize applications and infrastructure, solve complex problems, and collaborate on deployment solutions. Responsibilities include maintaining reliability, implementing best practices, and utilizing monitoring tools.
Be an Early Applicant
2 Days Ago
Bengaluru, Karnataka, IND
Hybrid
1,810 Employees
Senior level
1,810 Employees
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
The Sr. Site Reliability Engineer at BlackLine will manage the performance and operational aspects of applications and services, implement DevOps practices, and facilitate multi-release technical projects. Responsibilities include developing effective tools and processes, ensuring service availability, and collaborating across teams. The role emphasizes problem-solving, documentation, and strategic planning to support the company's cloud-based services.
Be an Early Applicant
20 Hours Ago
Bangalore, Bengaluru, Karnataka, IND
2,194 Employees
Senior level
2,194 Employees
Senior level
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
As a Lead DevOps/SRE Engineer, you will enhance production and development environments, develop tools and processes, automate deployment, and lead monitoring strategies. Your responsibilities include managing production systems, CI/CD pipelines, and providing on-call support while optimizing overall reliability and performance.

What you need to know about the Bengaluru Tech Scene

Dubbed the "Silicon Valley of India," Bengaluru has emerged as the nation's leading hub for information technology and a go-to destination for startups. Home to tech giants like ISRO, Infosys, Wipro and HAL, the city attracts and cultivates a rich pool of tech talent, supported by numerous educational and research institutions including the Indian Institute of Science, Bangalore Institute of Technology, and the International Institute of Information Technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account