Coupang Logo

Coupang

Senior Staff Engineer - SRE

Job Posted 17 Days Ago Reposted 17 Days Ago
Be an Early Applicant
Bengaluru, Karnataka
Expert/Leader
Bengaluru, Karnataka
Expert/Leader
As a Senior Staff Engineer - SRE at Coupang, you will ensure customer-facing services' reliability, guide large-scale technical initiatives, and influence best practices across teams while mentoring junior engineers.
The summary above was generated by AI

About the Company:

At Coupang we are building the future of ecommerce. Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” We are one of the fastest-growing e-commerce companies that established an unparalleled reputation for being a dominant and reliable force in South Korean commerce.

We are proud to have the best of both worlds — a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been at since our inception. We are all entrepreneurial surrounded by opportunities to drive new initiatives and innovations. At our core, we are bold and ambitious people who like to get our hands dirty and make a hands-on impact. At Coupang, you will see yourself, your colleagues, your team, and the company grow every day.

Our mission to build the future of commerce is real. We push the boundaries of what’s possible to solve problems and break traditional tradeoffs. Join Coupang now to create an epic experience in this always - on, high-tech, and hyper-connected world.

About the Role:

Site Reliability Engineers (SREs) at Coupang is a mission-critical role which combines software and system engineering to build, run and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer facing services are healthy, monitored, automated, and designed to scale.

As SRE organization we take pride in handling “operations as an engineering” problem with automation first approach. You will use your background to build best in class infrastructure automation for areas such as Observability, Incident management, Disaster Recovery, Load testing, Capacity engineering and many more. In this role you will work very closely with our product development teams from an early stage of design to all the way helping resolve any production incidents, maintaining SLI/SLA bar for production services and influencing them with SRE principles and best practices.

If you take pride in complete ownership, have a passion for solving complex technical challenges for large scale distributed systems and demeanour to work and communicate effectively across team boundaries, this is the role for you!

Key Responsibilities:

  • You will serve as a hands-on Senior Staff Engineer who will be dedicated to review critical services design, architecture reviews, re-architect, set performance / reliability / availability benchmarks, tuning and owner to work with specific system team on fundamental design improvements, track incidents to close architecturally, and work with domain team to close functionally.
  • Serve as a primary point responsible for the reliability, health, and performance of all Coupang customer-facing services.
  • Gain deep knowledge of Coupang application workflow and dependencies.
  • Lead and drive large scale technical initiatives across multiple engineering teams.
  • Be able to drive collaboration effectively across organisational boundaries, be able to build strong stakeholder relationships to achieve broad organisational objectives.
  • Identify and implement scalable solutions for complex technical problems. Be the change driver.
  • Self-motivated to be able to navigate the ambiguity with large initiatives and find solutions to accomplish the goal.
  • Be the SRE champion/lead working with rest of the technical leaders across Coupang to define and drive the engineering roadmap.
  • Contribute towards hiring and building a world class team. Mentor and coach junior engineers on the team.

Essential Qualifications:

  • At least 15+ years of industry experience building and operating large scale distributed systems.
  • Deep UNIX/Linux systems knowledge and administration background.
  • Strong programming skills in one or more of: Python, Java, Golang, C++.
  • Strong problem-solving and analytical skills spanning systems, network (TCP/IP) and code, with a focus on data-driven decision-making.
  • Proficient with cloud-based infrastructure, including AWS, Azure, or Google Cloud Platform.
  • Strong understanding of DevOps and SRE practices, including continuous integration, continuous delivery, and infrastructure as code (IaC).
  • Proficient with containerisation and orchestration technologies, such as Docker and Kubernetes.
  • Knowledge of observability ecosystem including metrics, logging, tracing and tools, such as
  • Prometheus, Grafana, Elastic Stack, Datadog, or New Relic.
  • Excellent communication and collaboration skills, with the ability to work with teams across distinct functions and technical domains.

Preferred Qualifications:

  • Master’s degree in computer science, Engineering, or a related technical field.
  • Prior experience working with large scale web-based Java architectures and JVM configuration.
  • Professional certifications in cloud platforms, monitoring tools, or related technologies.
  • Previous experience working on a large-scale ecommerce platform.

Equal Opportunities for All

Coupang is an equal opportunity employer. Our unprecedented success could not be possible without the valuable inputs of our globally diverse team.

Top Skills

AWS
Azure
C++
Datadog
Docker
Elastic Stack
Go
Google Cloud Platform
Grafana
Java
Kubernetes
New Relic
Prometheus
Python

Similar Jobs

4 Days Ago
Hybrid
Bengaluru, Karnataka, IND
Mid level
Mid level
Financial Services
As a Site Reliability Engineer III, you will enhance complex systems, optimize applications, maintain cloud infrastructure, and ensure reliability and scalability, while collaborating with teams to solve significant business problems.
Top Skills: AWSAzureBashDatadogDynatraceGitlabGCPGrafanaJenkinsLinux/UnixPrometheusPythonSplunkTerraform
7 Days Ago
Bangalore, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Cybersecurity
The Senior Site Reliability Engineer will manage and support Linux infrastructure, ensure availability via automation, collaborate with multiple teams, and drive operational excellence through monitoring and optimization.
Top Skills: AnsibleAWSBashCircleCIDockerGCPGitJenkinsKubernetesLinuxPuppetPythonShellTerraform
8 Days Ago
Bangalore, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Cybersecurity
This role involves managing Linux infrastructure, supporting cloud deployments, automation, monitoring, and collaborating with cross-functional teams to ensure IT systems reliability and performance.
Top Skills: AnsibleAWSBashCircleCIDockerGCPGitJenkinsKubernetesLinuxPuppetPythonShellTerraform

What you need to know about the Bengaluru Tech Scene

Dubbed the "Silicon Valley of India," Bengaluru has emerged as the nation's leading hub for information technology and a go-to destination for startups. Home to tech giants like ISRO, Infosys, Wipro and HAL, the city attracts and cultivates a rich pool of tech talent, supported by numerous educational and research institutions including the Indian Institute of Science, Bangalore Institute of Technology, and the International Institute of Information Technology.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account