Cognite Logo

Cognite

Senior Observability Engineer (SRE)

Posted Yesterday
Be an Early Applicant
Hybrid
Bengaluru, Bengaluru Urban, Karnataka
Senior level
Hybrid
Bengaluru, Bengaluru Urban, Karnataka
Senior level
The Senior Observability Engineer will design and optimize observability solutions, analyze telemetry data, mentor junior engineers, and collaborate on CI/CD integration and incident management.
The summary above was generated by AI
About Cognite
Embark on a transformative journey with Cognite, a global SaaS forerunner in leveraging AI and data to unravel complex business challenges through our cutting-edge offerings including Cognite Atlas AI, an industrial agent workbench, and the Cognite Data Fusion (CDF) platform. We were awarded the 2022 Technology Innovation Leader for Global Digital Industrial Platforms & Cognite was recognized as 2024 Microsoft Energy and Resources Partner of the Year. In the realm of industrial digital transformation, we stand at the forefront, reshaping the future of Oil & Gas, Chemicals, Pharma and other Manufacturing and Energy sectors. Join us in this venture where AI and data meet ingenuity, and together, we forge the path to a smarter, more connected industrial future.

Role Overview: 
We are seeking a Senior Observability Engineer with strong expertise in designing, implementing, and optimizing observability solutions. In this role, you will be key to shaping the future of observability at Cognite, assessing existing observability frameworks, identifying gaps, and building robust capabilities encompassing log aggregation, event correlation, noise reduction, and comprehensive telemetry analysis to enable proactive operational excellence and reliability for our services.

Key Responsibilities

  • Conduct assessments of existing observability architectures to identify gaps and improvement opportunities.
  • Design and implement scalable log aggregation pipelines for centralized and efficient data collection.
  • Apply noise-reduction techniques to filter irrelevant or false-positive alerts, enhancing focus on actionable issues.
  • Develop and maintain monitoring dashboards that deliver actionable insights across applications and infrastructure.
  • Lead the migration from Lightstep to Honeycomb, ensuring seamless data pipeline transitions, OpenTelemetry alignment, and stakeholder adoption.
  • Collaborate with infrastructure and product teams to integrate observability tooling into CI/CD workflows and cloud environments.
  • Analyze telemetry data (metrics, logs, traces) to troubleshoot complex system behaviors and recommend improvements.
  • Participate in production debugging and incident troubleshooting using telemetry data
  • Mentor junior engineers on log management, event correlation, distributed tracing, and alert management.
  • Stay current on observability innovations and recommend adoption strategies aligned with organizational goals.
  • Support post-incident reviews and continuous improvement through data-driven root cause analysis.
  • Drive continuous improvement in reliability and operational excellence through proactive observability initiatives.

Key Skills

  • 6+ years of experience in software or systems engineering, with at least 3 years focused on observability or SRE practices.
  • Hands-on experience with observability tools such as Honeycomb, VictoriaMetrics, Lightstep, Prometheus, Grafana, OpenTelemetry, Splunk, Datadog, or New Relic.
  • Strong knowledge of OpenTelemetry instrumentation (metrics, traces, logs) and SLIs/SLOs for reliability tracking.
  • Experience with distributed tracing, event correlation, and noise reduction frameworks.
  • Proficiency in one or more programming/scripting languages such as Python, Java, Kotlin, Go, or Shell.
  • Working knowledge of Infrastructure as Code (Terraform) and CI/CD (Jenkins, Github Actions,...) pipelines.
  • Familiarity with cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes).
  • Strong analytical, troubleshooting, and communication skills with the ability to work effectively across teams.
  • Experience conducting observability gap assessments and defining improvement plans.
  • Experience working in complex or multi-cloud environments is preferred.

Join the Global Cognite Community
Be part of a diverse, global team of 70+ nationalities, building technology that transforms how the world’s industries operate.
Work from our modern Bengaluru hub in a hybrid, high-trust environment with a flat structure and direct access to decision-makers.
At Cognite, you’ll learn fast, make an impact, and grow your career alongside exceptional talent.
Why Cognite
Recognized by CNBC and Frost & Sullivan as a global innovation leader, Cognite is driving the next wave of industrial AI and digital transformation.
Join us to shape the future of data and industry.
Apply today — and follow us on LinkedIn (@Cognite) to discover more opportunities.


Top Skills

AWS
Azure
Datadog
GCP
Github Actions
Go
Grafana
Honeycomb
Java
Jenkins
Kotlin
Kubernetes
Lightstep
New Relic
Opentelemetry
Prometheus
Python
Shell
Splunk
Terraform
Victoriametrics

Similar Jobs

An Hour Ago
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Area Lead for the Customer Excellence Group manages a team to ensure customer success, focusing on growth, retention, and platform adoption through collaboration, risk management, and stakeholder engagement.
Top Skills: Ai-Enhanced TechnologyServicenow
An Hour Ago
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Principal Platform Architect role involves guiding customers in leveraging ServiceNow, ensuring technical governance and health, and leading architecture design for successful digital transformation.
Top Skills: AICloud Application TechnologyServicenow
An Hour Ago
Remote or Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
Mid level
Mid level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Support Account Manager, you'll deliver proactive services, manage customer relationships, coordinate cross-functional teams, and ensure satisfaction with ServiceNow's offerings.
Top Skills: Ai-Powered ToolsCloud SoftwareItilServicenow

What you need to know about the Bengaluru Tech Scene

Dubbed the "Silicon Valley of India," Bengaluru has emerged as the nation's leading hub for information technology and a go-to destination for startups. Home to tech giants like ISRO, Infosys, Wipro and HAL, the city attracts and cultivates a rich pool of tech talent, supported by numerous educational and research institutions including the Indian Institute of Science, Bangalore Institute of Technology, and the International Institute of Information Technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account