We are seeking a driven and technically skilled DevOps Engineer with strong Microsoft Azure experience to support, troubleshoot, and improve cloud infrastructure, CI/CD pipelines, automation, monitoring, and operational reliability across production environments. This role is highly operational and troubleshooting focused, requiring someone who is comfortable diagnosing production issues, responding to alerts and outages, managing escalated support tickets, and serving as a key escalation point for infrastructure and application support.
The ideal candidate enjoys problem solving, identifying root causes, stabilizing environments, and partnering cross functionally to resolve complex operational issues quickly and effectively. This position operates within Agile/Scrum environments while balancing real time operational support priorities.
LocationRemote — India
Responsibilities- Build, deploy, maintain, and troubleshoot scalable Azure cloud infrastructure
- Develop and maintain Infrastructure as Code (IaC) solutions
- Create, manage, troubleshoot, and improve CI/CD pipelines and deployment automation
- Monitor production systems and actively respond to operational alerts, incidents, outages, and performance degradation
- Own and manage escalated support tickets and serve as a technical escalation point for operational issues
- Investigate and troubleshoot infrastructure, deployment, networking, database, and application related problems
- Perform root cause analysis and implement corrective actions to improve long term system stability
- Support highly available environments aligned with SLA/SLO objectives
- Participate in on call rotations and support critical production incidents as needed
- Perform application maintenance, patching, upgrades, and environment support activities
- Collaborate with development, security, infrastructure, and support teams to resolve operational issues quickly
- Work within Agile/Scrum processes while also handling ad hoc operational and troubleshooting priorities
- Implement operational best practices for reliability, security, monitoring, and performance optimization
- Maintain operational documentation, deployment standards, troubleshooting guides, and support procedures
RequirementsMinimum Requirements
- 5+ years of experience working in technology, infrastructure, cloud engineering, DevOps, or IT operations roles
- 3+ years of hands on experience with Microsoft Azure cloud services
- Experience supporting and troubleshooting production environments with SLA/SLO requirements
- Strong experience responding to operational alerts, incidents, outages, escalations, and infrastructure troubleshooting activities
- Experience diagnosing and resolving deployment, networking, application connectivity, and system performance issues
- Experience working in fast paced Agile/Scrum and ad hoc operational support environments
- Experience acting as a ticket owner or escalation resource for infrastructure and application related support cases
- 3+ years of Infrastructure as Code (IaC) experience using Terraform preferred; ARM templates and/or Bicep acceptable
- 2+ years of experience working with SQL databases and Active Directory environments
- Experience designing, managing, and troubleshooting CI/CD pipelines using GitHub Actions, Bitbucket Pipelines, and/or Azure DevOps
- Strong experience with Git based version control systems, primarily GitHub
- Experience with automation and scripting using PowerShell, Bash, or Python
- Hands on experience with monitoring and observability platforms such as Azure Monitor, Grafana, Uptrends, and Application Insights
- Experience troubleshooting Azure networking components including VNets, NSGs, Private Endpoints, peering, load balancing, and application connectivity
- Understanding of cloud security, operational reliability, and infrastructure best practices
- Microsoft Certified: Azure Administrator Associate (AZ-104)
- Experience with containerization and orchestration technologies such as Docker or Kubernetes
- Experience supporting or integrating AI/ML related Azure services such as Azure OpenAI, Azure AI Foundry, or Azure AI Search
- Familiarity with GitOps or platform engineering concepts
- Strong troubleshooting, analytical thinking, and root cause analysis skills
- Strong communication and cross team collaboration skills
Benefits
- 20 Days Annual Leave
- 45 Days Annual Leave Maximum
- 4 Festival Days Named
- 8 Festival Days Select
- 12 Days Sick Leave
- 100% Paid Medical Insurance & GPA
- Wellness Programme
- 3x CTC Group Life Insurance
- Pension
- Employee Referral Scheme


.png)
