We are looking for a seasoned Staff DevOps and Platform Engineer to own and evolve the infrastructure that powers Liberate’s real-time AI voice and workflow automation systems. This is a critical technical leadership role. You will inherit and advance a modern AWS-based platform that spans PBX telephony, canary routing, MTLS-based integrations with carriers, secure production environments, CI/CD, and compliance posture.
You will drive reliability, scalability, and operational rigor across our multi-agent runtime. You will also mentor engineers, design forward-looking system improvements, and create the platform foundations that enable Liberate’s rapid product expansion.
Key Responsibilities
- Lead architecture and operation of core AWS infrastructure including PBX systems, EKS, networking, IAM, VPC design, and secure environment isolation.
- Own and improve Canary routing infrastructure for LRA (LLM REST API) via Traefik and GitOps patterns.
- Maintain and optimize CI/CD flows including GitHub-based CodeBuild jobs, artifact pipelines, and environment promotion workflows.
- Manage and evolve MTLS proxy infrastructure used to integrate with carrier systems like Frontline.
- Own HAProxy-based proxy fleet, certificate lifecycle, root CA management, and IP-restricted ingress patterns.
- Ensure secure, audited access to production systems, tokens, and root-level accounts.
- Lead incident response, on-call rotations, and postmortems. Improve reliability metrics (SLA/SLO/SLI) for voice, agent runtime, and workflow systems.
- Maintain and improve non-obvious production infra details including external service dependencies, version pinning, and update cadences.
- Partner with AWS Support to optimize pricing, scaling configs, and resource utilization.
- Modernize developer workflows: streamlined builds, repeatable environments, safe deployment strategies (blue/green, canary, feature flags).
- Build internal tools and abstractions to make engineers productive while enforcing safety, configuration hygiene, and compliance requirements.
- Lead infrastructure-related components of SOC2, pen-testing, and Vanta-driven controls.
- Ensure auditability, traceability, secure storage of credentials, and alignment with enterprise customer expectations.
- Work closely with AI Platform, Forward Deployed Engineering, and Product teams to translate business goals into scalable infrastructure decisions.
- Mentor engineers across DevOps, platform, and backend areas. Help set engineering standards and raise operational maturity across the org.
Required Qualifications
- 8+ years of DevOps, SRE, or platform engineering experience operating production systems at scale.
- Deep hands-on AWS expertise (EKS, IAM, VPC, ALB/NLB, CloudWatch, KMS).
- Strong experience with Kubernetes, container orchestration, and multi-environment management.
- Proficiency with Terraform or other IaC tools and GitOps workflows.
- High proficiency in Python, Go, or Typescript for tooling, automation, and internal platform services.
- Experience with Traefik, HAProxy, or similar load-balancing and routing systems.
- Familiarity with secure network architectures, MTLS, certificate hierarchies, and service-to-service authentication.
- Strong background managing CI/CD systems such as GitHub Actions and CodeBuild.
- Ability to lead incidents, design SLOs, and drive reliability across mission-critical systems.
- Excellent communication and leadership skills in distributed teams.
Preferred Qualifications
- Experience with PBX or telephony systems in AWS, SIP routing, or real-time communication pipelines.
- Experience with voice agents, WebRTC, or low-latency streaming services.
- Prior work in regulated or enterprise environments where compliance is a first-class requirement.
- Experience scaling infra in fast-growth startups.
- Contributions to open source, infrastructure design talks, or technical publications.
Why This Role Matters
Our platform supports real-time, multi-agent reasoning and voice workflows that depend on low latency, reliability, and airtight security. This role is the backbone of that capability. If you're excited to own mission-critical infrastructure in a company where infrastructure is product, we’d love to talk.
Strong preference for Boston or San Francisco based, but open to remote within the U.S.



