DevOps

Remote

As a DevOps Engineer, you will own and scale the infrastructure for a global, real-time meeting platform. You’ll design resilient AWS architectures, lead automation with Terraform, and establish world-class observability, security, and scaling strategies. You will mentor junior engineers and drive DevOps excellence across the team.

Key Accountabilities & Activities
  • Cloud Architecture: Design and operate secure, multi-AZ AWS infrastructure (EKS, EC2, Lambda,
    RDS/Aurora, ElastiCache) or other cloud experiences. (Azure or GCP). Architect networking (VPC, subnets, NACLs, Security Groups, Transit Gateways). Implement firewall/WAF rules, Shield/GuardDuty, and advanced IAM policies.

  • Lead Terraform module design (remote state, workspaces, CI plan/apply), and enforce Infrastructure best practices (linting, policy-as-code, drift detection).

  • Kubernetes & Container Ops: Manage large-scale EKS clusters: autoscaling, NetworkPolicies, Ingress, PodSecurity. Implement service mesh (Istio/Linkerd) for traffic management and observability. Optimize container images, resource requests/limits, and cost efficiency.

  • CI/CD & Automation: Architect multi-stage CI/CD pipelines with automated testing, security scanning, canary/blue-green deployments. Implement GitOps workflows (ArgoCD/Flux).

  • Observability & Reliability: Own monitoring and alerting: Prometheus, Grafana, Loki/ELK, CloudWatch, OpenTelemetry. Define SLOs/SLIs, error budgets, and lead on-call rotations. Conduct chaos testing and resilience drills.

  • Security & Compliance: Enforce secrets management (AWS Secrets Manager/HashiCorp Vault). Apply encryption (KMS, TLS everywhere), vulnerability scanning, and audit trails. Ensure compliance with GDPR/PII retention and secure handling of meeting data/recordings.

  • Scaling & Performance: Lead capacity planning, autoscaling policies, and global failover strategies. Optimize cost via spot instances, reserved instances, lifecycle policies. Design global distribution strategies (CloudFront, Route53 latency-based routing)Background, Skills & Qualifications


Background, Skills & Qualifications
  • Bachelor’s/Master’s in CS/IT/Engineering (or equivalent).

  • 5+ years in DevOps/SRE, with large-scale cloud infra.

  • Expert in AWS (EKS, EC2, Lambda, RDS, ElastiCache, S3 lifecycle policies, WAF, GuardDuty).

  • Advanced Terraform usage and IaC best practices.

  • Strong Kubernetes expertise (scaling, security, multi-cluster ops).

  • Proven CI/CD pipeline architecture (GitHub Actions, Jenkins, ArgoCD).

  • Mastery in observability stacks (Prometheus, Grafana, ELK/Loki, OpenTelemetry).

  • Deep understanding of networking, security, compliance, and cost optimization.

  • Track record of scaling infrastructure for real-time, low-latency applications (RTC a plus).

  • [Bonus] familiarity with WebRTC infra (TURN/STUN/SFU like Coturn, Janus, mediasoup)

Apply Now