← Back to All Jobs

DevOps Engineer (Senior, GCP/SRE, LATAM) #26381

... create world-changing products using God-given talents . . .

PROJECT DESCRIPTION:

We are seeking a motivated DevOps / Site Reliability Engineer to partner with our backend and middleware development teams. You will own reliability, performance, and operational efficiency across our services, pipelines, and infrastructure, with a primary focus on Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE). This role blends software engineering with systems operations to enable rapid, safe delivery of high-quality software.

PROJECT STACK and TEAM:

Stack: GCP and GKE

Reports to Director of Engineering
Interview Process:
1. HR Interview
2. Technical interview

MAIN REQUIREMENTS:

  • 8+ years of hands-on experience in DevOps, SRE, or related roles supporting backend/middleware services

  • Proven expertise in CI/CD: Git, pipelines, artifact management, feature flags, blue/green or canary releases.

  • GCP proficiency with strong experience in GKE, GCE, Cloud Run, and related services; comfortable with multi-project, multi-region architectures.

  • Infrastructure as Code experience with Terraform (preferred for GCP), and configuration management

  • Containerization and orchestration: Docker, Kubernetes, with deep hands-on GKE administration

  • Observability stack: Prometheus, Grafana, Cloud Monitoring, Cloud Logging/Logging, Jaeger, Open Telemetry, or similar.

  • Scripting/programming: Python, Go, Bash, or similar for automation and tooling

  • Networking and security fundamentals: load balancing (Cloud Load Balancing), DNS, TLS, Private Service Connect, IAM, secrets management.

  • Excellent communication skills and ability to translate technical concepts for non-technical audiences

GOOD TO HAVE:

  • Experience with middleware platforms on GCP (e.g., Pub/Sub, API Gateway, Cloud Tasks).

  • Knowledge of distributed tracing and microservices patterns.

  • Familiarity with release engineering and feature flags strategies.

  • Prior experience with data-intensive backends or streaming platforms.

  • Ability to optimize cloud spend without compromising reliability.

  • Experience with GKE Autopilot, node auto-scaling, and cluster lifecycle management.

  • Experience in leveraging Prometheus/Grafana to improve observability, traceability and monitoring of deployed services.

  • Experience using tools like Rapid 7, Sonar Cube, K6/Locust, OWASP ZAP.

JOB RESPONSIBILITIES:

Collaborate with Backend/Middleware Teams to enable reliable delivery and operation of services (GCP/GKE). Design and implement CI/CD pipelines, build scripts, and release processes that support fast and safe deployments on GKE Instrument services for reliability, performance, and security; Collaborate with developers to optimize containerized workloads, resource requests/limits, and cluster configurations.
Maintain scalable, secure, and observable GCP-based infrastructure. Architect and operate cloud-native infrastructure on GCP with a focus on reliability and cost efficiency Manage GKE clusters, node pools, and the associated networking (VPCs, subnets, firewall rules). Implement monitoring, logging, tracing, and alerting using GCP-native and open-source observability tools; create dashboards and runbooks. 
Automation, tooling, and developer experience. Build internal tools to reduce toil and accelerate developer workflows in GCP/GKE environments. Standardize and automate provisioning, configuration management, and secrets management (e.g., GKE Config Sync, Secret Manager, IAM roles)
Performance and reliability engineering. Conduct capacity planning, load testing, and chaos engineering for GKE-based services. Perform incident response, on-call duties, post-incident reviews, and continuous improvement.
Security and compliance. Enforce security controls (IAM roles, VPC service controls, IAM bindings, network segmentation, secret management). Ensure compliance with applicable standards and best practices (e.g., PCI, GDPR, SOC2 where relevant). Help Implementing new tools for load testing (i.e.. K6/Locust). Security Policy Check (OPA/GateKeeper), Web application Security testing (OWASP ZAP), Middleware vulnerabilities check tools like Rapid 7.

SUMMARY:

  • Work your way – Enjoy the freedom to work from anywhere, with flexible hours that match your natural rhythm.

  • Work with global clients – Collaborate directly with international teams to create real impact.

  • Make extra cash – Earn bonuses for referring great people or bringing in new business opportunities.

  • Great people, no micromanagement – Join a supportive, results-focused team where you’re trusted to do your best work.

This flexibility allows developers…

  • A better work-life balance

  • Increased productivity

  • The ability to work any time around the clock

  • Reduction in commute time

  • Design your ideal daily schedule.

  • Build a career, not just a job.

  • Work smarter, not longer.

  • More time with family and friends

Apply To

No file chosen
A T G 7