DevOps EngineerĀ 

May 26, 2026
Application ends: August 25, 2026
Apply Now

Job Description

WHAT YOU’LL DO :

– Manage and operate production-grade Kubernetes clusters (EKS preferred), ensuring high availability and scalability

– Troubleshoot real-time production issues across distributed systems and microservices

– Diagnose and resolve issues such as :

i. Pod failures (CrashLoopBackOff, Pending, OOMKilled)

ii. Node failures, autoscaling, and resource constraints

iii. Networking, ingress, and service connectivity issues

– Build, maintain, and debug infrastructure using Terraform (modules, remote state, locking, drift handling)

– Implement and enhance monitoring & alerting systems using Prometheus, Grafana, and related tools

– Perform root cause analysis (RCA) for incidents and drive permanent fixes to improve system reliability

– Participate in a 24/7 on-call rotation, owning incidents and resolving them independently

– Collaborate with engineering teams to improve system performance, resilience, and deployment processes

– Automate deployments, infrastructure provisioning, and operational workflows to reduce manual effort

– Ensure adherence to security best practices across infrastructure and deployments

Requirements :

WHAT YOU’LL NEED :

– 8 + Years of experience as a Developer Engineer, owning and operating production Kubernetes clusters (EKS preferred), including cluster health, scaling, and availability

– Troubleshoot real-time production issues independently across microservices and distributed systems

– Debug and resolve critical issues such as :

i. Pods stuck in CrashLoopBackOff, Pending, OOMKilled states

ii. Node failures, node pressure, autoscaling issues

iii. Service connectivity, ingress, and networking issues

– Investigate and fix cluster-level issues including scheduling, resource constraints, and misconfigurations

– Build and maintain infrastructure using Terraform, including :

a. Writing and modifying modules

b. Managing remote state and locking

c. Handling drift and failed deployments

– Design and implement reusable Terraform modules for scalable infrastructure

Are you interested in this position?

Apply by clicking on the ā€œApply Nowā€ button below!

#GraphicDesignJobsOnline

#WebDesignRemoteJobs #FreelanceGraphicDesigner #WorkFromHomeDesignJobs #OnlineWebDesignWork #RemoteDesignOpportunities #HireGraphicDesigners #DigitalDesignCareers# Dynamicbrand guru