Job Description
What we are looking:
– Excellent dealing with high-availability, fault-tolerant, scalable, resilient and distributed systems.
– Expert working at AWS cloud computing infrastructure and its components.
– Hands-on with containerization, container & cluster management – Kubernetes, Docker, EKS etc.
– Hands-on experience with configuration management tools (Ansible, Terraform etc)
– Proven and Hands-on experience in handling large scale infrastructure like Package management, EC2, SQS, S3, MongoDB and Distributed systems like Kafka, Yarn, Elastic Search etc..
– Familiarity with container orchestration tools (K8’s, ECS, swarm) build, artifacts, packaging, service discovery management tools.
– Good at any of the following languages – Python, Java, Go.
– Source code management and Implementation of security best practices.
– Good at analysing App bottlenecks, performance degradation and implementing automated process/tools to detect such anomalies.
Responsibilities :
– Design, architect and implement best in class CI/CD pipelines
– Accountable for infrastructure design, Automation, stability, resilience, performance, monitoring, security, and implementation of right practices.
– Build and manage infrastructure as a code and experience with Terraform
– Collaborate with engineering teams to improve the development/production environment.
– Containerizing and orchestrating with K8S and driving the micro-services adoption across multiple engineering functions.
– Owning/Building functional KPIs for services, incident, and infrastructure metrics.
– Identify and track metrics such as MTTR (mean time to recovery, repair, respond or resolve) in order to exceed SLA expectations
– Build services and Maintain once they are online by measuring and monitoring availability, latency and overall system reliability.
– Building solutions and Monitoring at scale with Prometheus and TICK stack.
– Participate in Defining cloud data strategy, including designing multi-phased implementation roadmaps
What we are looking:
– Excellent dealing with high-availability, fault-tolerant, scalable, resilient and distributed systems.
– Expert working at AWS cloud computing infrastructure and its components.
– Hands-on with containerization, container & cluster management – Kubernetes, Docker, EKS etc.
– Hands-on experience with configuration management tools (Ansible, Terraform etc)
– Proven and Hands-on experience in handling large scale infrastructure like Package management, EC2, SQS, S3, MongoDB and Distributed systems like Kafka, Yarn, Elastic Search etc..
– Familiarity with container orchestration tools (K8’s, ECS, swarm) build, artifacts, packaging, service discovery management tools.
– Good at any of the following languages – Python, Java, Go.
– Source code management and Implementation of security best practices.
– Good at analysing App bottlenecks, performance degradation and implementing automated process/tools to detect such anomalies.
Responsibilities :
– Design, architect and implement best in class CI/CD pipelines
– Accountable for infrastructure design, Automation, stability, resilience, performance, monitoring, security, and implementation of right practices.
– Build and manage infrastructure as a code and experience with Terraform
– Collaborate with engineering teams to improve the development/production environment.
– Containerizing and orchestrating with K8S and driving the micro-services adoption across multiple engineering functions.
– Owning/Building functional KPIs for services, incident, and infrastructure metrics.
– Identify and track metrics such as MTTR (mean time to recovery, repair, respond or resolve) in order to exceed SLA expectations
– Build services and Maintain once they are online by measuring and monitoring availability, latency and overall system reliability.
– Building solutions and Monitoring at scale with Prometheus and TICK stack.
– Participate in Defining cloud data strategy, including designing multi-phased implementation roadmaps
Apply Now