Job Description
What You’ll Do
- Manage and Optimize Cloud Infrastructure, Oversee the deployment, management, and scaling of AWS cloud infrastructure to support a robust data platform.
- Implement and maintain monitoring solutions to ensure optimal performance, availability, and reliability of cloud-hosted services.
- Develop and maintain automation scripts using tools like Terraform, CloudFormation, and Ansible to streamline operations and reduce manual intervention.
- Lead the response to operational incidents, perform root cause analysis, and implement corrective actions to prevent recurrence.
- Ensure the cloud environment adheres to security best practices and compliance requirements, including regular audits and vulnerability assessments.
- Work closely with development, data engineering, and security teams to integrate new features and improvements into the cloud infrastructure.
- Monitor and optimize cloud spending, implementing cost-saving measures without compromising performance or reliability.
- Maintain comprehensive documentation of the cloud architecture, processes, and procedures.
- Plan and execute backup, archive, and recovery procedures for cloud-based data platform.
Who You’ll Work With
- Our Cloud Operations team is a global force, managing end-to-end cloud infrastructure across major CSPs. We work in a 24×7 environment, ensuring availability, data protection, and timely upgrades. Collaborating with product development and IT and Architecture teams, we’re passionate about excellence and stay ahead of the curve in this ever-evolving cloud landscape. If you’re ready to shape the future of cloud operations, come be part of our dynamic team!
- At the heart of our team lies a commitment to reliability and innovation. We thrive on challenges, adapt swiftly to change. Whether it’s optimizing resource allocation or troubleshooting complex incidents, we’re in it together.
- The position will report to Delivery Manager, Cloud Operations.
What Makes You a Qualified Candidate
- Bachelor’s degree in Engineering, IT, or a related field
- Minimum of 2 years of experience in cloud operations, with a strong focus on AWS services
- Proficiency in AWS services such as EC2, S3, RDS, Lambda, VPC, IAM, CloudWatch, and more
- Hands-on experience with infrastructure as code (IaC) tools like Terraform, CloudFormation, and configuration management tools like Ansible or Chef
- Strong scripting skills in languages such as Python, Shell, or PowerShell
- Experience with monitoring tools like Prometheus, Grafana, ELK Stack, or AWS CloudWatch
- Understanding of cloud security principles, including IAM policies, security groups, and encryption
- Excellent troubleshooting skills and the ability to perform root cause analysis
- Certified Solutions Architect/Sysops Associate, AWS Certified DevOps Engineer, or similar certifications are highly desirable
- Experience with Datadog for monitoring and performance optimization will be a significant advantage
- Excellent verbal and written communication skills to effectively collaborate with team members and stakeholders.
- Good understanding of ITIL (certification will be an advantage)
- Hands-on with an industry standard ITIL tool (ServiceNow experience preferred)
- Flexibility and Willingness to work in a 24×7 environment and adapt to shift schedules as per the roster
What You’ll Bring
- A proactive approach to identifying and implementing improvements in cloud operations.
- Strong interpersonal skills to work effectively with cross-functional teams.
- Meticulous attention to detail to ensure the reliability and security of the cloud infrastructure.
- Ability to adapt to rapidly changing environments and new technologies.
- A commitment to delivering high-quality solutions that meet the needs of internal and external stakeholders.
- A passion for staying up-to-date with the latest trends and advancements in cloud technology.
- Extensive experience in managing and optimizing large and complex AWS environments, including EC2, S3, RDS, and VPC configurations.
- Proficiency in Linux system administration, including installation, configuration, and maintenance of Linux servers.
- Expertise in managing patching and upgrades of Linux-hosted applications to ensure security and performance
- Teradata VantageCloud knowledge or experience (will be an advantage)
Are you interested in this position?
Apply by clicking on the “Apply Now” button below!
#GraphicDesignJobsOnline
#WebDesignRemoteJobs #FreelanceGraphicDesigner #WorkFromHomeDesignJobs #OnlineWebDesignWork #RemoteDesignOpportunities #HireGraphicDesigners #DigitalDesignCareers# Dynamicbrand guru