Job Description
Cloud Infrastructure & DevOps :
– Work closely with the DevOps team to manage and optimize cloud infrastructure (AWS/GCP), container orchestration (Kubernetes/ECS), and infrastructure-as-code.
– Drive tech innovation projects focused on cost optimisation – right-sizing, spot instances, reserved capacity, and architectural changes that reduce cloud spend.
– Architect monitoring, observability, and alerting systems using tools like Grafana, Prometheus. Common Services & Platform Ownership
– Own the lifecycle of shared/common services consumed across multiple product verticals auth services, notification services, config servers, and more.
– Maintain service health, SLAs, and documentation for all platform-owned services.
– Lead the effort to clear existing technical debt and backlogs in the platform layer.
AI Infrastructure & AI SDLC :
– Design and build the org-wide AI/ML platform harness including model serving infrastructure, prompt management, vector stores, evaluation pipelines, and LLM gateway/proxy layers.
– Establish AI SDLC best practices covering prompt versioning, model evaluation, A/B testing of AI features, cost tracking per model/call, and guardrails for responsible AI usage.
– Build shared tooling and abstractions that allow product teams to integrate LLMs, embeddings, and AI agents into their workflows without each team reinventing the stack.
– Set up observability for AI workloads latency tracking, token usage monitoring, hallucination detection, and cost dashboards across model providers.
– Stay current with the rapidly evolving AI tooling ecosystem and evaluate new frameworks, models, and infrastructure patterns for org-wide adoption.
What We Are Looking For :
Must-Have :
– 4 – 8 years of professional software engineering experience with strong programming skills (Java, Python, or Node.js).
– Hands-on experience with cloud platforms (AWS preferred, GCP/Azure acceptable) including networking, IAM, compute, and storage services.
– Proven experience designing and managing CI/CD pipelines (Jenkins, GitHub Actions, GitLab CI, ArgoCD, or equivalent).
– Solid understanding of containerisation and orchestration (Docker, Kubernetes, ECS/EKS).
– Experience with infrastructure-as-code tools (Terraform, CloudFormation, or Pulumi).
– Working knowledge of secrets management (HashiCorp Vault or AWS Secrets Manager), security scanning, and compliance controls.
– Familiarity with code quality tooling such as SonarQube, linting frameworks, and static analysis.
– Exposure to AI workflows understanding of how LLM-based applications are built, evaluated, and deployed (prompt engineering, model APIs, RAG patterns, vector databases).
– Strong communication skills you will work across product teams and need to influence without authority.
Are you interested in this position?
Apply by clicking on the “Apply Now” button below!
#GraphicDesignJobsOnline
#WebDesignRemoteJobs #FreelanceGraphicDesigner #WorkFromHomeDesignJobs #OnlineWebDesignWork #RemoteDesignOpportunities #HireGraphicDesigners #DigitalDesignCareers# Dynamicbrand guru
Apply Now