Job Description
Your key responsibilities
- Incident Management & Post Mortems: Rapidly respond to production incidents affecting the trading platform with data-driven decision-making, minimising downtime and financial impact. This includes identifying, investigating, and resolving application/infrastructure issues, ensuring timely and concise communication updates, and leading root cause analysis with blameless post-mortems. Drive strategic follow-ups and automation opportunities to reduce repeat occurrences of issues.
- Availability, Monitoring & Automation: Maintain the overall availability of the trading platform. Enhance application health monitoring, implement automation to reduce manual intervention, and improve system resilience, with a key focus on conducting and observing Start of Week/Day checks and acting on monitoring alerts. Set up new and maintain existing monitoring environments, and maintain oversight of high-profile technology processes, particularly those relating to exchange obligations.
- Change & Release Management: Implement application releases and maintain oversight of infrastructure patching schedules (may include some Out of Hours activities). Enforce and adhere to change management policies.
- Disaster Recovery & Business Continuity: Actively participate in Disaster Recovery & Business Continuity Plan exercises (may include some Out of Hours activities).
- Collaboration & Project Support: Partner with development teams, Project Managers, and Functional Business Analysts to design, deploy, and introduce new functionality and enhancements to the trading platform, ensuring fault-tolerant, scalable solutions aligned with business goals. Respond to user queries and requests in a timely manner.
- Governance & Compliance: Enforce and adhere to incident and problem management policies, as well as bank-specific non-financial risk frameworks.
- Mentorship & Continuous Improvement: Support the growth of junior team members and promote a culture of engineering excellence and continuous improvement. Actively participate in local, regional, and global process improvement initiatives.
- Wider Technology & Bank Initiatives: Engage in wider Technology & Bank-wide initiatives such such as Engineering Days, Hackathons, Volunteering, Lunch and Learns, Data Science challenges, and Employee Resource Group events.
Your skills and experience
- A bachelor’s degree or higher, preferably in the field of Science, Technology, Engineering or Mathematics (STEM).
- 5yrs+ proven industry experience in a Production Support, Site Reliability Engineer, or DevOps role within a Trading or Financial Services Environment.
- Familiarity with Agile software development concepts and wider processes within the Software Development Life Cycle
- Strong technical skills in Linux/Unix systems, SQL, shell scripting, and programming languages such as Python, Java, etc.
- Strong experience with monitoring and observability tools (Prometheus, Grafana, Splunk, Geneos, OpenTelemetry).
- Familiarity with cloud platforms, containerisation (e.g., Kubernetes, Docker), and CI/CD pipelines.
- Have a strong understanding of the trade lifecycle and fundamental trading systems. Knowledge of Fixed Income and related products such as Rates, Credit and Repo.
- Self-motivated, proactive in nature, and ability to work independently; as well as collaboratively in a global organisation.
Are you interested in this position?
Apply by clicking on the “Apply Now” button below!
#GraphicDesignJobsOnline
#WebDesignRemoteJobs
#FreelanceGraphicDesigner
#WorkFromHomeDesignJobs
#OnlineWebDesignWork
#RemoteDesignOpportunities
#HireGraphicDesigners
#DigitalDesignCareers
# Dynamicbrand guru