Description
A career as a System Reliability Specialist in the team at National Bank means participating in the transformation to have a direct impact on the client. This job allows you to have a positive impact on our organization, thanks to your skills in resilience and availability of IT services.
Your Job
- Build and maintain common CI/CD pipelines used by the larger team.
- Promote best practices for resilience and stability among application and infrastructure teams.
- Understand the main flows of our critical environments and identify single points of failure.
- Support IT teams to improve their documentation and architecture diagrams to include resilience and stability information.
- Promote and increase the automation of IT tasks to reduce human errors.
- Analyze system stability and recommend performance and resilience improvements.
- Promote best monitoring practices and support IT teams in implementing key resilience and stability indicators.
- Support IT teams following major events impacting the resilience of their systems.
- Participate in the redesign of the cross-functional architecture of the credit card domain.
- Challenge your colleagues, architects, developers, and designers to develop the team as a whole.
- Participate in a multitude of large-scale projects.
Your Team
IT is more than 2,300 experts who work in an agile, proactive, and collaborative manner to seize opportunities, stay at the cutting edge of technology, and continuously improve processes.
You are part of the Card Assets team. Our team stands out for its collaboration, agility, and continuous improvement mindset.
You report to the Director of Asset Management.
The Bank values continuous development and internal mobility. Our personalized training programs, based on learning through action, allow you to master your profession and develop new areas of expertise. Tools such as the Data Academy, language training, and coaching and mentoring support are available to you at all times.
Prerequisites
- Bachelor's degree or other related diploma in the field and several years of relevant experience.
- Expertise in software design of complex systems supporting thousands of concurrent clients.
- At least 10 years of experience in a complex environment composed of new and legacy technologies.
- Excellent understanding of DevSecOps principles, monitoring, and observability.
- Experience with AWS cloud technology (service development, deployment, automation, and operations).
- Proficiency with monitoring tools (Datadog, Cloud Watch, Splunk).
- Experience working with APIs.
- Experience in a technological leadership role.
- 24/7 operational experience.
- Experience in load testing and analysis.
- Strong ability to solve complex multi-system problems.
Required Languages
Bilingualism (French and English).
Soft Skills
- Collaboration
- Agility
- Continuous improvement mindset