VP / AVP, Application Resiliency Engineer, Enterprise Architecture - SRE, Technology and Operations
Group Technology and Operations (T&O) enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group T&O, we manage the majority of the Bank's operational processes and inspire to delight our business partners through our multiple banking delivery channels.
Site Reliability Engineering (SRE) at DBS combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring our customer and colleagues experience best of DBS systems.
As a Site Reliability Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault tolerant and designed to scale. You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime. Site Reliability Engineers utilize automation, continuous monitoring, tools and solid engineering principles around infrastructure and applications to optimize existing systems, build infrastructure and eliminate operational work.
DBS Technology and Operations is looking for passionate, creative and detail oriented engineers who excel on solving operational problems and improving efficiency. Responsibilities
Drive the Resiliency/Chaos Engineering agenda forward at an Enterprise Level Support services before they go live through activities such as Chaos testing (failure injection), system design inputs, developing software platforms and frameworks, capacity planning and launch reviews. Engage with product engineering teams to test against relevant Chaos Engineering tool kit. Sounds understanding of CI/CD pipelines and SDLC (application delivery) Understanding of SLI, SLO and Error budget for the system/s Programming and scripting to automate failure scenarios, integration with pipelines and developing self-service portals Work with teams located across locations in Asia Pacific Requirements
Very good analytical and problem solving skills with good understanding of technical risks emerging out of architecture decisions. Experience with developing applications in a Linux environment, with sound knowledge of algorithms, data structures, complexity analysis and software design. Understands complex architectures and well versed with design patterns. Development skills with experience in real time, distributed and highly secured environments. Experience with developing test cases and ensuring appropriate test coverage through unit and automated testing. Systematic problem-solving approach, coupled with effective communication skills and a sense of ownership and drive. Sound understanding of systems and application monitoring to establish SLI and Golden signals to improve detection rate and time. Bachelor's or Master's degree in Computer Science, a related technical field that involves programming, or equivalent practical experience. Minimum of 10 years technology experience. Highly motivated, pro-active and capable of working under pressure without compromising development processes and productivity. Strong, committed and reliable team player and strong communicator, able to take direction but also willing to contribute to discussions on design and strategy. Possess client-facing skills to be able to deal with and form good relationships with the business and other technology groups through day to day support and project work. Interest in financial technologies, new technology tools and the ability to learn. Apply Now
We offer a competitive salary and benefits package and the professional advantages of a dynamic environment that supports your development and recognises your achievements.