To Apply for this Job Click Here
Platform Software Delivery
- Design and implement business critical cloud based Platform solutions with automation-first mindset, observability, container design patterns and best of breed cloud tools and architecture practices
- Collaboratively solves business and technology problems in partnershp with key stakeholders from Digital Platform team, security, enterprise architecture and product owners
- Contribute to container, microservices application code base and architecture with a focus on optimization for performance, reliability, scalability, security, observability and cost
- Develop and implement solutions for non-functional requirements with a focus on automation, whitebox monitoring, and modularity for broad re-use across system components
- Design and implement CI/CD deployment pipelines and test automation frameworks based on best practices to enable frequent, high quality releases
- Define and implement application deployment strategies based on application type
Operations
- Assist with guiding, growing and training agile engineering teams to optimize service quality and ensure adoption of container, microservices, and operational best practices
- Ensure the effective capture of application telemetry, logging and monitoring of all aspects of system and application behavior to facilitate fast detection and issue resolution
- Design and develop operational tools and services needed to effectively operate system components at scale
- Understanding and adherence to operational processes ensuring audit-ability, risk and compliance with ISO and industry standards (includes Incident, Problem and Change Management)
- Continually evaluate service and infrastructure usage to effectively manage performance, capacity and cost – automating solutions, removing toil wherever possible
- Participate as a member of the broader SRE community to develop tools and services that enable automated operations
Support
- Contribute to technical documentation required to guide on-call engineers and on-board team members
- Maintain system wide health and proactively seek out potential issues, address with component teams
- Proactively and continuously drive system wide quality improvements by undertaking thorough root cause analysis for major incidents with component engineering teams
- Provide training and coaching in a capacity as Subject Matter Expert to other engineers
Qualifications:
Required Skills/Experience:
- 5-7 years experience with AWS, Kubernetes and container based architecture, designs and solutions
- Required experience in a continuous delivery model similar role at an organization that has adopted the SRE model
- Experienced in AWS CloudFormation, Hashicorp Terraform, and Ansible for automated infrastructure and platform provisioning
- Experience with source code management system (e.g., GIT/Bitbucket)
1423175kjm_1752094203