Manager, Site Reliability Engineering
Job Overview
Job title: Manager, Site Reliability Engineering
Job description: About Our Team:
The Site Reliability Engineering (SRE) team designs, implements, and maintains a wide variety of cloud-based services and applications within KAR Global. We also transition and modernize KAR’s legacy technologies from private data center/cloud platforms to public cloud for a more agile and modern architecture.
About Our Candidate:
The heart of the SRE’s team success is to provide automation, self-service and first-class support to our software engineering teams, with continuous face-to-face involvement and knowledge-sharing sessions. The successful candidate for this role will continue to drive these high standards forward while collaborating closely with the software engineering organization to support the ongoing innovation, scaling and availability of KAR’s production platforms.
What You Will Be Doing:
This is a technical role and will require proven hands-on experience alongside the ability to communicate, deliver, and support the solutions being created.
· As an SRE Manager you will be responsible for one of KAR’s SRE teams. Your team will focus on delivering to the requirements of our stakeholders and the overall vision of the SRE department leadership.
· Utilize your technical expertise and solid leadership skills to guide Site Reliability Engineers to create large-scale, resilient, distributed systems and tools focused on improving developer velocity.
· Daily hands-on development with technologies such as Terraform, Jenkins, and many AWS services means you get to continually expand your knowledge.
· Contributing to architectural and design decisions within the engineering organization.
· Support and professional development of the team members in an independent contributor role.
· Delivery and execution are critical, so you will be launching and iterating regularly in an Agile environment.
· A wiliness to be a hands-on contributor when needed.
· Other duties and responsibilities as assigned.
What You Need to Be Successful:
· BA/BS degree in Computer Science or related technical field, or equivalent practical experience.
· 2+ years of experience as a manager in Operations, DevOps, SRE, or Software Engineering.
· 8+ years of experience in an SRE, DevOps, or similar operational role.
· You can write code in any language and have deployed your work in production.
· Experience with deploying and managing cloud infrastructure on AWS.
· Experience with Configuration management and infrastructure automation tools, i.e. Terraform, CloudFormation, Ansible, Salt, etc.
· Experience working in large scale distributed systems in a 24/7 SaaS focused environment.
· Experience with operational aspects of software systems such as monitoring, centralized logging, and alerting.
· Understanding of Continuous Integration/Deployment best practices and tools i.e Jenkins, Gitlab, CodeDeploy, etc.
· You have exceptional communication skills and an aptitude for developing relationships at the executive, engineering, and business levels throughout organizations.
#techjobscanada
Company: KAR Auction Services
Expected salary:
Location: Toronto, ON
Job date: Sun, 23 Oct 2022 07:06:10 GMT
Job Source: Careerjet.ca