Purpose of the position
Reporting to the Manager, Software Engineering you will design, document, build, configure, verify, monitor, and support evolving system and cloud infrastructure for our Web applications, with a focus on uptime, application scalability and rapid code deployment.
Duties and Responsibilities
- Maintain and create automation approaches to reduce errors and improve performance
- Maintain and improve CI/CD pipelines
- Provision new servers and projects in the cloud.
- Identify architecture / infrastructure problems and architect solutions involving both development and IT groups.
- Lead new cloud projects to completion with the development and quality teams, keeping in mind system security, stability, scalability, measurability, and flexibility.
- Manage in-house systems that helps the development and quality teams be more productive in their day-to-day activities of building, deploying and testing new code.
- Manage monitoring and alerting infrastructure, to resolve and prevent problems.
- Define and maintain production persistence infrastructure, including data backups/restores.
- Provision systems according to the evolving needs of the team, which today include, but are not limited to, Tomcat, Java, Apache httpd, nginX, MySQL, Redis, MongoDB, etc.
- Provide on-call support for Linux and cloud infrastructure.
Education and work experience
- Degree in Computer Science, Computer Engineering, or related field.
- At least 2 years experience in Linux/Unix system administration or system operations engineering for web-based application platform.
- Demonstrated troubleshooting and problem-solving skills.
- Strong written and verbal communication skills with a focus on collaboration between teams.
- Strong scripting and automation skills using bash, python, etc.
- Experience protecting production systems, including access controls, auditing, and compliance.
- Experience managing large-scale, highly available production environments in a cloud environment.
- Experience with logging, monitoring, and alerting tools, such as Nagios, PagerDuty, Splunk, etc.
- Experience operating on virtualization platforms either locally or in the cloud, such as VMware, Google Cloud Platform, Amazon Web Services, Microsoft Azure, etc.
- Experience with CI/CD tools and platforms, such as Teamcity, Jenkins, git, Azure DevOps.
- Experience with DevOps principles and tools, such as Docker, Kubernetes, Terraform, Packer, etc.
Supervision
- This position does not have any direct reports
Additional Details
- Office-based role
- There is a requirement to be part of an ‘on call’ rotation during which time you would be expected to respond to hi-priority system outages on a 24/7 basis – this primary on call period responsibility will usually last for seven days and will include a weekend (Saturday / Sunday)
- You would be expected to perform certain system maintenance (planned maintenance) tasks during after hour’s periods where practical since our cloud-based applications are used in all time zones.
- The team is based in two offices in Calgary AB CA and Seattle WA USA. Occasional travel between these offices may be required.
At SMART Technologies, privacy is not an afterthought to us. Privacy is at the core of our software design. The result is that privacy is an essential component at the core of our business. To read more about our specific policies, please follow the links below. For more information please review: https://home.smarttech.com/legal/privacy-policy
#LI-SM1 #LI-HYBRID