Site Reliability Engineer
On-site- Vancouver, British Columbia, Canada
Technology
Job description
TrustFlight is at the forefront of digitizing the aviation industry with the creation of intelligent workflow applications that automate operating and maintenance processes, enabling our customers to focus on the data and insights that matter. TrustFlight has bases in both England (London & Leamington Spa) and Canada (Vancouver). Our business is rapidly expanding, and we’re proud to share that we’re entirely self-funded and consistently profitable.
Not only are we disrupting the sector, we are creating a great place to work that gives our people the freedom to create, innovate and influence how we do this. We continue to build an amazing group of people who are all here to make our products, services and culture the most envied in the industry!
We are seeking a talented Site Reliability Engineer (SRE) to join our Operations team. In this role, you will focus on ensuring the reliability, scalability, and performance of our systems and services across multiple cloud platforms. You’ll work with a variety of technologies, including cloud environments, containerized applications, and automation tools to continuously improve our infrastructure.
Responsibilities
- Maintain high reliability, availability, and scalability across cloud platforms (Azure, GCP) and resilient shared services (e.g., CI/CD)
- Automate infrastructure and resource provisioning, improving system efficiency and streamlining workflows
- Monitor system performance and capacity, implementing optimizations to ensure smooth operations
- Respond to incidents, investigate and resolve issues with operational environments
- Manage database backups and recovery processes, ensuring data integrity and availability
- Implement disaster recovery plans and regularly perform failover testing to ensure operational readiness
- Ensure security is a first-class priority across all areas of the platform, adhering to industry best practices and compliance requirements
- Contribute to blameless post-mortems for production incidents
- Provide support to our UK teams (and in emergencies) outside of regular business hours on a rota basis or similar
In this role, you will be energized and guided by our experienced Operations team, fostering a dynamic environment for continuous learning and improvement. At TrustFlight, we deeply value teamwork and are committed to the personal and professional growth of each team member. We are looking for professionals who are confident in their ability to acquire new skills and grow their expertise through dedicated mentorship and a supportive work culture.
Job requirements
You Ideally need the following to qualify:
- A passion for delivering modern scalable, performant, efficient, and resilient cloud-hosted systems
- Demonstrable experience of building and supporting internal platforms (pipelines, tooling, etc) that allow engineering teams to build, deploy, and operate software systems efficiently and effectively
- Experience or knowledge as an SQL Database Administrator (DBA), including database backup and recovery processes
- Hands-on experience with the Azure platform, including resource provisioning and cost management
- Excellent troubleshooting skills, with a focus on diagnosing and resolving complex issues across distributed systems
- A focus on building security and quality into development processes (i.e. DevSecOps)
- Demonstrable understanding of the following:
- Virtualization, Kubernetes, and containerisation (Docker, containerd)
- Azure App Service, SQL Database, Front Door/CDN/WAF, Cognitive Search
- CI/CD pipelines (preferably GitLab CI and Azure DevOps)
- Infrastructure-as-Code (preferably Terraform)
- Release processes and configuration management
- Web application and Microservice architecture
- Networking, including DNS, NGINX, firewalls, routing, load balancing, and VPNs
- Virtualization, Kubernetes, and containerisation (Docker, containerd)
- Strong scripting skills for automation using PowerShell, Azure CLI, and Bash
- Master-level organisation and detailed documentation skills
- Excellent collaboration and team-working skills
- Familiarity with the capabilities and practical applications of current AI technologies, and experience leveraging AI tools in operational processes
The following will be considered as a significant plus and would enhance your candidacy:
- Experience working in a high-paced SaaS startup
- Experience working across multiple timezones and with remote teams
- Knowledge of ArgoCD, Skaffold or Kustomize
- Familiarity with .NET
- Hands-on experience with Google Cloud Platform (GCP)
- Experience working with SIEMs and monitoring tools
Job location
This role is primarily based out of our Vancouver office. With our hybrid working policy, we prioritize a harmonious balance between working from home and being present in the office. This empowers an agile and flexible environment that supports your needs. However, it’s important to note that for you to fully harness the benefits of collaborating closely with our exceptional team, this role demands a heightened level of flexibility.
Benefits
- We offer a generous holiday allowance that increases the longer you are here. We are keen for birthdays to be celebrated and so we offer an additional day off to everyone.
- It is important to us that we all work in an environment that is supportive of health and wellbeing; healthcare cover for all our people covers your health, dental and ophthalmic requirements to support you physically and mentally.
- Our generous company contribution to your pension is greater than the local requirements and over time you can plan effectively for your future with our matching contribution scheme.
- We place huge importance on the contribution and experience you bring to the team, the salary will be based on the value you will bring to the role with a range spanning from 90-125K CAD.
How to apply
Tell us about you in a cover letter, outlining what you will bring to the role and how you can contribute to creating best in class tools and services throughout the aviation industry. Please also include your resume.
TrustFlight is an equal opportunity employer. We work together to create the most talented team that celebrates inclusivity, diversity and equality in a serious way. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. All candidates will receive consideration for this role without regard for gender, gender identity, race, national origin, colour, religion, disability or age. Our inclusive culture empowers all of us to inspire, enlighten and thrive.
or
All done!
Your application has been successfully submitted!