EdTech Jobs
Hack The Box

Site Reliability Engineer

Hack The Box
🇺🇸Hybrid - Palaio Faliro€28K–€42K/yri6h ago
Prep for this Role

Role Snapshot

Site Reliability Engineer at Hack The Box focused on empowering the Content Engineering team by building reliable, scalable, and automated cloud infrastructure for hands-on learning experiences. The role involves infrastructure automation, observability, and operational excellence over a 6-month initial focus period.

Key Responsibilities: Design and maintain Terraform-based infrastructure across multiple cloud providers (GCP, Azure, AWS), build observability capabilities using monitoring tools, and partner with Content Engineers to improve workflows and remove operational friction. Support production environments, drive automation initiatives, train engineers on infrastructure best practices, and collaborate with the broader SRE team on platform reliability.
Skills & Tools: Hands-on expertise with Terraform, cloud platforms (GCP, Azure, AWS), observability tools (Prometheus, Grafana, Mimir, Loki, Tempo), and CI/CD automation. Strong collaboration and enablement skills with ability to communicate infrastructure concepts to non-SRE teams.
Qualifications: Demonstrated hands-on experience with Terraform and cloud infrastructure management across multiple providers. Experience with observability, monitoring, and production environment support required.
Location: Hybrid - Palaio Faliro
Compensation: €28K–€42K/yr (estimated)

Job Description

Welcome! Super excited you dropped by 🥳
Let's redefine cyber security expertise standards and connect business - community through highly engaging hacking experiences. (Find out more insights about Hack The Box culture in our career site).

✨ The Core Mission of the Site Reliability Engineer (SRE):

As a Site Reliability Engineer at Hack The Box, your paramount mission is to empower our Content Engineering team by providing reliable, scalable, and automated cloud infrastructure for our hands-on learning experiences.

Over the next 6 months, you will participate in enhancing and simplifying the systems, services, and tools that enable Content Engineers to build and operate cloud labs efficiently. You will focus on infrastructure automation, observability, operational excellence, and continuous improvement of the workflows that support content creation and delivery.

In parallel, you will contribute to the broader Site Reliability Engineering practice by helping maintain production services, improving platform reliability, supporting observability initiatives, and collaborating with fellow SREs on operational excellence efforts. While your primary focus will be the Content Engineering domain, you will remain closely aligned with the team’s reliability, automation, and infrastructure standards.

⚔️ Technology Tools & Weapons You’ll Be Using:

  • Infrastructure as Code (Terraform): Automate the provisioning and management of cloud resources.
  • Cloud Platforms (Google Cloud Platform, Microsoft Azure, AWS): Design, deploy, and operate infrastructure powering our cloud labs.
  • Observability & Monitoring (Prometheus, Grafana, Mimir, Loki, Tempo): Maintain visibility into platform health and reliability.
  • CI/CD & Automation: Improve and automate existing workflows and deployment processes.
  • Collaboration & Enablement: Work closely with Content Engineers to improve developer experience and platform adoption.

🚀 The Adventures That Await Your Life Becoming a Site Reliability Engineer at Hack The Box:

  • Heavily contribute to the reliability and scalability of the infrastructure powering Hack The Box cloud labs.
  • Partner with Content Engineers to improve workflows, remove operational friction, and enable faster content delivery.
  • Train and facilitate engineers on infrastructure best practices, Infrastructure as Code, and cloud-native technologies.
  • Design, implement, and maintain Terraform-based infrastructure across multiple cloud providers.
  • Build and enhance observability capabilities that improve operational visibility and incident response.
  • Support production environments through maintenance, troubleshooting, and continuous improvement initiatives.
  • Collaborate with the broader SRE team, contributing to shared platform reliability efforts when needed.
  • Drive automation initiatives that reduce manual effort and improve consistency across systems and processes.

🏆 Skills, Knowledge, and Experience Points Required to Unlock the Role of SRE at Hack The Box:

  • Hands-on experience with Terraform and Infrastructure as Code practices.
  • Experience operating and supporting workloads in Microsoft Azure and/or Google Cloud Platform (GCP).
  • Strong scripting and automation skills, ideally in Go but open for Python, Bash, or similar.
  • Experience with monitoring, observability, and operational troubleshooting in production environments.
  • Familiarity with CI/CD pipelines and developer enablement practices.
  • Excellent communication and collaboration skills, with the ability to work closely with cross-functional engineering teams.

Bonus Points:

  • Previous participation in on-call rotations or incident response processes.
  • Software development experience and familiarity with application development workflows.
  • Background in cybersecurity, penetration testing, or security-focused environments.
  • Experience contributing to realistic cloud architectures and operational workflows that support cybersecurity training scenarios.
  • Experience with Kubernetes, containers, and cloud-native technologies.

What your Hack The Box adventure will have in store:

  • 🎯You'll have the exhilarating opportunity to contribute to a product that is highly appreciated by users and the cybersecurity community at large
  • 🎯 You'll experience a highly supportive and caring environment, fostering growth, flexibility, and autonomy
  • 🎯 You'll embark on an exciting journey of continuous learning and problem-solving, leveling up as our organization grows
  • 🎯 Most importantly, you'll have a blast at HTB 🥳 because fun is an essential ingredient in our recipe for success! Just wait until you see our global meet-ups!

💰 The gems you’ll be enjoying as a Site Reliability Engineer:

  • Private health care
  • Paid paternity leave
  • 25 annual leave days
  • Free lunch & snacks at the office
  • 120€ Ticket Restaurant by Edenred
  • Dedicated budget for training and professional development, participation in conferences
  • Full access to the Hack The Box lab offerings; so you can learn how to hack 😉
  • State-of-the-art equipment (mac, iPhone, and mobile plan)
  • Flexible WFH (Hybrid Model) - Fully Remote is also an option if you're not an Attica resident

Our benefits package is designed to provide strong support to our team, but it may vary depending on location and type of employment (e.g., UK, Greece, or engagement through an Employer of Record).

🗺️ The Quest of Becoming Hack The Box’s Site Reliability Engineer:

  • Level 1: To complete level one’s objective, submit your application.
  • Level 2: Meet the Talent Acquisition team. Level’s objective: highlight your past achievements, ambitions, and values.
  • Level 3: Meet the hiring team. Level’s objective: connect with the hiring team and share with them your achievements.
  • Level 4: Complete an assignment that aligns with day-to-day job-related tasks and responsibilities. Part of the assignment is discussing it with the hiring team in a debriefing session, in order to walk the team through your thinking process.
  • Level 5: Congratulations! Not many reach this level 💪. Level’s objective: have a constructive, final conversation with senior leadership to explore the role and your future at HTB.
  • Level 6: You've officially received an offer from HTB! To complete the last level and the Quest, all you need to do is accept the offer.
  • Quest complete. Congratulations, you’re officially one of us 🥳🎉🎇Your next quest: complete the onboarding.

Hack Your Career, Today. Join us in this epic adventure of cybersecurity at Hack The Box! 🚀🔒💻

At Hack The Box, we are on a quest to find the most exceptional and enthusiastic talent to join our team. Whether or not you consider yourself a gamer, we value what makes you unique and want to know more about you. This job post provides just a glimpse of the incredible gamified experience our business and consumer customers enjoy through our platforms. So, if you're ready to embark on a journey of growth and adventure, we can't wait to meet you!

ABOUT HACK THE BOX

Hack The Box is the Cyber Performance Center with the mission to provide a human-first platform to create and maintain high-performing cybersecurity individuals and organizations.

Hack The Box is the only platform that unites upskilling, workforce development, and the human focus in the cybersecurity industry, and it’s trusted by organizations worldwide for driving their teams to peak performance. Offering an all-in-one environment for continuous growth, assessment, and recruitment, Hack The Box provides solutions for all cybersecurity domains.

Launched in 2017, Hack The Box brings together the largest global cybersecurity community of more than 3 million platform members. Rapidly growing its international footprint and reach, Hack The Box is headquartered in the UK, with additional offices in the US, Australia, and Greece.

🚨 Exciting News:

At Hack The Box, we are committed to fostering a diverse, inclusive, and equitable workplace. We believe that diversity enriches our performance, services, and the communities we serve. As such, we ensure that all job applications are considered solely based on merit, skills, and qualifications. We do not discriminate on grounds of race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. We are dedicated to providing a fair and respectful work environment that reflects our values.