Senior Site Reliability Engineer

  • Full-Time
  • Remote
  • Posted on April 24, 2025

Quantitative Trading Firm #015

As one of Enterprise Technology’s first SREs, you will help to establish and grow our site reliability engineering practice in addition to ensuring the availability and reliability of systems within our stack.

This role requires a deep Linux operating system and application administration skill set, proficiency in Python, and solid experience with configuration management/IaC. Successful candidates should also have exceptional organizational, communication, and project management skills, as well as the ability to troubleshoot complex technical issues.

Responsibilities

  • Manage on-premise containerized web services, and a multitude of bridge services, integrations and batch processes that interconnect the elements of our productivity ecosystem
  • Proactively eliminate sources of operational work. Engineering not firefighting
  • Automate and troubleshoot a broad range of technical infrastructure both on-prem and in the cloud
  • Develop and implement monitoring solutions to ensure high system uptime and reliability
  • Enable transparency and high development velocity within the firm while maintaining a high bar for security. Find ways to reduce user friction, and make sure we have access to the tools and data they need when they need it
  • Break down complexity, iterate, and communicate progress to a wide variety of leads and stakeholders

Qualifications

  • 5+ years of experience in site reliability engineering or related disciplines
  • Proficiency with Python
  • Experience managing and monitoring containerized infrastructure
  • Experience working with CI/CD tools such as Jenkins, GitHub Actions, or ArgoCD
  • Expert experience with IaC and configuration management tools such as Terraform, SaltStack, Chef, Puppet, or Ansible

To apply for this job email your details to Graham.Gates@TechExecOnline.com

Job Overview
Job Location