Site Reliability Engineer

Capital Management Firm #017

We are seeking a highly technical site reliability engineer to join our growing team. This IT operations role applies an engineering approach to automating the reliability of the network.

We count on our site reliability engineers to deliver high availability and stellar performance that aligns with service-level objectives and goals of the IT organization and business.

You will help in the evolution of our system/network processes and author automation to help scale the business.

Candidates should have excellent client facing skills and fluency translating business requirements into technical architectures, data models and designs.

Experience with clients in the financial services domain is a plus.

Primary Functions and Responsibilities:

· Run the production environment by monitoring availability and system health.

· Continually and programmatically measure and optimize system performance.

· Provide primary operational support and engineering for enterprise applications and systems.

· Participate in on-call rotation.

· Develop and implement code for automation to provision enterprise infrastructure

· Use error budgets to balance speed to market and maintaining service-level objectives

· Define process and automation workflows

· Interact with other engineering teams to help them improve availability, reliability, observability and resilience of our infrastructure and systems.

· Use a proactive approach to spotting problems, areas for improvement, and performance bottlenecks

Communicate timelines, network dependencies, resource constraints, and progress with stakeholders



Experienced: BS degree (or equivalent professional experience) in Computer Science or related engineering field with at least 8+ years of experience.
A system engineer: have at least 8+ years of experience building, maintaining, operating, and deploying distributed services on-premise and in Azure.
A programmer: experience with 2 or more programming/scripting languages (Python, Javascript, Bash, Go, Powershell…) and love solving problems by writing code.

Experience with Linux and Windows System Administration
Experience with scripting languages – Powershell, Python, and Bash
Experience with Continuous Delivery/Integration pipelines using Jenkins
Have an ‘automate-first’ attitude
Ability to configure and maintain containers, Kubernetes, Ansible, and Terraform
Dependable, great attitude, highly motivated and a team player
Experience with Software Development Life Cycle (SDLC) practices
Excellent interpersonal and organizational skills
Ability to be flexible in terms of hours to coordinate with team members across various time zones
Great judgment in terms of escalating issues versus solving problems independently

To apply for this job email your details to

Job Location