Manager, IT Infrastructure Operations

Hedge Fund #72

Location: Mid-town, NY In office: 3 days a week

Comp: (Base $ 350k – Total Comp up to $ 650k)

A Career with Hedge Fund technology team

As Hedge Fund’s reimagines the future of investing, our Technology team is constantly improving our company’s IT infrastructure, positioning us at the forefront of a rapidly evolving technology landscape. We’re a team of experts experimenting, discovering new ways to harness the power of open-source solutions, and embracing enterprise agile methodology. We encourage professional development to ensure you bring innovative ideas to our products while satisfying your own intellectual curiosity.

What you’ll do

· Own and manage the end-to-end Major Incident Management (MIM) process to ensure swift resolution of critical infrastructure incidents.

· Act as the Incident Commander during high-severity incidents, coordinating cross-functional teams and ensuring clear communication with stakeholders.

· Drive root cause analysis (RCA) for all major incidents and ensure timely implementation of corrective and preventive actions.

· Lead a Change Management process to ensure all infrastructure changes are planned, reviewed, approved, and executed with minimal risk to production systems.

· Chair the Change Advisory Board (CAB), ensuring alignment between technical and business stakeholders.

· Define and implement operational playbooks and standard operating procedures (SOPs) for incident and change management.

· Develop and maintain dashboards and reports for tracking stability performance metrics, including MTTR (Mean Time to Resolve), incident volumes, change success rates, and infrastructure reliability trends.

· Act as a trusted partner to infrastructure, application, and business teams, ensuring alignment of operational priorities and objectives.

What’s REQUIRED

· 10+ years of relevant experience in IT operations, with a focus on incident and change management.

· Bachelor’s degree in computer science, information technology, or a related engineering field.

· Deep understanding of enterprise IT infrastructure (e.g., networks, servers, storage, cloud, databases) and operational processes.

· Proven experience in managing major incidents and running mature change management processes in large, complex environments.

· Strong knowledge best practices, with ITIL v4 Foundation (or higher) certification required.

· Hands-on experience with ServiceNow or similar ITSM tools for incident, problem, and change management.

· Experience creating executive-level reports and dashboards to communicate performance metrics and operational insights.

· Prior experience working with globally distributed teams and 24/7 operational environments.

· Ability effectively lead teams during high-severity incidents.

· Demonstrated success in driving process adoption and operational rigor.

· Strong interpersonal and communication skills, with the ability to engage and influence technical and business stakeholders at all levels.

· Commitment to the highest ethical standards.

We take care of our people

We invest in our people, their careers, their health, and their well-being. When you work here, we provide:

· Fully-paid health care benefits

· Generous parental and family leave policies

· Volunteer opportunities

· Support for employee-led affinity groups representing women, people of color and the LGBT+ community

· Mental and physical wellness programs

· Tuition assistance

· A 401(k) savings program with an employer match and more

To apply for this job email your details to Graham.Gates@TechExecOnline.com

Job Overview
Job Location