Site Reliability Engineer

HF #002

Fixed Income Technology is responsible for providing real time, firm wide risk, and P&L for Fixed Income, Commodities, Credit and FX.

We are running a service mesh on a Kubernetes cluster. If you join our team, you would help deploy and support the growing number of Cloud and Container services we run.

Principal Responsibilities:

Help team members deploy new services, including Kustomize manifests and build pipelines
Advise service developers on best practices for observability, metrics, logging and tracing
Configure Istio for a micro services environment, including routing, mirroring, A/B deployments, Circuit Breakers
Set up alerts with Prometheus and help trouble shoot in a multi-services environment
Help developers setting up Skaffold environments and Docker images

Desired Qualifications/Skills

Self-starter able to execute independently, on a deadline, and under pressure
At least 5 years of experience supporting production environments
Experience with Kubernetes and Docker
Experience with Istio
Experience with Prometheus and Jaeger
Experience with Kustomize
Experience with Skaffold
Experience with Jenkins
Experience with Argo CD and Workflows
Excellent troubleshooting and analytical skills
Excellent written and verbal communications
Experience with Python and bash scripting

To apply for this job email your details to

Job Location