Site Reliability Engineer
Posted on Dec 3, 2019 by Michael Bailey Associates - Amsterdam
For one of the biggest E-commerce companies worldwide we are looking for a Site Reliability/Devops/SiteOps engineer.
This is a contract role of 6 -12 months with option to extend.
You have experience managing large websites or services within the context of a large-scale web environment. You are able to execute and deliver projects in a high-pressure environment, without sacrificing quality. You are able to show personal initiative in identifying what needs to be done. You enjoy and want to automate everything so that you can sit back and relax! Google search is your best friend and open source repos and StackOverflow are your frequent stops.
The candidate we are looking for is an engineer with a passion for open source products and systems automation.
We are looking for expertise in one or several of the following key technologies:
Hashicorp stack: Terraform/Nomad/Consul
Scala or Go knowledge is a pre.
Dive deep into system and application performance and reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes and improvements.
Troubleshoot issues across the entire stack. Solve problems relating to mission critical services and build automation and monitoring to help prevent problem recurrence.
Engage in service capacity planning and demand forecasting, software performance analysis and system tuning.
Represent the Site Operations team in design reviews and operational readiness exercises for new and existing services.
Engineering degree or equivalent work experience
Hands-on experience with infrastructure automation tools and orchestrators (the whole Hashicorp stack, SaltStack, etc.)
Experience with monitoring frameworks (Grafana, Kibana, Prometheus)
You feel ownership over everything you ship.
You pride yourself on efficient monitoring and strong documentation.
Working knowledge of the TCP/IP stack, Internet routing and load balancing.
Experience troubleshooting Java-based server applications.
You are good on your own, but shine in a team.
Able to prioritize tasks and work independently.
Expertise in designing, analysing and troubleshooting large-scale distributed systems.
Experience with Cassandra Kafka and ElasticSearch
Deep network analysis experience a plus.
Strong Linux system-level analysis capabilities.
Systematic problem solving approach, coupled with a strong sense of ownership and drive.
Demonstrated skill and passion for operational excellence.
Are you ready for your next big challenge?
Michael Bailey International is acting as an Employment Business in relation to this vacancy.