Service Monitoring and Maintenance Engineer

Posted on Apr 25, 2024 by WNTD
Not Specified, United Kingdom
IT
Immediate Start
Annual Salary
Contract/Project

Job Title: Service Monitoring and Maintenance Engineer

Contract Type: Contract (Inside IR35), 12 Months

Job Description:

We are seeking a highly skilled Service Monitoring and Maintenance Engineer to join our team on a 12-month contract basis. The successful candidate will be responsible for monitoring and maintaining the operational health of various services within our technology ecosystem. This role is crucial in ensuring the reliability and performance of our services, making use of a variety of tools and platforms.

Key Responsibilities:

  • Service Monitoring: Continuously monitor service metrics through various platforms such as BES, ECP Platform Health Dashboard, and CloudWatch metrics. Identify and respond to anomalies and performance issues promptly.

  • Application Maintenance: Regularly update and maintain application code across services. This includes managing:

    • Python runtime and dependencies
    • Terraform configurations
    • GitHub Actions workflows
  • Incident Management

  • Develop and execute runbooks/playbooks for efficient response to incidents and service requests. Ensure swift resolution and minimal downtime.
  • Testing and Quality Assurance: Create, maintain, and enhance testing frameworks and infrastructure.

Responsibilities include:

  • Developing and executing unit tests and synthetic tests
  • Integrating and maintaining BES Monitoring
  • Ensuring proper functioning with the ECP Platform Health Dashboard
  • Deployment and Configuration Management: Manage GitHub deployment workflows to ensure smooth and reliable deployment processes.

Responsibilities include:

  • Performing tests on deployments
  • Reverting configurations that compromise operational availability, such as erroneous Firewall rules
  • Service Review and Stakeholder Engagement: Regularly review service performance and incident reports.
  • Provide constructive feedback and recommendations to ECP stakeholders and incorporate feedback from customers to enhance service delivery.

Required Skills and Qualifications:

  • Technical Expertise: Proficiency in Python, Terraform, and GitHub. Experience with AWS CloudWatch or similar monitoring tools is highly desired.
  • Problem Solving: Strong analytical and problem-solving skills with the ability to handle multiple incidents and emergencies.
  • Communication: Excellent communication skills, capable of effectively articulating technical challenges and solutions to stakeholders and team members.
  • Experience: Proven experience in managing IT service delivery, monitoring, and incident response.

Additional Requirements:

  • Ability to work in a fast-paced, dynamic environment.
  • Demonstrated experience in handling large-scale services and deployments.
  • A proactive approach to service health and improvements.

Reference: 2751420646

https://jobs.careeraddict.com/post/90072476

This Job Vacancy has Expired!

Service Monitoring and Maintenance Engineer

Posted on Apr 25, 2024 by WNTD

Not Specified, United Kingdom
IT
Immediate Start
Annual Salary
Contract/Project

Job Title: Service Monitoring and Maintenance Engineer

Contract Type: Contract (Inside IR35), 12 Months

Job Description:

We are seeking a highly skilled Service Monitoring and Maintenance Engineer to join our team on a 12-month contract basis. The successful candidate will be responsible for monitoring and maintaining the operational health of various services within our technology ecosystem. This role is crucial in ensuring the reliability and performance of our services, making use of a variety of tools and platforms.

Key Responsibilities:

  • Service Monitoring: Continuously monitor service metrics through various platforms such as BES, ECP Platform Health Dashboard, and CloudWatch metrics. Identify and respond to anomalies and performance issues promptly.

  • Application Maintenance: Regularly update and maintain application code across services. This includes managing:

    • Python runtime and dependencies
    • Terraform configurations
    • GitHub Actions workflows
  • Incident Management

  • Develop and execute runbooks/playbooks for efficient response to incidents and service requests. Ensure swift resolution and minimal downtime.
  • Testing and Quality Assurance: Create, maintain, and enhance testing frameworks and infrastructure.

Responsibilities include:

  • Developing and executing unit tests and synthetic tests
  • Integrating and maintaining BES Monitoring
  • Ensuring proper functioning with the ECP Platform Health Dashboard
  • Deployment and Configuration Management: Manage GitHub deployment workflows to ensure smooth and reliable deployment processes.

Responsibilities include:

  • Performing tests on deployments
  • Reverting configurations that compromise operational availability, such as erroneous Firewall rules
  • Service Review and Stakeholder Engagement: Regularly review service performance and incident reports.
  • Provide constructive feedback and recommendations to ECP stakeholders and incorporate feedback from customers to enhance service delivery.

Required Skills and Qualifications:

  • Technical Expertise: Proficiency in Python, Terraform, and GitHub. Experience with AWS CloudWatch or similar monitoring tools is highly desired.
  • Problem Solving: Strong analytical and problem-solving skills with the ability to handle multiple incidents and emergencies.
  • Communication: Excellent communication skills, capable of effectively articulating technical challenges and solutions to stakeholders and team members.
  • Experience: Proven experience in managing IT service delivery, monitoring, and incident response.

Additional Requirements:

  • Ability to work in a fast-paced, dynamic environment.
  • Demonstrated experience in handling large-scale services and deployments.
  • A proactive approach to service health and improvements.

Reference: 2751420646

CareerAddict

Alert me to jobs like this:

Amplify your job search:

CV/résumé help

Increase interview chances with our downloads and specialist services.

CV Help

Expert career advice

Increase interview chances with our downloads and specialist services.

Visit Blog

Job compatibility

Increase interview chances with our downloads and specialist services.

Start Test