Site Reliability Engineer (Remote)


  2026-05-21
  Remote, Nigeria
  Not specified
  Information Technology

Moniepoint Incorporated is a global business payments and banking platform and recently became QED Investors’ first investment in Africa. We are the partner of choice for over 600,000 businesses of all sizes, powering the dreams of SMBs and providing them with equal access to the tools they need to grow and scale.

We are recruiting to fill the position below:

Job Title: Site Reliability Engineer

Location: Nigeria (Remote)

Job Summary

  • The ideal candidate will balance real-time on-call responsibilities with strategic engineering work to achieve sustainable and scalable service reliability.
  • We are seeking a Site Reliability Engineer (SRE) responsible for ensuring our systems run smoothly and efficiently while engineering solutions to improve visibility, eliminate repetitive tasks, and increase system resilience.

Responsibilities

  • Implement and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs) defined by the engineering leadership.
  • Create and maintain meaningful dashboards and alerts. Work with development teams to instrument their code to ensure visibility.
  • Participate in on-call rotations to detect and triage service and reliability issues across all environments. Act as the Incident Commander during major incidents: initiating war room or bridge calls, coordinating cross-functional teams, providing timely and clear status updates to all stakeholders.
  • Investigate and resolve customer complaints escalated beyond L1 and L2 support, especially those involving performance, reliability, or complex system behavior.
  • Develop automation to eliminate manual and repetitive operational tasks (toil) related to reliability across both applications and infrastructure.

Requirements

  • Experience setting up dashboards in Grafana and using APM tools like Datadog, New Relic, Signoz.You have a Solid understanding of metrics, logs, and traces.
  • Hands-on experience with Kubernetes. You have managed applications on a major cloud provider (GCP, AWS, or Azure), and can troubleshoot common container issues.
  • Proficiency in SQL (e.g., PostgreSQL, MySQL). Ability to write complex queries to debug data issues and a basic understanding of database performance.
  • Minimum of 3 years of experience supporting enterprise applications as an SRE or similar role with proficiency in writing code in Java, Go or Python
  • Good understanding of distributed systems concepts, microservices architecture and software design patterns.

Click link to Apply





Get the Latest Jobs Delivered to Your Inbox