View Jobs at Interswitch Group |
Full Time Jobs |
Lagos |
Posted 3 hours ago |
Job Title: Site Reliability Engineer
Job Summary
- The role involves managing the availability and capacity of core applications to ensure optimal performance.
- Responsibilities include providing technical support for these applications, troubleshooting issues, and implementing the setup and integration of new applications within the company’s environment.
- The successful candidate will play a key role in maintaining the stability and efficiency of the organisation’s application infrastructure.
Responsibilities
- Design, implement, and maintain highly available and scalable infrastructure.
- Collaborate with development teams to ensure applications are built with reliability and performance in mind.
- Monitor system performance, identify bottlenecks, and proactively implement optimizations to improve system efficiency.
- Develop and maintain automation tools for deployment, configuration, and monitoring of systems and services.
- Conduct system capacity planning and provide recommendations for scaling resources to meet growing demands.
- Identify and resolve complex technical issues related to infrastructure, networking, and application performance.
- Implement and improve monitoring, alerting, and logging systems to ensure timely detection and resolution of incidents.
- Collaborate with cross-functional teams to define and implement best practices for infrastructure, deployment, and operational processes.
- Participate in on-call rotation and provide timely response and resolution to production incidents.
- Stay up-to-date with industry trends and emerging technologies in cloud computing and infrastructure automation.
- Strong experience with on-prem infrastructure and cloud platforms such as AWS and Azure.
- Proficiency in infrastructure-as-code (IaC) tools like Terraform or CloudFormation.
- Solid understanding of containerization technologies such as Docker and orchestration tools like Kubernetes.
- Experience with configuration management tools like Ansible, Puppet, or Chef.
- Strong scripting skills in languages such as Python, Bash, or PowerShell.
- Deep knowledge of Linux systems administration and networking concepts.
- Familiarity with monitoring and logging tools like Prometheus, Grafana, and ELK Stack
Requirements
Academic Qualification(s):
- Bachelor’s Degree in Computer Science, Engineering, or a related field (or equivalent work experience).
Professional Qualification(s):
- Service Management Certifications (e.g. ITIL) are an advantage
Experience (Number of relevant years):
- 3-4 years of experience in a similar SRE or infrastructure engineering role.
Application Instructions:
The application deadline is Not Specified. Therefore, qualified and interested candidates can “CLICK HERE TO SUBMIT APPLICATION.” It is important to visit the official website (link found below) for detailed information on how to apply successfully for this vacancy.
Official Job Website: https://interswitchgroup.com/
Job Features
Job Category | Site Reliability Engineer |