System Reliability Engineer

Kuala Lumpur Permanent View Job Description
The System Reliability Engineer will focus on ensuring the availability, performance, and reliability of technology systems within the insurance industry. This role requires a keen eye for system performance and a proactive approach to problem-solving in a high-demand environment.
  • Great renumeration package
  • Great career enhancement

About Our Client

This is an exciting opportunity to join a large organization within the insurance industry. The company is committed to leveraging innovative technology to deliver exceptional services and maintain a strong market presence.

Job Description

  • Monitor and improve the reliability and performance of critical technology systems.
  • Develop and implement automated solutions for system operations and monitoring.
  • Collaborate with cross-functional teams to resolve system issues effectively.
  • Analyze system performance metrics to identify and address potential bottlenecks.
  • Ensure system scalability to meet business demands and growth.
  • Maintain documentation of system configurations, processes, and troubleshooting steps.
  • Participate in incident management and root cause analysis to prevent future issues.
  • Support a culture of continuous improvement within the technology department.

The Successful Applicant

A successful System Reliability Engineer should have:

  • Bachelor's degree in Computer Science, Software Engineering, Information Technology, or a related field.
  • 3-5 years of experience in Site Reliability Engineering, DevOps, or Software Engineering roles.
  • Prior experience supporting front-end applications in production environments, preferably in financial services or regulated industries.
  • Frontend Performance Monitoring; Ability to instrument front-end code for custom metrics and traces.
  • Experience with Real User Monitoring (RUM), Synthetic Monitoring, and Application Performance Monitoring (APM) tools (e.g., New Relic, Dynatrace, Datadog).
  • Proficiency in setting up dashboards and alerts using tools like Dynatrace, Grafana, Prometheus, Elastic Stack, or Splunk.
  • Familiarity with OpenTelemetry standards for distributed tracing.
  • Scripting skills in Python, Bash, or JavaScript for automation and tooling.
  • Experience with CI/CD pipelines (e.g., GitHub Flow).
  • Hands-on experience with cloud platforms (AWS, Azure).
  • Familiarity with containerization (Docker) and orchestration (Kubernetes).
  • Understanding of secure coding practices for front-end applications.
  • Awareness of financial compliance standards (e.g., PCI-DSS).



What's on Offer

  • Opportunities to work with cutting-edge technology in the insurance industry.
  • Professional growth within a large organization in Kuala Lumpur.
  • Supportive and collaborative work environment.



This is a fantastic opportunity for a motivated System Reliability Engineer to contribute to a leading company in the insurance industry. If you are ready to take the next step in your career, apply today!

Contact
Khatijah Mohamed Ismail
Quote job ref
JN-022026-6958299
Phone number
+60323024014

Job summary

Function
IT
Specialisation
Systems Administration
What is your area of specialisation?
Insurance
Location
Kuala Lumpur
Contract Type
Permanent
Consultant name
Khatijah Mohamed Ismail
Consultant contact
+60323024014
Job Reference
JN-022026-6958299

Diversity & Inclusion at Michael Page

We don't just accept difference - we celebrate it. We encourage applicants from all backgrounds to apply for this role and are committed to building inclusive, diverse workplaces where everyone can thrive. If you require any support or reasonable adjustments during the recruitment process, please let us know.