Home/
Search jobs/
Lead Site Reliability Engineer ELK | Kuala Lumpur, W.P. Kuala Lumpur
Lead Site Reliability Engineer ELK
Back to job search
Back to job search

Kuala Lumpur , W.P. Kuala Lumpur

|

Hybrid

|

Direct hire

Kuala Lumpur, W.P. Kuala Lumpur

|

Hybrid

|

Direct hire

Job ID 8160|Posted Mar 25, 2026
JOB DESCRIPTION
Senior Test Automation Engineer
Kuala Lumpur, Malaysia


 About Horizontal: Established since 2003 in the US, Horizontal solves complex challenges across two distinct businesses: Horizontal Digital and Horizontal Talent. We are consistently recognized for being a top workplace and one of the fastest-growing private companies. Horizontal Talent specializes in staffing for IT, Digital & Creative, and Business & Strategy markets. We have global offices in US, UAE, India, Australia and Malaysia.

Job Summary:

Join our central DevOps Engineering Services organization, committed to reshaping the developer experience. As a Site Reliability Engineer, you'll be pivotal in crafting end-to-end delivery pipelines, ensuring seamless integration, deployment of infrastructure and software, and providing essential maintenance and support to our developer community. With a strong focus on real zero-trust strategies, problem-solving capabilities, and customer-oriented approaches, join us on our transformative journey.


Responsibilities:
Requirement:

What to expect:

  • Work through all phases of the system administration life cycle, including capacity planning, architecture design, compliance, deployment & configuration, monitoring, and incident management.
  • Develop automation scripts, infrastructure as code, and tooling using industry best practices to improve system reliability, reduce manual effort, and enable self-service.
  • Review system architectures design, deployment strategies, observability setups, and operational documentation to ensure reliability and operational excellence.
  • Analyze production issues, identify root causes, and implement long-term reliability improvements through automation, monitoring, and architectural enhancements.
  • Work collaboratively with other team members and provide guidance to more junior team members.
  • Organize an efficient handover through high quality documentation and training.
  • Automate the deployment and operation of multi-tenant infrastructure, handling tasks that ensure system resilience and availability.
  • Develop and maintain monitoring tools, dashboards, and self-healing mechanisms.
  • Participate in on-call rotations, conduct blameless postmortems, and drive continuous learning.
  • Work closely with developers, product teams, and engineering stakeholders to troubleshoot issues, improve systems, and integrate reliability improvements
  • Capable of providing accurate project estimates and strategically adapting plans throughout the project lifecycle.

What will make you successful?

  • Bachelor’s/master’s degree in engineering, Computer Science, IT, or equivalent experience.
  • Minimum 10 years of experience in Site Reliability Engineering or software development within an international company.
  • Minimum 2 years of experience leading project.
  • Familiarity or experience with data ingestion with big data technologies (Elastic Search, Logstash, Kibana and kafka).
  • Experience with CICD development & deployment tools such as Maven, Jenkins, Nexus, Git, and Docker.
  • Proficiency in Linux OS
  • Proficiency in scripting and automation (e.g. Python, PowerShell, YAML) with the ability to develop tools and infrastructure as code (Preferably Ansible, Terraform, Kubernetes, OpenShift).
  • Understanding of distributed systems and microservices architectures, including REST and SOAP APIs.
  • Hands-on experience with ITIL processes, including Incident, Problem, and Continual Improvement, is an advantage.
  • Experience working within an Agile-driven environment.
  • Practical experience in building metrics for data-driven reporting.
  • Strong interpersonal skills with a customer-centric mindset and ability to work effectively across diverse cultures.
  • Proven ability to collaborate with both local and remote teams across different time zones.
  • Familiarity with or experience in managing VM hosts using vCenter is an advantage.

Additional knowledge and experience in the following area will be an advantage:
The above description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.
Horizontal is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy, sexual orientation, and gender identity), national origin, age, disability, veteran status, or any other protected characteristic under applicable law.

Horizontal is committed to taking affirmative action to employ and advance in employment qualified individuals with disabilities and protected veterans. If you are an individual with a disability and require a reasonable accommodation to complete any part of the application process or participate in the interview process, click here to request accommodation assistance.

All applicants applying must be legally authorized to work in the country of employment.

We're sorry!

Applications from candidates outside of our global target markets are currently not accepted through Job Board - but that could change in the future. We will update Job Board if we expand to your market. Do you have other candidate related questions? Contact us today.
Contact us
Return to home

We're sorry!

Applications outside of the assigned country are currently not accepted for this job - but that doesn't mean we can't work with you. Visit our Job Board to view the jobs available within your market.
View available jobs
Contact us

Access denied.

Users logged in to client accounts are unable to access Job Board. If you are interested in viewing and applying for open jobs, please log out of your current account. If you are a hiring manager interested in viewing candidates available to hire, click below to go to Talent Board.
Go to Talent Board
Discover Horizontal
Our expertise and practice areas
Our solutions
Our company
View our locations
Need a digital marketing solution? Visit our partner company Horizontal Digital. (Opens in a new tab)
Privacy Policy | Terms And Conditions | Accessibility | CA Privacy Notice |
©Horizontal Talent 2026
Horizontal Integration is an affirmative action and equal opportunity employer.
| Protect yourself from job scams
Badge
Horizontal is a minority-owned business that is certified by the National Minority Supplier Development Council (NMSDC).