Lead Observability & Platform Engineer
Bangsar South , W.P. Kuala Lumpur
|Hybrid
|Direct hire
Bangsar South, W.P. Kuala Lumpur
|Hybrid
|Direct hire
Job ID 8044|Posted Mar 25, 2026
JOB DESCRIPTION
Lead Observability & Platform Engineer
Location: Kuala Lumpur, Malaysia
About Horizontal: Established since 2003 in the US, Horizontal solves complex challenges across two distinct businesses: Horizontal Digital and Horizontal Talent. We are consistently recognized for being a top workplace and one of the fastest-growing private companies. Horizontal Talent specializes in staffing for IT, Digital & Creative, and Business & Strategy markets. We have global offices in the US, UAE, India, Malaysia & Australia.
About the Role: In line with testing standards/procedures, under guidance of management, and within operating plan and allocated budget, to lead, plan and participate in qualification and acceptance testing activities of high complexity related to hardware and software components of Company’s networks ensuring product quality. To act as a technical lead on qualification projects.
We are seeking a highly skilled and motivated Lead Engineer to drive the evolution of our Observability Platform, focusing on ELK stack products. In this pivotal role, you will be responsible for designing, implementing, and continuously improving our observability and monitoring capabilities, ensuring systems remain scalable and meet global standards. You will lead the vision and strategy for CI/CD processes across development, release, and deployment lifecycles, taking ownership of building and optimizing automated CI/CD pipelines and release orchestration. Your expertise will enable the delivery of high-quality, reliable software at speed while maintaining performance at scale.
Key Responsibilities:
Co-lead the ELK platform
Key Requirements:
The above description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.
Location: Kuala Lumpur, Malaysia
About Horizontal: Established since 2003 in the US, Horizontal solves complex challenges across two distinct businesses: Horizontal Digital and Horizontal Talent. We are consistently recognized for being a top workplace and one of the fastest-growing private companies. Horizontal Talent specializes in staffing for IT, Digital & Creative, and Business & Strategy markets. We have global offices in the US, UAE, India, Malaysia & Australia.
About the Role: In line with testing standards/procedures, under guidance of management, and within operating plan and allocated budget, to lead, plan and participate in qualification and acceptance testing activities of high complexity related to hardware and software components of Company’s networks ensuring product quality. To act as a technical lead on qualification projects.
We are seeking a highly skilled and motivated Lead Engineer to drive the evolution of our Observability Platform, focusing on ELK stack products. In this pivotal role, you will be responsible for designing, implementing, and continuously improving our observability and monitoring capabilities, ensuring systems remain scalable and meet global standards. You will lead the vision and strategy for CI/CD processes across development, release, and deployment lifecycles, taking ownership of building and optimizing automated CI/CD pipelines and release orchestration. Your expertise will enable the delivery of high-quality, reliable software at speed while maintaining performance at scale.
Key Responsibilities:
Co-lead the ELK platform
- Architect, deploy, and scale Elasticsearch clusters (hot-warm-cold tiers, ILM, snapshot/restore).
- Design ingestion with Logstash/Beats/Elastic Agent; standardize parsing (grok processors), mappings, and templates.
- Implement APM/tracing (e.g., OpenTelemetry) and align logs/metrics/traces for end-to-end visibility.
- Build dashboards (Kibana) and analyst-friendly views; automate alerting and anomaly detection.
- Govern access (RBAC), encryption, masking; manage capacity & cost efficiency (index strategies, routing).
- Own CI/CD pipelines end-to-end (build, test, package, scan, deploy, rollback).
- Create pipelines for infra and content (Ansible for infra; version dashboards, index templates, alert rules).
- Good understanding of GIT branching strategy relation with CI/CD
- Implement automated release/deployment orchestration
- Mentor engineers; lead technical design reviews and roadmap planning.
- Collaborate with Security/Compliance for security, auditability and data governance.
- Advocate observability standards across engineering (structured logging, trace context propagation).
Key Requirements:
- 12+ years in software development including data/platform/DevOps engineering; 3+ with ELK at scale (hot-warm-cold, ILM, index templates).
- University degree in Computer Science, software engineer or equivalent.
- Experience with data ingestion with big data technologies (either Elastic Search, Logstash, Kibana, Kafka or any other message queue system).
- Experience of Elasticsearch internals (shards/replicas, ILM, index patterns, snapshots, query tuning).
- Knowledge of observability concepts (APM, traces, metrics, logs), alerting strategies, and on-call operations.
- Experience in one of the programming languages (Java or Python) and have good understanding of OOP.
- Experience in Relational Database (e.g. Oracle, MySQL) or NoSQL Databases.
- Experience with DevOps CI/CD tools (Cloudbees / Jenkins / Openshift / Ansible / Docker).
- Knowledge on DevOps principal model including support and maintenance on product delivery Kibana, Kafka and etc.
- Proficiency in both Windows and Linux OS.
- Experience working within an Agile-driven environment
- Strong interpersonal skills with a customer-centric mindset and ability to work effectively across diverse cultures.
- Proven ability to collaborate with both local and remote teams across different time zones.
- Able to build strong partnerships with internal customers and other delivery organizations.
- Familiarity with ITIL processes (Incident Management, Problem Management, and Continual Improvement) for managing issues and improving service delivery in a structured way.
- Experience in API benchmarking with tools such as JMeter, Postman is a plus.
- Experience with SOA technologies such as micro-services, REST, SOAP is essential.
- Knowledge of Atlassian products (Jira/Confluence/…) is an advantage.
- Knowledge of ServiceNow is an advantage.
The above description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.
Horizontal is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy, sexual orientation, and gender identity), national origin, age, disability, veteran status, or any other protected characteristic under applicable law.
Horizontal is committed to taking affirmative action to employ and advance in employment qualified individuals with disabilities and protected veterans. If you are an individual with a disability and require a reasonable accommodation to complete any part of the application process or participate in the interview process, click here to request accommodation assistance.
All applicants applying must be legally authorized to work in the country of employment.
We're sorry!
Applications from candidates outside of our global target markets are currently not accepted through Job Board - but that could change in the future. We will update Job Board if we expand to your market. Do you have other candidate related questions? Contact us today.
We're sorry!
Applications outside of the assigned country are currently not accepted for this job - but that doesn't mean we can't work with you. Visit our Job Board to view the jobs available within your market.
Access denied.
Users logged in to client accounts are unable to access Job Board. If you are interested in viewing and applying for open jobs, please log out of your current account. If you are a hiring manager interested in viewing candidates available to hire, click below to go to Talent Board.