Site Reliability Engineer

Atlanta, Georgia

|

Onsite

|

Contract

|Job ID 67705|Posted Aug 7, 2024
JOB DESCRIPTION
Our client is seeking a dedicated Site Reliability Engineer to join their dynamic team. This role is ideal for a professional with a deep understanding of AWS services, infrastructure as code, and a knack for troubleshooting complex infrastructure and application issues.

Key Responsibilities:
- Implement and improve monitoring, alerting, and logging solutions to detect and respond to incidents
- Collaborate closely with the development team to deploy applications and services, ensuring they meet reliability and performance standards
- Automate deployment, configuration management, and troubleshooting processes to streamline operations
- Participate in on-call rotation and triage production incidents, lead RCAs, and implement preventive actions

Qualifications:
- Proficiency in AWS services (Lambda, S3, SQS, IAM, Route 53 etc.) and infrastructure as code (e.g., Terraform, CloudFormation)
- Hands-on experience with monitoring tools such as CloudWatch, Sumo Logic, Dynatrace, Grafana, or similar
- Proficiency in scripting and automation (e.g., Python, Bash)
- Experience with containerization (Docker, Kubernetes) and serverless architecture (AWS Lambda)
- Strong analytical and troubleshooting skills

Desired Skillset:
- Deep understanding of the operations of AWS cloud platforms
- Experience with CI/CD tools (Gitlab, Github, Jenkins, Maven, Gradle, Nexus)
- Working experience with Software Release Management
- BS degree in Computer Science or a related technical field or equivalent practical experience
- 3+ years of related DevOps, SysOps engineering experience with focus on major cloud platforms (AWS preferred)
- 2+ years of application development experience including data streaming, deploying/monitoring high availability critical application components
- 1+ Years in Site Reliability Engineering organization preferred

As a Site Reliability Engineer with our client's team, you will be at the forefront of Cloud and Big Data technology. This role will serve as the escalation point for complex and hard to define issues in both on-premise and AWS environments. We are seeking talented engineers, well versed in DevOps technologies, automation, infrastructure orchestration, configuration management, and continuous integration.

Horizontal is proud to be an Equal Opportunity and Affirmative Action Employer. We seek to provide employment opportunities to talented, qualified candidates regardless of race, color, sex/gender including gender identity and/or expression, national origin, religion, sexual orientation, disability, marital status, citizen status, veteran status, or any other protected classification under federal, state or local law.

In addition, Horizontal will provide reasonable accommodations for qualified individuals with disabilities. If you need to request a reasonable accommodation in order to complete the application or interview process, please contact us.

All applicants applying must be legally authorized to work in the country of employment.