Senior Site Reliability Engineer (SRE) / DevOps Engineer

5 hours ago

To Apply for this Job Click Here

Senior Site Reliability Engineer (SRE) / DevOps Engineer

Overview

Seeking a Senior SRE / DevOps Engineer with deep AWS expertise and full-stack infrastructure troubleshooting experience. This role requires hands-on ownership of large-scale production environments, strong automation skills, and the ability to mentor junior engineers while maintaining high system reliability and performance.

Must Haves

10+ years of experience in Site Reliability Engineering / DevOps
Deep AWS expertise supporting large-scale, production environments

Strong hands-on experience with: VPC, EC2, S3, Fargate, Lambda, CloudFront, ALB/ELB, IAM, RDS

Full-stack infrastructure troubleshooting across:

Network, Server, OS, Application, Database, Storage, IAM

Expert-level Linux server administration and web application support
Intermediate or higher Windows Server support experience
Strong Infrastructure-as-Code and scripting experience:

Terraform (required)
Proficiency in Python, Go, Perl, or JavaScript

CI/CD pipeline design, implementation, and support (GitLab CI, Jenkins, or similar)
Containerization experience (Docker required)
Monitoring and observability experience:

Datadog, New Relic, CloudWatch, or similar

Experience working with Git-based repositories (GitHub, GitLab)
Experience working with JSON and XML data formats

Nice to Have

Experience with Azure and/or GCP
Configuration management experience (Chef or similar tools)
Experience designing highly available, fault-tolerant architectures

Interested candidates may submit their resume online or call at 310-906-4780 for further information regarding the position.

NS-SRED-NS_1771901776