To Apply for this Job Click Here
Senior Site Reliability Engineer (SRE) / DevOps Engineer
Overview
Seeking a Senior SRE / DevOps Engineer with deep AWS expertise and full-stack infrastructure troubleshooting experience. This role requires hands-on ownership of large-scale production environments, strong automation skills, and the ability to mentor junior engineers while maintaining high system reliability and performance.
Must Haves
- 10+ years of experience in Site Reliability Engineering / DevOps
- Deep AWS expertise supporting large-scale, production environments
- Strong hands-on experience with: VPC, EC2, S3, Fargate, Lambda, CloudFront, ALB/ELB, IAM, RDS
- Full-stack infrastructure troubleshooting across:
- Network, Server, OS, Application, Database, Storage, IAM
- Expert-level Linux server administration and web application support
- Intermediate or higher Windows Server support experience
- Strong Infrastructure-as-Code and scripting experience:
- Terraform (required)
- Proficiency in Python, Go, Perl, or JavaScript
- CI/CD pipeline design, implementation, and support (GitLab CI, Jenkins, or similar)
- Containerization experience (Docker required)
- Monitoring and observability experience:
- Datadog, New Relic, CloudWatch, or similar
- Experience working with Git-based repositories (GitHub, GitLab)
- Experience working with JSON and XML data formats
Nice to Have
- Experience with Azure and/or GCP
- Configuration management experience (Chef or similar tools)
- Experience designing highly available, fault-tolerant architectures
Interested candidates may submit their resume online or call at 310-906-4780 for further information regarding the position.
NS-SRED-NS_1771901776
