Senior Site Reliability Engineer
Site Reliability Engineer responsible for application resiliency, reliability, security, incident recovery, automation, and service-level objective development.
Requirements
- Experience with site reliability engineering
- Experience with application resiliency, reliability, and security
- Experience anticipating application failure modes
- Experience automating failure recovery solutions
- Experience serving as Incident Commander for large or complex systems
- Experience solving complex resiliency, reliability, and security problems across multiple systems
- Experience developing Service Level Objectives across multiple parts of a system
- Experience evaluating and implementing software application enhancements
- Experience gathering metrics for software application cost, quality, performance, and security
- Experience creating support documentation
- Experience creating playbook documentation
- Experience leading postmortems for application or system incidents
- Experience with AWS Cloud Services
- Experience with Kubernetes
- Experience with Datadog
- Experience with Terraform
Preferred Skills
- Experience with Java
- Experience with Scala
- Experience with JavaScript
- Experience with .NET
- Experience with Go
- Experience with Python
- Knowledge of technical innovations in site reliability engineering, cloud infrastructure, automation, and software operations
Responsibilities
- Develop solutions for application resiliency, reliability, and security
- Anticipate product failure modes and automate recovery solutions
- Lead incident response as Incident Commander for large or complex systems
- Coordinate failure recovery for large or complex systems
- Lead resolution of division-wide technical problems involving resiliency, reliability, and security
- Identify and organize technical resources needed to resolve complex system issues
- Serve as technical advisor for reliability, resiliency, and security initiatives
- Lead development of Service Level Objectives across multiple parts of the system
- Evaluate and implement enhancement design solutions to improve cost, quality, performance, and security of software applications
- Evaluate and implement enhancement design solutions for application metrics collection
- Maintain knowledge of industry and technical innovations
- Create and maintain support and playbook documentation
- Collaborate with site reliability engineers and product team members to validate feature alignment with business needs
- Lead incident postmortems
- Document changes to automation, recovery processes, and operational documentation after incidents
Pay Details: $70.00 to $81.00 per hour
Benefit offerings available for our associates include medical, dental, vision, life insurance, short-term disability, additional voluntary benefits, EAP program, commuter benefits and a 401K plan. Our benefit offerings provide employees the flexibility to choose the type of coverage that meets their individual needs. In addition, our associates may be eligible for paid leave including Paid Sick Leave or any other paid leave required by Federal, State, or local law, as well as Holiday pay where applicable.
Equal Opportunity Employer/Veterans/Disabled
Military connected talent encouraged to apply
To read our Candidate Privacy Information Statement, which explains how we will use your information, please
The Company will consider qualified applicants with arrest and conviction records in accordance with federal, state, and local laws and/or security clearance requirements, including, as applicable:
- The California Fair Chance Act
- Los Angeles City Fair Chance Ordinance
- Los Angeles County Fair Chance Ordinance for Employers
- San Francisco Fair Chance Ordinance
Massachusetts Candidates Only: It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.
Equal Opportunity Employer/Veterans/Disabled
The Company will consider qualified applicants with arrest and conviction records.
-
Flex your reach.
When you work with us, you gain access to our expansive network of top companies that are searching for engineering and technical talent just like you.
-
Leverage our reputation.
Sometimes it's not about what you know, but who you know. And when you know us, you're getting your foot in the right doors, shaking the right hands, and landing in the right spots.
-
Let us go to bat for you.
We'll make sure your resume, interview techniques, and technical training and certification are in line to shine with your next potential employer. We know what they're looking for, and we know how to help you stand out.