Senior Site Reliability Engineer

Senior Site Reliability Engineer
Company:

Sparibis


Details of the offer

Senior Site Reliability Engineer Location Remote in Washington, DC : Location: 100% remote Years' Experience: 10+ Year's of experience Education: Bachelor's degree Work Authorization: United States Citizenship is required as part of the eligibility criteria to be able to obtain a security clearance. Clearance: Applicants must be able to obtain and maintain a Public Trust security clearance. Key Skills: Must experience serving as a SRE Prior leadership and experience with leading a team Deep understanding of SRE principles for highly scalable and reliable systems. Configuration Management and Infrastructure as Code expertise Responsibilities Responsible for incident response, monitoring, alerting, triaging and closing of real problems Ensure platform stability and availability Responsible for the metrics reporting and tracking, evaluation of proper function, support to the teams for enhance performance Design and implement end-to-end continuous delivery pipelines. Leverage extensive AWS cloud experience in a production environment (e.g., network, security, deployment, automation, serverless technologies). Utilize a deep understanding of SRE principles for highly scalable and reliable systems. Leverage extensive experience with Configuration Management and Infrastructure as Code. Works with application teams to document application internal/external interface requirements for Development, Testing, Staging and Production environments Works with application teams to ensure compliance with High Availability and Disaster Recovery related concept of operations. Build service level requirements for SLA's Implements middleware application specific requirements as needed Implements migration efforts with application teams, including data migration Serve as a thought leader for agile development teams. Establish clarity of direction and a shared vision of success that is championed by team members, stakeholders, and product owners. Build relationships, and work in collaboration with team members, stakeholders, product owners, and technical team leads. Help enhance processes, communication, and delivery through new norms that improve how work is done — from discovery to delivery. Provides technical guidance to application teams to take advantage of cloud technologies, and implement cloud infrastructure, as needed. Qualifications 10+ years of software engineering and DevOps experience Bachelor degree or higher education required Must be able to obtain and maintain a Public Trust security clearance Must have experience with highly scalable and reliable systems by implementing and maintaining processes and tools Incident response, monitoring performance and releases, alerting, and triaging expertise ServiceNow, AWS Insight, Splunk, VictorOPS, CloudWatch, New Relic, and Confluence expertise preferred Experience in designing and implementing end-to-end continuous delivery pipelines. A deep AWS cloud experience in a production environment (e.g., network, security, deployment, automation, serverless technologies). Experience and understanding in SRE principles for highly scalable and reliable systems. A strong experience with Configuration Management and Infrastructure as a Code. Experience designing and implementing end to end CI/CD pipelines AWS Cloud experience in the production environment (ie. network, security, deployment, automation, serverless technologies) Experience designing and building web application environments on AWS including services such as EC2, S3, Lambda, ELB, ECS etc. Experience in deploying of the cloud resources using IaC tools like Terraform. Experience with monitoring and logging tools such as Cloud Watch, App Dynamics and Splunk. Create CloudWatch rules to capture the apps alerts and send notifications Previous experience migrating application teams from on-prem to cloud infrastructure (AWS, Azure) preferred. Experience with CI/CD frameworks (ie. Jenkins, Docker, Ansible, Chef, Puppet, Git) Experience in at least one automation and scripting tool experience (ie. Bash, Python, Shell, Perl) Experience in designing and building of CIFS and NFS on-premises File share migration using AWS Datasync and VPC endpoints to AWS storage services S3, EFS or FSx. Experience in creating build plans for AWS deployment by listing out compute resources, Security groups, LB, target group, NACL and all other components for various environments (Dev, TQA, and Prod etc.) Experience maintaining and administering configuration management systems such as Enterprise GitHub. Experience maintaining and administering software build systems such as Jenkins. Experience maintaining and administering artifact repository systems such as Artifactory. Ability to automate workflows through scripting or other technologies such as Ansible or Puppet. Expertise in Agile and DevSecOps approaches About Sparibis Sparibis LLC is a professional solution firm that Clients rely on to access the best talent to drive their business success. Sparibis is an equal opportunity employer that values diversity at all levels. All individuals, regardless of personal characteristics, are encouraged to apply.


Source: Grabsjobs_Co

Job Function:

Requirements

Senior Site Reliability Engineer
Company:

Sparibis


Senior Validation Engineer

Position Description:  The CQV Engineer develops the documentation to support Commissioning, Qualifications, and Validation. These people are responsible for...


From Cai - Distrito de Columbia

Published 15 days ago

Ai/Ml Engineer (Midlevel)

AI/ML Engineer (Mid-Level) Location: Washington, DC (Hybrid) Required: US Citizens Node. Digital is an innovative solutions development company committed tha...


From Node.Digital - Distrito de Columbia

Published 15 days ago

Applications Engineer

Are you ready to put your skills to work in a dynamic and growing company? Are you passionate about technology and want to see video, imagery, and data come ...


From Planar Systems - Distrito de Columbia

Published 15 days ago

Senior Service Technician

When you join the Allied Universal® Technology Services, you are joining one of the fastest growing security systems integrators in North America. Build your...


From Allied Universal - Distrito de Columbia

Published 15 days ago

Built at: 2024-05-17T09:32:00.166Z