Site Reliability Engineer - Apptio

Details of the offer

You:
You are passionate about observability, automation, and reliability. Your team can count on you to deliver creative and inventive solutions to hard problems. You are comfortable working with developers, senior leadership, and non-technical individuals to help provide value to the broader organization. You take opportunities to fix problems, mentor your peers, and step outside your comfort zone to develop your skillset.
Us:
Apptio Targetprocess empowers businesses to adopt and scale agile across the enterprise. We develop Agile tool that connects teams, products, and portfolios to business objectives using SAFe, LeSS and other Agile frameworks. In the 2021 Gartner Magic Quadrant for Enterprise Agile Planning Tools report, Apptio's recently acquired Targetprocess has been recognized as a "Leader".
SRE Team:
Apptio Targetprocess SRE team's main responsibility is to make sure that company's infrastructure and applications runs in a smooth and stable manner. We count on our site reliability engineers (SREs) to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. That mostly means work proactively on system's reliability, preventing any kind of outages, observing and keeping an eye on the key metrics, taking urgent mitigation measures when needed, assisting other teams on infrastructure-related topics.
On a typical day in this role, you will interact with Kubernetes, Docker, Helm, Elasticsearch, DataDog, Grafana, Sensu, Puppet, Ansible/AWX, AWS, Azure, Python/Bash/PowerShell, Terraform/Terragrunt. If you don't know all these tools, don't worry, we are not expecting that you know them all, we understand that technology evolves quickly.
Major Responsibilities:Scale systems sustainably through mechanisms like automation
Ownership of monitoring system
Maintain services in production by measuring and monitoring availability, latency, and overall system health.
Application expansion and horizontal scaling.
Work closely with developers, support and QA teams on maintaining and improving the whole lifecycle of services.
Practice sustainable incident response and blameless post-mortems.
Provide primary operational support and engineering for multiple large distributed software applications.

Nominal Salary: To be agreed

Source: Brassring

Job Function:

Engineering

Requirements

Similar offers

See more similar offers

Field Service Technician

Company Summary DISH, an EchoStar Company, has been reimagining the future of connectivity for more than 40 years. Our business reach spans satellite televis...

Dish - Michigan

Published 7 days ago

Quality Engineer Ii

Description On January 1, 2021, Hitachi Automotive Systems, Ltd., Keihin Corporation, Showa Corporation and Nissin Kogyo Co., Ltd. merged to form Hitachi Ast...

Hitachi Vantara Corporation - Michigan

Published 8 days ago

Plant Engineer

Major poultry processor is seeking a qualified Plant Engineer for their harvest plant. This position provides the day-to-day technical leadership and directi...

Toni Group, Llc - Michigan

Published 8 days ago

Environmental Engineer Intern/Co-Op - Spring 2025

Environmental Engineer INTERN/CO-OP - Spring 2025 Filer City, MI 49634, USA Req #20568 Wednesday, August 21, 2024 As a Fortune 500 company, Packaging Cor...

Pca - Michigan

Published 8 days ago

Built at: 2024-11-25T02:19:52.938Z