Data Center Devops Manager

Details of the offer

Pure Storageis seeking an IT professional with strong leadership, management, communication, problem-solving, and technical skills to oversee day-to-day hardware deployment and repair systems operations. As a supervisor for the dynamic team of DevOps Engineers, you will lead efforts in deploying, troubleshooting, repairing, and ensuring the server fleet operates at its maximum capacity. You will also be responsible for monitoring the task queue to identify tickets that require escalation or additional expertise, driving efficient resolution.
As part of a Data Center DevOps team, you will oversee work in a local data center, interfacing directly with the datacenter operations team to ensure hardware is validated and returned to the production fleet post-repair. As an autonomous, proactive leader, you will take initiative to drive process improvements and represent the Infrastructure and Shared Services (ISS) team within the datacenter. This supervisory role requires a strong ability to mentor team members, coordinate workloads, and maintain high performance standards.
SHOULD YOU ACCEPT THIS CHALLENGE...

Oversee day-to-day server administration (physical, virtual), storage administration, network configuration, applications support, and health and performance monitoring. Ensure quick turnaround times, performance levels, availability, and security.
Lead the team in meeting service delivery SLAs, ensuring validation for storage, network, testing, and application-level operations.
Refine and automate commonly performed tasks to gain efficiencies and scalability using automation platforms such as Ansible or Puppet.
Document complex processes using tools such as Visio, LucidChart (preferred), Sunbird DCTrack DCIM (preferred), Microsoft Word, and Excel.
Use ticketing systems like Jira to track team issues and document changes, ensuring a clear and efficient resolution process.
Provide technical mentorship and cross-training for team members, sharing domain expertise and fostering growth.
Deploy infrastructure manually and via configuration management/automation platforms while ensuring proper team execution.
Monitor and manage infrastructure utilization and performance across the team.
Supervise troubleshooting of hardware, software, and network-related issues, ensuring quick resolution and root cause analysis.
Communicate and coordinate the infrastructure maintenance schedule and downtime to users, while ensuring team alignment.
Guide implementation of infrastructure solutions to meet or exceed internal and external SLAs and expectations.
Participate in software development processes as necessary, offering operational infrastructure designs and recommendations.

WHAT YOU'LL NEED TO BRING TO THIS ROLE...

Strong hands-on working knowledge of Linux.
Scripting skills in Python or Ansible programming are desirable.
Deep understanding of networking and storage infrastructure.
Proven experience managing a Jira ticket queue, troubleshooting hardware failure issues, and overseeing team execution of repair and validation processes.
Ability to manage day-to-day operations of the DC DevOps team, including workload distribution and performance management.
Familiarity with identifying basic code-related problems in validation processes and escalating to appropriate teams.
Strong communication skills to collaborate with datacenter operations teams and resolve hardware issues.
Ability to maintain and contribute to technical documentation, troubleshooting manuals, and run books.
Commitment to continuously review, learn, and understand internal services and tools relevant to IT workflows.
Proven ability to mentor and train team members, fostering a culture of continuous improvement.
Experience working with hardware from vendors such as Cisco, Brocade, and Supermicro.
Resourcefulness and problem-solving aptitude.

GOOD TO HAVE...

Basic knowledge of Pure Storage products like Flash Array and Flash Blade.
Working knowledge of Windows administration.
Familiarity with automated booting in a Linux environment.
Experience with automation tools such as Jenkins or Docker.
Proficiency with the Google Suite, including Sheets, Docs, etc.
Familiarity with ESXi/VMware.
Education in Computer Science or Computer Hardware Engineering, or at least five years in the industry.
Excellent interpersonal and teamwork skills, with strong written and verbal communication.
Attention to detail and a well-organized, self-starter attitude.
Ability to accept constructive criticism and improve.
Capacity to explain complex hardware issues, configurations, and BIOS settings effectively.
Exceptional problem-solving skills, especially for server hardware.
Proven ability to take ownership of issues, managing them from escalation to resolution.
Demonstrated experience as a DevOps Engineer, System Operator, Administrator, or similar role.

We are primarily an in-office environment and therefore, you will be expected to work from Pure's Sunnyvale Data Center office in compliance with Pure's policies, unless you are on PTO, or work travel, or other approved leave.The annual base salary range is: $163,000 – $246,000. Salary ranges are determined based on role, level and location. For positions open to candidates in multiple geographical locations, the base salary range is reflective of the labor market across the applicable locations.  This role may be eligible for incentive pay and/or equity.  And because we understand the value of bringing your full and best self to work, we offer a variety of perks to manage a healthy balance, including flexible time off, wellness resources, and company-sponsored team events - check out purebenefits.com for more information. 
#LI-HYBRID, #LI-ONSITE, #LI-BR


Nominal Salary: To be agreed

Source: Greenhouse

Requirements

Enterprise Architect, Information Architecture

Keurig Dr Pepper is seeking an Enterprise Architect, Information Architecture who can play an instrumental role in driving our data-driven business transform...


Keurig Dr Pepper - California

Published 14 days ago

Machine Learning

W-2 Open Positions Need to be Filled Immediately. Consultant must be on our company payroll, Corp-to-Corp (C2C) is not allowed. Candidates encouraged to appl...


Ethereum Technologies Llc - California

Published 14 days ago

Software Engineer - Sensors

Hivemapper is a decentralized global mapping data network built by 10s of thousands of people. 2024 will see us 10x the number of contributors to the network...


Hive - California

Published 14 days ago

Staff Software Engineer - Incident Management San Francisco

Staff Software Engineer - Incident ManagementPagerDuty empowers teams of all kinds to do the critical work that moves business forward through the PagerDuty ...


Pager - California

Published 14 days ago

Built at: 2024-12-18T08:53:31.709Z