Who You’ll Work With
Arista's Network Detection and Response (NDR) platform is a mission-critical security tool for our customers. Its reliability is paramount. We are hiring a mid-level, Customer Reliability Engineer (CRE) to join our team. This role is critical to the evolution of our customer-facing infrastructure and operational posture.
What You’ll Do:
This is not a traditional operations role. You will inherit a set of critical, manual, and hands-on operational responsibilities essential to our customers' success. We need you to help with the effort to systematically dismantle this operational burden through automation, tooling, and systems. You will have a collaborative team of excellent engineers to work with.
The short-term needs are: manual deployments, reactive troubleshooting, and on-call escalations. But we need you to help us build a system where programmatic solutions have replaced human intervention. You must have the pragmatism to manage the current reality and the systematic impatience and technical skill to build its replacement.
Success in this role requires a dual mindset. You must be a skilled incident leader who can stabilize a crisis and a deliberate systems architect who can prevent the next one. You will work closely with our internal tools, platform, and product engineering teams to channel your direct operational knowledge into durable, long-term solutions.
Your First Year and Beyond
Your work will follow a deliberate trajectory from reactive execution to proactive design.
Phase 1: Stabilize and Map - You will embed with the team, taking on the existing operational workload alongside the other customer SRE team members covering the USA and India time zones. This includes customer deployments, upgrades, and incident response. You will be expected to go on-site for our airgapped customers, occasionally, to assist on-prem deployments. Your initial goal is to achieve stability while mapping the landscape of our operational toil.
Phase 2: Automate and Influence - Armed with your map of toil, you will begin to automate. You will write code, build tooling, and deploy declarative infrastructure to eliminate the most critical operational burdens. For larger projects, you will act as a primary stakeholder, providing clear requirements to our internal tooling and platform teams and ensuring their solutions meet the operational need. Your success will be measured by a demonstrable reduction in the overall support effort, fewer pages, support escalations, and manual tasks.
Arista Networks
https://careers.smartrecruiters.com/AristaNetworks