This role will be based in Bangalore, India.
At LinkedIn, our approach to flexible work is centered on trust and optimized for culture, connection, clarity, and the evolving needs of our business. The work location of this role is hybrid, meaning it will be performed both from home and from a LinkedIn office on select days, as determined by the business needs of the team.
At LinkedIn, the Productivity Engineering Site Reliability Engineering (SRE) team plays a critical role in ensuring our enterprise business applications are reliable, scalable, secure, and highly automated.
We are seeking a Senior Manager, Site Reliability Engineering to lead a high-performing team of SREs, software engineers, enterprise engineers, and test automation engineers responsible for system health, observability, and operational excellence across both development and production environments.
In this role, you will partner closely with Development and Test Automation teams from early design through production, driving improvements in reliability, performance, and scalability across complex application ecosystems. You will also collaborate with cross-functional infrastructure teams to scale and modernize financial systems infrastructure.
You will lead strategic initiatives across application, database, and middleware platforms, including performance optimization and the transformation of systems from on-premises environments to modern multi-cloud architectures.
This is a key leadership opportunity for someone passionate about building high-performing teams, driving automation at scale, and delivering resilient, efficient platforms that power mission-critical business operations.
Responsibilities:
Build, lead, and scale a high-performing SRE organization, including hiring, mentoring, and organizational development
Act as a role model and coach with a strong bias for action, engineering craftsmanship, and operational excellence
Participate with senior leadership to define and drive the long-term technology vision, strategy, and roadmap aligned with business priorities
Establish and foster a culture of ownership, accountability, continuous improvement, and high operational standards
Collaborate closely with cross-functional partners across development, infrastructure, testing, and business teams to drive impactful roadmaps
Influence and align senior stakeholders across engineering, infrastructure, and business domains
Own availability, reliability, performance, and scalability of enterprise business applications and financial systems
Define and implement SRE best practices including SLOs, SLAs, error budgets, incident management, and operational frameworks
Lead end-to-end incident response, root cause analysis, and long-term remediation strategies to improve system resilience
Drive operational maturity through metrics, observability, automation, and continuous improvement initiatives
Oversee application, database, and middleware platform performance, reliability, and capacity planning
Lead modernization efforts including migration from legacy environments to modern infrastructure.
Evaluate and implement new technologies and architectural patterns to improve scalability, resilience, and efficiency
Define and drive observability strategy across monitoring, logging, tracing, and alerting systems
Champion an automation-first mindset to eliminate manual processes and improve operational efficiency
Drive development of internal tools and self-service platforms to enhance engineering productivity and reduce operational overhead
Improve deployment, release, and operational workflows through engineering-led automation and standardization
Own infrastructure cost management, capacity planning, and financial forecasting for Financial Systems
Optimize infrastructure and licensing investments (e.g., Oracle ecosystem) aligned with business and financial goals