Role Overview
The DevOps Engineering Manager, SRE and Cloud Services is responsible for leading the teams that ensure the reliability, scalability, and performance of IntegriChain鈥檚 cloud platforms and production systems. This role manages DevOps and Site Reliability Engineering functions, with a strong focus on cloud infrastructure, automation, and operational excellence.
You will work closely with application engineering, platform, security, and IT teams to support product delivery while maintaining high standards for availability, resilience, and security. This role balances people leadership with hands-on technical engagement and is critical to the success of our SaaS platforms in a healthcare and life sciences environment.
How a Day in This Role Looks
Your day typically starts with connecting to the team through daily standups or operational check-ins. You review system health, active work, incidents, and priorities, making sure the team is focused on what matters most and that risks are addressed early. You stay close to production systems through dashboards, alerts, and direct conversations with engineers.
Throughout the day, you work directly with DevOps, SRE, and application engineering teams to remove roadblocks and keep work moving forward. This may involve helping troubleshoot issues, guiding technical decisions, or coordinating across teams to resolve dependencies. You are regularly involved in design and architecture discussions, helping teams think through reliability, scalability, performance, and operational readiness.
Because the team operates across multiple time zones, you spend time coordinating work and maintaining clear communication across regions. You help establish shared processes, clear handoffs, and consistent expectations so work continues smoothly around the clock.
When incidents or operational challenges arise, you support response efforts, help coordinate resolution, and ensure follow-up actions are completed. Over time, you help turn recurring issues into lasting improvements by strengthening automation, cloud practices, and reliability standards.
Key Responsibilities
DevOps and SRE Leadership
Cloud and Platform Operations
Reliability and Operational Excellence
Architecture and Engineering Collaboration
Cross-Functional Collaboration