Manage Cloud operational support services (AWS, GCP, Azure, or SaaS) and data center co-location services with a focus on monitoring.
Oversee and optimize service monitoring tools such as SolarWinds Orion, Microsoft System Center Operations Manager, and Nagios.
Lead and manage global 24/7 operations, including RedHat Linux, Windows, VMWare, EMC & NetApp Storage, Backups, and Cisco Networking devices.
Implement and maintain Change and Production Control Frameworks as per ITILv.3 library, ensuring compliance with security best practices and audit requirements (SOX, PCI, etc.).
Build, influence, and motivate effective teams, managing both onsite and offshore technical personnel.
Conduct thorough root cause analysis to resolve complex infrastructure issues and drive service restoration during critical outages.
Negotiate with customers to establish service level agreements and manage vendor relationships and budgets.
Create and present I&O information to executive management, including data analysis and trend reports.
Coordinate and support disaster recovery procedures and assist in developing disaster recovery plans.
Drive continuous improvement initiatives and stay updated with emerging technologies in cloud and infrastructure management.