We are seeking a skilled MuleSoft Production Support Engineer to support and maintain enterprise integration solutions built on MuleSoft Anypoint Platform. The ideal candidate will have strong experience in monitoring, troubleshooting, incident management, and ensuring high availability and performance of APIs and integrations in production environments. The role focuses on operational excellence, stability, and continuous improvement of deployed integrations.
Key Responsibilities
- Provide L2/L3 production support for MuleSoft APIs and integrations deployed on CloudHub / Runtime Fabric.
- Monitor Mule applications and APIs using Anypoint Monitoring, Runtime Manager, CloudHub logs, and Splunk.
- Troubleshoot and resolve production incidents, defects, and performance issues within SLA timelines.
- Perform root cause analysis (RCA) and implement preventive fixes.
- Manage incident, problem, and change requests through ticketing tools (ServiceNow / Jira).
- Support API policies, certificates, keystores, and platform configurations in Anypoint Platform.
- Handle deployments, hotfix releases, and rollback activities in production and lower environments.
- Analyze logs, thread dumps, heap dumps, and integration flows to identify failures.
- Monitor message queues, batch jobs, schedulers, and integrations with external systems.
- Ensure uptime, reliability, and performance of MuleSoft integrations.
- Implement and maintain alerting, dashboards, and health checks.
- Coordinate with development, DevOps, infrastructure, and business teams during incidents.
- Support security configurations (OAuth, TLS, certificates, client ID enforcement).
- Document known issues, runbooks, SOPs, and recovery procedures.
- Participate in on-call support and weekend releases/maintenance windows.
- Perform capacity monitoring and recommend scaling improvements.
- Support Anypoint Platform administration tasks (alerts, VPC, VPN, certificates).