Anticipated Contract End Date/Length: November 30, 2026
Work Set Up: Hybrid
Our client in the Information Technology and Services industry is looking for a SPLUNK Enterprise and ITSI Expert to design, deploy and operate Splunk Enterprise and ITSI within hybrid Kubernetes and OpenShift environments. This role will focus on scalable data onboarding, ITSI service modelling, observability design and performance optimisation while enforcing governance, security and cost guardrails.
The ideal candidate will bring deep SPL expertise, strong ITSI configuration experience and advanced knowledge of Kubernetes and OpenShift observability. This role requires hands-on capability across ingestion pipelines, reliability engineering, security controls and automation integrations within complex enterprise environments.
What you will do:
- Design, deploy and operate Splunk Enterprise and ITSI within hybrid Kubernetes and OpenShift environments.
- Onboard data at scale using HEC, Universal Forwarder and Deployment Server, align to CIM standards and enforce RBAC, retention and cost guardrails.
- Build ITSI service decompositions, KPIs, multi-KPI thresholds, NEAP policies, glass tables, deep dives and service health scoring models.
- Create OpenShift-focused executive and operational dashboards covering cluster health, node readiness, pod restart hotspots, network and storage errors and capacity visibility.
- Tune search and platform performance using workload rules, concurrency controls, DMA, summary indexing and scheduling optimisation.
- Implement alerting, enrichment and routing to ITSM and ChatOps platforms, including suppression windows, maintenance schedules and runbook automation.
- Govern data ingest and security controls including allow and deny lists, PII handling, TLS configuration, token governance, index and role mapping and data quality SLAs.
- Integrate upstream pipelines including OpenTelemetry, Prometheus exporters, Fluentd, Fluent Bit, Vector, Kafka, CMDB and ITSM enrichments and AIOps or ML anomaly detection capabilities.
- Align reliability practices to golden signals, SLO and KPI mapping and rollout and rollback health checks.
- Optimise license usage and cost controls through ingestion governance and workload management strategies.