Design, implement, and maintain Splunk-based monitoring and observability solutions across the enterprise.
Configure and optimize Splunk Enterprise, ITSI, APM, RUM, and Synthetic Monitoring to ensure accurate and actionable visibility into applications and infrastructure.
Develop and maintain custom dashboards, alerts, reports, and service health scores in ITSI for stakeholders including DevOps, SREs, and business units.
Integrate logs, metrics, traces, and real user data from a variety of platforms including cloud, on-prem, and hybrid environments.
Assist in the onboarding of data sources and develop efficient indexing and data retention strategies.
Collaborate with application, network, and infrastructure teams to define monitoring requirements and improve system performance and reliability.
Proactively identify system anomalies and performance bottlenecks using APM, RUM, and synthetic tests.
Develop automation scripts for alerting and response using Splunk SOAR or other automation tools (if applicable).
Stay up to date with the latest Splunk features and best practices and mentor junior team members.
Support troubleshooting, RCA, and incident response efforts using Splunk-based insights.
Required Qualifications:
3+ years of hands-on experience with Splunk Enterprise architecture, configuration, and administration.
2+ years of experience in Splunk ITSI, including KPI creation, service design, and correlation searches.
Proven experience in Splunk Observability, including:
Splunk APM (Application Performance Monitoring)
Real User Monitoring (RUM)
Synthetic Monitoring
Strong understanding of monitoring best practices, SRE principles, and DevOps workflows.
Experience with distributed systems, microservices, and monitoring in cloud environments (AWS, Azure, GCP).
Proficient in search processing language (SPL) and dashboard development.
Familiarity with data onboarding techniques (via UF, HF, or APIs).