0% found this document useful (0 votes)
353 views34 pages

Splunk ITSI Implementation Guide

- The document provides an agenda for Day 1 of a Splunk development training course. It includes an introduction to machine data, an overview of why Splunk is needed, and the core concepts of Splunk including what it is, its benefits and features, architecture and components. - The agenda also covers introducing the instructor, discussing machine data and use cases for Splunk, as well as a hands-on lab installing and starting Splunk.

Uploaded by

Loga K
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
353 views34 pages

Splunk ITSI Implementation Guide

- The document provides an agenda for Day 1 of a Splunk development training course. It includes an introduction to machine data, an overview of why Splunk is needed, and the core concepts of Splunk including what it is, its benefits and features, architecture and components. - The agenda also covers introducing the instructor, discussing machine data and use cases for Splunk, as well as a hands-on lab installing and starting Splunk.

Uploaded by

Loga K
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
  • Agenda
  • Introduction
  • About Me
  • Data Everywhere
  • Why We Need Splunk
  • Introduction to Splunk
  • Benefits and Features of Splunk
  • Use Cases
  • Splunk Architecture and Components
  • Lab

Splunk

Development
Day 1
- Vikram Yadav (VY)

Your attendance & progress of the course will be calculated based on


the completion of the assignments and quiz.
Agenda – Day 1
• Machine Data
• Why we need a Splunk?
• Introduction to Splunk
• What is Splunk?
• Benefits of using Splunk?
• Features & Use cases
• Architecture and components
• Lab
About me !
• 3+ years of Splunk experience

• Splunk Developer & Administrator at Avotrix

• Enterprise Security & ITSI

• IOT & AWS

• Leader of Splunk Mumbai User Group

• Watching movies, learning new technology

- thevikramyadav
Data Everywhere

• What is Machine data?

• How big it is?


Why do we need Splunk ?
Introduction to Splunk
• What is Splunk?
• Splunk is a software platform to search, analyze and visualize the machine-generated
data gathered from the websites, applications, sensors, devices etc. which make up
your IT infrastructure and business.

• Big Data Tool, Log Analytics Platform

• Machine-generated data is information automatically generated by a computer


process, application, or other mechanism without the active intervention of a human
• Complex to understand
• In an unstructured format
• Not suitable for making analysis / visualization
Got Data?
Visuals - Reports and Dashborads
Introduction to Splunk
Splunk in Various Area

Network QoS & KPI Analytics


Android Logs Analytics
Operation Intelligence & Visualization
Mobile Device Product Performance Visibility
Customer Loyalty & Churn Analysis
Field Test Data Analytics
Network Planning & Optimization
QXDM Logs Analysis
Fault Prediction & Alerts

Media &
Network
Entertainment
Infrastructure

Performance Monitoring
Operational Health Visualization STB & CMTS Process Logs Analytics
Security Intelligence Content & Search Analytics
Storage &
Alert Monitoring Customer Usage Pattern Insights
Computing
Resource Utilization Analytics
Benefits of Using Splunk
• Your input data can be in any format for e.g. .csv, or json or other human readable
formats
• You can configure Splunk to give Alerts / Events notification at the onset of a machine
state
• You can accurately predict the resources needed for scaling up the infrastructure
• You can create knowledge objects for Operational Intelligence
• No need of Data Mapping or any structure specific conversion
• Works on any Machine Generated Data
• One Tool for Data Uploading, Data Searching & Visualization
• Supports Machine Learning, Prediction & Forecasting
• On Premise & Cloud Solution
Splunk Features
Analyze system
Analyze system
performance
performance
Store and retrieve data for Troubleshoot any failure
Store and retrieve
use data for Troubleshoot
later conditionany failure
later use condition

SPLUNK
SPLUNK
Create dashboard to
Create dashboard to Monitor business metrics
visualize & analyze results Monitor business metrics
visualize & analyze results

Search and investigate


Search andoutcome
particular investigate
particular outcome
Use Cases 1: Hospital Service
• Reporting health conditions in real time

• Delve deeper into the patient’s health record and


analyse patterns

• Alarms / Alerts to both the doctor and patient when


the patient’s health degrades
Use Case 2 : Data Electrified - Porsche
Use Case 3: Dominos Story
• Lot of unstructured data was generated because:
• They had an omni-channel presence for driving sales
• They had a huge customer base
• They had several touch points for customer service
• They provided multiple systems for delivery: Order food in-store, order via
telephone, via  their website and through cross-platform mobile applications
• They upgraded their mobile apps with a new tool to support ‘voice ordering’ and
enable tracking of their orders

• The excess data generated gave rise to the following problems:


• Manual searches being tedious and error prone
• Less visibility into how customer need/preference varies
• Unpreparedness and thus working in reactive mode to fix any problem
Dominos Story
Business Scenario

• Domino’s had no clear visibility into which offer works best – in


terms of:
• Offer type (Whether their customers preferred a 10% discount or a
flat $2 discount?)
• Cultural differences at a regional level (Do cultural differences play a
role in offer choice?)
• Device used for buying products (Do devices used for ordering play a
role in offer choices?)
• Time of Purchase (What is the best time for the order to be live?)
• Order revenue (Will offer response change wrt to order revenue size?)
Introduction to Splunk
Indexers

Search head
Offline orders

Mobile orders

Forwarder
Different Stages In Data Pipeline
Data Processing Stages

• Different Stages In Data Pipeline with respective Splunk


Components
• There are 3 main components in Splunk:
• Splunk Forwarder
• Splunk Indexer
• Search Head

DATA INPUT PARSING INDEXING SEARCHING

Forwarder Indexer Search Head


 Architecture and components

• Universal Forwarder

• Heavy Forwarder
Heavy Forwarder

Indexer
Heavy Forwarder

DATA Parsing Indexing Routing


Architecture and components

• Splunk Indexer
Architecture and components

• Splunk Indexer ( Event Processing)


• Separating the data stream into individual, searchable events
• Creating or identifying timestamps
• Extracting fields such as host, source, and sourcetype
• Performing user-defined actions on the incoming data
• identifying custom fields
• masking sensitive data
• writing new or modified keys
• applying breaking rules for multi-line events
• filtering unwanted events
• routing events to specified indexes or servers
Architecture and components

• Splunk Search Head


Architecture and components

Universal Forwarder Heavy Forwarder Indexer Search Head


Architecture and components
Example of Splunk Deployment
Splunk Architecture - simple
Distributed Search and
Summary Indexing Tier

Indexing Tier
License Capacity: 500 MB/day … x5

Forwarders or
Forwarding Tier

Data Sources

desktops laptops servers/VMs proxy applications syslog firewall config

28
Architecture and components
Key components of Splunk

One Splunk
Many Uses
Organizations Using Splunk
Products of splunk

Core Products
• Splunk Enterprise
• Splunk Cloud
• Splunk Light

Premium Application
• Splunk Enterprise Security (ES)
• Splunk IT Service Intelligence(ITSI)
• Splunk User Behavior Analytics(UBA)
LAB
• Download Splunk Package
• Install Splunk in Windows/MAC Env
• Operations: Start, Stop and Restart

Please submit an email with a screenshot of the assignment before the commencement of the next class, Also in order to
move forward with the course, it is important to complete this assignment. If you are unable to complete the assignment or
need any assistance regarding the same, please email me at vikram@[Link]
Question & Answer.....

Common questions

Powered by AI

Splunk's architecture is designed to accept data in various formats, such as .csv, JSON, and other human-readable forms, without needing prior structuring or conversion . It uses forwarders to ingest data, indexes it for efficient search, and applies user-defined rules to parse and extract necessary fields . By processing the raw data directly, Splunk can seamlessly convert it into searchable events, allowing for immediate analysis and visualization. This capability eliminates the need for cumbersome data transformation processes typically required by other analytics tools .

In Splunk's architecture, the Search Head is responsible for performing searches and generating reports and dashboards that visualize data for analysis . It acts as the interface for users to query indexed data stored in the Splunk Indexer, facilitating data exploration and insight generation . The Search Head also manages user authentication and authorization, ensuring secure access to sensitive data. By enabling complex searches and rich visualizations, the Search Head plays a crucial role in delivering actionable insights to stakeholders using Splunk .

In healthcare, Splunk is used to report health conditions in real-time and analyze patient records to identify patterns . It can trigger alarms or alerts to both doctors and patients if a patient's health condition deteriorates, ensuring proactive healthcare management . By analyzing health data, Splunk enhances service delivery by identifying critical issues early and enabling timely interventions, thereby improving patient outcomes and optimizing healthcare operations .

The main components of Splunk architecture include the Splunk Forwarder, Splunk Indexer, and Search Head. The Splunk Forwarder collects data from various sources and forwards it to the Indexer . The Splunk Indexer processes the data by parsing, indexing, and transforming it into searchable events, while extracting useful information like timestamps and fields . The Search Head enables users to search, visualize, and analyze the data through dashboards and reports . These components work in tandem to handle and process machine-generated data efficiently in a Splunk environment .

Splunk's resource utilization analytics provide insights into infrastructure performance by monitoring business metrics and system performance in real-time . It can predict resource needs, enabling IT managers to scale infrastructure proactively before demand exceeds capacity . By identifying bottlenecks and inefficiencies, Splunk aids in optimizing resource allocation and enhancing overall IT management. Its ability to store and retrieve data for future analysis ensures continuous monitoring and improvement of IT operations .

Splunk architecture processes data through multiple components: the Splunk Forwarder, Splunk Indexer, and Search Head . The data input is received by forwarders, which parse, index, and route the data through heavy or universal forwarders . The Splunk Indexer is crucial for event processing, as it separates data streams into individual, searchable events, timestamps them, and extracts fields such as host, source, and sourcetype . It also performs user-defined actions, masks sensitive data, and applies breaking rules for multi-line events . Finally, the Search Head enables data searching and reporting .

Splunk enhances operational intelligence through its ability to analyze machine-generated data from various sources, such as websites, applications, sensors, and devices . It enables users to monitor business metrics, troubleshoot failures, and analyze system performance in real-time. By creating dashboards for visualization, Splunk allows businesses to gain insights into their operations more efficiently . Moreover, with machine learning capabilities, Splunk facilitates predictive analytics to forecast potential issues, optimize resource utilization, and improve data-driven decision-making .

Domino’s faced challenges from unstructured data due to its omni-channel presence, large customer base, and various customer service touchpoints . The complexity of manually searching unstructured data led to error-prone processes and limited visibility into customer preferences, resulting in reactive problem-solving . Using Splunk, Domino’s could automate data parsing, indexing, and search processes, thus improving data visibility and accuracy. Splunk's ability to create dashboards allows for better monitoring of customer behavior and preferences, enabling data-driven decisions to enhance customer service and operational efficiency .

Splunk offers numerous benefits for handling machine-generated data. It supports input data in any format, such as .csv or JSON, and eliminates the need for data mapping or conversion to a specific structure . Splunk accurately predicts the resources needed for scaling up infrastructure and creates knowledge objects for operational intelligence . Additionally, it is a versatile tool for uploading, searching, and visualizing data, supporting machine learning, prediction, and forecasting . It offers both on-premise and cloud solutions, making it adaptable to different IT environments .

Splunk can enhance customer experience for companies like Domino's by integrating data from multiple channels—websites, phone orders, and mobile applications—into a cohesive dataset . By using Splunk's analytics capabilities, companies can track customer interactions across different platforms to gain a unified view of customer behavior and preferences . This enables personalized marketing strategies, improves customer service responsiveness, and allows for the timely resolution of service issues. Furthermore, real-time data analysis helps in adapting to customer needs more dynamically, thus creating a more seamless and satisfying customer experience .

Splunk 
Development
Day 1
- Vikram Yadav (VY)
Your attendance & progress of the course will be calculated based on 
the compl
Agenda – Day 1
• Machine Data
• Why we need a Splunk?
• Introduction to Splunk
• What is Splunk?
• Benefits of using Splunk?
•
3+ years of Splunk experience 
•
Splunk Developer & Administrator at Avotrix
•
Enterprise Security & ITSI
•
IOT & AWS
•
Lea
Data Everywhere
• What is Machine data?
• How big it is?
Why do we need Splunk ?
Introduction to Splunk
• What is Splunk?
• Splunk is a software platform to search, analyze and visualize the machine-generat
Got Data?
Visuals - Reports and Dashborads
Introduction to Splunk
Splunk in Various Area
Network QoS & KPI Analytics
Operation Intelligence & Visualization
Customer Loyalty & Churn Analysis
N

You might also like