0% found this document useful (0 votes)
299 views8 pages

Business Analytics Complete Notes PDF

The document provides comprehensive notes on Business Analytics, covering key concepts such as data science, types of analytics, applications in business, and challenges in data analytics. It also includes practical guidance on data preparation, visualization using spreadsheets, and using R for statistical analysis and predictive analytics. Additionally, it addresses textual analytics and its applications, challenges, and techniques.

Uploaded by

ankitanand8
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
299 views8 pages

Business Analytics Complete Notes PDF

The document provides comprehensive notes on Business Analytics, covering key concepts such as data science, types of analytics, applications in business, and challenges in data analytics. It also includes practical guidance on data preparation, visualization using spreadsheets, and using R for statistical analysis and predictive analytics. Additionally, it addresses textual analytics and its applications, challenges, and techniques.

Uploaded by

ankitanand8
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Business Analytics - Complete Notes

Unit 1: Introduction

Data and Data Science:

- Data: Raw facts and figures (e.g., sales records, temperatures).

- Data Science: Field that uses math, statistics, and computer science to analyze and gain insights from data.

Data Analytics vs. Data Analysis:

- Data Analytics: Broad, strategic use of data for decision-making.

- Data Analysis: Detailed process of examining datasets.

Classification of Analytics:

1. Descriptive ? What happened?

2. Diagnostic ? Why did it happen?

3. Predictive ? What is likely to happen?

4. Prescriptive ? What should we do?

Application of Analytics in Business:

- Marketing: Customer segmentation, campaign effectiveness.

- Finance: Credit scoring, fraud detection.

- HR: Employee attrition prediction.

- Operations: Inventory optimization.

Types of Data:

- Nominal: Categories without order (e.g., gender).


Business Analytics - Complete Notes

- Ordinal: Categories with order (e.g., ranking).

- Scale: Interval (temperature) or ratio (weight).

Big Data & Its Characteristics (5Vs):

1. Volume ? Large quantity

2. Velocity ? Speed of generation

3. Variety ? Different types (text, video)

4. Veracity ? Uncertainty

5. Value ? Usefulness

Applications of Big Data:

- E-commerce, healthcare, smart cities, etc.

Challenges in Data Analytics:

- Data quality, storage, security, skill gaps, integration.

Unit 2: Data Preparation, Summarisation and Visualisation Using Spreadsheet

Data Preparation and Cleaning:

- Remove errors, fix missing values.

Spreadsheet Tools:

- Sort and Filter: Organize or filter rows.

- Conditional Formatting: Highlight important cells.


Business Analytics - Complete Notes

- Text to Column: Split data in one column into multiple.

- Remove Duplicates: Clean repeated values.

- Data Validation: Control input types.

- Identify Outliers: Use scatter plots or z-scores.

- Covariance & Correlation: Relationships between variables.

- Moving Averages: Smooth trends.

- Finding Missing Values: Use filters or functions (IF, ISBLANK).

Summarisation:

- Use formulas (SUM, AVERAGE, COUNT).

Visualisation Tools:

- Scatter Plots: Two variable relation.

- Line Charts: Trends.

- Histograms: Data distribution.

- Pivot Tables: Quick summaries.

- Pivot Charts: Visual pivot table data.

- Dashboards: Interactive visuals.

Unit 3: Getting Started with R

Introduction to R:

- A free programming language for statistical analysis.


Business Analytics - Complete Notes

Advantages of R:

- Open-source, data visualization, strong statistical packages.

Installation of R Packages:

- [Link]("package_name")

- library(package_name)

Import Data:

- [Link]("[Link]")

- readxl::read_excel("[Link]")

Basic Commands & Syntax:

- Assignment: <- or =

- Print: print(), head(), summary()

Data Structures:

- Vectors: One-dimensional

- Matrices: 2D with same data type

- Arrays: Multi-dimensional

- Lists: Different types of elements

- Factors: Categorical data

- Data Frames: Tabular data (like Excel)

Control Flows:

- if, else, ifelse()


Business Analytics - Complete Notes

Loops:

- for, while, repeat

Functions & Apply Family:

- Custom: myfunc <- function(x){...}

- Apply: apply(), lapply(), sapply()

Unit 4: Descriptive Statistics Using R

Importing Data:

- [Link]("[Link]")

Visualization:

- hist(), barplot(), boxplot(), plot(), line charts

Measure of Central Tendency:

- Mean: average()

- Median: middle value

- Mode: most frequent (custom function)

Measure of Dispersion:

- Variance: var()

- Standard Deviation: sd()


Business Analytics - Complete Notes

- Range: max() - min()

Relationship Between Variables:

- Covariance: cov(x, y)

- Correlation: cor(x, y)

- Coefficient of Determination (R²): summary(model)$[Link]

Unit 5: Predictive and Textual Analytics

Predictive Analytics:

Simple Linear Regression:

- Formula: Y = a + bX

- lm(Y ~ X, data = df)

Confidence & Prediction Intervals:

- predict(model, newdata, interval = "confidence/prediction")

Multiple Linear Regression:

- lm(Y ~ X1 + X2 + ..., data = df)

Interpretation:

- Each coefficient shows effect on Y when other variables are fixed


Business Analytics - Complete Notes

Heteroscedasticity:

- Non-constant error variance

- plot(model$residuals)

Multicollinearity:

- High correlation among Xs

- Check using vif(model)

Textual Analytics:

Basics:

- Analyze unstructured data (e.g., tweets, reviews)

Applications:

- Customer feedback, brand monitoring

Challenges:

- Noise, sarcasm, multi-language

Text Analysis in R:

- Packages: tm, wordcloud, syuzhet, tidytext

Techniques:

- Text Mining: Clean & extract words

- Categorization: Grouping text (e.g., spam)


Business Analytics - Complete Notes

- Sentiment Analysis: get_nrc_sentiment()

Common questions

Powered by AI

R programming plays a vital role in enhancing statistical analysis due to its strong statistical packages, ability to handle large datasets, and advanced data visualization capabilities. It simplifies complex statistical analyses through built-in functions for various statistical measures and models. R's open-source nature allows for continuous enhancement and customization suited for specific business needs, making it a preferred choice for statisticians and data analysts .

Key challenges in data analytics related to data quality include data errors, missing values, and inconsistencies. These can be addressed through rigorous data preparation and cleaning processes such as removing errors, fixing missing values, and ensuring data integrity. Using tools like data validation, duplicate removal, and proper data governance policies can significantly enhance the quality of data for analytical purposes .

Data visualization is crucial as it transforms complex data analyses into understandable visual formats, aiding quicker decision-making. Methods like scatter plots reveal relationships between variables, line charts identify trends over time, histograms show distribution patterns, and pivot tables provide summary insights. Effective visualization ensures that stakeholders can grasp data insights intuitively and act on them efficiently .

Moving averages smooth out short-term fluctuations and highlight longer-term trends or cycles in historical data, making them effective for business forecasting. They function by computing the average of data points over a specified period and progressing over time. However, their limitations include a lag effect, where the moving average reacts slowly to new data, and they can mask significant short-duration changes, thus not capturing dynamic shifts in trends promptly .

Challenges in text analytics include handling noise, sarcasm, and multiple languages. These can be mitigated using R's robust text mining packages like tm and tidytext. By performing data cleaning and preprocessing such as removing stopwords, stemming, and tokenization, R helps manage noise. Sentiment analysis functions and language-specific models can address sarcasm and multi-language issues. R offers a repeatable framework to manage these challenges effectively .

Predictive analytics in HR can predict employee attrition, identify potential leaders, and optimize training investments. By analyzing data on employee performance, engagement, and turnover patterns, HR can proactively address issues leading to attrition, develop retention strategies, and tailor training programs to employee needs. Predictive insights help align HR strategies with organizational goals and enhance talent management processes .

Predictive analytics can enhance marketing strategies by enabling customer segmentation, predicting campaign effectiveness, and optimizing customer interactions. By analyzing historical customer data, businesses can forecast buying behaviors and tailor marketing efforts towards the most lucrative segments. Predictive insights help in personalizing marketing messages, improving customer experience, and hence, increasing campaign ROI .

Data science focuses on the methodologies to analyze data using mathematical, statistical, and computer science tools to gain insights, while data analytics is a broader, strategic usage of data that assists decision-making. Data analytics encompasses data analysis as part of its process but extends to applying insights strategically across business functions .

Descriptive analytics concentrates on explaining what has happened in the past through historical data analysis. Diagnostic examines why something happened by identifying causal factors and relationships. Predictive analytics forecasts future events by identifying trends and patterns. Prescriptive analytics suggests actions based on predictions to optimize outcomes. Businesses use these analytics to respond to what has happened, understand causality, anticipate future trends, and make informed actions, respectively .

The '5Vs' — Volume, Velocity, Variety, Veracity, and Value — present opportunities by enabling more comprehensive and timely data analysis, leading to better business insights and competitive advantage. However, they also pose challenges: managing large volumes requires robust infrastructure, high velocity demands real-time processing capabilities, variety necessitates tools to handle diverse data types, veracity involves ensuring data quality and reducing uncertainty, and deriving value requires strategic integration with business processes .

You might also like