0% found this document useful (0 votes)

64 views10 pages

Overview of Descriptive Statistics

Uploaded by

beelzeboob987

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views10 pages

Overview of Descriptive Statistics

Uploaded by

beelzeboob987

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Chapter

Descriptive Statistics
Hazhar Talaat Abubaker Blbas

Abstract

Descriptive statistics is a branch of statistics that deals with summarizing and

describing the main features of a dataset. This chapter will cover the value of statistics,
how data analysis occurs in scientific study, the distinction between a sample and the
population, the different types of variables, sampling techniques, measures of central
tendency, and measures of dispersion. The summary of descriptive statistics gives a
succinct overview of various metrics and visual representations, enabling researchers
and analysts to learn more about the features of the dataset and draw accurate
conclusions.

Keywords: mean, median, mode, standard deviation, coefficient of variation,

probability sampling, non-probability sampling

1. Introduction

Descriptive statistics involve summarizing and describing data using numerical

measures and graphical representations. It provides a concise and meaningful way to
understand and communicate the main characteristics of a dataset. This introduction
explores the basics of descriptive statistics, including measures of central tendency,
measures of dispersion, and graphical representations. By examining these statistical
tools, we can gain insights into the patterns, variability, and distribution of data,
allowing us to make informed interpretations and draw meaningful conclusions.

2. What is the process of analyzing data in statistics?

Statistics is the science of collecting, organizing, analyzing, and interpreting data

in order to make decisions as shown in Figure 1.

3. Sample and population

A population is the collection of all outcomes, responses, measurements, or counts

that are of interest since sample is a subset of a population as shown in Figure 2 [1–3].

1
Recent Advances in Biostatistics

Figure 1.
Process of data analysis in scientific research. Source: Author has been created as a new work.

Figure 2.
Difference between sample and population. Source: Author has been created as a new work.

2
Descriptive Statistics
DOI: [Link]

4. Type of variables

Variable is a characteristic that can assume different values and alphabetic. There
are two common types of variables, such as quantitative variables and qualitative
variables, as shown in Figure 3 [4, 5].

1. Quantitative variables (numerical variables) are variables that represent

measurable quantities or amounts. They can be further classified into two types:

i. Discrete variables: Discrete variables are numerical variables that can

only take on specific, separate values. These values are typically whole
numbers or counts and cannot be subdivided further. Examples of
discrete variables include the number of children in a family, the
number of customers in a store, or the number of items sold.

ii. Continuous variables: Continuous variables are numerical variables that

can take on any value within a certain range. They can be measured with
a high degree of precision and can have infinite possible values between
any two points. Examples of continuous variables include height,
weight, temperature, and income.

2. Qualitative variables (categorical variable): This type of variable represents data

that can be divided into distinct categories or groups. Examples include gender,
ethnicity, marital status, and level of education.

i. Nominal variables: Nominal variables are categorical variables that

represent data with no ranking. Examples of nominal variables include
gender (male/female), ethnicity (Asian, African, European, etc.),
marital status (single, married, divorced), and eye color (blue, brown,
green).

Figure 3.
Type of variables. Source: Author has been created as a new work.

3
Recent Advances in Biostatistics

ii. Ordinal variables: Ordinal variables represent data that has a natural
order or ranking, but the differences between the categories may not be
consistent or measurable. The categories can be ranked or ordered based
on some criterion, but the magnitude of the difference between
categories is not known. Examples of ordinal variables include
satisfaction levels (very satisfied, satisfied, neutral, dissatisfied, very
dissatisfied), educational attainment (high school diploma, bachelor’s
degree, master’s degree), and survey responses using Likert scales
(“strongly agree,” “agree,” “neutral,” “disagree,” “strongly disagree”).

5. Sampling plan

Once the target population has been identified, next the sampling plan must be
devised. Goal: Randomly select a small percent of the population that will in turn
represent the ideas of the population as a whole. There are two general types of
sampling techniques [1, 2, 5]:

5.1 Probability (random) sampling

All members of the population must be specified prior to drawing the sample and
each member of the population has equal probability of being chosen or included in
the sample. There are four common types of Probability (Random) Sampling:

5.1.1 Simple random sampling

Simple random sampling is a statistical sampling technique in which each member

of a population has an equal probability of being selected to be part of the sample. The
selection process is conducted randomly, without any bias or preference toward
certain individuals or elements in the population.
A researcher wants to conduct a survey to understand the opinions of students at a
university regarding a new policy. The university has a total population of 1500
students. For example, a researcher wants to select 100 out of 1500 students as a
sample. Put a unique identifier to each of the students such as a student ID number.
Then, randomly select the 100 students as a sample like a lottery game.

5.1.2 Systematic sampling

Systematic sampling is a statistical sampling technique that involves selecting

every kth element from a population, where k is a predetermined interval. It is similar
to simple random sampling but incorporates a systematic approach to the selection
process.
Depending on the previous example of simple random sampling, the researcher
wants to select 100 students using systematic sampling. We will calculate the sam-
pling interval, which divides the population size by the desired sample size to deter-
mine the sampling interval. In this case, the sampling interval would be 1000/
100 = 10. Next, select a random starting point within the first k elements (in this case,
the first 10 students). Next, starting from the random starting point, select every 10th
student thereafter. So, you would select the 10th, 20th, 30th, and so on, until you
reach the desired sample size.
4
Descriptive Statistics
DOI: [Link]

5.1.3 Stratified sampling

Stratified sampling is a statistical sampling technique that involves dividing the

population into two or more than two homogeneous groups. Then, randomly select
the desire case in each groups using simple random sampling.
Depending on the previous example in simple random sampling, the researcher
wants to select 100 students using stratified sampling.
First, students can be stratified based on their academic disciplines into four strata:
statistics, accounting, business, and economics department.

1. Determine the sample size: Decide on the desired sample size for each stratum.
Let us say you want to sample 25 students from each stratum (department),
resulting in a total sample size of 100 students.

2. Divide the population into four strata: Categorize the students into the respective
strata based on their academic disciplines. Each student should belong to only
one stratum.

3. Determine the allocation: Calculate the proportionate allocation for each stratum
by dividing the desired sample size for that stratum by the total sample size. In
this case, since each stratum has the same desired sample size (25 students), the
allocation would be 1/4 (25%) for each stratum.

4. Sample within each stratum: Perform simple random sampling within each
stratum separately. Randomly select 25% (25 students) from the statistics
stratum, 25% from the accounting stratum, 25% from business stratum, and 25%
from the economics stratum.

5. Collect data: Once the samples are selected, collect the relevant data or
information from the students in each stratum.

5.1.4 Cluster sampling

Cluster sampling: Cluster sampling involves dividing the population into clusters
or groups, often based on geographical proximity, and randomly selecting entire more
than one clusters as the sampling units. This technique is useful when it is impractical
or costly to sample individuals individually, and it can provide cost and time
efficiencies.

5.2 Nonprobability sampling

Every element in the population does not have an equal probability of being
chosen. The process of inclusion in the sample is based on the judgment of the person
selecting the sample. There are four common types of nonprobability sampling.

5.2.1 Judgment sampling

Purposive sampling: Purposive sampling, also known as judgmental or selective

sampling, involves handpicking individuals based on specific criteria or the
researcher’s judgment. This technique is often used in qualitative research or when a
5
Recent Advances in Biostatistics

specific subgroup of the population is of particular interest. Purposive sampling allows

the researcher to target individuals who possess the desired characteristics or have
relevant experiences.

5.2.2 Convenience sampling

Convenience sampling: Convenience sampling involves selecting individuals who

are easily accessible or readily available to the researcher. This method is convenient
and often used in situations where time, cost, or accessibility is a constraint. However,
convenience sampling can introduce bias, as the sample may not be representative of
the entire population.

5.2.3 Quota sampling

Quota sampling: Quota sampling involves setting specific quotas or targets for
certain characteristics or subgroups within the population. The researcher selects
individuals to fulfill the predetermined quotas until they are satisfied with the sample
composition. Quota sampling allows for control over sample proportions but does not
involve random selection.

5.2.4 Snowball sampling

Snowball sampling: Snowball sampling is a technique where initial participants are

selected, and then they help identify and recruit additional participants from their
social networks. This method is useful when studying hard-to-reach or hidden
populations. Snowball sampling relies on referrals and networks to expand the
sample size.

6. Measures of central tendency

It is a statistical measure that represents information about the central or middle

value of a dataset. The three common measures of central tendency are the mean,
median, and mode [4–6].

1. Mean (average), is calculated by summing up all the values in a dataset and

dividing by the number of values. It represents the balancing point of the dataset
and is sensitive to outliers. Depending on 894 people from Kurdistan Region of
Iraq, the average age of people for the survey about depression and anxiety
during the outbreak of COVID-19 is 33 years [1].
P
Xi
X¼ (1)
n

Example: Consider the following dataset of exam scores: 85, 90, 92, 88, 95. The
mean is calculated as (85 + 90 + 92 + 88 + 95) / 5 = 90.

2. Median: The median is the middle value in a dataset when it is arranged in

ascending or descending order. If there is an even number of values, the median
6
Descriptive Statistics
DOI: [Link]

is the average of the two middle values. The median is less influenced by outliers
compared to the mean.
Example: Using the same dataset of exam scores: 85, 90, 92, 88, 95. When
arranged in ascending order, the middle value is 90. Therefore, the median is 90.

3. Mode: The mode represents the most frequently occurring value(s) in a dataset.
It is the value that appears with the highest frequency. A dataset can have no
mode (when all values occur equally) or multiple modes (when multiple values
have the same highest frequency).

Example: Consider the following dataset of exam scores: 85, 90, 92, 88, 90.
The mode is 90 because it appears twice, which is more frequently than any
other value.

7. Measures of dispersion (variation)

Measures of dispersion (Variation), provide information about the spread or dis-

persion of data points around the central tendency. The first three main measures of
dispersion including range, standard deviation, and variance, are used when we have
the same unit of datasets but we can use coefficient of variation once we have
different units of datasets [4–8].

1. Range (R): It is the difference between the maximum and minimum values in a
dataset.

R ¼ Highest value Lowest value (2)

Example: Consider the following dataset of exam scores: 85, 90, 92, 88, 95. The
range is calculated as 95–85 = 10.

2. Variance (S2): It measures the average squared deviation of each data point from
the mean. It provides a more precise measure of dispersion by considering the
differences between individual data points and the mean. However, it is in
squared units and is sensitive to outliers.
P 2
Xi X
S2 ¼ (3)
n1

Example: Using the same dataset of exam scores: 85, 90, 92, 88, 95. The variance is
calculated as follows:

• Calculate the mean: (85 + 90 + 92 + 88 + 95) / 5 = 90.

• Calculate the squared deviation for each data point from the mean: (85–90)^2,
(90–90)^2, (92–90)^2, (88–90)^2, (95–90)^2.

• Calculate the average of these squared deviations: (25 + 0 + 4 + 4 + 25) / 5 = 12.8.

Therefore, the variance is 12.8.
7
Recent Advances in Biostatistics

1. Standard Deviation (S): It is the square root of the variance. It is the most
commonly used measure of dispersion as it is in the original units of the data,
making it more interpretable. It provides a measure of how much the data
deviates from the mean.
sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
P 2
Xi X
S¼ (4)
n1

Example: Using the same dataset of exam scores: 85, 90, 92, 88, 95. The standard
deviation is the square root of the variance calculated in the previous example,
which is approximately 3.58.

2. The coefficient of variation (CV) is a relative measure of dispersion that

expresses the standard deviation as a percentage of the mean. It is used to
compare the variability of datasets with different means or scales. The formula
for calculating the coefficient of variation is:

S
CV ¼ ∗ 100 (5)
X

Here’s an example to illustrate the calculation of the coefficient of variation.

Consider two datasets representing the monthly sales of two stores:
Store A: Mean = $10,000, Standard Deviation = $2000.
Store B: Mean = $15,000, Standard Deviation = $3000

• CV foe the store A = (2000 / 10,000) * 100 = 20%

• CV foe the store B = (3000 / 15,000) * 100 = 20%

In this example, both stores have the same coefficient of variation of 20%. It
indicates that the relative variability or dispersion of sales is the same for both stores,
even though Store B has a higher mean and standard deviation compared to Store A.
A lower coefficient of variation indicates less variability relative to the mean, while
a higher coefficient of variation suggests greater relative variability.

Additional information

ORCID account: [Link]

Google Scholar Citation: [Link]
JoAAAAJ&hl=en&authuser=1

8
Descriptive Statistics
DOI: [Link]

Author details

Hazhar Talaat Abubaker Blbas

Department of Statistics, College of Administration and Economics, Salahaddin
University, Erbil, Kurdistan Region, Iraq

*Address all correspondence to: [Link]@[Link]

© 2023 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of
the Creative Commons Attribution License ([Link]
which permits unrestricted use, distribution, and reproduction in any medium, provided
the original work is properly cited.
9
Recent Advances in Biostatistics

References

[1] Aroian K, Uddin N, Blbas H.

Longitudinal study of stress, social
support, and depression in married Arab
immigrant women. Health care for
women international. Feb 1 2017;38(2):
100-117

[2] Rosner B. Fundamentals of

biostatistics. Cengage Learning. 2015

[3] Bluman A. Elementary Statistics: A

Step by Step Approach 9e. McGraw Hill;
2014

[4] Blbas H. Statistical analysis for the

most influential reasons for divorce
between men and women in Erbil-Iraq.
International Journal. Malmö, Sweden.
2019

[5] Triola MF, Iossi L. Essentials of

Statistics. Boston, MA, USA: Pearson
Addison Wesley; 2008

[6] Hanif M, Ahmed M, Ahmed AM.

Biostatistics for health students with
manual on software applications. Islamic
Society of Statistical Sciences. 2006

[7] Rowe P. Essential Statistics for the

Pharmaceutical Sciences. John Wiley &
Sons; 2015

[8] Blbas HT, Aziz KF, Nejad SH,

Barzinjy AA. Phenomenon of depression
and anxiety related to precautions for
prevention among population during the
outbreak of COVID-19 in Kurdistan
region of Iraq: Based on questionnaire
survey. Journal of Public Health. 2020;
10:1-5

Intro to Educational Statistics Concepts
No ratings yet
Intro to Educational Statistics Concepts
4 pages
Introduction to Educational Statistics
No ratings yet
Introduction to Educational Statistics
42 pages
Business Statistics II Course Overview
No ratings yet
Business Statistics II Course Overview
15 pages
Introduction to Statistical Inference
No ratings yet
Introduction to Statistical Inference
30 pages
Understanding Statistics: Definitions & Types
No ratings yet
Understanding Statistics: Definitions & Types
18 pages
Understanding Statistics and Data Analysis
No ratings yet
Understanding Statistics and Data Analysis
6 pages
Data Management and Statistical Methods
No ratings yet
Data Management and Statistical Methods
66 pages
Population and Sample in Food Safety
No ratings yet
Population and Sample in Food Safety
42 pages
Understanding Statistics: Key Concepts
No ratings yet
Understanding Statistics: Key Concepts
11 pages
Data Management and Statistical Analysis
No ratings yet
Data Management and Statistical Analysis
24 pages
Understanding Statistics and Data Analysis
No ratings yet
Understanding Statistics and Data Analysis
13 pages
STA410: Intro to Behavioral Stats
No ratings yet
STA410: Intro to Behavioral Stats
9 pages
Introduction to Statistics Concepts
No ratings yet
Introduction to Statistics Concepts
18 pages
Sampling Methods and Statistical Estimation
No ratings yet
Sampling Methods and Statistical Estimation
21 pages
Understanding Probability and Statistics
No ratings yet
Understanding Probability and Statistics
28 pages
Introduction to Statistical Analysis
No ratings yet
Introduction to Statistical Analysis
37 pages
Probability and Statistics Lecture Notes
No ratings yet
Probability and Statistics Lecture Notes
229 pages
Stats Short Notes
No ratings yet
Stats Short Notes
19 pages
Overview of Statistical Methods
No ratings yet
Overview of Statistical Methods
15 pages
Student Sibling Probability Analysis
No ratings yet
Student Sibling Probability Analysis
32 pages
Understanding Probability and Statistics
No ratings yet
Understanding Probability and Statistics
5 pages
Sampling and Descriptive Statistics Guide
No ratings yet
Sampling and Descriptive Statistics Guide
38 pages
Understanding Statistics: Types & Techniques
100% (2)
Understanding Statistics: Types & Techniques
54 pages
Introduction to Statistics and Sampling
No ratings yet
Introduction to Statistics and Sampling
10 pages
Data and Statistics Overview Guide
No ratings yet
Data and Statistics Overview Guide
49 pages
Introduction to Statistics Overview
No ratings yet
Introduction to Statistics Overview
14 pages
Sampling and Descriptive Statistics Guide
No ratings yet
Sampling and Descriptive Statistics Guide
38 pages
Probability in Call Center Statistics
No ratings yet
Probability in Call Center Statistics
248 pages
Essential Statistical Techniques for Education
No ratings yet
Essential Statistical Techniques for Education
33 pages
Introduction to Statistics Module
No ratings yet
Introduction to Statistics Module
10 pages
Basic Steps in Statistical Studies
No ratings yet
Basic Steps in Statistical Studies
19 pages
Overview of Data Collection Methods
No ratings yet
Overview of Data Collection Methods
27 pages
Understanding Statistics Basics
No ratings yet
Understanding Statistics Basics
32 pages
Statistics: Methods and Applications
No ratings yet
Statistics: Methods and Applications
37 pages
STA 111: Introduction to Statistics
No ratings yet
STA 111: Introduction to Statistics
27 pages
Understanding Statistical Methods and Analysis
No ratings yet
Understanding Statistical Methods and Analysis
51 pages
Statistics and Statistical Thinking
No ratings yet
Statistics and Statistical Thinking
6 pages
Biostatistics Course Overview and Concepts
No ratings yet
Biostatistics Course Overview and Concepts
38 pages
Statistical Tools for Data Analysis
No ratings yet
Statistical Tools for Data Analysis
68 pages
Introduction to Basic Statistics Concepts
No ratings yet
Introduction to Basic Statistics Concepts
40 pages
Understanding Statistics: Key Concepts
No ratings yet
Understanding Statistics: Key Concepts
12 pages
Overview of SMAT3 Statistics Concepts
No ratings yet
Overview of SMAT3 Statistics Concepts
8 pages
Introduction to Statistics Overview
No ratings yet
Introduction to Statistics Overview
32 pages
Essential Statistics for Psychiatry Research
No ratings yet
Essential Statistics for Psychiatry Research
18 pages
Statistics and Data Management Overview
No ratings yet
Statistics and Data Management Overview
12 pages
Understanding Statistics: Types & Variables
No ratings yet
Understanding Statistics: Types & Variables
2 pages
Understanding Data Measurement Scales
No ratings yet
Understanding Data Measurement Scales
32 pages
Understanding Statistics and Sampling Methods
No ratings yet
Understanding Statistics and Sampling Methods
3 pages
Data Collection Methods in Statistics
No ratings yet
Data Collection Methods in Statistics
129 pages
Sampling Bias in Teen Nicotine Use Poll
No ratings yet
Sampling Bias in Teen Nicotine Use Poll
10 pages
Basic Concepts of Statistics
No ratings yet
Basic Concepts of Statistics
21 pages
Examples of Statistics in Research
No ratings yet
Examples of Statistics in Research
95 pages
Biostatistics: Sampling and Data Analysis
No ratings yet
Biostatistics: Sampling and Data Analysis
32 pages
STA2023 Statistics Summary Notes
No ratings yet
STA2023 Statistics Summary Notes
58 pages
Types and Classification of Data
No ratings yet
Types and Classification of Data
56 pages
Introduction to Statistics Overview
No ratings yet
Introduction to Statistics Overview
46 pages
Introduction to Statistics Basics
No ratings yet
Introduction to Statistics Basics
47 pages
VTR-Xylanase Feed Additive Safety Review
No ratings yet
VTR-Xylanase Feed Additive Safety Review
11 pages
Safety Assessment of Pediococcus pentosaceus
No ratings yet
Safety Assessment of Pediococcus pentosaceus
10 pages
Safety Assessment of Chromium Propionate
No ratings yet
Safety Assessment of Chromium Propionate
14 pages
Men's Pornography Use: A Literature Review
No ratings yet
Men's Pornography Use: A Literature Review
45 pages
PIIS1083318813001186
No ratings yet
PIIS1083318813001186
8 pages
The Effect of Online Pornography On Pornography Mo
No ratings yet
The Effect of Online Pornography On Pornography Mo
13 pages
Love as a Commitment Device
No ratings yet
Love as a Commitment Device
22 pages
Porn and Psychosexual Well-Being in Men
No ratings yet
Porn and Psychosexual Well-Being in Men
19 pages
Everyday Ageism Scale Evaluation
No ratings yet
Everyday Ageism Scale Evaluation
12 pages
Philippine Suicide Prevention Legislation Review
No ratings yet
Philippine Suicide Prevention Legislation Review
38 pages
Filipino Adaptation of COVID-19 Fear Scale
No ratings yet
Filipino Adaptation of COVID-19 Fear Scale
15 pages
Introductory Chapter Feminism Corporeality and Bey
No ratings yet
Introductory Chapter Feminism Corporeality and Bey
6 pages
One Flesh Many Bodies Agency and Womens Body
No ratings yet
One Flesh Many Bodies Agency and Womens Body
8 pages
Motivations Behind Pornography Use
No ratings yet
Motivations Behind Pornography Use
2 pages
Blended Families in Pampanga: Challenges & Opportunities
No ratings yet
Blended Families in Pampanga: Challenges & Opportunities
31 pages
Filipino MSM Beliefs on HIV Retesting
No ratings yet
Filipino MSM Beliefs on HIV Retesting
11 pages
Pornography and Sexual Behavior in Indonesia
No ratings yet
Pornography and Sexual Behavior in Indonesia
18 pages
Effects of Pornography on Indonesian Students
No ratings yet
Effects of Pornography on Indonesian Students
24 pages
Pornography Craving Questionnaire Development
No ratings yet
Pornography Craving Questionnaire Development
12 pages
Hald 2007
No ratings yet
Hald 2007
12 pages
Effects of Pornography on Attitudes Toward Violence
No ratings yet
Effects of Pornography on Attitudes Toward Violence
11 pages
Effects of Pornography on Heterosexual Men
No ratings yet
Effects of Pornography on Heterosexual Men
27 pages
Trends in Statistical Literacy Research
No ratings yet
Trends in Statistical Literacy Research
10 pages
Seaborn: Enhancing Data Visualization
No ratings yet
Seaborn: Enhancing Data Visualization
23 pages
Executive PG Certification in Business Analysis
No ratings yet
Executive PG Certification in Business Analysis
12 pages
Data Mining Career Overview
No ratings yet
Data Mining Career Overview
5 pages
People Analytics Strategy and Execution
No ratings yet
People Analytics Strategy and Execution
1 page
Master Power BI: From Beginner to Expert
No ratings yet
Master Power BI: From Beginner to Expert
58 pages
UIDAI Data Hackathon 2026
No ratings yet
UIDAI Data Hackathon 2026
5 pages
WCM Tools for Kaizen Implementation
0% (1)
WCM Tools for Kaizen Implementation
11 pages
Top 12 Transformative Insights in Healthcare
No ratings yet
Top 12 Transformative Insights in Healthcare
19 pages
COBOL Flowcharting Tool Overview
No ratings yet
COBOL Flowcharting Tool Overview
13 pages
JNTUA R23 CSE AI & ML Syllabus 3-1
No ratings yet
JNTUA R23 CSE AI & ML Syllabus 3-1
182 pages
HR Analytics: Data-Driven Decision Making
No ratings yet
HR Analytics: Data-Driven Decision Making
33 pages
Key Aspects of Analytical Thinking
No ratings yet
Key Aspects of Analytical Thinking
5 pages
SHS - Core - CAE - Q3 - LE2 (FINAL)
No ratings yet
SHS - Core - CAE - Q3 - LE2 (FINAL)
18 pages
Social Media and Text Analytics
No ratings yet
Social Media and Text Analytics
9 pages
JAISURAJRESUME
No ratings yet
JAISURAJRESUME
2 pages
Python-Based Network Traffic Analyzer
No ratings yet
Python-Based Network Traffic Analyzer
13 pages
Data Visualization PPT
No ratings yet
Data Visualization PPT
18 pages
Classifying Tweets Sentiment Analysis
No ratings yet
Classifying Tweets Sentiment Analysis
9 pages
Anurag Sharma: IIT Delhi Profile
No ratings yet
Anurag Sharma: IIT Delhi Profile
1 page
Overview of ThingSpeak IoT Platform
No ratings yet
Overview of ThingSpeak IoT Platform
6 pages
Saketh Rao: Data Analyst Profile
100% (1)
Saketh Rao: Data Analyst Profile
2 pages
Data Analytics: Transforming Raw Data Insights
No ratings yet
Data Analytics: Transforming Raw Data Insights
10 pages
NetSuite APM User Guide 2016
No ratings yet
NetSuite APM User Guide 2016
29 pages
Data Science Portfolio Insights
No ratings yet
Data Science Portfolio Insights
9 pages
Data Science with Python Guide
No ratings yet
Data Science with Python Guide
149 pages
Exploratory Data Analysis Techniques
No ratings yet
Exploratory Data Analysis Techniques
44 pages
Unit I
No ratings yet
Unit I
7 pages
User and Group Management Essentials
No ratings yet
User and Group Management Essentials
52 pages
Business Analytics: Data to Insights
No ratings yet
Business Analytics: Data to Insights
37 pages

Overview of Descriptive Statistics

Uploaded by

Overview of Descriptive Statistics

Uploaded by

Chapter

Descriptive statistics is a branch of statistics that deals with summarizing and

Keywords: mean, median, mode, standard deviation, coefficient of variation,

Descriptive statistics involve summarizing and describing data using numerical

2. What is the process of analyzing data in statistics?

Statistics is the science of collecting, organizing, analyzing, and interpreting data

3. Sample and population

A population is the collection of all outcomes, responses, measurements, or counts

1. Quantitative variables (numerical variables) are variables that represent

i. Discrete variables: Discrete variables are numerical variables that can

ii. Continuous variables: Continuous variables are numerical variables that

2. Qualitative variables (categorical variable): This type of variable represents data

i. Nominal variables: Nominal variables are categorical variables that

5.1 Probability (random) sampling

5.1.1 Simple random sampling

Simple random sampling is a statistical sampling technique in which each member

5.1.2 Systematic sampling

Systematic sampling is a statistical sampling technique that involves selecting

5.1.3 Stratified sampling

Stratified sampling is a statistical sampling technique that involves dividing the

5.1.4 Cluster sampling

5.2 Nonprobability sampling

5.2.1 Judgment sampling

Purposive sampling: Purposive sampling, also known as judgmental or selective

specific subgroup of the population is of particular interest. Purposive sampling allows

5.2.2 Convenience sampling

Convenience sampling: Convenience sampling involves selecting individuals who

5.2.3 Quota sampling

5.2.4 Snowball sampling

Snowball sampling: Snowball sampling is a technique where initial participants are

6. Measures of central tendency

It is a statistical measure that represents information about the central or middle

1. Mean (average), is calculated by summing up all the values in a dataset and

2. Median: The median is the middle value in a dataset when it is arranged in

7. Measures of dispersion (variation)

Measures of dispersion (Variation), provide information about the spread or dis-

R ¼ Highest value Lowest value (2)

• Calculate the mean: (85 + 90 + 92 + 88 + 95) / 5 = 90.

• Calculate the average of these squared deviations: (25 + 0 + 4 + 4 + 25) / 5 = 12.8.

2. The coefficient of variation (CV) is a relative measure of dispersion that

Here’s an example to illustrate the calculation of the coefficient of variation.

• CV foe the store A = (2000 / 10,000) * 100 = 20%

• CV foe the store B = (3000 / 15,000) * 100 = 20%

ORCID account: [Link]

Hazhar Talaat Abubaker Blbas

*Address all correspondence to: [Link]@[Link]

[1] Aroian K, Uddin N, Blbas H.

[2] Rosner B. Fundamentals of

[3] Bluman A. Elementary Statistics: A

[4] Blbas H. Statistical analysis for the

[5] Triola MF, Iossi L. Essentials of

[6] Hanif M, Ahmed M, Ahmed AM.

[7] Rowe P. Essential Statistics for the

[8] Blbas HT, Aziz KF, Nejad SH,

You might also like