0% found this document useful (0 votes)

22 views35 pages

STA 249: Intro to Statistics & Data Analysis

Uploaded by

berre.k8107

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views35 pages

STA 249: Intro to Statistics & Data Analysis

Uploaded by

berre.k8107

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

STA 249 Probability and Statistics

Lecture 1: Introduction to Statistics and Data Analysis

A S S I S T. P R O F. D R . Y E T K İ N T U A Ç
A N K A R A U N I V E R S I T Y, FA C U LT Y O F S C I E N C E , D E PA R T M E N T O F
S TAT I S T I C S
Y T U A C @ A N K A R A . E D U.T R
2 0 2 5 - 2 0 2 6 FA L L
How to contact?
[Link]
Email: ytuac@[Link]
Room number: Block A Dekan Yardımcıları Office
Office Hours: Thursday, 14:00 – 17:00
BREAKDOWN OF GRADES:
Here is the plan:
The grade for STA 249 will be composed of the grades on:
One midterm (40%),
Final exam (60%),

Attendance (min. 50%). (Unless you are not exempt from attendance)
Reference Books
Some of this lecture notes are prepared according to the contents of

1. PROBABILITY & STATISTICS FOR ENGINEERS & SCIENTISTS by Walpole,

Myers, Myers and Ye
2. Statistics for Biomedical Engineers and Scientists How to Visualize and Analyze
Data» by Andrew P. King and Robert J. Eckersley
3. Course Notes on [Link]
How do we analyse our data?
Throughout the course, we will practice data analysis using IBM SPSS
(Statistical Package for the Social Sciences)
Students are expected to install SPSS on their personal computers from
University database at, [Link]
COURSE OBJECTIVES:
We will attempt to cover some or all of the following topics in general:
Give basic statistical concepts,
Understand randomness,
Model the phenomenon of randomness,
Establish the relationship between the problem in the real world and statistical theory,
Have knowledge about some concepts of probability theory,
Learn how to make data analysis with SPSS.
Course Content
Week 1. Introduction To Statistics And Data Analysis
Week 2. Summarizing Data: Tables And Diagrams
Week 3. Summarizing Data: Measures Of Tendency And Dispersion
Week 4. Probability
Week 5. Discrete Random Variables And Their Probability Distributions Probability
Week 6. Continuous Random Variables And Their Probability Distributions
Week 7. Sampling Distributions and Central Limit Theorem
Week 8. Properties of Point Estimators and Methods of Estimation
Week 9-10. Hypothesis Testing Statistics
Week 11-12. Simple Linear Regression and Correlation
Statistical Thinking
Engineers solve problems of interest to society by the efficient application of scientific
principles.
The engineering or scientific method is the approach to formulating and solving these
problems. (Chemometrics)
Statistical Thinking
The field of Probability
Used to quantify likelihood or chance
Used to represent risk or uncertainty in engineering applications
Can be interpreted as our degree of belief or relative frequency
The field of Statistics
Deals with the collection, presentation, analysis, and use of data to
Make decisions
Solve problems.
Definitions
Definitions
Statistics is the science of
◦ collection of methods for planning experiments,
◦ obtaining data, and then organizing,
◦ summarizing,
◦ analyzing,
◦ interpreting,
◦ drawing conclusions.
Definitions
The study of statistics has two major branches – descriptive(exploratory) statistics and inferential
statistics.
• Descriptive statistics is the branch of statistics that involves the organization, summarization,
and display of the data.
• Inferential statistics is the branch of statistics that involves using a sample to draw conclusions
about population. A basic tool in the study of inferential statistics is probability (i.e. α= 0.05).
Definitions
Population
◦ All subjects possessing a common characteristic that is being studied.
There are different types of population.
They are:
Finite Population
Infinite Population
Existent Population
Hypothetical Population
Sample
◦ A subgroup or subset of the population.

Individuals are the objects described by a set of data. Individuals may be people, but they may
also be animals or things (experiment units).
The term sample size simply means the number of elements in the sample.

Often in statistics, we compare samples from two different populations and try to determine
statistically if the populations are significantly different (Comparison tests).
Sampling
Sampling
Sampling consists of selecting some part of a population to observe so that one may estimate
something about the whole population.

Some questions:
– How best to obtain the sample and make the observations?
– Once the sample data are in hand, how best to use them to estimate the characteristic of the
whole population?
Sampling
Basically, there are two types of sampling. They are:

Probability sampling
Non-probability sampling
Probability Sampling
In probability sampling, the population units cannot be selected at the discretion of the
researcher. This can be dealt with following certain procedures which will ensure that every unit
of the population consists of one fixed probability being included in the sample. Such a method is
also called random sampling.

Some of the techniques used for probability sampling are:

Simple Random Sampling
Stratified Sampling
Cluster Sampling
Systematic Sampling
Simple Random Sampling
Every individual or item from the frame has an equal chance of being selected

Selection may be with replacement or without replacement

Samples obtained from table of random numbers or computer random number generators
Stratified Sampling
Stratified sampling is a method of dividing a
population into distinct subgroups (strata)
based on shared characteristics, and then
randomly sampling from each subgroup to
ensure representation across the entire
population.
Example: Imagine you're studying dietary
habits in a city. Rather than randomly picking
people from the whole population, you first
split them into age groups (e.g., teens, adults,
seniors), then randomly select participants
from each group so all age ranges are fairly
represented.
Cluster Sampling
Cluster sampling is a method where the population is divided into naturally occurring groups
(clusters), and then entire clusters are randomly selected for inclusion in the sample rather than
sampling individuals across the whole population.

Imagine you're studying school performance across a city. Instead of randomly selecting students
from every school, you randomly pick 5 schools (clusters) and include all students from those
schools in your sample. This saves time and resources while still capturing group-level variation.
Systematic Sampling
Systematic sampling is a method where you
select every nᵗʰ individual from a list or sequence
after choosing a random starting point. It’s
simple, efficient, and often used when the
population is ordered or evenly spaced
Imagine you have a list of 64 patients in a
hospital database. You want to select a sample of
8. You randomly pick a starting point — say,
patient #3 — and then select every 8ᵗʰ patient N = 64
from there: #3, #11, #19, #27… until you reach 8
n=8 First Group
patients. This ensures a spread-out, evenly
spaced sample. k=8

BASIC BUSINESS STATISTICS, 8E © 2002 PRENTICE-HALL, INC.

Non Probability Sampling
In non-probability sampling, the population units can be selected at the discretion of the
researcher. Those samples will use the human judgements for selecting units and has no
theoretical basis for estimating the characteristics of the population. Some of the techniques
used for non-probability sampling are

Quota sampling
Judgement sampling
Purposive sampling
Population and Sample Examples
All the students in the class are population whereas the top 10 students in the class are the
sample.

All the members of the parliament is population and the female candidates present there is the
sample.
Types of Variables
Variable
◦ Characteristic or attribute that can assume different values.

• Random Variable
◦ A variable whose values are determined by chance (throw a dice or flip a coin)
Types of Variables
A variable is any characteristic of an individual. A variable can take different values for different
individuals.
Qualitative Variables
◦ Variables which assume non-numerical (categorical) values.
◦ Nominal
◦ Ordinal
Quantitative Variable
◦ Variables which assume numerical values.
Discrete Variables
Variables which assume a finite or countable number of possible values. Usually obtained by
counting.
 Continuous Variables
Variables which assume an infinite number of possible values. Usually obtained by measurement.
Types of Variables
Categorical variables
Categorical variables have values that describe a 'quality' or 'characteristic' of a data unit, like 'what
type' or 'which category’.
Categorical variables further described as nominal or ordinal:
A nominal variable is a categorical variable. Observations can take a value that is not able to be
organised in a logical sequence. Examples of nominal categorical variables include gender, business
type, eye colour, religion and brand.
An ordinal variable is a categorical variable. Observations can take a value that can be logically
ordered or ranked. The categories associated with ordinal variables can be ranked higher or lower
than another, but do not necessarily establish a numeric difference between each category. Examples
of ordinal categorical variables include academic grades (i.e. A, B1, B2, C1,…), clothing size (i.e. small,
medium, large, extra large) and attitudes (i.e. strongly agree, agree, disagree, strongly disagree).
The data collected for a categorical variable are qualitative data.
Types of Variables
Numeric variables
Numeric variables have values that describe a measurable quantity as a number, like 'how many' or
'how much'. Therefore numeric variables are quantitative variables.
Numeric variables further described as either continuous or discrete:
A continuous variable is a numeric variable. Observations can take any value between a certain set of
real numbers. The value given to an observation for a continuous variable can include values as small
as the instrument of measurement allows. Examples of continuous variables include height, time,
age, and temperature.
A discrete variable is a numeric variable. Observations can take a value based on a count from a set of
distinct whole values. A discrete variable cannot take the value of a fraction between one value and
the next closest value. Examples of discrete variables include the number of registered cars, number
of business locations, and number of children in a family, all of which measured as whole units (i.e. 1,
2, 3 cars).
The data collected for a numeric variable are quantitative data.
Types of Variables

Qualitative
Quantitative
(non-numerical-
(numerical)
categorical)

Nominal (sex, color Continuous

of eyes) (Height, weight, age)

Discrete
Ordinal (stage of
cancer, education (numbers of sisters
levels) or brothers, phones,
cars in a park)
Some Definitions
Parameter
◦ Characteristic or measure obtained from a population.
Statistic (not to be confused with Statistics)
◦ Characteristic or measure obtained from a sample.
Descriptive Statistics
◦ Collection, organization, summarization, and presentation of data.
Inferential Statistics
◦ Generalizing from samples to populations using probabilities. Performing hypothesis testing,
determining relationships between variables, and making predictions.
Scale
◦ It is the tools and equipment used to obtain numerical data
Example (Descriptive Statistics)
Collect data
◦ e.g. Survey

Present data
◦ e.g. Tables and graphs

Characterize data
◦ e.g. Sample mean = X i

n
Example (Inferential Statistics)
Estimation
◦ e.g.: Estimate the population mean weight using the
sample mean weight

Hypothesis testing
◦ e.g.: Test the claim that the population mean weight is
120 pounds

Drawing conclusions and/or making decisions concerning a population based on

sample results.
Example-1
Consider the following dataset with information about 10 different basketball players:
Solution-1
Qualitative Qualitative Quantitative Quantitative Quantitative
Nominal Nominal Discrete Continuous Discrete

Introduction to Statistics and Data Analysis
No ratings yet
Introduction to Statistics and Data Analysis
32 pages
Understanding Statistics and Its Applications
No ratings yet
Understanding Statistics and Its Applications
85 pages
Intro to Statistics and Data Collection
No ratings yet
Intro to Statistics and Data Collection
22 pages
Probability and Statistics Lecture Notes
No ratings yet
Probability and Statistics Lecture Notes
229 pages
Understanding Probability and Statistics
No ratings yet
Understanding Probability and Statistics
22 pages
Introduction to Basic Statistics Concepts
No ratings yet
Introduction to Basic Statistics Concepts
40 pages
Introduction to Statistical Inference
No ratings yet
Introduction to Statistical Inference
30 pages
Understanding Statistics: Types & Techniques
No ratings yet
Understanding Statistics: Types & Techniques
49 pages
Understanding Statistics: Definitions & Types
No ratings yet
Understanding Statistics: Definitions & Types
18 pages
Understanding Statistics and Sampling Methods
No ratings yet
Understanding Statistics and Sampling Methods
97 pages
STA2023 Statistics Summary Notes
No ratings yet
STA2023 Statistics Summary Notes
58 pages
Understanding Basic Statistical Concepts
No ratings yet
Understanding Basic Statistical Concepts
71 pages
Understanding Statistics: Key Concepts
No ratings yet
Understanding Statistics: Key Concepts
11 pages
Understanding Statistics and Data Analysis
No ratings yet
Understanding Statistics and Data Analysis
13 pages
Introduction to Statistics and Data Analysis
No ratings yet
Introduction to Statistics and Data Analysis
12 pages
Statistics
No ratings yet
Statistics
50 pages
Grade Requirement for Top 30% in Stats
No ratings yet
Grade Requirement for Top 30% in Stats
90 pages
Understanding Statistics and Data Analysis
No ratings yet
Understanding Statistics and Data Analysis
16 pages
Statistics for Research Overview
No ratings yet
Statistics for Research Overview
29 pages
Agricultural Statistics Overview
No ratings yet
Agricultural Statistics Overview
41 pages
Introduction to Statistics Basics
No ratings yet
Introduction to Statistics Basics
47 pages
Introduction to Business Statistics
No ratings yet
Introduction to Business Statistics
31 pages
Statistics Fundamentals and Applications
No ratings yet
Statistics Fundamentals and Applications
170 pages
Biostatistics: Key Concepts Overview
No ratings yet
Biostatistics: Key Concepts Overview
21 pages
Understanding Statistics and Data Analysis
No ratings yet
Understanding Statistics and Data Analysis
41 pages
Introduction to Statistics and Data Analysis
No ratings yet
Introduction to Statistics and Data Analysis
13 pages
Basic Statistics: Data Types & Sampling
No ratings yet
Basic Statistics: Data Types & Sampling
36 pages
Intro to Probability and Statistics
No ratings yet
Intro to Probability and Statistics
10 pages
Introduction to Statistics and Probability
No ratings yet
Introduction to Statistics and Probability
88 pages
Biostatistics Course Overview and Requirements
No ratings yet
Biostatistics Course Overview and Requirements
41 pages
Biostatistics Basics: Concepts & Sampling
No ratings yet
Biostatistics Basics: Concepts & Sampling
67 pages
Sampling Bias in Teen Nicotine Use Poll
No ratings yet
Sampling Bias in Teen Nicotine Use Poll
10 pages
Understanding Statistics and Its Applications
No ratings yet
Understanding Statistics and Its Applications
7 pages
Introduction to Statistics and Data Analysis
No ratings yet
Introduction to Statistics and Data Analysis
29 pages
Overview of SMAT3 Statistics Concepts
No ratings yet
Overview of SMAT3 Statistics Concepts
8 pages
Research Design and Data Collection Methods
No ratings yet
Research Design and Data Collection Methods
42 pages
Understanding Basic Statistics Concepts
No ratings yet
Understanding Basic Statistics Concepts
9 pages
Business Statistics Course Overview
No ratings yet
Business Statistics Course Overview
132 pages
Introduction to Statistics for Management
No ratings yet
Introduction to Statistics for Management
32 pages
Data and Statistics Overview Guide
No ratings yet
Data and Statistics Overview Guide
49 pages
Data Collection and Analysis Framework
No ratings yet
Data Collection and Analysis Framework
66 pages
Essential Statistical Techniques for Education
No ratings yet
Essential Statistical Techniques for Education
33 pages
CH 01
No ratings yet
CH 01
6 pages
Introduction to Statistics Overview
No ratings yet
Introduction to Statistics Overview
14 pages
3 Introduction To Probablities
No ratings yet
3 Introduction To Probablities
25 pages
Data Management and Statistical Methods
No ratings yet
Data Management and Statistical Methods
66 pages
Essential Statistics Guide for FUTO Students
No ratings yet
Essential Statistics Guide for FUTO Students
57 pages
Engineering Probability and Statistics Course
No ratings yet
Engineering Probability and Statistics Course
34 pages
Overview of Descriptive Statistics
No ratings yet
Overview of Descriptive Statistics
10 pages
Engineering Data Analysis Overview
No ratings yet
Engineering Data Analysis Overview
64 pages
Introduction to Statistics Concepts
No ratings yet
Introduction to Statistics Concepts
3 pages
Statistical Techniques for Sampling Theory
No ratings yet
Statistical Techniques for Sampling Theory
53 pages
Business Statistics Course Overview
No ratings yet
Business Statistics Course Overview
132 pages
Constructing Frequency Polygons in Statistics
No ratings yet
Constructing Frequency Polygons in Statistics
20 pages
Introduction to Statistics Concepts
No ratings yet
Introduction to Statistics Concepts
27 pages
Descriptive Statistics: Populations & Samples
No ratings yet
Descriptive Statistics: Populations & Samples
2 pages
Biostatistics in Clinical Research
No ratings yet
Biostatistics in Clinical Research
39 pages
2024 QTS105D Study Package Overview
No ratings yet
2024 QTS105D Study Package Overview
184 pages
Nonlinear Regression Curve Fitting
No ratings yet
Nonlinear Regression Curve Fitting
4 pages
Business & Economics Statistics Exam
No ratings yet
Business & Economics Statistics Exam
7 pages
ANOVA Assumptions and Example Analysis
No ratings yet
ANOVA Assumptions and Example Analysis
3 pages
CH Chapter 4 Test Bank CH Chapter 4 Test Bank
No ratings yet
CH Chapter 4 Test Bank CH Chapter 4 Test Bank
32 pages
Applied Linear Regression Study Guide
No ratings yet
Applied Linear Regression Study Guide
58 pages
Profile of Anil K. Bera, Economist
No ratings yet
Profile of Anil K. Bera, Economist
55 pages
Factors Influencing Unemployment in Indonesia
No ratings yet
Factors Influencing Unemployment in Indonesia
10 pages
Sales Forecasting with Decomposition Models
No ratings yet
Sales Forecasting with Decomposition Models
2 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
7 pages
Understanding Multicollinearity in Regression
No ratings yet
Understanding Multicollinearity in Regression
24 pages
Testbank Business Statistics 4th Canadian Edition Norean R Sharpe Richard D de Veaux Paul Velleman David Wright ISBN10 0136726542 ISBN13 9780136726548 Download
No ratings yet
Testbank Business Statistics 4th Canadian Edition Norean R Sharpe Richard D de Veaux Paul Velleman David Wright ISBN10 0136726542 ISBN13 9780136726548 Download
260 pages
Forecasting Methods for Demand Estimation
No ratings yet
Forecasting Methods for Demand Estimation
36 pages
Factor Analysis in Research Methodology
No ratings yet
Factor Analysis in Research Methodology
8 pages
Statistical Methods for Engineering Analysis
No ratings yet
Statistical Methods for Engineering Analysis
11 pages
Hedge Fund Predictability via GARCH Model
No ratings yet
Hedge Fund Predictability via GARCH Model
28 pages
Bayesian Insights on Autistic Perception
No ratings yet
Bayesian Insights on Autistic Perception
27 pages
Wilcoxon Signed-Rank Test Explained
No ratings yet
Wilcoxon Signed-Rank Test Explained
29 pages
Fuzzy Inference for Student Performance Evaluation
No ratings yet
Fuzzy Inference for Student Performance Evaluation
7 pages
Standard Deviation in Pharma Analysis
No ratings yet
Standard Deviation in Pharma Analysis
8 pages
Analyzing Paired Sample T-Test in SPSS
No ratings yet
Analyzing Paired Sample T-Test in SPSS
5 pages
Key Properties of Estimators Explained
No ratings yet
Key Properties of Estimators Explained
5 pages
تقييم نظام معلومات الموارد البشرية
No ratings yet
تقييم نظام معلومات الموارد البشرية
16 pages
Call Volume Forecasting Methods
0% (2)
Call Volume Forecasting Methods
9 pages
CEO Salary Regression Analysis Results
No ratings yet
CEO Salary Regression Analysis Results
1 page
Statistical Inference for Rayleigh Distributions
No ratings yet
Statistical Inference for Rayleigh Distributions
6 pages
UP BIT Information Systems Admission Guide
No ratings yet
UP BIT Information Systems Admission Guide
36 pages
Portfolio Projects Roadmap - Beginner To Advanced Economist-Data Scientist
No ratings yet
Portfolio Projects Roadmap - Beginner To Advanced Economist-Data Scientist
18 pages
Understanding ANOVA in Inferential Stats
No ratings yet
Understanding ANOVA in Inferential Stats
102 pages
ANOVA and F-Test in Regression Analysis
No ratings yet
ANOVA and F-Test in Regression Analysis
15 pages
SPSS Data Analysis Guide
No ratings yet
SPSS Data Analysis Guide
133 pages

STA 249: Intro to Statistics & Data Analysis

Uploaded by

STA 249: Intro to Statistics & Data Analysis

Uploaded by

STA 249 Probability and Statistics

Lecture 1: Introduction to Statistics and Data Analysis

1. PROBABILITY & STATISTICS FOR ENGINEERS & SCIENTISTS by Walpole,

Some of the techniques used for probability sampling are:

Selection may be with replacement or without replacement

BASIC BUSINESS STATISTICS, 8E © 2002 PRENTICE-HALL, INC.

Nominal (sex, color Continuous

Drawing conclusions and/or making decisions concerning a population based on

You might also like