0% found this document useful (0 votes)

34 views5 pages

Supervised Learning Problem Set

The document outlines a problem set for a supervised learning course at the American University of Beirut, divided into two parts: Toolbox Tasks and Theoretical Questions. Students are required to collect sensor data using a specific application, perform regression and classification tasks, and answer theoretical questions related to supervised learning. Submission guidelines include providing links to a Google Colab notebook and dataset.

Uploaded by

sobohalaa90

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views5 pages

Supervised Learning Problem Set

Uploaded by

sobohalaa90

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Problem Set Supervised Learning

490 - Spring 2025

American University of Beirut

Important Notes
For any questions regarding the assignment, please contact the Teaching Assistants:

• Mohammad Zbeeb — mbz02@[Link]

• Mariam Salman — mcs12@[Link]

Note: Do not contact the professor regarding this assignment.

Overview
This assignment is divided into two parts:

1. Toolbox Tasks (Data Collection & Modeling): Use the Physics Toolbox Sensor
Suite application to collect sensor data, and design two machine learning tasks (one
regression and one classification).

2. Theoretical Questions: Answer theoretical questions related to supervised learning.

All questions in this section are required.

1
1 Part I: Toolbox Tasks for Regression and Classification
1.1 Overview
In this part, you will use real-world sensor data collected from your smartphone using the
Physics Toolbox Sensor Suite application. The aim is to design and implement two separate
machine learning tasks:

• Regression Task: Choose a physical phenomenon (e.g., harmonic motion, acceleration

changes, etc.) and analyze it using regression techniques.

• Classification Task: Collect labeled data for an activity recognition problem (e.g., de-
tecting push-ups over time) and apply classification methods.

1.2 Task Requirements

1. Download the Physics Toolbox Sensor Suite application on your smartphone.

2. Collect appropriate sensor data for each task.

3. Preprocess and analyze the collected data using Python in a Google Colab notebook.

4. Implement the corresponding regression and classification models.

5. Evaluate and interpret the results.

6. Document your entire workflow in a well-organized Colab notebook.

1.3 Submission Guidelines

Submit the following:

• A publicly accessible link to your Google Colab notebook. Ensure that anyone with
the link can view it.

• A publicly accessible Google Drive link to your dataset. Ensure that the dataset is
shared with anyone with the link.

Submission Format: Provide both links in a plain text file or a PDF document in the
submission box on Moodle.

2
2 Part II: Theoretical Questions
In this part, you will answer theoretical questions related to supervised learning. All questions
in this section are required. Provide your answers in the same Colab Notebook as Part I.

Theoretical Question 1: Multivariate Least Squares

For a dataset where y (i) is vector-valued with p outputs, the cost function is:
m p
1 X X T (i) (i) 2

J(Θ) = (Θ x )j − yj .
2
i=1 j=1

Tasks:

1. Express this cost function in matrix-vector notation.

2. Derive the normal equations for Θ.

3. Compare this solution to solving p independent least squares problems.

Theoretical Question 3: Losses, Error Estimates, and k-NN Review

This exercise will help you review the notions of losses, prediction error, and estimates of error
(training error, error using an independent test set, cross-validation estimates of error), as well
as remind you of basic k-Nearest Neighbor (k-NN) ideas.

Setup. We consider a training set T = {(yi , xi )}i=1,...,n , where for simplicity we assume xi are
fixed and yi ∈ {−1, 1} is a binary random variable with pi = Pr(yi = 1). Given a training set,
we assume our estimate for each pi will be
1 + ayi
p̂i = ,
2
where 0 ≤ a ≤ 1 is a parameter that controls the degree of fit to the training data. Larger
values of a provide a closer fit.
We want to compute training and test errors under different losses to parallel the zero-one
loss case discussed in class. The losses are:

• Exponential loss:
L(y, f ) = exp(−yf ),

• Squared error (L2) loss:

L(y, f ) = (y − f )2 ,

• Absolute error (L1) loss:

L(y, f ) = |y − f |,

• Logistic (likelihood) loss:

L(y, f ) = log 1 + e−2 y f .

Tasks (Choose 3 out of the 4 losses):

3
1. For each of your chosen losses, obtain f ∗ , the population minimizer of the corresponding
population risk:
f ∗ = arg min E(X,Y ) L(Y, f (X)) ,

f

where the expectation is with respect to the distribution of (X, Y ). In this simplified
scenario, X is fixed and the randomness is from Y only, so effectively pi is all we need.

2. Using p̂i in place of pi , obtain the corresponding estimate fˆ (i.e., show how f ∗ depends
on pi ; then plug in p̂i ).

3. Compute the training error R̂ (as a function of a) and the average test error R = Err of
the rule F̂ . Comment on how the errors vary with a. For R, find also (if it exists) an a∗
that minimizes R. To simplify your expression for R, recall the 0-1 loss error term:
n
2 X
e = pi (1 − pi ),
n
i=1

as discussed in the class notes.

4. Compute the mean and variance of fˆ at a point X ∗ = x.

(Hint: Try to parallel the worked example in the lecture notes where the 0-1 loss was used.
Most of the reasoning remains similar, but the minimizers f ∗ differ by loss function.)

Theoretical Question 4: A Simple Non-Convex Optimization

In this question, you will analyze a small non-convex function and explore its critical points.
Consider the function:

J(w1 , w2 ) = w12 + w22 − α w1 w2 − log 1 + ew1 +w2 ,

where α is a real constant (e.g., α > 0).

Tasks:
1. Take partial derivatives with respect to w1 and w2 ; set them equal to zero to find the
critical points.

2. Discuss the nature of the critical points (local minima, maxima, or saddle points).
You can use the Hessian or any other preferred method to analyze the curvature around
each critical point.

3. Implementation (Optional): Provide a small code snippet (e.g., in Python) using

symbolic differentiation to verify your analytical derivatives.

Example Python Code (Optional): Symbolic Derivatives

Listing 1: Symbolic Differentiation of a Non-Convex Function

1 import sympy
2

3 # Define the symbols

4 w1 , w2 , alpha = sympy . symbols ( ’ w1 ␣ w2 ␣ alpha ’ , real = True )
5
6 # Define the function J ( w1 , w2 )

4
7 J = w1 **2 + w2 **2 - alpha * w1 * w2 - sympy . log (1 + sympy . exp ( w1 + w2 ))
8
9 # Partial derivatives
10 dJ_w1 = sympy . diff (J , w1 )
11 dJ_w2 = sympy . diff (J , w2 )
12
13 # Display the derivatives
14 print ( " dJ / dw1 ␣ = " , dJ_w1 )
15 print ( " dJ / dw2 ␣ = " , dJ_w2 )

You can then set dJ_w1 = 0 and dJ_w2 = 0 numerically (e.g., with [Link]) or analyze
them by hand.

pset1
No ratings yet
pset1
4 pages
Machine Learning Parameter Estimation Guide
No ratings yet
Machine Learning Parameter Estimation Guide
25 pages
CS771A Machine Learning Homework 1
No ratings yet
CS771A Machine Learning Homework 1
3 pages
COMP 3105A Machine Learning Assignment 1
No ratings yet
COMP 3105A Machine Learning Assignment 1
10 pages
Machine Learning Exam Questions
No ratings yet
Machine Learning Exam Questions
16 pages
Deep Neural Networks: Key Concepts & Techniques
No ratings yet
Deep Neural Networks: Key Concepts & Techniques
6 pages
Machine Learning Homework Assignment 1
No ratings yet
Machine Learning Homework Assignment 1
8 pages
Logistic Regression: Exercises & Solutions
No ratings yet
Logistic Regression: Exercises & Solutions
5 pages
Deep Learning Homework 2: CS/DS541
No ratings yet
Deep Learning Homework 2: CS/DS541
3 pages
Understanding Deep Neural Networks Basics
No ratings yet
Understanding Deep Neural Networks Basics
6 pages
Midterm Exam Solutions for ML Course
No ratings yet
Midterm Exam Solutions for ML Course
11 pages
CS229 Problem Set 1 Solutions
No ratings yet
CS229 Problem Set 1 Solutions
10 pages
CS229 Problem Set 1: Supervised Learning
No ratings yet
CS229 Problem Set 1: Supervised Learning
55 pages
Supervised Learning: Linear Regression Basics
No ratings yet
Supervised Learning: Linear Regression Basics
8 pages
EE 212 Machine Learning Exam Summer 2020
No ratings yet
EE 212 Machine Learning Exam Summer 2020
12 pages
Logistic Regression Lab Exercise
No ratings yet
Logistic Regression Lab Exercise
9 pages
CS4100/CS5100 Machine Learning Assignment
No ratings yet
CS4100/CS5100 Machine Learning Assignment
10 pages
CS229 Fall 2017 Problem Set 1: Supervised Learning
100% (1)
CS229 Fall 2017 Problem Set 1: Supervised Learning
8 pages
Lab 06 Labreport
No ratings yet
Lab 06 Labreport
15 pages
Linear Regression Lab: Feature Scaling & Training
No ratings yet
Linear Regression Lab: Feature Scaling & Training
8 pages
Understanding the Representer Theorem
No ratings yet
Understanding the Representer Theorem
12 pages
M.Tech AIML Deep Neural Networks Exam Guide
No ratings yet
M.Tech AIML Deep Neural Networks Exam Guide
6 pages
CS 224n: Word2Vec Assignment 2
No ratings yet
CS 224n: Word2Vec Assignment 2
10 pages
MIT 6.036 Spring 2022 Final Exam Solutions
No ratings yet
MIT 6.036 Spring 2022 Final Exam Solutions
27 pages
Machine Learning Labs Overview
No ratings yet
Machine Learning Labs Overview
46 pages
Machine Learning Homework 3: Logistic Regression
No ratings yet
Machine Learning Homework 3: Logistic Regression
7 pages
Supervised Learning: Key Concepts Explained
No ratings yet
Supervised Learning: Key Concepts Explained
39 pages
Linear Regression in CSC 411 UofT
No ratings yet
Linear Regression in CSC 411 UofT
39 pages
CS229 Lecture 2: Supervised Learning
100% (1)
CS229 Lecture 2: Supervised Learning
48 pages
Logistic Regression Lab in Machine Learning
No ratings yet
Logistic Regression Lab in Machine Learning
24 pages
Logistic Regression vs Perceptron
No ratings yet
Logistic Regression vs Perceptron
20 pages
CSE 446/546 Homework #3 Guidelines
No ratings yet
CSE 446/546 Homework #3 Guidelines
7 pages
Machine Learning Optimization Techniques
No ratings yet
Machine Learning Optimization Techniques
6 pages
Machine Learning Homework 4: Regression
No ratings yet
Machine Learning Homework 4: Regression
7 pages
Homework Assignment 3: ML Regression Tasks
No ratings yet
Homework Assignment 3: ML Regression Tasks
3 pages
Linear Regression II Lab: Regularization
No ratings yet
Linear Regression II Lab: Regularization
10 pages
IIT Kanpur CS 771A Machine Learning Exam
No ratings yet
IIT Kanpur CS 771A Machine Learning Exam
4 pages
COL 774 Machine Learning Exam Guide
No ratings yet
COL 774 Machine Learning Exam Guide
3 pages
Lab 11: Linear Models & Gradient Descent
No ratings yet
Lab 11: Linear Models & Gradient Descent
13 pages
Machine Learning Concepts and Algorithms
No ratings yet
Machine Learning Concepts and Algorithms
15 pages
Machine Learning Exam Solutions 2021
No ratings yet
Machine Learning Exam Solutions 2021
8 pages
CS 189/289A Spring 2025 HW1 Guide
No ratings yet
CS 189/289A Spring 2025 HW1 Guide
12 pages
Logistic Regression for AI Systems
No ratings yet
Logistic Regression for AI Systems
47 pages
CS-419 Semester-End Exam Instructions
No ratings yet
CS-419 Semester-End Exam Instructions
6 pages
Supervised Learning in CS229 Notes
No ratings yet
Supervised Learning in CS229 Notes
142 pages
CS6923 Machine Learning Homework 3
No ratings yet
CS6923 Machine Learning Homework 3
3 pages
CS229 Autumn 2014 Problem Set 1
No ratings yet
CS229 Autumn 2014 Problem Set 1
5 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
61 pages
CS229 Autumn 2014 Midterm Exam
No ratings yet
CS229 Autumn 2014 Midterm Exam
23 pages
PyTorch Classification Lab 4 Guide
No ratings yet
PyTorch Classification Lab 4 Guide
5 pages
Machine Learning Exam Questions
No ratings yet
Machine Learning Exam Questions
16 pages
Loss Functions in Deep Learning
No ratings yet
Loss Functions in Deep Learning
29 pages
4432_2022_final
No ratings yet
4432_2022_final
10 pages
Supervised Learning: Linear Regression Basics
No ratings yet
Supervised Learning: Linear Regression Basics
60 pages
Key Concepts in Machine Learning Models
No ratings yet
Key Concepts in Machine Learning Models
19 pages
NTU Fall 2024 Machine Learning Homework 3
No ratings yet
NTU Fall 2024 Machine Learning Homework 3
4 pages
Machine Learning Midsem Exam Solutions
No ratings yet
Machine Learning Midsem Exam Solutions
6 pages
CS229 Autumn 2017 Problem Set #4
No ratings yet
CS229 Autumn 2017 Problem Set #4
10 pages
IV Cse Cs3491-Aiml QB Unit 4
No ratings yet
IV Cse Cs3491-Aiml QB Unit 4
3 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
77 pages
Machine Learning Mid Exam Questions
No ratings yet
Machine Learning Mid Exam Questions
4 pages
Data - Mining - Presentation V1
No ratings yet
Data - Mining - Presentation V1
35 pages
Affective Image Classification Psychology Art
No ratings yet
Affective Image Classification Psychology Art
10 pages
ML for Detecting Parkinson's via Gait Analysis
No ratings yet
ML for Detecting Parkinson's via Gait Analysis
26 pages
Loan Approval Prediction Using ML Techniques
No ratings yet
Loan Approval Prediction Using ML Techniques
13 pages
RGPV CS-601 Machine Learning Notes
No ratings yet
RGPV CS-601 Machine Learning Notes
12 pages
Customer Loan Approval Prediction Report
100% (1)
Customer Loan Approval Prediction Report
11 pages
Machine Learning Real World Projects
No ratings yet
Machine Learning Real World Projects
16 pages
Machine Learning Course Syllabus 2024-2026
No ratings yet
Machine Learning Course Syllabus 2024-2026
2 pages
Introduction to Neural Networks Basics
No ratings yet
Introduction to Neural Networks Basics
98 pages
Flight Delay Prediction Using Weather
No ratings yet
Flight Delay Prediction Using Weather
6 pages
Azure AI-900 Exam Practice Questions
No ratings yet
Azure AI-900 Exam Practice Questions
18 pages
Linear Models in Scikit-Learn Guide
No ratings yet
Linear Models in Scikit-Learn Guide
30 pages
A Study of Early Prediction and Classification of Arthritis Disease Using Soft Computing Techniques
No ratings yet
A Study of Early Prediction and Classification of Arthritis Disease Using Soft Computing Techniques
13 pages
Automated Blood Type Detection System
No ratings yet
Automated Blood Type Detection System
8 pages
Types of Cluster Analysis Explained
No ratings yet
Types of Cluster Analysis Explained
17 pages
Machine Learning in Mechanical Engineering
100% (1)
Machine Learning in Mechanical Engineering
20 pages
Heart Disease Prediction with ML
No ratings yet
Heart Disease Prediction with ML
63 pages
Neural Networks: Pattern Recognition Insights
100% (2)
Neural Networks: Pattern Recognition Insights
11 pages
Deep Learning Exam Paper 2024
No ratings yet
Deep Learning Exam Paper 2024
2 pages
Machine Learning-Based Antenna Selection in Wireless Communications
No ratings yet
Machine Learning-Based Antenna Selection in Wireless Communications
4 pages
Data Modeling Cheat Sheet
No ratings yet
Data Modeling Cheat Sheet
9 pages
Repository-Level Prompt Generation for LLMs
No ratings yet
Repository-Level Prompt Generation for LLMs
21 pages
Optimal Ambulance Positioning Study
No ratings yet
Optimal Ambulance Positioning Study
61 pages
Machine Learning MCQs with Answers
100% (1)
Machine Learning MCQs with Answers
20 pages
Feed-Forward Neural Network Homework
No ratings yet
Feed-Forward Neural Network Homework
2 pages
Deep Learning in Ayurvedic Plant Identification
No ratings yet
Deep Learning in Ayurvedic Plant Identification
8 pages
Fake Profile Detection Using ML & NLP
No ratings yet
Fake Profile Detection Using ML & NLP
60 pages

Supervised Learning Problem Set

Uploaded by

Supervised Learning Problem Set

Uploaded by

Problem Set Supervised Learning

490 - Spring 2025

• Mohammad Zbeeb — mbz02@[Link]

• Mariam Salman — mcs12@[Link]

Note: Do not contact the professor regarding this assignment.

2. Theoretical Questions: Answer theoretical questions related to supervised learning.

• Regression Task: Choose a physical phenomenon (e.g., harmonic motion, acceleration

1.2 Task Requirements

2. Collect appropriate sensor data for each task.

4. Implement the corresponding regression and classification models.

5. Evaluate and interpret the results.

6. Document your entire workflow in a well-organized Colab notebook.

1.3 Submission Guidelines

Theoretical Question 1: Multivariate Least Squares

1. Express this cost function in matrix-vector notation.

2. Derive the normal equations for Θ.

3. Compare this solution to solving p independent least squares problems.

Theoretical Question 3: Losses, Error Estimates, and k-NN Review

• Squared error (L2) loss:

• Absolute error (L1) loss:

• Logistic (likelihood) loss:

L(y, f ) = log 1 + e−2 y f .

Tasks (Choose 3 out of the 4 losses):

as discussed in the class notes.

4. Compute the mean and variance of fˆ at a point X ∗ = x.

Theoretical Question 4: A Simple Non-Convex Optimization

J(w1 , w2 ) = w12 + w22 − α w1 w2 − log 1 + ew1 +w2 ,

where α is a real constant (e.g., α > 0).

3. Implementation (Optional): Provide a small code snippet (e.g., in Python) using

Example Python Code (Optional): Symbolic Derivatives

Listing 1: Symbolic Differentiation of a Non-Convex Function

3 # Define the symbols

You might also like