Course Information Sheet
Course Title: CT-583 Tools and Techniques for Data Science
Course Description:
CT-583 Tools and Techniques for Data Science is designed to provide students with a
comprehensive understanding of the fundamental tools and techniques used in the field of data
science. The course will cover a wide range of topics, including statistical inference, exploratory
data analysis, machine learning algorithms, feature generation and selection, recommendation
systems, social-network graph mining, data visualization, and ethical considerations. Students
will learn how to collect, clean, preprocess, analyze, and interpret data using various statistical
and machine learning techniques. Practical applications and case studies will be used to
demonstrate how data science techniques can be applied to solve real-world problems in
different domains such as business, healthcare, finance, and social media. Throughout the course,
students will also develop practical skills in programming languages commonly used in data
science such as Python and R, as well as in data visualization tools and libraries. Emphasis will
be placed on hands-on learning through lab exercises, projects, and assignments to reinforce
theoretical concepts and enhance practical proficiency. By the end of the course, students will be
equipped with the knowledge, skills, and ethical understanding necessary to pursue careers in
data science or related fields, as well as to critically evaluate and contribute to advancements in
the field of data science.
Course Objectives:
By the end of this course, students will be able to:
To introduce students to the fundamental concepts and principles of data science,
including statistical inference, exploratory data analysis, and machine learning
algorithms.
To provide students with hands-on experience in collecting, cleaning, preprocessing,
analyzing, and interpreting data using various tools and techniques.
To familiarize students with programming languages commonly used in data science,
such as Python and R, and data visualization tools and libraries.
To enable students to apply data science techniques to solve real-world problems in
different domains, including business, healthcare, finance, and social media.
To develop students' critical thinking and problem-solving skills in the context of data
science, including the ability to evaluate and interpret data, develop predictive models,
and make data-driven decisions.
To cultivate students' ethical understanding and awareness of the ethical considerations
and implications of data science, including privacy concerns, bias in algorithms, and
responsible data handling practices.
To foster collaboration and teamwork skills through group projects and collaborative
learning activities, allowing students to work effectively in interdisciplinary teams to
solve complex data science problems.
To prepare students for careers in data science or related fields by providing them with a
solid foundation in data science concepts, skills, and ethical principles, as well as
opportunities for practical experience and professional development.
Course Outline:
1. Introduction to Data Science:
Definition and Scope of Data Science
Role of Data Scientists
Importance of Data Science in Industry and Research
2. Statistical Inference:
Basics of Probability and Statistics
Hypothesis Testing
Confidence Intervals
Regression Analysis
3. Exploratory Data Analysis and the Data Science Process:
Data Cleaning and Preprocessing
Data Exploration Techniques
Data Transformation and Normalization
Steps in the Data Science Process: From Data Collection to Model Deployment
4. Basic Machine Learning Algorithms:
Overview of Supervised and Unsupervised Learning
Classification Algorithms (e.g., Decision Trees, Naive Bayes)
Regression Algorithms (e.g., Linear Regression, Polynomial Regression)
5. Advanced Machine Learning Algorithms:
Introduction to Additional Machine Learning Algorithms (e.g., Support Vector Machines,
Neural Networks)
Applications of Advanced Algorithms in Real-world Scenarios
6. Feature Generation and Feature Selection:
Importance of Features in Machine Learning
Techniques for Generating Meaningful Features
Methods for Selecting Relevant Features
7. Recommendation Systems:
Understanding Recommendation Systems
Collaborative Filtering and Content-Based Filtering
Building User-Facing Data Products
8. Mining Social-Network Graphs:
Introduction to Social Network Analysis
Graph Representation of Social Networks
Analysis Techniques for Extracting Insights from Social Networks
9. Data Visualization:
Importance of Data Visualization in Data Science
Types of Visualization Techniques (e.g., Bar Charts, Scatter Plots, Heatmaps)
Tools for Creating Effective Data Visualizations
10. Data Science and Ethical Issues:
Ethical Considerations in Data Collection, Storage, and Usage
Privacy Concerns and Data Security
Responsible Data Science Practices and Guidelines
Prerequisites:
While there are no formal prerequisites for this course, a basic understanding of data analysis and
statistics is beneficial.
Assessment Methods:
Quiz 10%
Mid-term 20%
Presentation with Report 10%
Final term 60%
Instructor:
Dr. Khalid Mahboob
Course Duration:
SUNDAY 12.00 PM to 03.00 PM [03 Hours]
LinkedIn Profile: [Link]/in/dr-khalid-mahboob-ph-d-bb833b1b
Research Gate Profile: [Link]
Google Scholar Profile: [Link]