Bachelor of Computer Applications
Course Code: BCA 312 L T C
Course Name: Data Visualization & Analytics 4 1 5
INSTRUCTIONS TO PAPER SETTERS:
1. Question No. 1 should be compulsory and cover the entire syllabus. There should be 10
questions of short answer type of 2.5 marks each, having at least 2 questions from each unit.
2. Apart from Question No. 1, rest of the paper shall consist of four units as per the syllabus.
Every unit should have two questions to evaluate analytical/technical skills of candidate.
However, student may be asked to attempt only 1 question from each unit. Each question
should be of 12.5 marks, including its subparts, if any.
3. Examiners are requested to go through the Course Outcomes (CO) of this course and prepare
the question paper accordingly, using Bloom‟s Taxonomy (BT), in such a way that every
question be mapped to some or other CO and all the questions, put together, must be able to
achieve the mapping to all the CO(s), in balanced way.
LEARNING OBJECTIVES:
In this course, the learners will be able to develop expertise related to the following:
1. Students will develop relevant programming abilities.
2. Students will demonstrate proficiency with statistical analysis of data.
3. Conduct exploratory data analysis using visualization.
4. Craft visual presentations of data for effective communication.
PRE-REQUISITES:
1. Basics of Python Programming (BCA-206)
COURSE OUTCOMES(COs):
After completion of this course, the learners will be able to:-
CO# Detailed Statement of the CO *BT Mapping to PO #
Level
CO1 Illustrating the features of Multithreading in python. BTL2 PO1, PO2,
PO3, PO5
CO2 Analyzing data using suitable python library. BTL2 PO1, PO2,
PO3,PO7,
PO10
CO3 Visualizing data using Matplotib, Seaborn library. BTL3 PO1, PO2,
PO3,PO4
CO4 Develop python applications with database BTL3 PO1, PO2,
connectivity operations. PO3,PO4
UNIT–I
No. of Hours: 11 Chapter/Book Reference: TB1 [Chapters - 1, 2], TB2 [Chapters - 1, 2]
Analytics: Basic Nomenclature, Analytics Process Model, Analytics part in different profiles,
Analytical Model Requirements.
Data Sources for data collection, Sampling and Sampling distribution, Types of data elements,
Missing Values, Outlier Detection and Treatment, Standardization using Min/max and z-score,
categorization, Segmentation.
UNIT–II
No. of Hours: 11 Chapter/Book Reference: TB2 [Chapter - 3], TB3 [Chapter - 7]
Statistical Hypothesis Testing, p-Values, Confidence Intervals.
Bachelor of Computer Applications
Correlation, Simpson‟s Paradox, Some Other Correlational Caveats, Correlation and Causation,
Correlation Statistics-ANOVA.
UNIT–III
No. of Hours: 11 Chapter/Book Reference: TB3[Chapter - 3], TB4 [Chapter - 8]
Data Visualization: Graphs in Python: Line Graph, Bar charts, Pie-charts, Scatter plots, multiple
plots, Subplots, Legends, Changing figure Size, Styling plots using Matplotib Library. Functions like
relplot(), displot() and catplot ().
Seaborn Library: Introduction, Line plot, Dist plot, Lmplot, Count plot, Color palettes.
UNIT–IV
No. of Hours: 11 Chapter/Book Reference: TB5 [Chapter - 4], TB3 [Chapter - 7]
GUI Programming: Creating User-interface, GUI Widgets with Tkinter, Creating Layouts, Check
Box, Radio Buttons, List Box, Menus, Menus Options, Dialog Boxes
Database Access: Database Connectivity Operations: Create, lnsert, Select, Delete, Drop, Update.
TEXT BOOKS:
TB1. Analytics in a Big Data World, Essential Guide to Data Science and its Application, Bart
Baesens, Wiley Big Data Series.
TB2. Data Science & Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
Published by John Wiley & Sons, Inc
TB3. Data Science from Scratch, 2nd Edition by Joel Grus Publisher(s): O'Reilly Media, Inc
TB4. Agile tools for real world data: Python for Data Analysis by Wes McKinney, O‟Reilly
TB5. Python and Tkinter Programming JOHN E. GRAYSON
REFERENCE BOOKS:
RB1. Allen Downey, "Think Python: How to Think Like a Computer Scientist", O'Reilly, 2nd
Edition,2015.
RB2. ReemaThareja, "Python Programming using Problem Solving Approach", Oxford University
Press, 1" Edition, 2017. RB4. Joel Grus, "Data Science from Scratch", O'Reilly, 2no Edition ,2019.
RB3. Tony Gaddis, "starting out with Python", Pearson, 3'd Edition, 2014.
RB4. Y. Daniel Liang, "lntroduction to Programming Using Python", Pearson, 1" Edition, 201,3.
RB5. Programming in Python 3: A Complete Introduction to the Python Language (2nd Edition),
Mark Summerfield
List of Practicals
[Link]. Detailed Statement Mapping to
CO#
Core Practicals
1. Write a program to create a DataFrame have E-commerce data and CO1,CO2,CO3
perform selection of row/column using loc() and iloc()
2. Create a Series object S5 containing numbers. Write a program to CO1, CO2, CO4
store the square of the series values in object S6. Display S6‟s
values which are >15.
3. Write a program to fill all missing values in a DataFrame with zero. CO1, CO2
4. Program for combining DataFrames using concat(), join(),merge() CO1, CO2
Bachelor of Computer Applications
5. Write a program to draw bar graph for the following data for the CO1, CO2,CO3
Medal tally of CWG-2018:-
Gold Silver Bronze Total
26 20 20 66
6. Implementing Line plot, Dist plot, Lmplot, Count plot using CO1, CO2,CO3
Seaborn library
Application Based Practicals (Implement minimum 5 out of 10 practicals)
7. Create a DataFrame namely aid that stores aid CO1, CO2,CO5
(Toys,books,uniform,shoes) by NGO‟s for different states. Write a
program to display the aid for:-
(a) Books and Uniforms only
(b) Shoes only
8. Create a DataFrame ndf having Name, Gender, Position, City, Age, CO1, CO2,CO5
Projects.
Write a program to summarize how many projects are being
handled by each position for each city?
Use pivot()
9. Marks is a list that stores marks of a student in 10 unit test. Write a CO1, CO2
program to plot Line chart for the student‟s performance in these
10 test.
10. Write a program to plot a horizontal bar chart from the height of CO1, CO2,CO3
some students.
11. CO1,
Write a program to implement ANNOVA.
CO2,CO3,CO5
12. Write a program to show correlation between two randomly CO1,
generated numbers . CO2,CO4,CO5
13. Write a program to implement Covariance. CO1, CO2, CO5
14. Create a GUI based form for admission purpose for your college CO1, CO2, CO3
15. The created GUI based application form is to connected to a CO1, CO2, CO5
database and use insert query to enter data.
Note:
1. In total 10 practicals to be implemented. 2 additional practical may be given by the
course instructor.
2. This is a suggestive list of programs. However, the instructor may add programs as per
the requirement of the course.