0% found this document useful (0 votes)

72 views4 pages

Key Data Mining Concepts and Techniques

The document contains a list of important questions related to data mining, covering definitions, applications, analysis techniques, and statistical methods. Each question is assigned a specific mark value, indicating the weight of the question in an assessment context. Topics include data types, data preprocessing, similarity measures, and various data mining functions.

Uploaded by

rohithsd0222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views4 pages

Key Data Mining Concepts and Techniques

Uploaded by

rohithsd0222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Important Questions

Sr. No Questions Marks

1. Define Data Mining. 2M

2. Why is data mining required? 2M

3. Enlist the applications of Data Mining. 2M

4. What is Cluster Analysis 2M

5. What is Outlier Analysis 2M

6. Define data, Information, Knowledge 2M

7. Define Correlation, Covariance 2M

8. Compute the similarity between Chicken and Bird using SMC coefficient for 2M
the given data.

Chicken={0,1,1,0,1,0,0,1,1,1}

Bird= {0,1,1,0,0,0,0,1,0,1}

9. Define Time Series Data 2M

10. Define the following (I)Object. (II) Attribute. 2M

11. List data reduction techniques in data mining. 2M

12. Define Ordinal data Attribute. 2M

13. Enlist types of Datasets 2M

14. Define Qualitative Data and Quantitative data 2M

15. Define Data Redundancy 2M

16. Define Data scrubbing, Data auditing 2M

17. Which tools are used for Data Mitigation 2M

18. Explain Ordered Data 2M

19. Explain about knowledge discovery in database process with a neat diagram. 5M

20. Discuss Different Data Mining Function in detail 5M

21. Explain the Multidimensional view of data mining. 5M

22. Explain how data mining works. 5M

23. Explain role of Data Mining in Business Intelligence 5M

24. Illustrate 5 applications of data mining that has been used to solve specific 5 M

Page 1 of 4
problems

25. List and explain the goals of data mining. 5M

26. Discuss about confluence of multiple disciplines in Data Mining. 5M

27. Illustrate the typical view in ML and statistics with a neat diagram 5M

28. Illustrate 5 applications of data mining that have been used to solve specific 5 M
problems

29. How to search for knowledge and interesting patterns in data? 5M

30. Discuss the major issues of Data mining 5M

31. Compare quantitative data and qualitative data. 5M

32. Explain Attribute subset selection methods with an example 5M

33. How to perform correlation analysis between categorical Variable using chi 5 M
square test.

34. A survey on car has had conducted in 2011 and determined that 60% of car 5 M
owners have only one car, 28% have two cars, and 12% have three or more.
Supposing that you have decided to conduct your own survey and have
collected the data below, determine whether your data supports the results of
the study. Use a significance level of 0.05. Also, given that, out of 129 car
owners, 73 had one car and 38 had two cars. df = 2 is 5.99. Apply the chi
square test to get nominal data.

35. Suppose two stocks A and B have the following values in one week: (2, 5), 5 M
(3, 8), (5, 10), (4, 11), (6, 14). If the stocks are affected by the same industry
trends, will their prices rise or fall together using covariance?

36. What is dimensionality Reduction. Explain methods used for reduction the 5 M
dimensionality

37. Illustrate why data preprocessing is a major step in data mining. 5M

38. Consider the following salaries: 5M

25, 30, 28, 55, 60, 42, 70, 75, 50, 48

Apply the binning technique to remove noisy data.

39. Explain about quality measures of data preprocessing. 5M

40. Illustrate similarity, dissimilarity and their properties 5M

41. Define noisy data. Explain how noisy data can be handled in data mining 5M

42. Calculate the cosine similarity distance between d1 and d2 vectors. 5M

d1 3 2 0 5 0 0 0 2 0 0

Page 2 of 4
d2 1 0 0 0 0 0 0 1 0 2

43. Illustrate why data preprocessing is a major step in data mining. 5M

44. Describe quality measures of data preprocessing. 5M

45. List and explain the major task in data preprocessing. 5M

46. Normalize the following group of data: 200 , 300 , 400 , 600, 1000 using 5M

i. Min-Max
ii. Z-Score
iii. Decimal Scaling

47. Explain Data Cube Aggregation 5M

48. Below dataset describes the rate of economic growth (ai) and the rate of return 5M
on the S&P 500(bi). Using the covariance formula, determine whether
economic growth and S&P 500 returns have positive or negative relationship?

Economic Growth % S&P 500 Returns %

(ai) (bi)
2.1 8
2.5 12
4.0 14
3.6 10
49. Explain Data Discretization in detail, Supervised and Unsupervised 5M
Discretization
50. Describe Binarization with example 5M

51. Explain Linear relationship between variables 5M

52. Describe Similarity And Dissimilarity in details 5M

53. Apply entropy-based discretization on the given set S= (16, n), (0, y), (4, y), 10M
(12, y), (16, n), (26, n), (18, y), (24, n), (28, n). If S has partitioned into 2
intervals S1 & S2 with 2 possible split points 14 & 21. Find the Best split
point.

54. Calculate the minkowski distance and Euclidean distance between the 10M
following pairs of points to determine their dissimilarity:

Point X Y
p1 0 2
p2 2 0
p3 3 1
p4 5 1
55. Explain data Reduction methods in Detail 10M

Calculate the entropy discretization for the following data set. If S has 10M
partitioned into 2 intervals S1 & S2 with 2 possible split points 14 & 17. Find

Page 3 of 4
the Best split point.

0 4 12 16 16 18 24 26 28

Y Y Y N N Y N N N

Page 4 of 4

Types of Data Mining Tasks Explained
No ratings yet
Types of Data Mining Tasks Explained
26 pages
Similarity and Dissimilarity Measures
No ratings yet
Similarity and Dissimilarity Measures
2 pages
Data Similarity and Dissimilarity Measures
No ratings yet
Data Similarity and Dissimilarity Measures
27 pages
Understanding Online Analytical Processing
No ratings yet
Understanding Online Analytical Processing
18 pages
Decision Tree Classification Overview
No ratings yet
Decision Tree Classification Overview
43 pages
Understanding Multidimensional Modeling
No ratings yet
Understanding Multidimensional Modeling
29 pages
Classification Techniques in Machine Learning
No ratings yet
Classification Techniques in Machine Learning
41 pages
Comparing MOLAP, ROLAP, and HOLAP
No ratings yet
Comparing MOLAP, ROLAP, and HOLAP
9 pages
Classifier Accuracy Metrics Overview
No ratings yet
Classifier Accuracy Metrics Overview
35 pages
9 Prime and Primality Testing
No ratings yet
9 Prime and Primality Testing
49 pages
Playfair Matrix for "Balloon" Encryption
No ratings yet
Playfair Matrix for "Balloon" Encryption
66 pages
Data Warehouse & OLAP Overview Guide
No ratings yet
Data Warehouse & OLAP Overview Guide
36 pages
Understanding Decision Trees: Gain Metrics
No ratings yet
Understanding Decision Trees: Gain Metrics
13 pages
ElGamal Cryptography Overview and Applications
No ratings yet
ElGamal Cryptography Overview and Applications
10 pages
Mining Equipment Performance Analysis
No ratings yet
Mining Equipment Performance Analysis
7 pages
Classification Techniques in Data Mining
No ratings yet
Classification Techniques in Data Mining
67 pages
Association Rule Mining in Data Mining
No ratings yet
Association Rule Mining in Data Mining
11 pages
Decision Tree Induction in Data Science
No ratings yet
Decision Tree Induction in Data Science
15 pages
Clustering Techniques in Data Mining
No ratings yet
Clustering Techniques in Data Mining
5 pages
Classical Encryption Techniques Overview
No ratings yet
Classical Encryption Techniques Overview
64 pages
Big Data and Analytics Course Overview
No ratings yet
Big Data and Analytics Course Overview
34 pages
Understanding Data Quality Issues
No ratings yet
Understanding Data Quality Issues
7 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
64 pages
Overview of Advanced Encryption Standard
No ratings yet
Overview of Advanced Encryption Standard
32 pages
Knowledge Representation in AI Systems
No ratings yet
Knowledge Representation in AI Systems
28 pages
Major Challenges in Data Mining
No ratings yet
Major Challenges in Data Mining
2 pages
Data Mining for Retail Decisions
No ratings yet
Data Mining for Retail Decisions
40 pages
Gini Index and Gain Calculations in Trees
No ratings yet
Gini Index and Gain Calculations in Trees
24 pages
Efficient Association Rule Mining Techniques
No ratings yet
Efficient Association Rule Mining Techniques
15 pages
Data Mining: Classification Techniques
No ratings yet
Data Mining: Classification Techniques
72 pages
Data Warehousing Lab Manual
No ratings yet
Data Warehousing Lab Manual
118 pages
Network Security and Cryptography Guide
No ratings yet
Network Security and Cryptography Guide
6 pages
Lecture Notes For Chapter 6: by Tan, Steinbach, Kumar
No ratings yet
Lecture Notes For Chapter 6: by Tan, Steinbach, Kumar
65 pages
Cryptography and Network Security Overview
No ratings yet
Cryptography and Network Security Overview
96 pages
Introduction to Data Mining Concepts
No ratings yet
Introduction to Data Mining Concepts
10 pages
Association Rule Mining Overview
No ratings yet
Association Rule Mining Overview
61 pages
Data Mining: Characterization & Discrimination
No ratings yet
Data Mining: Characterization & Discrimination
4 pages
DES Algorithm Overview and Process
No ratings yet
DES Algorithm Overview and Process
25 pages
Introduction to Data Mining Concepts
No ratings yet
Introduction to Data Mining Concepts
30 pages
Market Basket Analysis Overview
No ratings yet
Market Basket Analysis Overview
24 pages
Association Rule Mining Overview
No ratings yet
Association Rule Mining Overview
30 pages
Evolution of Database Technology and Data Mining
No ratings yet
Evolution of Database Technology and Data Mining
27 pages
Introduction to Decision Trees and CHAID
100% (1)
Introduction to Decision Trees and CHAID
50 pages
Data Mining Tools Lab Manual
No ratings yet
Data Mining Tools Lab Manual
100 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
26 pages
Mining Frequent Patterns and Associations
100% (1)
Mining Frequent Patterns and Associations
60 pages
Data Preprocessing Techniques in Mining
No ratings yet
Data Preprocessing Techniques in Mining
11 pages
Understanding Elliptic Curve Cryptography
No ratings yet
Understanding Elliptic Curve Cryptography
4 pages
Frequent Itemset Mining Overview
No ratings yet
Frequent Itemset Mining Overview
15 pages
Data Quality: Noise, Outliers, and Issues
No ratings yet
Data Quality: Noise, Outliers, and Issues
4 pages
Data Analytics Question Bank for KDS-501
No ratings yet
Data Analytics Question Bank for KDS-501
5 pages
Data Mining Exam Review Guide
100% (1)
Data Mining Exam Review Guide
6 pages
Data Mining: Overview and Applications
No ratings yet
Data Mining: Overview and Applications
48 pages
Clustering Techniques in Data Mining
No ratings yet
Clustering Techniques in Data Mining
18 pages
Data Mining Exam Instructions 2007
No ratings yet
Data Mining Exam Instructions 2007
5 pages
Mining Frequent Patterns, Association and Correlations
No ratings yet
Mining Frequent Patterns, Association and Correlations
42 pages
Key Concepts in Data Mining Explained
No ratings yet
Key Concepts in Data Mining Explained
47 pages
Data Mining Midterm Exam 2021/2022
100% (2)
Data Mining Midterm Exam 2021/2022
4 pages
Data Mining Practice Questions Unit 1
No ratings yet
Data Mining Practice Questions Unit 1
3 pages
Association vs. Clustering Explained
No ratings yet
Association vs. Clustering Explained
28 pages
Blockchain Traceability for Ayurvedic Herbs
No ratings yet
Blockchain Traceability for Ayurvedic Herbs
6 pages
Search Algorithms: Key Concepts & Comparisons
No ratings yet
Search Algorithms: Key Concepts & Comparisons
3 pages
UiPath String Manipulation Guide
No ratings yet
UiPath String Manipulation Guide
1 page
Mobile App Development (MAD) Overview
No ratings yet
Mobile App Development (MAD) Overview
31 pages
Datagram Options in Network Routing
No ratings yet
Datagram Options in Network Routing
86 pages
Java Code for Tower of Hanoi
No ratings yet
Java Code for Tower of Hanoi
2 pages
12-Channel ESP-NOW Transmitter Project
No ratings yet
12-Channel ESP-NOW Transmitter Project
1 page
Postfix Expression Evaluation in Java
No ratings yet
Postfix Expression Evaluation in Java
2 pages
Unique LinkedIn Post Ideas for Engagement
No ratings yet
Unique LinkedIn Post Ideas for Engagement
24 pages
TR-H/TR-W Series Setup Guide
No ratings yet
TR-H/TR-W Series Setup Guide
356 pages
Full Stack Developer Profile: Kevin Francisco
No ratings yet
Full Stack Developer Profile: Kevin Francisco
3 pages
Ripple Effect in Class Stability Metrics
No ratings yet
Ripple Effect in Class Stability Metrics
12 pages
Number Theory and Cryptography Basics
No ratings yet
Number Theory and Cryptography Basics
35 pages
Android App Download Status Logs
No ratings yet
Android App Download Status Logs
56 pages
Family Information and Privacy Policy
No ratings yet
Family Information and Privacy Policy
2 pages
Karunya University Admission Guide 2020
No ratings yet
Karunya University Admission Guide 2020
18 pages
Enhance Your EEE Vocabulary Skills
No ratings yet
Enhance Your EEE Vocabulary Skills
11 pages
Software Engineering Process Models Guide
No ratings yet
Software Engineering Process Models Guide
8 pages
Extract IPs from Spam to ServerConfig
No ratings yet
Extract IPs from Spam to ServerConfig
20 pages
College Algebra Textbook Analysis
No ratings yet
College Algebra Textbook Analysis
42 pages
Filipino Entrepreneurs in ICT & OSH Standards
No ratings yet
Filipino Entrepreneurs in ICT & OSH Standards
28 pages
MWD Scribing Process Guidelines
No ratings yet
MWD Scribing Process Guidelines
4 pages
Protostar Arms & Equipment Guide
No ratings yet
Protostar Arms & Equipment Guide
26 pages
SEO Poisoning: An In-Depth Analysis
No ratings yet
SEO Poisoning: An In-Depth Analysis
8 pages
Change Point Detection Metrics Analysis
No ratings yet
Change Point Detection Metrics Analysis
65 pages
Experimental Cinema in The Digital Age
No ratings yet
Experimental Cinema in The Digital Age
356 pages
K-Means & Hierarchical Clustering in Python
No ratings yet
K-Means & Hierarchical Clustering in Python
4 pages
Manual PC Troubleshooting Steps
No ratings yet
Manual PC Troubleshooting Steps
22 pages
VCO Applications in Op-Amp Circuits
No ratings yet
VCO Applications in Op-Amp Circuits
11 pages
Riya Sari's Customer Service Resume
No ratings yet
Riya Sari's Customer Service Resume
2 pages
Air Cargo Handling Services Overview
No ratings yet
Air Cargo Handling Services Overview
14 pages
PT9 C-Proof Beacon Label Update
No ratings yet
PT9 C-Proof Beacon Label Update
1 page
Robotics Transforming Law Enforcement
No ratings yet
Robotics Transforming Law Enforcement
9 pages
How to Access Your Facebook Activity Log
No ratings yet
How to Access Your Facebook Activity Log
1 page
NRC 2025: Future Innovators Guide
No ratings yet
NRC 2025: Future Innovators Guide
16 pages
GSM Cell Index Configuration Guide
No ratings yet
GSM Cell Index Configuration Guide
67 pages
SQL Queries for Data Analysis
No ratings yet
SQL Queries for Data Analysis
9 pages
Project Approval Matrix and Investment Types
No ratings yet
Project Approval Matrix and Investment Types
9 pages
Exploring Publication Design PDF Guide
33% (3)
Exploring Publication Design PDF Guide
2 pages

Key Data Mining Concepts and Techniques

Uploaded by

Key Data Mining Concepts and Techniques

Uploaded by

Important Questions

Sr. No Questions Marks

1. Define Data Mining. 2M

2. Why is data mining required? 2M

3. Enlist the applications of Data Mining. 2M

4. What is Cluster Analysis 2M

5. What is Outlier Analysis 2M

6. Define data, Information, Knowledge 2M

7. Define Correlation, Covariance 2M

9. Define Time Series Data 2M

10. Define the following (I)Object. (II) Attribute. 2M

11. List data reduction techniques in data mining. 2M

12. Define Ordinal data Attribute. 2M

13. Enlist types of Datasets 2M

14. Define Qualitative Data and Quantitative data 2M

15. Define Data Redundancy 2M

16. Define Data scrubbing, Data auditing 2M

17. Which tools are used for Data Mitigation 2M

18. Explain Ordered Data 2M

20. Discuss Different Data Mining Function in detail 5M

21. Explain the Multidimensional view of data mining. 5M

22. Explain how data mining works. 5M

23. Explain role of Data Mining in Business Intelligence 5M

25. List and explain the goals of data mining. 5M

26. Discuss about confluence of multiple disciplines in Data Mining. 5M

29. How to search for knowledge and interesting patterns in data? 5M

30. Discuss the major issues of Data mining 5M

31. Compare quantitative data and qualitative data. 5M

32. Explain Attribute subset selection methods with an example 5M

37. Illustrate why data preprocessing is a major step in data mining. 5M

38. Consider the following salaries: 5M

Apply the binning technique to remove noisy data.

39. Explain about quality measures of data preprocessing. 5M

40. Illustrate similarity, dissimilarity and their properties 5M

42. Calculate the cosine similarity distance between d1 and d2 vectors. 5M

43. Illustrate why data preprocessing is a major step in data mining. 5M

44. Describe quality measures of data preprocessing. 5M

45. List and explain the major task in data preprocessing. 5M

47. Explain Data Cube Aggregation 5M

Economic Growth % S&P 500 Returns %

51. Explain Linear relationship between variables 5M

52. Describe Similarity And Dissimilarity in details 5M

You might also like