Subject Code: 2170715
GUJARAT TECHNOLOGICAL UNIVERSITY
--- Content provided by FirstRanker.com ---
BE - SEMESTER-VII (NEW) EXAMINATION - WINTER 2018
Subject Name: Data Mining and Business Intelligence
Time: 10:30 AM TO 01:00 PM
Date: 03/12/2018
Total Marks: 70
--- Content provided by FirstRanker.com ---
Instructions:
- Attempt all questions.
- Make suitable assumptions wherever necessary.
- Figures to the right indicate full marks.
Q.1
--- Content provided by FirstRanker.com ---
- (a) What is Data Mining? Why is it called data mining rather than knowledge mining? (03)
- (b) Explain various features of Data Warehouse? (04)
- (c) Differentiate between Operational Database System and Data Warehouse (07)
Q.2
- (a) What is the difference between KDD and Data Mining? (03)
- (b) What is Concept Hierarchy? List and briefly explain types of Concept Hierarchy (04)
- (c) Explain Mean, Median, Mode, Variance, Standard Deviation & five number summary with suitable database example. (07)
--- Content provided by FirstRanker.com ---
OR
- (a) What is noise? Explain data smoothing methods as noise removal technique (03)
- (b) to divide given data into bins of size 3 by bin partition (equal frequency), by bin means, by bin medians and by bin boundaries. (04) Consider the data: 10, 2, 19, 18, 20,18, 25, 28, 22
- (c) Differentiate Fact table vs. Dimension table (07)
--- Content provided by FirstRanker.com ---
Q.3
- (a) Suppose that the data for analysis includes the attribute age. The age values for the data tuples are (in increasing order): 13, 15, 16, 16,19, 20, 23, 29, 35, 41, 44, 53, 62, 69, 72 Use min-max normalization to transform the value 45 for age onto the range [0:0, 1:0] (03)
- (b) Explain mining in following Databases with example. (04)
- Temporal Databases
- Sequence Databases
- Spatial Databases
- Spatiotemporal Databases.
--- Content provided by FirstRanker.com ---
- (c) List and describe methods for handling missing values in data cleaning. (07)
OR
--- Content provided by FirstRanker.com ---
- (a) Explain the following as attribute selection measure: (03)
- Information Gain
- Gain Ratio
Q.4
- (a) How K-Mean clustering method differs from K-Medoid clustering method? (03)
- (b) Define data cube and explain 3 operations on it. (04)
- (c) State the Apriori Property. Generate large itemsets and association rules using Apriori algorithm on the following data set with minimum support value and minimum confidence value set as 50% and 75% respectively (07)
TID Items Purchased T101 Cheese, Milk, Cookies T102 Butter, Milk, Bread T103 Cheese, Butter, Milk, Bread T104 Butter, Bread
--- Content provided by FirstRanker.com ---
OR
- (a) Define following terms : (03) Data Mart , Enterprise Warehouse & Virtual Warehouse
- (b) Discuss the application of data warehousing and data mining (04)
- (c) What is web log? Explain web structure mining and web usage mining in detail (07)
--- Content provided by FirstRanker.com ---
Q.5
- (a) Draw the topology of a multilayer, feed-forward Neural Network. (03)
- (b) Explain Linear regression with example. (04)
- (c) Explain the major issues in data mining (07)
--- Content provided by FirstRanker.com ---
OR
- (a) Briefly explain text mining (03)
- (b) What is market basket analysis? Explain the two measures of rule interestingness: support and confidence (04)
- (c) What is Big Data? What is big data analytic? Explain the big data-distributed file system. (07)
--- Content provided by FirstRanker.com ---
This download link is referred from the post: GTU BE/B.Tech 2018 Winter Question Papers || Gujarat Technological University
--- Content provided by FirstRanker.com ---