Subject Code: 2170715
GUJARAT TECHNOLOGICAL UNIVERSITY
--- Content provided by FirstRanker.com ---
BE - SEMESTER- VII (New) EXAMINATION — WINTER 2019Subject Name: Data Mining and Business Intelligence
Time: 10:30 AM TO 01:00 PM
Instructions:
- Attempt all questions.
- Make suitable assumptions wherever necessary.
- Figures to the right indicate full marks.
--- Content provided by FirstRanker.com ---
Q.1
- Define data mining and list its features. [03]
- Differentiate between OLTP and OLAP. [04]
- Describe the steps involved in data mining when viewed as a process of knowledge discovery. [07]
--- Content provided by FirstRanker.com ---
Q.2
- Can BI be used for DM? Or vice versa? Justify. [03]
- Explain in detail the extract/transform/load (ETL) design of an automated warehouse. [04]
- Explain Mean, Median, Mode, Variance, Standard Deviation & five number summary with suitable database example. [07]
--- Content provided by FirstRanker.com ---
OR
- Is Graphical visualization better than text data? Justify your answer and explain different data visualization techniques. [07]
Q.3
- Explain why data warehouses are needed for developing business solutions from today’s perspective. Discuss the role of data marts. [03]
- Draw and Explain Snowflakes and Fact constellations Schema. [04]
- Define outlier analysis? Why outlier mining is important? Briefly describe the different approaches: statistical-based outlier detection, distance-based outlier detection and deviation-based outlier detection. [07]
--- Content provided by FirstRanker.com ---
OR
- Discuss Following: (i) Meta Data/(ii) Virtual Warehouse [03]
- Briefly outline the major steps of decision tree classification. Why is tree pruning useful in decision tree induction? [04]
- In data pre-processing why do we need data smoothing? Discuss data smoothing by Binning. [07]
--- Content provided by FirstRanker.com ---
Q.4
- Draw the topology of a multilayer, feed-forward Neural Network. [03]
- Describe Concept Hierarchy? List and briefly explain types of Concept Hierarchy. [04]
- In real-world data, tuples with missing values for some attributes are a common occurrence. Describe various methods for handling this problem. [07]
OR
--- Content provided by FirstRanker.com ---
- Define “clustering”? Mention any two applications of clustering. [03]
- Briefly explain Linear and Non-linear regression. [04]
- Consider the following dataset and find frequent item sets and generate association rules for them using Apriori Algorithm. [07]
Date: 30/11/2019
Total Marks: 70
--- Content provided by FirstRanker.com ---
T1 {I1,I2, I5}
T2 {I2,I4}
T3 {I2,I3}
T4 {I1,I2,I4}
--- Content provided by FirstRanker.com ---
T5 {I1,I3}T6 {I2,I3}
T7 {I1,I3}
T8 {I1,I2,I3,I5}
T9 {I1,I2,I3}
--- Content provided by FirstRanker.com ---
minimum support count is 2 minimum confidence is 60% .
Q.5
- Differentiate Fact table vs. Dimension table [03]
- What is market basket analysis? Explain the two measures of rule interestingness: support and confidence [04]
- Briefly explain the life-cycle of Data Analytics and discuss the role of data scientists. [07]
--- Content provided by FirstRanker.com ---
OR
- Explain text mining using example. [03]
- Explain data mining application for fraud detection. [04]
- Discuss the main features of Hadoop Distributed File System. [07]
--- Content provided by FirstRanker.com ---
This download link is referred from the post: GTU BE/B.Tech 2019 Winter Question Papers || Gujarat Technological University
--- Content provided by FirstRanker.com ---