Download AKTU (Dr. A.P.J. Abdul Kalam Technical University (AKTU), formerly Uttar Pradesh Technical University (UPTU) B-Tech 6th Semester (Sixth Semester) 2015-2016 NCS 066 Data Warehousing Data Mining Question Paper
(Following Paper 1]) and Roll No. to be ?lled in your
Answer Books)
RollNoLlJ I I II I I I
B. TECH.
Theory Examination (Semester?VI) 2015-16
DATA WAREHOUSING & DATA MINING
Time : 3 Hours Max. Marks : 100
Note: Attempt questions ?om all Sections as per directions.
Section-A
1. Attempt all parts of this section.Answerin brief. [2X10=20]
(a) Write some of the facts of the association rule mining.
(b) Brie?y explain the concept of Frequent Item sets and
Closed Item sets.
(c) Brie?y explain important approaches to build the data
warehouse.
(d) De?ne KDD. Identify the phases in KDD process.
(6) Why data warehouse is maintained separately ?om
database? '
(1) P.T.O.
3005/249/258/6450
(t)
(g)
(h)
(i)
(1?)
Describe the market basket analysis.
Write short notes on Linear Regression.
Sketch a neat diagram of architecture of a typical Data
Mining System.
Compare Roll up, Dn'll down, Slice and Dice operations.
Name and describe the main features of Genetic
Algon'thm
Section-B
Attempt any ?ve questions from this section. (110X5=50)
(a)
(b)
(C)
(d)
Draw the 3-tier data warehouse architecture. Explain
ETL process.
Explain the various types of OLAP servers. What are
the steps for ef?cient processing of OLAP queries?
Write the algorithm of decision tree induction. What are
the methods that can be used for selecting the splitting
cn'teria?
Draw a box-and-whisker plot for the following data
set :
126, 132, 138, 140, 141, 141, 142, 143, 144, 144,
144, 145, 146, 147, 148, 148, 149, 149, 150, 150,
150, 154, 155, 158, 158.
Also ?nd the outliers.
(2)
3005/249/258/6450
(e) What is Hierarchical method for clustering? Explain
BIRCH method.
(1) Give the di??erence between the star and fact constellation
multidimensional data model.
(g) Descn'be the types of data that often occur in cluster
analysis and brie?y explain ' how to pre process that
data for clustering:
(h) Explain how query performance can be improved by
cascading the operations.
Section-C
Attempt any two questions from this section. (15X2=301)
3. Classify the tuple X= {Color = 'RED?, Type = 'SUV?
Origin = ?DOMESTIC?} using Naive Bayesian classi?cation.
Training data is given in the following table where class label
is {STOLEN}.
Color Type Origin Stolen?
Red Sports Domestic Yes
Red Sports Domestic No
Red Sports Domestic Yes
Yellow Sports Domestic No
Yellow Sports Imported Yes
(3) P.T.O.
30m 58/6450
Yellow SUV Imported No
Yellow SUV Imported Yes
Yellow SUV ' Domestic No
Red SUV Imported No
Red Sports Imported Yes
4. (i) Describe the difference between the following
approaches for the integration of data mining system
with database or data warehouse systems: no coupling,
loose coupling and semi tight coupling.
(ii) ' De?ne and describe the basic similarities and
differences among ROLAP, MOEAP and HOLAP.
5. Explain Chi-square test method. Show using chi-square test
that gender and prefelred reading are independent or not ?'om
given table. (Given are the observed counts).
Male Female Total
Fiction 250 200 450
Non-Fiction 50 1000 1050
Total 300 1200 1500
(4)
3005/279/258/6450
This post was last modified on 29 January 2020