Download JNTUH MCA 4th Sem R17 2019 April-May 844AD Aprilmay Data Warehousing And Datamining Question Paper

Download JNTUH (Jawaharlal nehru technological university) MCA (Master of Computer Applications) 4th Sem (Fourth Semester) Regulation-R17 2019 April-May 844AD Aprilmay Data Warehousing And Datamining Previous Question Paper


R17

Code No: 844AD















JAWAHARLAL NEHRU TECHNOLOGICAL UNIVERSITY HYDERABAD

MCA IV Semester Examinations, April/May - 2019

DATA WAREHOUSING AND DATAMINING

Time: 3hrs













Max.Marks:75

Note: This question paper contains two parts A and B.

Part A is compulsory which carries 25 marks. Answer all questions in Part A. Part B

consists of 5 Units. Answer any one full question from each unit. Each question carries

10 marks and may have a, b, c as sub questions.



PART - A



















5 ? 5 Marks = 25



1.a) What is a pattern? What are the characteristics of an interesting pattern?

[5]

b) Compare and contrast database management system with data warehouse.

[5]

c) What characteristics of neural networks make them good classifiers?



[5]

d) Give the advantages and disadvantages of partition based methods for clustering. [5]

e) Provide examples for spatial and non-spatial data in database.





[5]



PART - B

















5 ? 10 Marks = 50

2.

What is the need of preprocessing of data for mining? Briefly explain various forms of

preprocessing.

















[10]

OR

3.

Discuss the major issues pertaining to mining methodology and user interaction in data

mining.



















[10]



4.

Illustrate online analytical processing operations.







[10]

OR

5.

Demonstrate the working of BUC algorithm for data cube computation.

[10]


6.

What are the limitations of Apriori algorithm? Suggest mechanisms to improve the

accuracy of Apriori algorithm.













[10]

OR

7.

Consider the following data and classify the new sample X= < youth, medium, yes,

fair> using Na?ve Bayesian classification.









[10]

RID Age

Income

Student

Credit_rating Class: buys_computer

1

Youth

High

No

Fair

No

2

Youth

High

No

excellent

No

3

Middle_aged High

No

Fair

Yes

4

Senior

Medium

No

Fair

Yes

5

Senior

Low

Yes

Fair

Yes

6

Senior

Low

Yes

excellent

No

7

Middle_aged Low

Yes

excellent

Yes

8

Youth

Medium

No

Fair

No

9

Youth

Low

Yes

Fair

Yes

10

Senior

Medium

Yes

Fair

Yes

11

Youth

Medium

Yes

excellent

Yes

12

Middle_aged Medium

No

excellent

Yes

13

Middle_aged High

Yes

Fair

Yes

14

Senior

Medium

No

excellent

No







8.a) What is - neighbourhood in density based methods?

b) List the challenges raised by high dimensional data for clustering.



[5+5]

OR

9.

Write BIRCH algorithm. Does BIRCH follow agglomerative nesting? Justify your

answer.



















[10]


10.

What is the importance of sequence mining? Explain Prefix Span algorithm for

sequence mining.

















[10]

OR

11. Demonstrate Latent semantic indexing for text mining with simple example text corpus.

















[10]

---ooOoo---


This post was last modified on 17 March 2023