Learning Objectives:
--- Content provided by FirstRanker.com ---
After reading this lesson, you should be able to understand:
--- Content provided by FirstRanker.com ---
?
Meaning, objectives and types of research
--- Content provided by FirstRanker.com ---
?
Qualities of researcher
--- Content provided by FirstRanker.com ---
?Significance of research
?
--- Content provided by FirstRanker.com ---
Research process
?
--- Content provided by FirstRanker.com ---
Research problem?
Features, importance, characteristics, concepts and types of Research
--- Content provided by FirstRanker.com ---
design
?
--- Content provided by FirstRanker.com ---
Case study research?
Hypothesis and its testing
--- Content provided by FirstRanker.com ---
?
Sample survey and sampling methods
--- Content provided by FirstRanker.com ---
1.1 Meaning of Research:
Research in simple terms refers to search for knowledge. It is a scientific
--- Content provided by FirstRanker.com ---
and systematic search for information on a particular topic or issue. It is also
known as the art of scientific investigation. Several social scientists have
--- Content provided by FirstRanker.com ---
defined research in different ways.In
the
--- Content provided by FirstRanker.com ---
Encyclopedia of Social Sciences, D. Slesinger and M. Stephension
(1930) defined research as "the manipulation of things, concepts or symbols for
--- Content provided by FirstRanker.com ---
the purpose of generalizing to extend, correct or verify knowledge, whether thatknowledge aids in the construction of theory or in the practice of an art".
--- Content provided by FirstRanker.com ---
According to Redman and Mory (1923), research is a "systematized
--- Content provided by FirstRanker.com ---
effort to gain new knowledge". It is an academic activity and therefore the term
should be used in a technical sense. According to Clifford Woody (Kothari,
--- Content provided by FirstRanker.com ---
1988), research comprises "defining and redefining problems, formulatinghypotheses or suggested solutions; collecting, organizing and evaluating data;
making deductions and reaching conclusions; and finally, carefully testing the
--- Content provided by FirstRanker.com ---
conclusions to determine whether they fit the formulated hypotheses".
--- Content provided by FirstRanker.com ---
Thus, research is an original addition to the available knowledge, whichcontributes to its further advancement. It is an attempt to pursue truth through
the methods of study, observation, comparison and experiment. In sum,
--- Content provided by FirstRanker.com ---
research is the search for knowledge, using objective and systematic methods to
find solution to a problem.
--- Content provided by FirstRanker.com ---
1.1.1 Objectives of Research:
The objective of research is to find answers to the questions by applying
--- Content provided by FirstRanker.com ---
scientific procedures. In other words, the main aim of research is to find out the
truth which is hidden and has not yet been discovered. Although every research
--- Content provided by FirstRanker.com ---
study has its own specific objectives, the research objectives may be broadlygrouped as follows:
1. to gain familiarity with new insights into a phenomenon (i.e., formulative
--- Content provided by FirstRanker.com ---
research studies);
2. to accurately portray the characteristics of a particular individual, group, or a
--- Content provided by FirstRanker.com ---
situation (i.e., descriptive research studies);3. to analyse the frequency with which something occurs (i.e., diagnostic research
studies); and
--- Content provided by FirstRanker.com ---
4. to examine the hypothesis of a causal relationship between two variables (i.e.,
hypothesis-testing research studies).
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1.1.2 Research Methods versus Methodology:Research methods include all those techniques/methods that are adopted
--- Content provided by FirstRanker.com ---
for conducting research. Thus, research techniques or methods are the methods
that the researchers adopt for conducting the research studies.
--- Content provided by FirstRanker.com ---
On the other hand, research methodology is the way in which research
problems are solved systematically. It is a science of studying how research is
--- Content provided by FirstRanker.com ---
conducted scientifically. Under it, the researcher acquaints himself/herself with
the various steps generally adopted to study a research problem, along with the
--- Content provided by FirstRanker.com ---
underlying logic behind them. Hence, it is not only important for the researcherto know the research techniques/methods, but also the scientific approach called
methodology.
--- Content provided by FirstRanker.com ---
1.1.3 Research Approaches:
--- Content provided by FirstRanker.com ---
There are two main approaches to research, namely quantitative
approach and qualitative approach. The quantitative approach involves the
--- Content provided by FirstRanker.com ---
collection of quantitative data, which are put to rigorous quantitative analysis in
a formal and rigid manner. This approach further includes experimental,
--- Content provided by FirstRanker.com ---
inferential, and simulation approaches to research. Meanwhile, the qualitativeapproach uses the method of subjective assessment of opinions, behaviour and
attitudes. Research in such a situation is a function of the researcher's
--- Content provided by FirstRanker.com ---
impressions and insights. The results generated by this type of research are
either in non-quantitative form or in the form which cannot be put to rigorous
--- Content provided by FirstRanker.com ---
quantitative analysis. Usually, this approach uses techniques like indepthinterviews, focus group interviews, and projective techniques.
--- Content provided by FirstRanker.com ---
1.1.4 Types of Research:
There are different types of research. The basic ones are as follows:
--- Content provided by FirstRanker.com ---
1) Descriptive versus Analytical:--- Content provided by FirstRanker.com ---
Descriptive research consists of surveys and fact-finding enquiries ofdifferent types. The main objective of descriptive research is describing the
state of affairs as it prevails at the time of study. The term `ex post facto
--- Content provided by FirstRanker.com ---
research' is quite often used for descriptive research studies in social sciences
and business research. The most distinguishing feature of this method is that the
--- Content provided by FirstRanker.com ---
researcher has no control over the variables here. He/she has to only report whatis happening or what has happened. Majority of the ex post facto research
projects are used for descriptive studies in which the researcher attempts to
--- Content provided by FirstRanker.com ---
examine phenomena, such as the consumers' preferences, frequency of
purchases, shopping, etc. Despite the inability of the researchers to control the
--- Content provided by FirstRanker.com ---
variables, ex post facto studies may also comprise attempts by them to discoverthe causes of the selected problem. The methods of research adopted in
conducting descriptive research are survey methods of all kinds, including
--- Content provided by FirstRanker.com ---
correlational and comparative methods.
Meanwhile in the Analytical research, the researcher has to use the
--- Content provided by FirstRanker.com ---
already available facts or information, and analyse them to make a criticalevaluation of the subject.
--- Content provided by FirstRanker.com ---
2)
Applied versus Fundamental:
--- Content provided by FirstRanker.com ---
Research can also be applied or fundamental in nature. An attempt tofind a solution to an immediate problem encountered by a firm, an industry, a
business organisation, or the society is known as Applied Research. Researchers
--- Content provided by FirstRanker.com ---
engaged in such researches aim at drawing certain conclusions confronting a
concrete social or business problem.
--- Content provided by FirstRanker.com ---
On the other hand, Fundamental Research mainly concernsgeneralizations and formulation of a theory. In other words, "Gathering
knowledge for knowledge's sake is termed `pure' or `basic' research" (Young in
--- Content provided by FirstRanker.com ---
Kothari, 1988). Researches relating to pure mathematics or concerning some
--- Content provided by FirstRanker.com ---
natural phenomenon are instances of Fundamental Research. Likewise, studies
focusing on human behaviour also fall under the category of fundamental
--- Content provided by FirstRanker.com ---
research.Thus, while the principal objective of applied research is to find a
solution to some pressing practical problem, the objective of basic research is to
--- Content provided by FirstRanker.com ---
find information with a broad base of application and add to the already existing
organized body of scientific knowledge.
--- Content provided by FirstRanker.com ---
3)
Quantitative versus Qualitative:
--- Content provided by FirstRanker.com ---
Quantitative research relates to aspects that can be quantified or can be
expressed in terms of quantity. It involves the measurement of quantity or
--- Content provided by FirstRanker.com ---
amount. The various available statistical and econometric methods are adoptedfor analysis in such research. Some such includes correlation, regressions and
time series analysis.
--- Content provided by FirstRanker.com ---
On the other hand, Qualitative research is concerned with qualitative
phenomena, or more specifically, the aspects related to or involving quality or
--- Content provided by FirstRanker.com ---
kind. For example, an important type of qualitative research is `MotivationResearch', which investigates into the reasons for human behaviour. The main
aim of this type of research is discovering the underlying motives and desires of
--- Content provided by FirstRanker.com ---
human beings by using in-depth interviews. The other techniques employed in
such research are story completion tests, sentence completion tests, word
--- Content provided by FirstRanker.com ---
association tests, and other similar projective methods. Qualitative research isparticularly significant in the context of behavioural sciences, which aim at
discovering the underlying motives of human behaviour. Such research helps to
--- Content provided by FirstRanker.com ---
analyse the various factors that motivate human beings to behave in a certain
manner, besides contributing to an understanding of what makes individuals like
--- Content provided by FirstRanker.com ---
or dislike a particular thing. However, it is worth noting that conductingqualitative research in practice is considerably a difficult task. Hence, while
--- Content provided by FirstRanker.com ---
undertaking such research, seeking guidance from experienced expert
researchers is important.
--- Content provided by FirstRanker.com ---
4)
--- Content provided by FirstRanker.com ---
Conceptual versus Empirical:The research related to some abstract idea or theory is known as
Conceptual Research. Generally, philosophers and thinkers use it for
--- Content provided by FirstRanker.com ---
developing new concepts or for reinterpreting the existing ones. Empirical
Research, on the other hand, exclusively relies on the observation or experience
--- Content provided by FirstRanker.com ---
with hardly any regard for theory and system. Such research is data based,which often comes up with conclusions that can be verified through experiments
or observation. Empirical research is also known as experimental type of
--- Content provided by FirstRanker.com ---
research, in which it is important to first collect the facts and their sources, and
actively take steps to stimulate the production of desired information. In this
--- Content provided by FirstRanker.com ---
type of research, the researcher first formulates a working hypothesis, and thengathers sufficient facts to prove or disprove the stated hypothesis. He/she
formulates the experimental design, which according to him/her would
--- Content provided by FirstRanker.com ---
manipulate the variables, so as to obtain the desired information. This type of
research is thus characterized by the researcher's control over the variables
--- Content provided by FirstRanker.com ---
under study. Empirical research is most appropriate when an attempt is made toprove that certain variables influence the other variables in some way.
Therefore, the results obtained by using the experimental or empirical studies
--- Content provided by FirstRanker.com ---
are considered to be the most powerful evidences for a given hypothesis.
--- Content provided by FirstRanker.com ---
5)Other Types of Research:
The remaining types of research are variations of one or more of the
--- Content provided by FirstRanker.com ---
afore-mentioned methods. They vary in terms of the purpose of research, or the
time required to complete it, or may be based on some other similar factor. On
--- Content provided by FirstRanker.com ---
the basis of time, research may either be in the nature of one-time or--- Content provided by FirstRanker.com ---
longitudinal research. While the research is restricted to a single time-period inthe former case, it is conducted over several time-periods in the latter case.
Depending upon the environment in which the research is to be conducted, it can
--- Content provided by FirstRanker.com ---
also be laboratory research or field-setting research, or simulation research,
besides being diagnostic or clinical in nature. Under such research, in-depth
--- Content provided by FirstRanker.com ---
approaches or case study method may be employed to analyse the basic causalrelations. These studies usually undertake a detailed in-depth analysis of the
causes of certain events of interest, and use very small samples and sharp data
--- Content provided by FirstRanker.com ---
collecting methods. The research may also be explanatory in nature.
--- Content provided by FirstRanker.com ---
Formalized research studies consist of substantial structure and specifichypotheses to be verified. As regards historical research, sources like historical
documents, remains, etc. are utilized to study past events or ideas. It also
--- Content provided by FirstRanker.com ---
includes philosophy of persons and groups of the past or any remote point of
time.
--- Content provided by FirstRanker.com ---
Research has also been classified into decision-oriented and conclusion-oriented categories. The Decision-oriented research is always carried out as per
the need of a decision maker and hence, the researcher has no freedom to
--- Content provided by FirstRanker.com ---
conduct the research according to his/her own desires. On the other hand, in the
case of Conclusion-oriented research, the researcher is free to choose the
--- Content provided by FirstRanker.com ---
problem, redesign the enquiry as it progresses and even changeconceptualization as he/she wishes to. Further, Operations research is a kind of
decision-oriented research, because it is a scientific method of providing the
--- Content provided by FirstRanker.com ---
departments, a quantitative basis for decision-making with respect to the
activities under their purview.
--- Content provided by FirstRanker.com ---
1.1.5 Importance of Knowing How to Conduct Research:
--- Content provided by FirstRanker.com ---
The importance of knowing how to conduct research is listed below:
(i) the knowledge of research methodology provides training to new
--- Content provided by FirstRanker.com ---
researchers and enables them to do research properly. It helps them to
--- Content provided by FirstRanker.com ---
develop disciplined thinking or a `bent of mind' to objectively observethe field;
(ii) the knowledge of doing research inculcates the ability to evaluate and
--- Content provided by FirstRanker.com ---
utilise the research findings with confidence;
(iii) the knowledge of research methodology equips the researcher with the
--- Content provided by FirstRanker.com ---
tools that help him/her to make the observations objectively; and(iv) the knowledge of methodology helps the research consumer to evaluate
research and make rational decisions.
--- Content provided by FirstRanker.com ---
1.1.6 Qualities of a Researcher:
--- Content provided by FirstRanker.com ---
It is important for a researcher to possess certain qualities to conductresearch. First and foremost, he being a scientist should be firmly committed to
the `articles of faith' of the scientific methods of research. This implies that a
--- Content provided by FirstRanker.com ---
researcher should be a social science person in the truest sense. Sir Michael
Foster (Wilkinson and Bhandarkar, 1979) identified a few distinct qualities of a
--- Content provided by FirstRanker.com ---
scientist. According to him, a true research scientist should possess thefollowing qualities:
(1) First of all, the nature of a researcher must be of the temperament that
--- Content provided by FirstRanker.com ---
vibrates in unison with the theme which he is searching. Hence, the seeker of
knowledge must be truthful with truthfulness of nature, which is much more
--- Content provided by FirstRanker.com ---
important, much more exacting than what is sometimes known as truthfulness.The truthfulness relates to the desire for accuracy of observation and precision
of statement. Ensuring facts is the principle rule of science, which is not an easy
--- Content provided by FirstRanker.com ---
matter. The difficulty may arise due to untrained eye, which fails to see
anything beyond what it has the power of seeing and sometimes even less than
--- Content provided by FirstRanker.com ---
that. This may also be due to the lack of discipline in the method of science. Anunscientific individual often remains satisfied with the expressions like
--- Content provided by FirstRanker.com ---
approximately, almost, or nearly, which is never what nature is. It cannot see
two things which differ, however minutely, as the same.
--- Content provided by FirstRanker.com ---
(2) A researcher must possess an alert mind. Nature is constantly
--- Content provided by FirstRanker.com ---
changing and revealing itself through various ways. A scientific researcher mustbe keen and watchful to notice such changes, no matter how small or
insignificant they may appear. Such receptivity has to be cultivated slowly and
--- Content provided by FirstRanker.com ---
patiently over time by the researcher through practice. An individual who is
ignorant or not alert and receptive during his research will not make a good
--- Content provided by FirstRanker.com ---
researcher. He will fail as a good researcher if he has no keen eyes or mind toobserve the unusual behind the routine. Research demands a systematic
immersion into the subject matter for the researcher to be able to grasp even the
--- Content provided by FirstRanker.com ---
slightest hint that may culminate into significant research problems. In this
context, Cohen and Negal (Selltiz et al, 1965; Wilkinson and Bhandarkar, 1979)
--- Content provided by FirstRanker.com ---
state that "the ability to perceive in some brute experience the occasion of aproblem is not a common talent among men... It is a mark of scientific genius to
be sensitive to difficulties where less gifted people pass by untroubled by
--- Content provided by FirstRanker.com ---
doubt".
--- Content provided by FirstRanker.com ---
(3) Scientific enquiry is pre-eminently an intellectual effort. It requiresthe moral quality of courage, which reflects the courage of a steadfast
endurance. The science of conducting research is not an easy task. There are
--- Content provided by FirstRanker.com ---
occasions when a research scientist might feel defeated or completely lost. This
is the stage when a researcher would need immense courage and the sense of
--- Content provided by FirstRanker.com ---
conviction. The researcher must learn the art of enduring intellectual hardships.In the words of Darwin, "It's dogged that does it".
--- Content provided by FirstRanker.com ---
In order to cultivate the afore-mentioned three qualities of a researcher, a
--- Content provided by FirstRanker.com ---
fourth one may be added. This is the quality of making statements cautiously.
According to Huxley, the assertion that outstrips the evidence is not only a
--- Content provided by FirstRanker.com ---
blunder but a crime (Thompson, 1975). A researcher should cultivate the habitof reserving judgment when the required data are insufficient.
--- Content provided by FirstRanker.com ---
1.1.7 Significance of Research:
According to a famous Hudson Maxim, "All progress is born of inquiry.
--- Content provided by FirstRanker.com ---
Doubt is often better than overconfidence, for it leads to inquiry, and inquiryleads to invention". It brings out the significance of research, increased amounts
of which make the progress possible. Research encourages scientific and
--- Content provided by FirstRanker.com ---
inductive thinking, besides promoting the development of logical habits of
thinking and organisation. The role of research in applied economics in the
--- Content provided by FirstRanker.com ---
context of an economy or business is greatly increasing in modern times. Theincreasingly complex nature of government and business has raised the use of
research in solving operational problems. Research assumes significant role in
--- Content provided by FirstRanker.com ---
the formulation of economic policy for both, the government and business. It
provides the basis for almost all government policies of an economic system.
--- Content provided by FirstRanker.com ---
Government budget formulation, for example, depends particularly on theanalysis of needs and desires of people, and the availability of revenues, which
requires research. Research helps to formulate alternative policies, in addition
--- Content provided by FirstRanker.com ---
to examining the consequences of these alternatives. Thus, research also
facilitates the decision-making of policy-makers, although in itself it is not a part
--- Content provided by FirstRanker.com ---
of research. In the process, research also helps in the proper allocation of acountry's scarce resources.
Research is also necessary for collecting information on the social and
--- Content provided by FirstRanker.com ---
economic structure of an economy to understand the process of change
occurring in the country. Collection of statistical information, though not a
--- Content provided by FirstRanker.com ---
routine task, involves various research problems. Therefore, large staff of
--- Content provided by FirstRanker.com ---
research technicians or experts is engaged by the government these days toundertake this work. Thus, research as a tool of government economic policy
formulation involves three distinct stages of operation: (i) investigation of
--- Content provided by FirstRanker.com ---
economic structure through continual compilation of facts; (ii) diagnosis of
events that are taking place and analysis of the forces underlying them; and (iii)
--- Content provided by FirstRanker.com ---
the prognosis i.e., the prediction of future developments (Wilkinson andBhandarkar, 1979).
--- Content provided by FirstRanker.com ---
Research also assumes a significant role in solving various operational
and planning problems associated with business and industry. In several ways,
--- Content provided by FirstRanker.com ---
operations research, market research and motivational research are vital andtheir results assist in taking business decisions. Market research refers to the
investigation of the structure and development of a market for the formulation of
--- Content provided by FirstRanker.com ---
efficient policies relating to purchases, production and sales. Operational
research relates to the application of logical, mathematical, and analytical
--- Content provided by FirstRanker.com ---
techniques to find solution to business problems, such as cost minimization orprofit maximization, or the optimization problems. Motivational research helps
to determine why people behave in the manner they do with respect to market
--- Content provided by FirstRanker.com ---
characteristics. More specifically, it is concerned with the analysis of the
motivations underlying consumer behaviour. All these researches are very
--- Content provided by FirstRanker.com ---
useful for business and industry, and are responsible for business decision-making.
--- Content provided by FirstRanker.com ---
Research is equally important to social scientists for analyzing the social
relationships and seeking explanations to various social problems. It gives
--- Content provided by FirstRanker.com ---
intellectual satisfaction of knowing things for the sake of knowledge. It alsopossesses the practical utility for the social scientist to gain knowledge so as to
--- Content provided by FirstRanker.com ---
be able to do something better or in a more efficient manner. The research in
social sciences is concerned with both knowledge for its own sake, and
--- Content provided by FirstRanker.com ---
knowledge for what it can contribute to solve practical problems.
--- Content provided by FirstRanker.com ---
1.2 Research Process:Research process consists of a series of steps or actions required for
--- Content provided by FirstRanker.com ---
effectively conducting research. The following are the steps that provide useful
procedural guidelines regarding the conduct of research:
--- Content provided by FirstRanker.com ---
(1) formulating the research problem;
--- Content provided by FirstRanker.com ---
(2) extensive literature survey;
--- Content provided by FirstRanker.com ---
(3) developing hypothesis;(4) preparing the research design;
--- Content provided by FirstRanker.com ---
(5)
determining
--- Content provided by FirstRanker.com ---
sampledesign;
(6)
--- Content provided by FirstRanker.com ---
collecting
data;
--- Content provided by FirstRanker.com ---
(7) execution of the project;
--- Content provided by FirstRanker.com ---
(8) analysis of data;
(9)
--- Content provided by FirstRanker.com ---
hypothesistesting;
--- Content provided by FirstRanker.com ---
(10) generalization and interpretation, and
--- Content provided by FirstRanker.com ---
(11) preparation of the report or presentation of the results. In otherwords, it involves the formal write-up of conclusions.
--- Content provided by FirstRanker.com ---
1.3 Research Problem:
The first and foremost stage in the research process is to select and
--- Content provided by FirstRanker.com ---
properly define the research problem. A researcher should first identify aproblem and formulate it, so as to make it amenable or susceptible to research.
In general, a research problem refers to an unanswered question that a researcher
--- Content provided by FirstRanker.com ---
might encounter in the context of either a theoretical or practical situation,
--- Content provided by FirstRanker.com ---
which he/she would like to answer or find a solution to. A research problem is
generally said to exist if the following conditions emerge (Kothari, 1988):
--- Content provided by FirstRanker.com ---
(i) there should be an individual or an organisation, say X, to whom the
problem can be attributed. The individual or the organization is situated
--- Content provided by FirstRanker.com ---
in an environment Y, which is governed by certain uncontrolled variables
Z;
--- Content provided by FirstRanker.com ---
(ii)there should be atleast two courses of action to be pursued, say A1 and
A2. These courses of action are defined by one or more values of the
--- Content provided by FirstRanker.com ---
controlled variables. For example, the number of items purchased at a
specified time is said to be one course of action.
--- Content provided by FirstRanker.com ---
(iii)there should be atleast two alternative possible outcomes of the said
courses of action, say B1 and B2. Of them, one alternative should be
--- Content provided by FirstRanker.com ---
preferable to the other. That is, atleast one outcome should be what the
researcher wants, which becomes an objective.
--- Content provided by FirstRanker.com ---
(iv)the courses of possible action available must offer a chance to the
researcher to achieve the objective, but not the equal chance. Therefore,
--- Content provided by FirstRanker.com ---
if P(Bj / X, A, Y) represents the probability of the occurrence of an
outcome Bj when X selects Aj in Y, then P(B1 / X, A1,Y) P (B1 / X, A2,
--- Content provided by FirstRanker.com ---
Y). Putting it in simple words, it means that the choices must not haveequal efficiencies for the desired outcome.
Above all these conditions, the individual or organisation may be said to have
--- Content provided by FirstRanker.com ---
arrived at the research problem only if X does not know what course of action to
be taken is the best. In other words, X should have a doubt about the solution.
--- Content provided by FirstRanker.com ---
Thus, an individual or a group of persons can be said to have a problem if theyhave more than one desired outcome. They should have two or more alternative
courses of action, which have some but not equal efficiency. This is required for
--- Content provided by FirstRanker.com ---
probing the desired objectives, such that they have doubts about the best course
--- Content provided by FirstRanker.com ---
of action to be taken. Thus, the components of a research problem may be
summarised as:
--- Content provided by FirstRanker.com ---
(i)there should be an individual or a group who have some difficulty or
problem.
--- Content provided by FirstRanker.com ---
(ii)
there should be some objective(s) to be pursued. A person or an
--- Content provided by FirstRanker.com ---
organization who wants nothing cannot have a problem.(iii)
there should be alternative ways of pursuing the objective the researcher
--- Content provided by FirstRanker.com ---
wants to pursue. This implies that there should be more than one
alternative means available to the researcher. This is because if the
--- Content provided by FirstRanker.com ---
researcher has no choice of alternative means, he/she would not have aproblem.
(iv)
--- Content provided by FirstRanker.com ---
there should be some doubt in the mind of the researcher about the
choice of alternative means. This implies that research should answer
--- Content provided by FirstRanker.com ---
the question relating to the relative efficiency or suitability of thepossible alternatives.
(v)
--- Content provided by FirstRanker.com ---
there should be a context to which the difficulty relates.
Thus, identification of a research problem is the pre-condition to conducting
--- Content provided by FirstRanker.com ---
research. A research problem is said to be the one which requires a researcher tofind the best available solution to the given problem. That is, the researcher
needs to find out the best course of action through which the research objective
--- Content provided by FirstRanker.com ---
may be achieved optimally in the context of a given situation. Several factors
may contribute to making the problem complicated. For example, the
--- Content provided by FirstRanker.com ---
environment may alter, thus affecting the efficiencies of the alternative coursesof action taken or the quality of the outcomes. The number of alternative courses
of action might be very large and the individual not involved in making the
--- Content provided by FirstRanker.com ---
decision may be affected by the change in environment and may react to it
favorably or unfavorably. Other similar factors are also likely to cause such
--- Content provided by FirstRanker.com ---
changes in the context of research, all of which may be considered from the
--- Content provided by FirstRanker.com ---
point of view of a research problem.1.4 Research Design:
--- Content provided by FirstRanker.com ---
The most important step after defining the research problem is preparing the
design of the research project, which is popularly known as the `research
--- Content provided by FirstRanker.com ---
design'. A research design helps to decide upon issues like what, when, where,how much, by what means etc. with regard to an enquiry or a research study.
A research design is the arrangement of conditions for collection and analysis of
--- Content provided by FirstRanker.com ---
data in a manner that aims to combine relevance to the research purpose with
economy in procedure. Infact, research design is the conceptual structure within
--- Content provided by FirstRanker.com ---
which research is conducted; it constitutes the blueprint for the collection,measurement and analysis of data (Selltiz et al, 1962). Thus, research design
provides an outline of what the researcher is going to do in terms of framing the
--- Content provided by FirstRanker.com ---
hypothesis, its operational implications and the final data analysis. Specifically,
the research design highlights decisions which include:
--- Content provided by FirstRanker.com ---
(i)the nature of the study
(ii)
--- Content provided by FirstRanker.com ---
the purpose of the study
(iii)
--- Content provided by FirstRanker.com ---
the location where the study would be conducted(iv)
the nature of data required
--- Content provided by FirstRanker.com ---
(v)
from where the required data can be collected
--- Content provided by FirstRanker.com ---
(vi)what time period the study would cover
(vii) the type of sample design that would be used
--- Content provided by FirstRanker.com ---
(viii) the techniques of data collection that would be used
(ix)
--- Content provided by FirstRanker.com ---
the methods of data analysis that would be adopted and(x)
the manner in which the report would be prepared
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
In view of the stated research design decisions, the overall research
design may be divided into the following (Kothari 1988):
--- Content provided by FirstRanker.com ---
(a)
the sampling design that deals with the method of selecting items to be
--- Content provided by FirstRanker.com ---
observed for the selected study;
--- Content provided by FirstRanker.com ---
(b)the observational design that relates to the conditions under which the
observations are to be made;
--- Content provided by FirstRanker.com ---
(c)
--- Content provided by FirstRanker.com ---
the statistical design that concerns with the question of how many items areto be observed, and how the information and data gathered are to be
analysed; and
--- Content provided by FirstRanker.com ---
(d)
--- Content provided by FirstRanker.com ---
the operational design that deals with the techniques by which theprocedures specified in the sampling, statistical and observational designs
can be carried out.
--- Content provided by FirstRanker.com ---
1.4.1 Features of Research Design:
--- Content provided by FirstRanker.com ---
The important features of research design may be outlined as follows:(i) it constitutes a plan that identifies the types and sources of information
required for the research problem;
--- Content provided by FirstRanker.com ---
(ii) it constitutes a strategy that specifies the methods of data collection and
--- Content provided by FirstRanker.com ---
analysis which would be adopted; and(iii) it also specifies the time period of research and monetary budget involved
--- Content provided by FirstRanker.com ---
in conducting the study, which comprise the two major constraints of
undertaking any research.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1.4.2
Concepts Relating to Research Design:
--- Content provided by FirstRanker.com ---
Some of the important concepts relating to Research Design are
discussed below:
--- Content provided by FirstRanker.com ---
1. Dependent and Independent Variables:A magnitude that varies is known as a variable. The concept may
--- Content provided by FirstRanker.com ---
assume different quantitative values like height, weight, income etc. Qualitative
variables are not quantifiable in the strictest sense of the term. However, the
--- Content provided by FirstRanker.com ---
qualitative phenomena may also be quantified in terms of the presence orabsence of the attribute(s) considered. The phenomena that assume different
values quantitatively even in decimal points are known as `continuous
--- Content provided by FirstRanker.com ---
variables'. But all variables need not be continuous. Values that can be
expressed only in integer values are called `non-continuous variables'. In
--- Content provided by FirstRanker.com ---
statistical terms, they are also known as `discrete variables'. For example, ageis a continuous variable, whereas the number of children is a non-continuous
variable. When changes in one variable depend upon the changes in other
--- Content provided by FirstRanker.com ---
variable or variables, it is known as a dependent or endogenous variable, and the
variables that cause the changes in the dependent variable are known as the
--- Content provided by FirstRanker.com ---
independent or explanatory or exogenous variables. For example, if demanddepends upon price, then demand is a dependent variable, while price is the
independent variable. And, if more variables determine demand, like income
--- Content provided by FirstRanker.com ---
and price of the substitute commodity, then demand also depends upon them in
addition to the price of original commodity. In other words, demand is a
--- Content provided by FirstRanker.com ---
dependent variable which is determined by the independent variables like priceof the original commodity, income and price of substitutes.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2 Extraneous Variable:
The independent variables which are not directly related to the purpose
--- Content provided by FirstRanker.com ---
of the study but affect the dependent variable are known as extraneous variables.For instance, assume that a researcher wants to test the hypothesis that there is a
relationship between children's school performance and their self-concepts, in
--- Content provided by FirstRanker.com ---
which case the latter is an independent variable and the former, a dependent
variable. In this context, intelligence may also influence the school
--- Content provided by FirstRanker.com ---
performance. However, since it is not directly related to the purpose of thestudy undertaken by the researcher, it would be known as an extraneous
variable. The influence caused by the extraneous variable(s) on the dependent
--- Content provided by FirstRanker.com ---
variable is technically called the `experimental error'. Therefore, a research
study should always be framed in such a manner that the influence of extraneous
--- Content provided by FirstRanker.com ---
variables on the dependent variable/s is completely controlled, and the influenceof independent variable/s is clearly evident.
--- Content provided by FirstRanker.com ---
3. Control:
One of the most important features of a good research design is to
--- Content provided by FirstRanker.com ---
minimize the effect of extraneous variable(s). Technically, the term `control' isused when a researcher designs the study in such a manner that it minimizes the
effects of extraneous variables. The term `control' is used in experimental
--- Content provided by FirstRanker.com ---
research to reflect the restrain in experimental conditions.
--- Content provided by FirstRanker.com ---
4. Confounded Relationship:The relationship between the dependent and independent variables is
said to be confounded by an extraneous variable, when the dependent variable is
--- Content provided by FirstRanker.com ---
not free from its effects.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
5. Research Hypothesis:When a prediction or a hypothesized relationship is tested by adopting
scientific methods, it is known as research hypothesis. The research hypothesis
--- Content provided by FirstRanker.com ---
is a predictive statement which relates to a dependent variable and an
independent variable. Generally, a research hypothesis must consist of at least
--- Content provided by FirstRanker.com ---
one dependent variable and one independent variable. Whereas, therelationships that are assumed but not to be tested are predictive statements that
are not to be objectively verified, thus are not classified as research hypotheses.
--- Content provided by FirstRanker.com ---
6. Experimental and Non-experimental Hypothesis Testing Research:
--- Content provided by FirstRanker.com ---
When the objective of a research is to test a research hypothesis, it is known ashypothesis-testing research. Such research may be in the nature of experimental
design or non-experimental design. The research in which the independent
--- Content provided by FirstRanker.com ---
variable is manipulated is known as `experimental hypothesis-testing research',
whereas the research in which the independent variable is not manipulated is
--- Content provided by FirstRanker.com ---
termed as `non-experimental hypothesis-testing research'. For example, assumethat a researcher wants to examine whether family income influences the school
attendance of a group of students, by calculating the coefficient of correlation
--- Content provided by FirstRanker.com ---
between the two variables. Such an example is known as a non-experimental
hypothesis-testing research, because the independent variable - family income is
--- Content provided by FirstRanker.com ---
not manipulated here. Again assume that the researcher randomly selects 150students from a group of students who pay their school fees regularly and then
classifies them into two sub-groups by randomly including 75 in Group A,
--- Content provided by FirstRanker.com ---
whose parents have regular earning, and 75 in group B, whose parents do not
have regular earning. Assume that at the end of the study, the researcher
--- Content provided by FirstRanker.com ---
conducts a test on each group in order to examine the effects of regular earningsof the parents on the school attendance of the student. Such a study is an
example of experimental hypothesis-testing research, because in this particular
--- Content provided by FirstRanker.com ---
study the independent variable regular earnings of the parents have been
--- Content provided by FirstRanker.com ---
manipulated.
--- Content provided by FirstRanker.com ---
7. Experimental and Control Groups:When a group is exposed to usual conditions in an experimental
hypothesis-testing research, it is known as `control group'. On the other hand,
--- Content provided by FirstRanker.com ---
when the group is exposed to certain new or special condition, it is known as an
`experimental group'. In the afore-mentioned example, Group A can be called
--- Content provided by FirstRanker.com ---
as control group and Group B as experimental group. If both the groups, A andB are exposed to some special feature, then both the groups may be called as
`experimental groups'. A research design may include only the experimental
--- Content provided by FirstRanker.com ---
group or both the experimental and control groups together.
--- Content provided by FirstRanker.com ---
8. Treatments:Treatments refer to the different conditions to which the experimental
and control groups are subject to. In the example considered, the two treatments
--- Content provided by FirstRanker.com ---
are the parents with regular earnings and those with no regular earnings.
Likewise, if a research study attempts to examine through an experiment the
--- Content provided by FirstRanker.com ---
comparative effect of three different types of fertilizers on the yield of rice crop,then the three types of fertilizers would be treated as the three treatments.
--- Content provided by FirstRanker.com ---
9. Experiment:
Experiment refers to the process of verifying the truth of a statistical
--- Content provided by FirstRanker.com ---
hypothesis relating to a given research problem. For instance, an experimentmay be conducted to examine the yield of a certain new variety of rice crop
developed. Further, Experiments may be categorized into two types, namely,
--- Content provided by FirstRanker.com ---
`absolute experiment' and `comparative experiment'. If a researcher wishes to
determine the impact of a chemical fertilizer on the yield of a particular variety
--- Content provided by FirstRanker.com ---
of rice crop, then it is known as absolute experiment. Meanwhile, if the
--- Content provided by FirstRanker.com ---
researcher wishes to determine the impact of chemical fertilizer as compared tothe impact of bio-fertilizer, then the experiment is known as a comparative
experiment.
--- Content provided by FirstRanker.com ---
10. Experimental Unit(s):
Experimental Units refer to the pre-determined plots, characteristics or
--- Content provided by FirstRanker.com ---
the blocks, to which different treatments are applied. It is worth mentioninghere that such experimental units must be selected with great caution.
--- Content provided by FirstRanker.com ---
1.4.3 Types of Research Design:
There are different types of research designs. They may be broadly categorized
--- Content provided by FirstRanker.com ---
as:(1) Exploratory Research Design;
(2) Descriptive and Diagnostic Research Design; and
--- Content provided by FirstRanker.com ---
(3) Hypothesis-Testing Research Design.
--- Content provided by FirstRanker.com ---
1. Exploratory Research Design:The Exploratory Research Design is known as formulative research design.
The main objective of using such a research design is to formulate a research
--- Content provided by FirstRanker.com ---
problem for an in-depth or more precise investigation, or for developing a
working hypothesis from an operational aspect. The major purpose of such
--- Content provided by FirstRanker.com ---
studies is the discovery of ideas and insights. Therefore, such a research designsuitable for such a study should be flexible enough to provide opportunity for
considering different dimensions of the problem under study. The in-built
--- Content provided by FirstRanker.com ---
flexibility in research design is required as the initial research problem would be
transformed into a more precise one in the exploratory study, which in turn may
--- Content provided by FirstRanker.com ---
necessitate changes in the research procedure for collecting relevant data.Usually, the following three methods are considered in the context of a research
--- Content provided by FirstRanker.com ---
design for such studies. They are (a) a survey of related literature; (b)
experience survey; and (c) analysis of `insight-stimulating' instances.
--- Content provided by FirstRanker.com ---
2.
--- Content provided by FirstRanker.com ---
Descriptive and Diagnostic Research Design:A Descriptive Research Design is concerned with describing the
characteristics of a particular individual or a group. Meanwhile, a diagnostic
--- Content provided by FirstRanker.com ---
research design determines the frequency with which a variable occurs or its
relationship with another variable. In other words, the study analyzing whether
--- Content provided by FirstRanker.com ---
a certain variable is associated with another comprises a diagnostic researchstudy. On the other hand, a study that is concerned with specific predictions or
with the narration of facts and characteristics related to an individual, group or
--- Content provided by FirstRanker.com ---
situation, are instances of descriptive research studies. Generally, most of the
social research design falls under this category. As a research design, both the
--- Content provided by FirstRanker.com ---
descriptive and diagnostic studies share common requirements, hence they aregrouped together. However, the procedure to be used and the research design
must be planned carefully. The research design must also make appropriate
--- Content provided by FirstRanker.com ---
provision for protection against bias and thus maximize reliability, with due
regard to the completion of the research study in an economical manner. The
--- Content provided by FirstRanker.com ---
research design in such studies should be rigid and not flexible. Besides, it mustalso focus attention on the following:
--- Content provided by FirstRanker.com ---
(a) formulation of the objectives of the study,
(b) proper designing of the methods of data collection ,
--- Content provided by FirstRanker.com ---
(c) sample selection,(d) data collection,
(e) processing and analysis of the collected data, and
--- Content provided by FirstRanker.com ---
(f) reporting the findings.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
3. Hypothesis-testing Research Design:Hypothesis-testing Research Designs are those in which the researcher tests
the hypothesis of causal relationship between two or more variables. These
--- Content provided by FirstRanker.com ---
studies require procedures that would not only decrease bias and enhance
reliability, but also facilitate deriving inferences about the causality. Generally,
--- Content provided by FirstRanker.com ---
experiments satisfy such requirements. Hence, when research design isdiscussed in such studies, it often refers to the design of experiments.
--- Content provided by FirstRanker.com ---
1.4.4 Importance of Research Design:
The need for a research design arises out of the fact that it facilitates the
--- Content provided by FirstRanker.com ---
smooth conduct of the various stages of research. It contributes to makingresearch as efficient as possible, thus yielding the maximum information with
minimum effort, time and expenditure. A research design helps to plan in
--- Content provided by FirstRanker.com ---
advance, the methods to be employed for collecting the relevant data and the
techniques to be adopted for their analysis. This would help in pursuing the
--- Content provided by FirstRanker.com ---
objectives of the research in the best possible manner, provided the availablestaff, time and money are given. Hence, the research design should be prepared
with utmost care, so as to avoid any error that may disturb the entire project.
--- Content provided by FirstRanker.com ---
Thus, research design plays a crucial role in attaining the reliability of the results
obtained, which forms the strong foundation of the entire process of the research
--- Content provided by FirstRanker.com ---
work.Despite its significance, the purpose of a well-planned design is not
realized at times. This is because it is not given the importance that it deserves.
--- Content provided by FirstRanker.com ---
As a consequence, many researchers are not able to achieve the purpose for
which the research designs are formulated, due to which they end up arriving at
--- Content provided by FirstRanker.com ---
misleading conclusions. Therefore, faulty designing of the research projecttends to render the research exercise meaningless. This makes it imperative that
--- Content provided by FirstRanker.com ---
an efficient and suitable research design must be planned before commencing
the process of research. The research design helps the researcher to organize
--- Content provided by FirstRanker.com ---
his/her ideas in a proper form, which in turn facilitates him/her to identify the
inadequacies and faults in them. The research design is also discussed with
--- Content provided by FirstRanker.com ---
other experts for their comments and critical evaluation, without which it wouldbe difficult for any critic to provide a comprehensive review and comments on
the proposed study.
--- Content provided by FirstRanker.com ---
1.4.5 Characteristics of a Good Research Design:
--- Content provided by FirstRanker.com ---
A good research design often possesses the qualities of being flexible,suitable, efficient, economical and so on. Generally, a research design which
minimizes bias and maximizes the reliability of the data collected and analysed
--- Content provided by FirstRanker.com ---
is considered a good design (Kothari 1988). A research design which does not
allow even the smallest experimental error is said to be the best design for
--- Content provided by FirstRanker.com ---
investigation. Further, a research design that yields maximum information andprovides an opportunity of viewing the various dimensions of a research
problem is considered to be the most appropriate and efficient design. Thus, the
--- Content provided by FirstRanker.com ---
question of a good design relates to the purpose or objective and nature of the
research problem studied. While a research design may be good, it may not be
--- Content provided by FirstRanker.com ---
equally suitable to all studies. In other words, it may be lacking in one aspect orthe other in the case of some other research problems. Therefore, no single
research design can be applied to all types of research problems.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
A research design suitable for a specific research problem would usuallyinvolve the following considerations:
(i) the methods of gathering the information;
--- Content provided by FirstRanker.com ---
(ii) the skills and availability of the researcher and his/her staff, if any;
(iii) the objectives of the research problem being studied;
--- Content provided by FirstRanker.com ---
(iv) the nature of the research problem being studied; and
--- Content provided by FirstRanker.com ---
(v) the available monetary support and duration of time for the researchwork.
--- Content provided by FirstRanker.com ---
1.5 Case Study Research:
The method of exploring and analyzing the life or functioning of a social
--- Content provided by FirstRanker.com ---
or economic unit, such as a person, a family, a community, an institution, a firmor an industry is called case study method. The objective of case study method
is to examine the factors that cause the behavioural patterns of a given unit and
--- Content provided by FirstRanker.com ---
its relationship with the environment. The data for a study are always gathered
with the purpose of tracing the natural history of a social or economic unit, and
--- Content provided by FirstRanker.com ---
its relationship with the social or economic factors, besides the forces involvedin its environment. Thus, a researcher conducting a study using the case study
method attempts to understand the complexity of factors that are operative
--- Content provided by FirstRanker.com ---
within a social or economic unit as an integrated totality. Burgess (Kothari,
1988) described the special significance of the case study in understanding the
--- Content provided by FirstRanker.com ---
complex behaviour and situations in specific detail. In the context of socialresearch, he called such data as social microscope.
--- Content provided by FirstRanker.com ---
1.5.1 Criteria for Evaluating Adequacy of Case Study:
John Dollard (Dollard, 1935) specified seven criteria for evaluating the
--- Content provided by FirstRanker.com ---
adequacy of a case or life history in the context of social research. They are:(i)
The subject being studied must be viewed as a specimen in a cultural set
--- Content provided by FirstRanker.com ---
up. That is, the case selected from its total context for the purpose of study
should be considered a member of the particular cultural group or community.
--- Content provided by FirstRanker.com ---
The scrutiny of the life history of the individual must be carried out with a viewto identify the community values, standards and shared ways of life.
--- Content provided by FirstRanker.com ---
(ii) The organic motors of action should be socially relevant. This is to say
--- Content provided by FirstRanker.com ---
that the action of the individual cases should be viewed as a series of
reactions to social stimuli or situations. To Put in simple words, the social
--- Content provided by FirstRanker.com ---
meaning of behaviour should be taken into consideration.(iii) The crucial role of the family-group in transmitting the culture should be
--- Content provided by FirstRanker.com ---
recognized. This means, as an individual is the member of a family, the
role of the family in shaping his/her behaviour should never be ignored.
--- Content provided by FirstRanker.com ---
(iv) The specific method of conversion of organic material into social
behaviour should be clearly demonstrated. For instance, case-histories that
--- Content provided by FirstRanker.com ---
discuss in detail how basically a biological organism, that is man,
gradually transforms into a social person are particularly important.
--- Content provided by FirstRanker.com ---
(v) The constant transformation of character of experience from childhood to
adulthood should be emphasized. That is, the life-history should portray
--- Content provided by FirstRanker.com ---
the inter-relationship between the individual's various experiences during
his/her life span. Such a study provides a comprehensive understanding of
--- Content provided by FirstRanker.com ---
an individual's life as a continuum.(vi) The `social situation' that contributed to the individual's gradual
--- Content provided by FirstRanker.com ---
transformation should carefully and continuously be specified as a factor.
One of the crucial criteria for life-history is that an individual's life should
--- Content provided by FirstRanker.com ---
be depicted as evolving itself in the context of a specific social situationand partially caused by it.
--- Content provided by FirstRanker.com ---
(vii) The life-history details themselves should be organized according to some
--- Content provided by FirstRanker.com ---
conceptual framework, which in turn would facilitate their generalizations
at higher levels.
--- Content provided by FirstRanker.com ---
These criteria discussed by Dollard emphasize the specific link of co-
--- Content provided by FirstRanker.com ---
ordinated, related, continuous and configured experience in a cultural pattern
that motivated the social and personal behaviour. Although, the criteria
--- Content provided by FirstRanker.com ---
indicated by Dollard are principally perfect, some of them are difficult to put topractice.
--- Content provided by FirstRanker.com ---
Dollard (1935) attempted to express the diverse events depicted in the
life-histories of persons during the course of repeated interviews by utilizing
--- Content provided by FirstRanker.com ---
psycho-analytical techniques in a given situational context. His criteria of life-history originated directly from this experience. While the life-histories possess
independent significance as research documents, the interviews recorded by the
--- Content provided by FirstRanker.com ---
investigators can afford, as Dollard observed, "rich insights into the nature of the
social situations experienced by them".
--- Content provided by FirstRanker.com ---
It is a well-known fact that an individual's life is very complex. Till date
there is hardly any technique that can establish some kind of uniformity, and as
--- Content provided by FirstRanker.com ---
a result ensure the cumulative of case-history materials by isolating the complex
totality of a human life. Nevertheless, although case history data are difficult to
--- Content provided by FirstRanker.com ---
put to rigorous analysis, a skilful handling and interpretation of such data couldhelp in developing insights into cultural conflicts and problems arising out of
cultural-change.
--- Content provided by FirstRanker.com ---
Gordon Allport (Kothari 1988) has recommended the following aspects
so as to broaden the perspective of case-study data:
--- Content provided by FirstRanker.com ---
(i) if the life-history is written in first person, it should be as comprehensiveand coherent as possible.
--- Content provided by FirstRanker.com ---
(ii) Life-histories must be written for knowledgeable persons. That is, if the
--- Content provided by FirstRanker.com ---
enquiry of study is sociological in nature, the researcher should write it on
the assumption that it would be read largely by sociologists only.
--- Content provided by FirstRanker.com ---
(iii) It would be advisable to supplement case study data by observational,statistical and historical data, as they provide standards for assessing the
reliability and consistency of the case study materials. Further, such data
--- Content provided by FirstRanker.com ---
offer a basis for generalizations.
(iv) Efforts must be made to verify the reliability of life-history data by
--- Content provided by FirstRanker.com ---
examining the internal consistency of the collected material, and byrepeating the interviews with the concerned person. Besides this, personal
interviews with the persons who are well-acquainted with him/her,
--- Content provided by FirstRanker.com ---
belonging to his/her own group should be conducted.
(v) A judicious combination of different techniques for data-collection is
--- Content provided by FirstRanker.com ---
crucial for collecting data that are culturally meaningful and scientificallysignificant.
(vi) Life-histories or case-histories may be considered as an adequate basis for
--- Content provided by FirstRanker.com ---
generalization to the extent that they are typical or representative of a
certain group.
--- Content provided by FirstRanker.com ---
(vii) The researcher engaged in the collection of case study data should neverignore the unique or typical cases. He/she should include them as
exceptional cases.
--- Content provided by FirstRanker.com ---
Case histories are filled with valuable information of a personal or
--- Content provided by FirstRanker.com ---
private nature. Such information not only helps the researcher to portray thepersonality of the individual, but also the social background that contributed to
it. Besides, it also helps in the formulation of relevant hypotheses. In general,
--- Content provided by FirstRanker.com ---
although Blummer (in Wilkinson and Bhandarkar, 1979) was critical of
documentary material, he gave due credit to case histories by acknowledging the
--- Content provided by FirstRanker.com ---
fact that the personal documents offer an opportunity to the researcher to--- Content provided by FirstRanker.com ---
develop his/her spirit of enquiry. The analysis of a particular subject would bemore effective if the researcher acquires close acquaintance with it through
personal documents. However, Blummer also acknowledges the limitations of
--- Content provided by FirstRanker.com ---
the personal documents. According to him, such documents do not entirely
fulfill the criteria of adequacy, reliability, and representativeness. Despite these
--- Content provided by FirstRanker.com ---
shortcomings, avoiding their use in any scientific study of personal life would bewrong, as these documents become necessary and significant for both theory-
building and practice.
--- Content provided by FirstRanker.com ---
In spite of these formidable limitations, case study data are used by
--- Content provided by FirstRanker.com ---
anthropologists, sociologists, economists and industrial psychiatrists. GordonAllport (Kothari, 1988) strongly recommends the use of case study data for in-
depth analysis of a subject. For, it is one's acquaintance with an individual that
--- Content provided by FirstRanker.com ---
instills a desire to know his/her nature and understand them. The first stage
involves understanding the individual and all the complexity of his/her nature.
--- Content provided by FirstRanker.com ---
Any haste in analyzing and classifying the individual would create the risk ofreducing his/her emotional world into artificial bits. As a consequence, the
important emotional organizations, anchorages and natural identifications
--- Content provided by FirstRanker.com ---
characterizing the personal life of the individual might not yield adequate
representation. Hence, the researcher should understand the life of the subject.
--- Content provided by FirstRanker.com ---
Therefore, the totality of life-processes reflected in the well-ordered life-historydocuments become invaluable source of stimulating insights. Such life-history
documents provide the basis for comparisons that contribute to statistical
--- Content provided by FirstRanker.com ---
generalizations and help to draw inferences regarding the uniformities in human
behaviour, which are of great value. Even if some personal documents do not
--- Content provided by FirstRanker.com ---
provide ordered data about personal lives of people, which is the basis ofpsychological science, they should not be ignored. This is because the final aim
of science is to understand, control and make predictions about human life. Once
--- Content provided by FirstRanker.com ---
they are satisfied, the theoretical and practical importance of personal
--- Content provided by FirstRanker.com ---
documents must be recognized as significant. Thus, a case study may be
considered as the beginning and the final destination of abstract knowledge.
--- Content provided by FirstRanker.com ---
1.6 Hypothesis:"Hypothesis may be defined as a proposition or a set of propositions set
forth as an explanation for the occurrence of some specified group of
--- Content provided by FirstRanker.com ---
phenomena either asserted merely as a provisional conjecture to guide some
investigation in the light of established facts" (Kothari, 1988). A research
--- Content provided by FirstRanker.com ---
hypothesis is quite often a predictive statement, which is capable of being testedusing scientific methods that involve an independent and some dependent
variables. For instance, the following statements may be considered:
--- Content provided by FirstRanker.com ---
i) "students who take tuitions perform better than the others who do not receive
tuitions" or,
--- Content provided by FirstRanker.com ---
ii) "the female students perform as well as the male students".These two statements are hypotheses that can be objectively verified and tested.
Thus, they indicate that a hypothesis states what one is looking for. Besides, it
--- Content provided by FirstRanker.com ---
is a proposition that can be put to test in order to examine its validity.
--- Content provided by FirstRanker.com ---
1.6.1 Characteristics of Hypothesis:
--- Content provided by FirstRanker.com ---
A hypothesis should have the following characteristic features:-
--- Content provided by FirstRanker.com ---
(i) A hypothesis must be precise and clear. If it is not precise and clear, thenthe inferences drawn on its basis would not be reliable.
--- Content provided by FirstRanker.com ---
(ii) A hypothesis must be capable of being put to test. Quite often, the
research programmes fail owing to its incapability of being subject to
--- Content provided by FirstRanker.com ---
testing for validity. Therefore, some prior study may be conducted by the--- Content provided by FirstRanker.com ---
researcher in order to make a hypothesis testable. A hypothesis "is testedif other deductions can be made from it, which in turn can be confirmed or
disproved by observation" (Kothari, 1988).
--- Content provided by FirstRanker.com ---
(iii) A hypothesis must state relationship between two variables, in the case of
relational hypotheses.
--- Content provided by FirstRanker.com ---
(iv) A hypothesis must be specific and limited in scope. This is because asimpler hypothesis generally would be easier to test for the researcher.
And therefore, he/she must formulate such hypotheses.
--- Content provided by FirstRanker.com ---
(v) As far as possible, a hypothesis must be stated in the simplest language, so
as to make it understood by all concerned. However, it should be noted
--- Content provided by FirstRanker.com ---
that simplicity of a hypothesis is not related to its significance.(vi) A hypothesis must be consistent and derived from the most known facts.
In other words, it should be consistent with a substantial body of
--- Content provided by FirstRanker.com ---
established facts. That is, it must be in the form of a statement which
Judges accept as being the most likely to occur.
--- Content provided by FirstRanker.com ---
(vii) A hypothesis must be amenable to testing within a stipulated or reasonableperiod of time. No matter how excellent a hypothesis, a researcher should
not use it if it cannot be tested within a given period of time, as no one can
--- Content provided by FirstRanker.com ---
afford to spend a life-time on collecting data to test it.
(viii) A hypothesis should state the facts that give rise to the necessity of looking
--- Content provided by FirstRanker.com ---
for an explanation. This is to say that by using the hypothesis, and otherknown and accepted generalizations, a researcher must be able to derive
the original problem condition. Therefore, a hypothesis should explain
--- Content provided by FirstRanker.com ---
what it actually wants to explain, and for this it should also have an
empirical reference.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1.6.2
--- Content provided by FirstRanker.com ---
Concepts Relating to Testing of Hypotheses:
Testing of hypotheses requires a researcher to be familiar with various
--- Content provided by FirstRanker.com ---
concepts concerned with it such as:1)
--- Content provided by FirstRanker.com ---
Null Hypothesis and Alternative Hypothesis:
In the context of statistical analysis, hypothesis is of two types viz., null
--- Content provided by FirstRanker.com ---
hypothesis and alternative hypothesis. When two methods A and B arecompared on their relative superiority, and it is assumed that both the methods
are equally good, then such a statement is called as the null hypothesis. On the
--- Content provided by FirstRanker.com ---
other hand, if method A is considered relatively superior to method B, or vice-
versa, then such a statement is known as an alternative hypothesis. The null
--- Content provided by FirstRanker.com ---
hypothesis is expressed as H0, while the alternative hypothesis is expressed asHa. For example, if a researcher wants to test the hypothesis that the population
mean (?) is equal to the hypothesized mean (H0) = 100, then the null hypothesis
--- Content provided by FirstRanker.com ---
should be stated as the population mean is equal to the hypothesized mean 100.
Symbolically it may be written as:-
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
H0: = ? = ? H0 = 100
--- Content provided by FirstRanker.com ---
If sample results do not support this null hypothesis, then it should be
concluded that something else is true. The conclusion of rejecting the null
--- Content provided by FirstRanker.com ---
hypothesis is called as alternative hypothesis. To put it in simple words, the set
of alternatives to the null hypothesis is termed as the alternative hypothesis. If
--- Content provided by FirstRanker.com ---
H0 is accepted, then it implies that Ha is being rejected. On the other hand, if H0is rejected, it means that Ha is being accepted. For H0: ? = ? H0 = 100, the
following three possible alternative hypotheses may be considered:
--- Content provided by FirstRanker.com ---
Alternative hypothesis
to be read as follows
--- Content provided by FirstRanker.com ---
the alternative hypothesis is that theHa: ? ? H0
population mean is not equal to 100,
--- Content provided by FirstRanker.com ---
i.e., it could be greater than or less
--- Content provided by FirstRanker.com ---
than 100
the alternative hypothesis is that the
--- Content provided by FirstRanker.com ---
Ha : ? > ? H0population mean is greater than 100
the alternative hypothesis is that the
--- Content provided by FirstRanker.com ---
Ha : ? < ? H0
population mean is less than 100
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Before the sample is drawn, the researcher has to state the null
hypothesis and the alternative hypothesis. While formulating the null
--- Content provided by FirstRanker.com ---
hypothesis, the following aspects need to be considered:(a) Alternative hypothesis is usually the one which a researcher wishes to prove,
whereas the null hypothesis is the one which he/she wishes to disprove. Thus, a
--- Content provided by FirstRanker.com ---
null hypothesis is usually the one which a researcher tries to reject, while an
alternative hypothesis is the one that represents all other possibilities.
--- Content provided by FirstRanker.com ---
(b) The rejection of a hypothesis when it is actually true involves great risk, as itindicates that it is a null hypothesis because then the probability of rejecting it
when it is true is (i.e., the level of significance) which is chosen very small.
--- Content provided by FirstRanker.com ---
(c) Null hypothesis should always be specific hypothesis i.e., it should not state
about or approximately a certain value.
--- Content provided by FirstRanker.com ---
(2) The Level of Significance:In the context of hypothesis testing, the level of significance is a very
important concept. It is a certain percentage that should be chosen with great
--- Content provided by FirstRanker.com ---
care, reason and thought. If for instance, the significance level is taken at 5 per
cent, then it means that H0 would be rejected when the sampling result has a less
--- Content provided by FirstRanker.com ---
than 0.05 probability of occurrence when H0 is true. In other words, the five percent level of significance implies that the researcher is willing to take a risk of
five per cent of rejecting the null hypothesis, when (H0) is actually true. In sum,
--- Content provided by FirstRanker.com ---
the significance level reflects the maximum value of the probability of rejecting
--- Content provided by FirstRanker.com ---
H0 when it is actually true, and which is usually determined prior to testing the
hypothesis.
--- Content provided by FirstRanker.com ---
(3) Test of Hypothesis or Decision Rule:Suppose the given hypothesis is H0 and the alternative hypothesis Ha,
then the researcher has to make a rule known as the decision rule. According to
--- Content provided by FirstRanker.com ---
the decision rule, the researcher accepts or rejects H0. For example, if the H0 is
that certain students are good against the Ha that all the students are good, then
--- Content provided by FirstRanker.com ---
the researcher should decide the number of items to be tested and the criteria onthe basis of which to accept or reject the hypothesis.
(4) Type I and Type II Errors:
--- Content provided by FirstRanker.com ---
As regards the testing of hypotheses, a researcher can make basically two
types of errors. He/she may reject H0 when it is true, or accept H0 when it is
--- Content provided by FirstRanker.com ---
not true. The former is called as Type I error and the latter is known as Type IIerror. In other words, Type I error implies the rejection of a hypothesis when it
must have been accepted, while Type II error implies the acceptance of a
--- Content provided by FirstRanker.com ---
hypothesis which must have been rejected. Type I error is denoted by (alpha)
and is known as error, while Type II error is usually denoted by (beta) and is
--- Content provided by FirstRanker.com ---
known as error.(5) One-tailed and two-tailed Tests:
These two types of tests are very important in the context of hypothesis
--- Content provided by FirstRanker.com ---
testing. A two-tailed test rejects the null hypothesis, when the sample mean is
significantly greater or lower than the hypothesized value of the mean of the
--- Content provided by FirstRanker.com ---
population. Such a test is suitable when the null hypothesis is some specifiedvalue, the alternative hypothesis is a value that is not equal to the specified value
of the null hypothesis.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1.6.3 Procedure of Hypothesis Testing:Testing a hypothesis refers to verifying whether the hypothesis is valid
or not. Hypothesis testing attempts to check whether to accept or not to accept
--- Content provided by FirstRanker.com ---
the null hypothesis. The procedure of hypothesis testing includes all the steps
that a researcher undertakes for making a choice between the two alternative
--- Content provided by FirstRanker.com ---
actions of rejecting or accepting a null hypothesis. The various steps involved inhypothesis testing are as follows:
(i)
--- Content provided by FirstRanker.com ---
Making a Formal Statement:
This step involves making a formal statement of the null hypothesis (H0)
--- Content provided by FirstRanker.com ---
and the alternative hypothesis (Ha). This implies that the hypotheses should beclearly stated within the purview of the research problem. For example, suppose
a school teacher wants to test the understanding capacity of the students which
--- Content provided by FirstRanker.com ---
must be rated more than 90 per cent in terms of marks, the hypotheses may be
stated as follows:
--- Content provided by FirstRanker.com ---
Null Hypothesis H0 : = 100Alternative Hypothesis Ha : > 100
--- Content provided by FirstRanker.com ---
(ii)
Selecting a Significance Level:
--- Content provided by FirstRanker.com ---
The hypotheses should be tested on a pre-determined level of
significance, which should be specified. Usually, either 5% level or 1% level is
--- Content provided by FirstRanker.com ---
considered for the purpose. The factors that determine the levels of significance
are: (a) the magnitude of difference between the sample means; (b) the sample
--- Content provided by FirstRanker.com ---
size: (c) the variability of measurements within samples; and (d) whether thehypothesis is directional or non-directional (Kothari, 1988). In sum, the level of
significance should be sufficient in the context of the nature and purpose of
--- Content provided by FirstRanker.com ---
enquiry.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(iii)Deciding the Distribution to Use:
After making decision on the level of significance for hypothesis testing,
--- Content provided by FirstRanker.com ---
the researcher has to next determine the appropriate sampling distribution. The
choice to be made generally relates to normal distribution and the t-distribution.
--- Content provided by FirstRanker.com ---
The rules governing the selection of the correct distribution are similar to theones already discussed with respect to estimation.
--- Content provided by FirstRanker.com ---
(iv)
Selection of a Random Sample and Computing an Appropriate
--- Content provided by FirstRanker.com ---
Value:Another step involved in hypothesis testing is the selection of a random
sample and then computing a suitable value from the sample data relating to test
--- Content provided by FirstRanker.com ---
statistic by using the appropriate distribution. In other words, it involves
drawing a sample for furnishing empirical data.
--- Content provided by FirstRanker.com ---
(v)
Calculation of the Probability:
--- Content provided by FirstRanker.com ---
The next step for the researcher is to calculate the probability that the
sample result would diverge as far as it can from expectations, under the
--- Content provided by FirstRanker.com ---
situation when the null hypothesis is actually true.(vi) Comparing the Probability:
--- Content provided by FirstRanker.com ---
Another step involved consists of making a comparison of the
probability calculated with the specified value for , the significance level. If
--- Content provided by FirstRanker.com ---
the calculated probability works out to be equal to or smaller than the value incase of one-tailed test, then the null hypothesis is to be rejected. On the other
hand, if the calculated probability is greater, then the null hypothesis is to be
--- Content provided by FirstRanker.com ---
accepted. In case the null hypothesis H0 is rejected, the researcher runs the risk
of committing the Type I error. But, if the null hypothesis H0 is accepted, then it
--- Content provided by FirstRanker.com ---
involves some risk (which cannot be specified in size as long as H0 is vague and
--- Content provided by FirstRanker.com ---
not specific) of committing the Type II error.1.7 Sample Survey:
--- Content provided by FirstRanker.com ---
A sample design is a definite plan for obtaining a sample from a given
population (Kothari, 1988). Sample constitutes a certain portion of the
--- Content provided by FirstRanker.com ---
population or universe. Sampling design refers to the technique or theprocedure the researcher adopts for selecting items for the sample from the
population or universe. A sample design helps to decide the number of items to
--- Content provided by FirstRanker.com ---
be included in the sample, i.e., the size of the sample. The sample design should
be determined prior to data collection. There are different kinds of sample
--- Content provided by FirstRanker.com ---
designs which a researcher can choose. Some of them are relatively moreprecise and easier to adopt than the others. A researcher should prepare or select
a sample design, which must be reliable and suitable for the research study
--- Content provided by FirstRanker.com ---
proposed to be undertaken.
--- Content provided by FirstRanker.com ---
1.8.1 Steps in Sampling Design:
A researcher should take into consideration the following aspects while
--- Content provided by FirstRanker.com ---
developing a sample design:
--- Content provided by FirstRanker.com ---
(i) Type of universe:The first step involved in developing sample design is to clearly define the
number of cases, technically known as the Universe, to be studied. A universe
--- Content provided by FirstRanker.com ---
may be finite or infinite. In a finite universe the number of items is certain,
whereas in the case of an infinite universe the number of items is infinite (i.e.,
--- Content provided by FirstRanker.com ---
there is no idea about the total number of items). For example, while thepopulation of a city or the number of workers in a factory comprise finite
--- Content provided by FirstRanker.com ---
universes, the number of stars in the sky, or throwing of a dice represent infinite
universe.
--- Content provided by FirstRanker.com ---
(ii) Sampling Unit:
--- Content provided by FirstRanker.com ---
Prior to selecting a sample, decision has to be made about the sampling unit. Asampling unit may be a geographical area like a state, district, village, etc., or a
social unit like a family, religious community, school, etc., or it may also be an
--- Content provided by FirstRanker.com ---
individual. At times, the researcher would have to choose one or more of such
units for his/her study.
--- Content provided by FirstRanker.com ---
(iii) Source List:
Source list is also known as the `sampling frame', from which the sample is to
--- Content provided by FirstRanker.com ---
be selected. The source list consists of names of all the items of a universe. The
researcher has to prepare a source list when it is not available. The source list
--- Content provided by FirstRanker.com ---
must be reliable, comprehensive, correct, and appropriate. It is important thatthe source list should be as representative of the population as possible.
--- Content provided by FirstRanker.com ---
(iv) Size of Sample:
Size of the sample refers to the number of items to be chosen from the universe
--- Content provided by FirstRanker.com ---
to form a sample. For a researcher, this constitutes a major problem. The size ofsample must be optimum. An optimum sample may be defined as the one that
satisfies the requirements of representativeness, flexibility, efficiency, and
--- Content provided by FirstRanker.com ---
reliability. While deciding the size of sample, a researcher should determine the
desired precision and the acceptable confidence level for the estimate. The size
--- Content provided by FirstRanker.com ---
of the population variance should be considered, because in the case of a largervariance generally a larger sample is required. The size of the population should
be considered, as it also limits the sample size. The parameters of interest in a
--- Content provided by FirstRanker.com ---
research study should also be considered, while deciding the sample size.
--- Content provided by FirstRanker.com ---
Besides, costs or budgetary constraint also plays a crucial role in deciding the
sample size.
--- Content provided by FirstRanker.com ---
(a) Parameters of Interest:
The specific population parameters of interest should also be considered
--- Content provided by FirstRanker.com ---
while determining the sample design. For example, the researcher may want to
make an estimate of the proportion of persons with certain characteristic in the
--- Content provided by FirstRanker.com ---
population, or may be interested in knowing some average regarding thepopulation. The population may also consist of important sub-groups about
whom the researcher would like to make estimates. All such factors have strong
--- Content provided by FirstRanker.com ---
impact on the sample design the researcher selects.
--- Content provided by FirstRanker.com ---
(b) Budgetary Constraint:From the practical point of view, cost considerations exercise a major
influence on the decisions related to not only the sample size, but also on the
--- Content provided by FirstRanker.com ---
type of sample selected. Thus, budgetary constraint could also lead to the
adoption of a non-probability sample design.
--- Content provided by FirstRanker.com ---
(c) Sampling Procedure:
Finally, the researcher should decide the type of sample or the technique
--- Content provided by FirstRanker.com ---
to be adopted for selecting the items for a sample. This technique or procedure
itself may represent the sample design. There are different sample designs from
--- Content provided by FirstRanker.com ---
which a researcher should select one for his/her study. It is clear that theresearcher should select that design which, for a given sample size and budget
constraint, involves a smaller error.
--- Content provided by FirstRanker.com ---
1.7.2 Criteria for Selecting a Sampling Procedure:
--- Content provided by FirstRanker.com ---
Basically, two costs are involved in a sampling analysis, which govern
--- Content provided by FirstRanker.com ---
the selection of a sampling procedure. They are:(i) the cost of data collection, and
(ii) the cost of drawing incorrect inference from the selected data.
--- Content provided by FirstRanker.com ---
There are two causes of incorrect inferences, namely systematic bias and
sampling error. Systematic bias arises out of errors in the sampling procedure.
--- Content provided by FirstRanker.com ---
They cannot be reduced or eliminated by increasing the sample size. Utmost,the causes of these errors can be identified and corrected. Generally, a
systematic bias arises out of one or more of the following factors:
--- Content provided by FirstRanker.com ---
a. inappropriate sampling frame,
b. defective measuring device,
--- Content provided by FirstRanker.com ---
c. non-respondents,d. indeterminacy principle, and
e. natural bias in the reporting of data.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Sampling error refers to the random variations in the sample estimatesaround the true population parameters. Because they occur randomly and likely
to be equally in either direction, they are of compensatory type, the expected
--- Content provided by FirstRanker.com ---
value of which errors tend to be equal to zero. Sampling error tends to decrease
with the increase in the size of the sample. It also becomes smaller in magnitude
--- Content provided by FirstRanker.com ---
when the population is homogenous.--- Content provided by FirstRanker.com ---
Sampling error can be computed for a given sample size and design. The
measurement of sampling error is known as `precision of the sampling plan'.
--- Content provided by FirstRanker.com ---
When the sample size is increased, the precision can be improved. However,increasing the sample size has its own limitations. The large sized sample not
only increases the cost of data collection, but also increases the systematic bias.
--- Content provided by FirstRanker.com ---
Thus, an effective way of increasing the precision is generally to choose a better
--- Content provided by FirstRanker.com ---
sampling design, which has smaller sampling error for a given sample size at a
specified cost. In practice, however, researchers generally prefer a less precise
--- Content provided by FirstRanker.com ---
design owing to the ease in adopting the same, in addition to the fact thatsystematic bias can be controlled better way in such designs.
--- Content provided by FirstRanker.com ---
In sum, while selecting the sample, a researcher should ensure that the
procedure adopted involves a relatively smaller sampling error and helps to
--- Content provided by FirstRanker.com ---
control systematic bias.1.7.3 Characteristics of a Good Sample Design:
The following are the characteristic features of a good sample design:
--- Content provided by FirstRanker.com ---
(a)
the sample design should yield a truly representative sample;
--- Content provided by FirstRanker.com ---
(b)the sample design should be such that it results in small sampling error;
(c)
--- Content provided by FirstRanker.com ---
the sample design should be viable in the context of budgetary
constraints of the research study;
--- Content provided by FirstRanker.com ---
(d)the sample design should be such that the systematic bias can be
controlled; and
--- Content provided by FirstRanker.com ---
(e)
the sample must be such that the results of the sample study would be
--- Content provided by FirstRanker.com ---
applicable, in general, to the universe at a reasonable level of confidence.--- Content provided by FirstRanker.com ---
1.7.4 Different Types of Sample Designs:
--- Content provided by FirstRanker.com ---
Sample designs may be classified into different categories based on twofactors, namely, the representation basis and the element selection technique.
Under the representation basis, the sample may be classified as:
--- Content provided by FirstRanker.com ---
I.
non-probability sampling
--- Content provided by FirstRanker.com ---
II.probability sampling
--- Content provided by FirstRanker.com ---
While probability sampling is based on random selection, the non-
--- Content provided by FirstRanker.com ---
probability sampling is based on `non-random' sampling.
--- Content provided by FirstRanker.com ---
I. Non-Probability Sampling:Non-probability sampling is the sampling procedure that does not afford any
basis for estimating the probability that each item in the population would have
--- Content provided by FirstRanker.com ---
an equal chance of being included in the sample. Non-probability sampling is
also known as deliberate sampling, judgment sampling and purposive sampling.
--- Content provided by FirstRanker.com ---
Under this type of sampling, the items for the sample are deliberately chosen bythe researcher; and his/her choice concerning the choice of items remains
supreme. In other words, under non-probability sampling the researchers select
--- Content provided by FirstRanker.com ---
a particular unit of the universe for forming a sample on the basis that the small
number that is thus selected out of a huge one would be typical or representative
--- Content provided by FirstRanker.com ---
of the whole population. For example, to study the economic conditions ofpeople living in a state, a few towns or village may be purposively selected for
an intensive study based on the principle that they are representative of the
--- Content provided by FirstRanker.com ---
entire state. In such a case, the judgment of the researcher of the study assumes
prime importance in this sampling design.
--- Content provided by FirstRanker.com ---
Quota Sampling:
Quota sampling is also an example of non-probability sampling. Under
--- Content provided by FirstRanker.com ---
this sampling, the researchers simply assume quotas to be filled from different
strata, with certain restrictions imposed on how they should be selected. This
--- Content provided by FirstRanker.com ---
type of sampling is very convenient and is relatively less expensive. However,the samples selected using this method certainly do not satisfy the characteristics
of random samples. They are essentially judgment samples and inferences
--- Content provided by FirstRanker.com ---
drawn based on that would not be amenable to statistical treatment in a formal
way.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
II.Probability Sampling:
Probability sampling is also known as `choice sampling' or `random sampling'.
--- Content provided by FirstRanker.com ---
Under this sampling design, every item of the universe has an equal chance of
being included in the sample. In a way, it is a lottery method under which
--- Content provided by FirstRanker.com ---
individual units are selected from the whole group, not deliberately, but by usingsome mechanical process. Therefore, only chance would determine whether an
item or the other would be included in the sample or not. The results obtained
--- Content provided by FirstRanker.com ---
from probability or random sampling would be assured in terms of probability.
That is, the researcher can measure the errors of estimation or the significance of
--- Content provided by FirstRanker.com ---
results obtained from the random sample. This is the superiority of randomsampling design over the deliberate sampling design. Random sampling
satisfies the law of Statistical Regularity, according to which if on an average
--- Content provided by FirstRanker.com ---
the sample chosen is random, then it would have the same composition and
characteristics of the universe. This is the reason why the random sampling
--- Content provided by FirstRanker.com ---
method is considered the best technique of choosing a representative sample.The following are the implications of the random sampling:
--- Content provided by FirstRanker.com ---
(i) it provides each element in the population an equal probability chance of
being chosen in the sample, with all choices being independent of one another
--- Content provided by FirstRanker.com ---
and(ii) it offers each possible sample combination an equal probability
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
opportunity of being selected.1.7.5 Method of Selecting a Random Sample:
--- Content provided by FirstRanker.com ---
The process of selecting a random sample involves writing the name of
--- Content provided by FirstRanker.com ---
each element of a finite population on a slip of paper and putting them into a box--- Content provided by FirstRanker.com ---
or a bag. Then they have to be thoroughly mixed and then the required numberof slips for the sample should be picked one after the other without replacement.
While doing this, it has to be ensured that in successive drawings each of the
--- Content provided by FirstRanker.com ---
remaining elements of the population has an equal chance of being chosen. This
method results in the same probability for each possible sample.
--- Content provided by FirstRanker.com ---
1.7.6 Complex random sampling designs:Under restricted sampling technique, the probability sampling may result in
complex random sampling designs. Such designs are known as mixed sampling
--- Content provided by FirstRanker.com ---
designs. Many of such designs may represent a combination of non-probability
and probability sampling procedures in choosing a sample.
--- Content provided by FirstRanker.com ---
Some of the prominent complex random sampling designs are as follows:(i) Systematic sampling: In some cases, the best way of sampling is to select
every first item on a list. Sampling of this kind is called as systematic sampling.
--- Content provided by FirstRanker.com ---
An element of randomness is introduced in this type of sampling by using
random numbers to select the unit with which to start. For example, if a 10 per
--- Content provided by FirstRanker.com ---
cent sample is required, the first item would be selected randomly from the firstand thereafter every 10th item. In this kind of sampling, only the first unit is
selected randomly, while rest of the units of the sample is chosen at fixed
--- Content provided by FirstRanker.com ---
intervals.
(ii) Stratified Sampling: When a population from which a sample is to be
--- Content provided by FirstRanker.com ---
selected does not comprise a homogeneous group, stratified sampling techniqueis generally employed for obtaining a representative sample. Under stratified
sampling, the population is divided into many sub-populations in such a manner
--- Content provided by FirstRanker.com ---
that they are individually more homogeneous than the rest of the total
population. Then, items are selected from each stratum to form a sample. As
--- Content provided by FirstRanker.com ---
each stratum is more homogeneous than the remaining total population, theresearcher is able to obtain a more precise estimate for each stratum and by
estimating each of the component parts more accurately, he/she is able to obtain
--- Content provided by FirstRanker.com ---
a better estimate of the whole. In sum, stratified sampling method yields more
--- Content provided by FirstRanker.com ---
reliable and detailed information.
(iii) Cluster Sampling: When the total area of research interest is large, a
--- Content provided by FirstRanker.com ---
convenient way in which a sample can be selected is to divide the area into anumber of smaller non-overlapping areas and then randomly selecting a number
of such smaller areas. In the process, the ultimate sample would consist of all
--- Content provided by FirstRanker.com ---
the units in these small areas or clusters. Thus in cluster sampling, the total
population is sub-divided into numerous relatively smaller subdivisions, which
--- Content provided by FirstRanker.com ---
in themselves constitute clusters of still smaller units. And then, some of suchclusters are randomly chosen for inclusion in the overall sample.
(iv) Area Sampling: When clusters are in the form of some geographic
--- Content provided by FirstRanker.com ---
subdivisions, then cluster sampling is termed as area sampling. That is, when
the primary sampling unit represents a cluster of units based on geographic area,
--- Content provided by FirstRanker.com ---
the cluster designs are distinguished as area sampling. The merits and demeritsof cluster sampling are equally applicable to area sampling.
(v) Multi-stage Sampling: A further development of the principle of cluster
--- Content provided by FirstRanker.com ---
sampling is multi-stage sampling. When the researcher desires to investigate the
working efficiency of nationalized banks in India and a sample of few banks is
--- Content provided by FirstRanker.com ---
required for this purpose, the first stage would be to select large primarysampling unit like the states in the country. Next, certain districts may be
selected and all banks interviewed in the chosen districts. This represents a two-
--- Content provided by FirstRanker.com ---
stage sampling design, with the ultimate sampling units being clusters of
districts.
--- Content provided by FirstRanker.com ---
On the other hand, if instead of taking census of all banks within the
selected districts, the researcher chooses certain towns and interviews all banks
--- Content provided by FirstRanker.com ---
in it, this would represent three-stage sampling design. Again, if instead of
taking a census of all banks within the selected towns, the researcher randomly
--- Content provided by FirstRanker.com ---
selects sample banks from each selected town, then it represents a case of using--- Content provided by FirstRanker.com ---
a four-stage sampling plan. Thus, if the researcher selects randomly at allstages, then it is called as multi-stage random sampling design.
(vi) Sampling with Probability Proportional to Size: When the case of cluster
--- Content provided by FirstRanker.com ---
sampling units does not have exactly or approximately the same number of
elements, it is better for the researcher to adopt a random selection process,
--- Content provided by FirstRanker.com ---
where the probability of inclusion of each cluster in the sample tends to beproportional to the size of the cluster. For this, the number of elements in each
cluster has to be listed, irrespective of the method used for ordering it. Then the
--- Content provided by FirstRanker.com ---
researcher should systematically pick the required number of elements from the
cumulative totals. The actual numbers thus chosen would not however reflect
--- Content provided by FirstRanker.com ---
the individual elements, but would indicate as to which cluster and how manyfrom them are to be chosen by using simple random sampling or systematic
sampling. The outcome of such sampling is equivalent to that of simple random
--- Content provided by FirstRanker.com ---
sample. The method is also less cumbersome and is also relatively less
expensive.
--- Content provided by FirstRanker.com ---
Thus, a researcher has to pass through various stages of conducting
research once the problem of interest has been selected. Research methodology
--- Content provided by FirstRanker.com ---
familiarizes a researcher with the complex scientific methods of conducting
research, which yield reliable results that are useful to policy-makers,
--- Content provided by FirstRanker.com ---
government, industries etc. in decision-making.References:
--- Content provided by FirstRanker.com ---
Claire Sellitiz and others, Research Methods in Social Sciences, 1962, p.50
--- Content provided by FirstRanker.com ---
Dollard,J., Criteria for the Life-history, Yale University Press, New York,1935,pp.8-31.
--- Content provided by FirstRanker.com ---
C.R. Kothari, Research Methodology, Methods and Techniques, Wiley Eastern
--- Content provided by FirstRanker.com ---
Limited, New Delhi, 1988.
--- Content provided by FirstRanker.com ---
Marie Jahoda, Morton Deutsch and Staurt W. Cook, Research Methods inSocial Relations, p.4.
--- Content provided by FirstRanker.com ---
Pauline V. Young, Scientific Social Surveys and Research, p.30
--- Content provided by FirstRanker.com ---
L.V. Redman and A.V.H. Mory, The Romance of Research, 1923.The Encylopaedia of Social Sciences, Vol. IX, MacMillan, 1930.
--- Content provided by FirstRanker.com ---
T.S. Wilkinson and P.L. Bhandarkar, Methodology and Techniques of Social
--- Content provided by FirstRanker.com ---
Research, Himalaya Publishing House, Bombay, 1979.Questions:
--- Content provided by FirstRanker.com ---
1. Define research.
2. What are the objectives of research?
--- Content provided by FirstRanker.com ---
3. State the significance of research.4. What is the importance of knowing how to do research?
5. Briefly outline research process.
--- Content provided by FirstRanker.com ---
6. Highlight the different research approaches.
7. Discuss the qualities of a researcher.
--- Content provided by FirstRanker.com ---
8. Explain the different types of research.9. What is a research problem?
10. Outline the features of research design.
--- Content provided by FirstRanker.com ---
11. Discuss the features of a good research design.
12. Describe the different types of research design.
--- Content provided by FirstRanker.com ---
13. Explain the significance of research design.
--- Content provided by FirstRanker.com ---
14. What is a case study?15. Discuss the criteria for evaluating case study.
16. Define hypothesis.
--- Content provided by FirstRanker.com ---
17. What are the characteristic features of a hypothesis?
18. Distinguish between null and alternative hypothesis.
--- Content provided by FirstRanker.com ---
19. Differentiate Type I error and Type II error.20. How is a hypothesis tested?
21. Define the concept of sampling design.
--- Content provided by FirstRanker.com ---
22. Describe the steps involved in sampling design.
23. Discuss the criteria for selecting a sampling procedure.
--- Content provided by FirstRanker.com ---
24. Distinguish between probability and non-probability sampling.25. How is a random sample selected?
26. Explain complex random sampling designs.
--- Content provided by FirstRanker.com ---
***
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT--IIDATA COLLECTION
1. SOURCES OF DATA
--- Content provided by FirstRanker.com ---
Lesson Outline:
Primary data
--- Content provided by FirstRanker.com ---
investigation
Indirect oral Methods of collecting primary data
--- Content provided by FirstRanker.com ---
Direct personal interviewsInformation received through local
agencies
--- Content provided by FirstRanker.com ---
Mailed questionnaire method
Schedules sent through enumerators
--- Content provided by FirstRanker.com ---
Learning Objectives:
--- Content provided by FirstRanker.com ---
After reading this lesson, you should be able to
? Understand the meaning of primary data
? Preliminaries of data collection
--- Content provided by FirstRanker.com ---
? Method of data collection? Methods of collecting primary data
? Usefulness of primary data
? Merits and demerits of different methods of primary data collection
? Precautions while collecting primary data.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Introduction:
It is important for a researcher to know the sources of data which he
--- Content provided by FirstRanker.com ---
requires for different purposes. Data are nothing but the information. There aretwo sources of information or data - Primary data and Secondary data. Primary
data refers to the data collected for the first time, whereas secondary data refers
--- Content provided by FirstRanker.com ---
to the data that have already been collected and used earlier by somebody or
some agency. For example, the statistics collected by the Government of India
--- Content provided by FirstRanker.com ---
relating to the population is primary data for the Government of India since ithas been collected for the first time. Later when the same data are used by a
researcher for his study of a particular problem, then the same data become the
--- Content provided by FirstRanker.com ---
secondary data for the researcher. Both the sources of information have their
merits and demerits. The selection of a particular source depends upon the (a)
--- Content provided by FirstRanker.com ---
purpose and scope of enquiry, (b) availability of time, (c) availability of finance,(d) accuracy required, (e) statistical tools to be used, (f) sources of information
(data), and (g) method of data collection.
--- Content provided by FirstRanker.com ---
(a)
--- Content provided by FirstRanker.com ---
Purpose and Scope of Enquiry: The purpose and scope of datacollection or survey should be clearly set out at the very beginning. It requires
the clear statement of the problem indicating the type of information which is
--- Content provided by FirstRanker.com ---
needed and the use for which it is needed. If for example, the researcher is
interested in knowing the nature of price change over a period of time, it would
--- Content provided by FirstRanker.com ---
be necessary to collect data of commodity prices. It must be decided whether itwould be helpful to study wholesale or retail prices and the possible uses to
which such information could be put. The objective of an enquiry may be either
--- Content provided by FirstRanker.com ---
to collect specific information relating to a problem or adequate data to test a
hypothesis. Failure to set out clearly the purpose of enquiry is bound to lead to
--- Content provided by FirstRanker.com ---
confusion and waste of resources.--- Content provided by FirstRanker.com ---
After the purpose of enquiry has been clearly defined, the next step is todecide about the scope of the enquiry. Scope of the enquiry means the coverage
with regard to the type of information, the subject-matter and geographical area.
--- Content provided by FirstRanker.com ---
For instance, an enquiry may relate to India as a whole or a state or an industrial
town wherein a particular problem related to a particular industry can be studied.
--- Content provided by FirstRanker.com ---
(b)Availability of Time: - The investigation should be carried out within a
reasonable period of time, failing which the information collected may become
--- Content provided by FirstRanker.com ---
outdated, and would have no meaning at all. For instance, if a producer wants to
know the expected demand for a product newly launched by him and the result
--- Content provided by FirstRanker.com ---
of the enquiry that the demand would be meager takes two years to reach him,then the whole purpose of enquiry would become useless because by that time
he would have already incurred a huge loss. Thus, in this respect the information
--- Content provided by FirstRanker.com ---
is quickly required and hence the researcher has to choose the type of enquiry
accordingly.
--- Content provided by FirstRanker.com ---
(c) Availability of Resources: The investigation will greatly depend on the
resources available like number of skilled personnel, the financial position etc. If
--- Content provided by FirstRanker.com ---
the number of skilled personnel who will carry out the enquiry is quite sufficient
and the availability of funds is not a problem, then enquiry can be conducted
--- Content provided by FirstRanker.com ---
over a big area covering a good number of samples, otherwise a small samplesize will do.
--- Content provided by FirstRanker.com ---
(d)The Degree of Accuracy Desired: Deciding the degree of accuracy required
is a must for the investigator, because absolute accuracy in statistical work is
--- Content provided by FirstRanker.com ---
seldom achieved. This is so because (i) statistics are based on estimates, (ii)tools of measurement are not always perfect and (iii) there may be unintentional
bias on the part of the investigator, enumerator or informant. Therefore, a desire
--- Content provided by FirstRanker.com ---
of 100% accuracy is bound to remain unfulfilled. Degree of accuracy desired
--- Content provided by FirstRanker.com ---
primarily depends upon the object of enquiry. For example, when we buy gold,
even a difference of 1/10th gram in its weight is significant, whereas the same
--- Content provided by FirstRanker.com ---
will not be the case when we buy rice or wheat. However, the researcher mustaim at attaining a higher degree of accuracy, otherwise the whole purpose of
research would become meaningless.
--- Content provided by FirstRanker.com ---
(e) Statistical Tools to be used: A well defined and identifiable object or a
group of objects with which the measurements or counts in any statistical
--- Content provided by FirstRanker.com ---
investigation are associated is called a statistical unit. For example, in socio-economic survey the unit may be an individual, a family, a household or a block
of locality. A very important step before the collection of data begins is to define
--- Content provided by FirstRanker.com ---
clearly the statistical units on which the data are to be collected. In number of
situations the units are conventionally fixed like the physical units of
--- Content provided by FirstRanker.com ---
measurement, such as meters, kilometers, quintals, hours, days, weeks etc.,which are well defined and do not need any elaboration or explanation.
However, in many statistical investigations, particularly relating to socio-
--- Content provided by FirstRanker.com ---
economic studies, arbitrary units are used which must be clearly defined. This is
a must because in the absence of a clear cut and precise definition of the
--- Content provided by FirstRanker.com ---
statistical units, serious errors in the data collection may be committed in thesense that we may collect irrelevant data on the items, which should have, in
fact, been excluded and omit data on certain items which should have been
--- Content provided by FirstRanker.com ---
included. This will ultimately lead to fallacious conclusions.
--- Content provided by FirstRanker.com ---
(f) Sources of Information (data): After deciding about the unit, a researcherhas to decide about the source from which the information can be obtained or
collected. For any statistical inquiry, the investigator may collect the data first
--- Content provided by FirstRanker.com ---
hand or he may use the data from other published sources, such as publications
--- Content provided by FirstRanker.com ---
of the government/semi-government organizations or journals and magazines
etc.
--- Content provided by FirstRanker.com ---
(g) Method of Data Collection: - There is no problem if secondary data areused for research. However, if primary data are to be collected, a decision has to
be taken whether (i) census method or (ii) sample technique is to be used for
--- Content provided by FirstRanker.com ---
data collection. In census method, we go for total enumeration i.e., all the units
of a universe have to be investigated. But in sample technique, we inspect or
--- Content provided by FirstRanker.com ---
study only a selected representative and adequate fraction of the population andafter analyzing the results of the sample data we draw conclusions about the
characteristics of the population. Selection of a particular technique becomes
--- Content provided by FirstRanker.com ---
difficult because where population or census method is more scientific and
100% accuracy can be attained through this method, choosing this becomes
--- Content provided by FirstRanker.com ---
difficult because it is time taking, it requires more labor and it is very expensive.Therefore, for a single researcher or for a small institution it proves to be
unsuitable. On the other hand, sample method is less time taking, less laborious
--- Content provided by FirstRanker.com ---
and less expensive but a 100% accuracy cannot be attained through this method
because of sampling and non-sampling errors attached to this method. Hence, a
--- Content provided by FirstRanker.com ---
researcher has to be very cautious and careful while choosing a particularmethod.
Methods of Collecting Primary Data:
--- Content provided by FirstRanker.com ---
Primary data may be obtained by applying any of the following methods:
1. Direct Personal Interviews.
--- Content provided by FirstRanker.com ---
2. Indirect oral interviews.3. Information from correspondents.
4. Mailed questionnaire methods.
--- Content provided by FirstRanker.com ---
5. Schedule sent through enumerators.
--- Content provided by FirstRanker.com ---
1. Direct personal interviews: A face to face contact is made with the
--- Content provided by FirstRanker.com ---
informants (persons from whom the information is to be obtained) under thismethod of collecting data. The interviewer asks them questions pertaining to the
survey and collects the desired information. Thus, if a person wants to collect
--- Content provided by FirstRanker.com ---
data about the working conditions of the workers of the Tata Iron and Steel
Company, Jamshedpur, he would go to the factory, contact the workers and
--- Content provided by FirstRanker.com ---
obtain the desired information. The information collected in this manner is firsthand and also original in character. There are many merits and demerits of this
method, which are discussed as under:
--- Content provided by FirstRanker.com ---
Merits:
1.
--- Content provided by FirstRanker.com ---
Most often respondents are happy to pass on the information requiredfrom them when contacted personally and thus response is encouraging.
2.
--- Content provided by FirstRanker.com ---
The information collected through this method is normally more accurate
because interviewer can clear doubts of the informants about certain
--- Content provided by FirstRanker.com ---
questions and thus obtain correct information. In case the interviewerapprehends that the informant is not giving accurate information, he may
cross-examine him and thereby try to obtain the information.
--- Content provided by FirstRanker.com ---
3.
This method also provides the scope for getting supplementary
--- Content provided by FirstRanker.com ---
information from the informant, because while interviewing it is possibleto ask some supplementary questions which may be of greater use later.
4.
--- Content provided by FirstRanker.com ---
There might be some questions which the interviewer would find
difficult to ask directly, but with some tactfulness, he can mingle such
--- Content provided by FirstRanker.com ---
questions with others and get the desired information. He can twist thequestions keeping in mind the informant's reaction. Precisely, a delicate
situation can usually he handled more effectively by a personal interview
--- Content provided by FirstRanker.com ---
than by other survey techniques.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
5.The interviewer can adjust the language according to the status and
educational level of the person interviewed, and thereby can avoid
--- Content provided by FirstRanker.com ---
inconvenience and misinterpretation on the part of the informant.
--- Content provided by FirstRanker.com ---
Demerits:1. This method can prove to be expensive if the number of informants is large
--- Content provided by FirstRanker.com ---
and the area is widely spread.
--- Content provided by FirstRanker.com ---
2. There is a greater chance of personal bias and prejudice under this method ascompared to other methods.
--- Content provided by FirstRanker.com ---
3. The interviewers have to be thoroughly trained and experienced; otherwise
they may not be able to obtain the desired information. Untrained or poorly
--- Content provided by FirstRanker.com ---
trained interviewers may spoil the entire work.4. This method is more time taking as compared to others. This is because
--- Content provided by FirstRanker.com ---
interviews can be held only at the convenience of the informants. Thus, if
information is to be obtained from the working members of households,
--- Content provided by FirstRanker.com ---
interviews will have to be held in the evening or on week end. Even duringevening only an hour or two can be used for interviews and hence, the work
may have to be continued for a long time, or a large number of people may
--- Content provided by FirstRanker.com ---
have to be employed which may involve huge expenses.
--- Content provided by FirstRanker.com ---
Conclusion:Though there are some demerits in this method of data collection still we cannot
say that it is not useful. The matter of fact is that this method is suitable for
--- Content provided by FirstRanker.com ---
intensive rather than extensive field surveys. Hence, it should be used only in
--- Content provided by FirstRanker.com ---
those cases where intensive study of a limited field is desired.
In the present time of extreme advancement in the communication system,
--- Content provided by FirstRanker.com ---
the investigator instead of going personally and conducting a face to faceinterview may also obtain information over telephone. A good number of
surveys are being conducted every day by newspapers and television channels
--- Content provided by FirstRanker.com ---
by sending the reply either by e-mail or SMS. This method has become very
popular nowadays as it is less expensive and the response is extremely quick.
--- Content provided by FirstRanker.com ---
But this method suffers from some serious defects, such as (a) very few peopleown a phone or a television and hence a limited number of people can be
approached by this method, (b) only few questions can be asked over phone or
--- Content provided by FirstRanker.com ---
through television, (c) the respondents may give a vague and reckless answers
because answers on phone or through SMS would have to be very short.
--- Content provided by FirstRanker.com ---
2. Indirect Oral Interviews: Under this method of data collection, theinvestigator contacts third parties generally called `witnesses' who are capable
of supplying necessary information. This method is generally adopted when the
--- Content provided by FirstRanker.com ---
information to be obtained is of a complex nature and informants are not
inclined to respond if approached directly. For example, when the researcher is
--- Content provided by FirstRanker.com ---
trying to obtain data on drug addiction or the habit of taking liquor, there is highprobability that the addicted person will not provide the desired data and hence
will disturb the whole research process. In this situation taking the help of such
--- Content provided by FirstRanker.com ---
persons or agencies or the neighbours who know them well becomes necessary.
Since these people know the person well, they can provide the desired data.
--- Content provided by FirstRanker.com ---
Enquiry Committees and Commissions appointed by the Government generallyadopt this method to get people's views and all possible details of the facts
related to the enquiry.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Though this method is very popular, its correctness depends upon a number of
factors which are discussed below:
--- Content provided by FirstRanker.com ---
1. The person or persons or agency whose help is solicited must be of provenintegrity; otherwise any bias or prejudice on their part will not bring the correct
information and the whole process of research will become useless.
--- Content provided by FirstRanker.com ---
2. The ability of the interviewers to draw information from witnesses by means
of appropriate questions and cross-examination.
--- Content provided by FirstRanker.com ---
3. It might happen that because of bribery, nepotism or certain other reasonsthose who are collecting the information give it such a twist that correct
conclusions are not arrived at.
--- Content provided by FirstRanker.com ---
Therefore, for the success of this method it is necessary that the evidence of
one person alone is not relied upon. Views from other persons and related
--- Content provided by FirstRanker.com ---
agencies should also be ascertained to find the real position .Utmost care mustbe exercised in the selection of these persons because it is on their views that the
final conclusions are reached.
--- Content provided by FirstRanker.com ---
3. Information from Correspondents: The investigator appoints local agents
or correspondents in different places to collect information under this method.
--- Content provided by FirstRanker.com ---
These correspondents collect and transmit the information to the central officewhere data are processed. This method is generally adopted by news paper
agencies. Correspondents who are posted at different places supply information
--- Content provided by FirstRanker.com ---
relating to such events as accidents, riots, strikes, etc., to the head office. The
correspondents are generally paid staff or sometimes they may be honorary
--- Content provided by FirstRanker.com ---
correspondents also. This method is also adopted generally by the governmentdepartments in such cases where regular information is to be collected from a
wide area. For example, in the construction of a wholesale price index numbers
--- Content provided by FirstRanker.com ---
regular information is obtained from correspondents appointed in different areas.
The biggest advantage of this method is that it is cheap and appropriate for
--- Content provided by FirstRanker.com ---
extensive investigation. But a word of caution is that it may not always ensure--- Content provided by FirstRanker.com ---
accurate results because of the personal prejudice and bias of thecorrespondents. As stated earlier, this method is suitable and adopted in those
cases where the information is to be obtained at regular intervals from a wide
--- Content provided by FirstRanker.com ---
area.
4. Mailed Questionnaire Method: Under this method, a list of questions
--- Content provided by FirstRanker.com ---
pertaining to the survey which is known as `Questionnaire' is prepared andsent to the various informants by post. Sometimes the researcher himself too
contacts the respondents and gets the responses related to various
--- Content provided by FirstRanker.com ---
questions in the questionnaire. The questionnaire contains questions and
provides space for answers. A request is made to the informants through a
--- Content provided by FirstRanker.com ---
covering letter to fill up the questionnaire and send it back within a specifiedtime. The questionnaire studies can be classified on the basis of:
i.
--- Content provided by FirstRanker.com ---
The degree to which the questionnaire is formalized or structured.
ii.
--- Content provided by FirstRanker.com ---
The disguise or lack of disguise of the questionnaire andiii. The communication method used.
--- Content provided by FirstRanker.com ---
When no formal questionnaire is used, interviewers adapt their questioning
to each interview as it progresses. They might even try to elicit responses by
--- Content provided by FirstRanker.com ---
indirect methods, such as showing pictures on which the respondent comments.When a researcher follows a prescribed sequence of questions, it is referred to as
structured study. On the other hand, when no prescribed sequence of questions
--- Content provided by FirstRanker.com ---
exists, the study is non-structured.
When questionnaires are constructed in such a way that the objective is clear
--- Content provided by FirstRanker.com ---
to the respondents then these questionnaires are known as non- disguised; on theother hand, when the objective is not clear, the questionnaire is a disguised one.
On the basis of these two classifications, four types of studies can he
--- Content provided by FirstRanker.com ---
distinguished:
i.
--- Content provided by FirstRanker.com ---
Non-disguised structured,--- Content provided by FirstRanker.com ---
ii.Non-disguised non-structured,
iii. Disguised structured and
--- Content provided by FirstRanker.com ---
iv. Disguised non-structured.
--- Content provided by FirstRanker.com ---
There are certain merits and demerits or limitations of this method of datacollection which are discussed below:
Merits:
--- Content provided by FirstRanker.com ---
1. Questionnaire method of data collection can be easily adopted where the
field of investigation is very vast and the informants are spread over a
--- Content provided by FirstRanker.com ---
wide geographical area.2. This method is relatively cheap and expeditious provided the informants
respond in time.
--- Content provided by FirstRanker.com ---
3. This method has proved to be superior when compared to other methods like
personal interviews or telephone method. This is because when questions
--- Content provided by FirstRanker.com ---
pertaining to personal nature or the ones requiring reaction by the family areput forth to the informants, there is a chance for them to be embarrassed in
answering them.
--- Content provided by FirstRanker.com ---
Demerits:
1. This method can be adopted only where the informants are literate
--- Content provided by FirstRanker.com ---
people so that they can understand written questions and lend theanswers in writing.
2. It involves some uncertainty about the response. Co-operation on the part of
--- Content provided by FirstRanker.com ---
informants may be difficult to presume.
3. The information provided by the informants may not be correct and it may
--- Content provided by FirstRanker.com ---
be difficult to verify the accuracy.However, by following the guidelines given below, this method can be made
--- Content provided by FirstRanker.com ---
more effective:
--- Content provided by FirstRanker.com ---
The questionnaires should be made in such a manner that they do not
become an undue burden on the respondents; otherwise the respondents may not
--- Content provided by FirstRanker.com ---
return them back.i.
Prepaid postage stamp should be affixed
--- Content provided by FirstRanker.com ---
ii.
The sample should be large
--- Content provided by FirstRanker.com ---
iii.It should be adopted in such enquiries where it is expected that the
respondents would return the questionnaire because of their own interest
--- Content provided by FirstRanker.com ---
in the enquiry.
iv.
--- Content provided by FirstRanker.com ---
It should be preferred in such enquiries where there could be a legalcompulsion to provide the information.
--- Content provided by FirstRanker.com ---
5. Schedules sent through Enumerators: Another method of data collection is
sending schedules through the enumerators or interviewers. The enumerators
--- Content provided by FirstRanker.com ---
contact the informants, get replies to the questions contained in a schedule andfill them in their own handwriting in the questionnaire form. There is difference
between questionnaire and schedule. Questionnaire refers to a device for
--- Content provided by FirstRanker.com ---
securing answers to questions by using a form which the respondent fills in him
self, whereas Schedule is the name usually applied to a set of questions which
--- Content provided by FirstRanker.com ---
are asked in a face-to face situation with another person. This method is freefrom most of the limitations of the mailed questionnaire method.
--- Content provided by FirstRanker.com ---
Merits:
The main merits or advantages of this method are listed below:
--- Content provided by FirstRanker.com ---
i.It can be adopted in those cases where informants are illiterate.
ii.
--- Content provided by FirstRanker.com ---
There is very little scope of non-response as the enumerators go personally
to obtain the information.
--- Content provided by FirstRanker.com ---
iii. The information received is more reliable as the accuracy of statementscan be checked by supplementary questions wherever necessary.
--- Content provided by FirstRanker.com ---
This method too like others is not free from defects or limitations. The
main limitations are listed below:
--- Content provided by FirstRanker.com ---
Demerits:
--- Content provided by FirstRanker.com ---
i.In comparison to other methods of collecting primary data, this method is
quite costly as enumerators are generally paid persons.
--- Content provided by FirstRanker.com ---
ii.
The success of the method depends largely upon the training imparted to
--- Content provided by FirstRanker.com ---
the enumerators.iii. Interviewing is a very skilled work and it requires experience and training.
Many statisticians have the tendency to neglect this extremely important
--- Content provided by FirstRanker.com ---
part of the data collecting process and this result in bad interviews.
Without good interviewing most of the information collected is of doubtful
--- Content provided by FirstRanker.com ---
value.iv. Interviewing is not only a skilled work but it also requires a great degree of
politeness and thus the way the enumerators conduct the interview would
--- Content provided by FirstRanker.com ---
affect the data collected. When questions are asked by a number of
different interviewers, it is possible that variations in the personalities of
--- Content provided by FirstRanker.com ---
the interviewers will cause variation in the answers obtained. Thisvariation will not be obvious. Hence, every effort must be made to remove
as much of variation as possible due to different interviewers.
--- Content provided by FirstRanker.com ---
Secondary Data: As stated earlier, secondary data are those data which have
--- Content provided by FirstRanker.com ---
already been collected and analyzed by some earlier agency for its own use, andlater the same data are used by a different agency. According to
W.A.Neiswanger, "A primary source is a publication in which the data are
--- Content provided by FirstRanker.com ---
published by the same authority which gathered and analyzed them. A
secondary source is a publication, reporting the data which was gathered by
--- Content provided by FirstRanker.com ---
other authorities and for which others are responsible."--- Content provided by FirstRanker.com ---
Sources of secondary data:-The various sources of secondary data can be
divided into two broad categories:
--- Content provided by FirstRanker.com ---
1.
Published sources, and
--- Content provided by FirstRanker.com ---
2.Unpublished sources.
--- Content provided by FirstRanker.com ---
1. Published Sources: The governmental, international and local agencies
publish statistical data, and chief among them are explained below:
--- Content provided by FirstRanker.com ---
(a) International Publications: There are some international institutions and
bodies like I.M.F, I.B.R.D, I.C.A.F.E and U.N.O who publish regular and
--- Content provided by FirstRanker.com ---
occasional reports on economic and statistical matters.
--- Content provided by FirstRanker.com ---
(b) Official publications of Central and State Governments: Severaldepartments of the Central and State Governments regularly publish reports on a
number of subjects. They gather additional information. Some of the important
--- Content provided by FirstRanker.com ---
publications are: The Reserve Bank of India Bulletin, Census of India, Statistical
Abstracts of States, Agricultural Statistics of India, Indian Trade Journal, etc.
--- Content provided by FirstRanker.com ---
(c) Semi-official publications: Semi-Government institutions like MunicipalCorporations, District Boards, Panchayats, etc. publish reports relating to
different matters of public concern.
--- Content provided by FirstRanker.com ---
(d) Publications of Research Institutions: Indian Statistical Institute (I.S.I),
Indian Council of Agricultural Research (I.C.A.R), Indian Agricultural Statistics
--- Content provided by FirstRanker.com ---
Research Institute (I.A.S.R.I), etc. publish the findings of their researchprogrammes.
(e) Publications of various Commercial and Financial Institutions
--- Content provided by FirstRanker.com ---
(f) Reports of various Committees and Commissions appointed by the
Government as the Raj Committee's Report on Agricultural Taxation, Wanchoo
--- Content provided by FirstRanker.com ---
Committee's Report on Taxation and Black Money, etc. are also important
--- Content provided by FirstRanker.com ---
sources of secondary data.(g) Journals and News Papers: Journals and News Papers are very important
and powerful source of secondary data. Current and important materials on
--- Content provided by FirstRanker.com ---
statistics and socio-economic problems can be obtained from journals and
newspapers like Economic Times, Commerce, Capital, Indian Finance, Monthly
--- Content provided by FirstRanker.com ---
Statistics of trade etc.2. Unpublished Sources: Unpublished data can be obtained from many
--- Content provided by FirstRanker.com ---
unpublished sources like records maintained by various government and private
offices, the theses of the numerous research scholars in the universities or
--- Content provided by FirstRanker.com ---
institutions etc.Precautions in the Use of Secondary Data: Since secondary data have already
--- Content provided by FirstRanker.com ---
been obtained, it is highly desirable that a proper scrutiny of such data is made
before they are used by the investigator. In fact the user has to be extra-cautious
--- Content provided by FirstRanker.com ---
while using secondary data. In this context Prof. Bowley rightly points out that"Secondary data should not be accepted at their face value." The reason being
that data may be erroneous in many respects due to bias, inadequate size of the
--- Content provided by FirstRanker.com ---
sample, substitution, errors of definition, arithmetical errors etc. Even if there is
no error such data may not be suitable and adequate for the purpose of the
--- Content provided by FirstRanker.com ---
enquiry. Prof. Simon Kuznet's view in this regard is also of great importance.According to him, "The degree of reliability of secondary source is to be
assessed from the source, the compiler and his capacity to produce correct
--- Content provided by FirstRanker.com ---
statistics and the users also, for the most part, tend to accept a series particularly
one issued by a government agency at its face value without enquiring its
--- Content provided by FirstRanker.com ---
reliability".--- Content provided by FirstRanker.com ---
Therefore, before using the secondary data the investigators should
consider the following factors:
--- Content provided by FirstRanker.com ---
4.
The suitability of data: The investigator must satisfy himself that the data
--- Content provided by FirstRanker.com ---
available are suitable for the purpose of enquiry. It can be judged by thenature and scope of the present enquiry with the original enquiry. For
example, if the object of the present enquiry is to study the trend in retail
--- Content provided by FirstRanker.com ---
prices, and if the data provide only wholesale prices, such data are
unsuitable.
--- Content provided by FirstRanker.com ---
(a) Adequacy of data: If the data are suitable for the purpose of investigationthen we must consider whether the data are useful or adequate for the
present analysis. It can be studied by the geographical area covered by the
--- Content provided by FirstRanker.com ---
original enquiry. The time for which data are available is very important
element. In the above example, if our object is to study the retail price
--- Content provided by FirstRanker.com ---
trend of India, and if the available data cover only the retail price trend inthe State of Bihar, then it would not serve the purpose.
--- Content provided by FirstRanker.com ---
(b) Reliability of data: The reliability of data is must. Without which there is
no meaning in research. The reliability of data can be tested by finding out
--- Content provided by FirstRanker.com ---
the agency that collected such data. If the agency has used proper methodsin collection of data, statistics may be relied upon.
--- Content provided by FirstRanker.com ---
It is not enough to have baskets of data in hand. In fact, data in a raw form are
--- Content provided by FirstRanker.com ---
nothing but a handful of raw material waiting for proper processing so that theycan become useful. Once data have been obtained from primary or secondary
source, the next step in a statistical investigation is to edit the data i.e. to
--- Content provided by FirstRanker.com ---
scrutinize the same. The chief objective of editing is to detect possible errors and
irregularities. The task of editing is a highly specialized one and requires great
--- Content provided by FirstRanker.com ---
care and attention. Negligence in this respect may render useless the findings of
--- Content provided by FirstRanker.com ---
an otherwise valuable study. Editing data collected from internal records andpublished sources is relatively simple but the data collected from a survey need
excessive editing.
--- Content provided by FirstRanker.com ---
While editing primary data, the following considerations should be borne in
mind:
--- Content provided by FirstRanker.com ---
1. The data should be complete in every respect2. The data should be accurate
3. The data should be consistent, and
--- Content provided by FirstRanker.com ---
4. The data should be homogeneous.
--- Content provided by FirstRanker.com ---
Data to posses the above mentioned characteristics have to undergo thesame type of editing which is discussed below:
5.
--- Content provided by FirstRanker.com ---
Editing for completeness: While editing, the editor should see that each
schedule and questionnaire is complete in all respects. He should see to it that
--- Content provided by FirstRanker.com ---
the answers to each and every question have been furnished. If some questionsare not answered and if they are of vital importance, the informants should be
contacted again either personally or through correspondence. Even after all the
--- Content provided by FirstRanker.com ---
efforts it may happen that a few questions remain unanswered. In such
questions, the editor should mark `No answer' in the space provided for answers
--- Content provided by FirstRanker.com ---
and if the questions are of vital importance then the schedule or questionnaireshould be dropped.
1.
--- Content provided by FirstRanker.com ---
Editing for Consistency: At the time of editing the data for consistency,
the editor should see that the answers to questions are not contradictory in
--- Content provided by FirstRanker.com ---
nature. If they are mutually contradictory answers, he should try to obtain thecorrect answers either by referring back the questionnaire or by contacting,
wherever possible, the informant in person. For example, if amongst others, two
--- Content provided by FirstRanker.com ---
questions in questionnaire are (a) Are you a student? (b) Which class do you
--- Content provided by FirstRanker.com ---
study and the reply to the first question is `no' and to the latter `tenth' then there
is contradiction and it should be clarified.
--- Content provided by FirstRanker.com ---
2.Editing for Accuracy: The reliability of conclusions depends basically
on the correctness of information. If the information supplied is wrong,
--- Content provided by FirstRanker.com ---
conclusions can never be valid. It is, therefore, necessary for the editor to see
that the information is accurate in all respects. If the inaccuracy is due to
--- Content provided by FirstRanker.com ---
arithmetical errors, it can be easily detected and corrected. But if the cause ofinaccuracy is faulty information supplied, it may be difficult to verify it and an
example of this kind is information relating to income, age etc.
--- Content provided by FirstRanker.com ---
3.
Editing for Homogeneity: Homogeneity means the condition in which
--- Content provided by FirstRanker.com ---
all the questions have been understood in the same sense. The editor must checkall the questions for uniform interpretation. For example, as to the question of
income, if some informants have given monthly income, others annual income
--- Content provided by FirstRanker.com ---
and still others weekly income or even daily income, no comparison can be
made. Therefore, it becomes an essential duty of the editor to check up that the
--- Content provided by FirstRanker.com ---
information supplied by the various people is homogeneous and uniform.Choice between Primary and Secondary Data: As we have already seen,
--- Content provided by FirstRanker.com ---
there are a lot of differences in the methods of collecting Primary and Secondary
data. Primary data which is to be collected originally involves an entire scheme
--- Content provided by FirstRanker.com ---
of plan starting with the definitions of various terms used, units to be employed,type of enquiry to be conducted, extent of accuracy aimed at etc. For the
collection of secondary data, a mere compilation of the existing data would be
--- Content provided by FirstRanker.com ---
sufficient. A proper choice between the type of data needed for any particular
statistical investigation is to be made after taking into consideration the nature,
--- Content provided by FirstRanker.com ---
objective and scope of the enquiry; the time and the finances at the disposal ofthe agency; the degree of precision aimed at and the status of the agency
(whether government- state or central-or private institution of an individual).
--- Content provided by FirstRanker.com ---
In using the secondary data, it is best to obtain the data from the primary source
--- Content provided by FirstRanker.com ---
as far as possible. By doing so, we would at least save ourselves from the errors
of transcription which might have inadvertently crept in the secondary source.
--- Content provided by FirstRanker.com ---
Moreover, the primary source will also provide us with detailed discussion aboutthe terminology used, statistical units employed, size of the sample and the
technique of sampling (if sampling method was used), methods of data
--- Content provided by FirstRanker.com ---
collection and analysis of results and we can ascertain ourselves if these would
suit our purpose.
--- Content provided by FirstRanker.com ---
Now-a-days in a large number of statistical enquiries, secondary data aregenerally used because fairly reliable published data on a large number of
diverse fields are now available in the publications of governments, private
--- Content provided by FirstRanker.com ---
organizations and research institutions, agencies, periodicals and magazines etc.
In fact, primary data are collected only if there do not exist any secondary data
--- Content provided by FirstRanker.com ---
suited to the investigation under study. In some of the investigations bothprimary as well as secondary data may be used.
--- Content provided by FirstRanker.com ---
SUMMARY:
There are two types of data, primary and secondary. Data which are collected
--- Content provided by FirstRanker.com ---
first hand are called Primary data and data which have already been collectedand used by somebody are called Secondary data. There are two methods of
collecting data: (a) Survey method or total enumeration method and (b) Sample
--- Content provided by FirstRanker.com ---
method. When a researcher goes for investigating all the units of the subject, it is
called as survey method. On the other hand if he/she resorts to investigating only
--- Content provided by FirstRanker.com ---
a few units of the subject and gives the result on the basis of that, it is known assample survey method. There are different sources of collecting Primary and
Secondary data. Some of the important sources of Primary data are--Direct
--- Content provided by FirstRanker.com ---
Personal Interviews, Indirect Oral Interviews, Information from correspondents,
Mailed questionnaire method, Schedules sent through enumerators and so on.
--- Content provided by FirstRanker.com ---
Though all these sources or methods of Primary data have their relative merits
--- Content provided by FirstRanker.com ---
and demerits, a researcher should use a particular method with lot of care. Thereare basically two sources of collecting secondary data- (a) Published sources and
(b) Unpublished sources. Published sources are like publications of different
--- Content provided by FirstRanker.com ---
government and semi-government departments, research institutions and
agencies etc. whereas unpublished sources are like records maintained by
--- Content provided by FirstRanker.com ---
different government departments and unpublished theses of differentuniversities etc. Editing of secondary data is necessary for different purposes as
? editing for completeness, editing for consistency, editing for accuracy and
--- Content provided by FirstRanker.com ---
editing for homogeneity.
--- Content provided by FirstRanker.com ---
It is always a tough task for the researcher to choose between primary
and secondary data. Though primary data are more authentic and accurate, time,
--- Content provided by FirstRanker.com ---
money and labor involved in obtaining these more often prompt the researcher
to go for the secondary data. There are certain amount of doubt about its
--- Content provided by FirstRanker.com ---
authenticity and suitability, but after the arrival of many government and semigovernment agencies and some private institutions in the field of data collection,
most of the apprehensions in the mind of the researcher have been removed.
--- Content provided by FirstRanker.com ---
SELF ASSESMENT QUESTIONS (SAQs):
--- Content provided by FirstRanker.com ---
1. Explain primary and secondary data and distinguish between them.(Refer the introduction part of this lesson.)
2. Explain the different methods of collecting primary data.
--- Content provided by FirstRanker.com ---
(Explain direct personal, indirect oral interview, information received
through agencies etc.)
--- Content provided by FirstRanker.com ---
3. Explain the merits and demerits of different methods of collecting primarydata.
(Refer the methods of collecting primary data)
--- Content provided by FirstRanker.com ---
4. Explain the different sources of secondary data and the precautions in using
--- Content provided by FirstRanker.com ---
secondary data.
5. What is editing of secondary data? Why is it required?
--- Content provided by FirstRanker.com ---
6. What are the different types of editing of secondary data?GLOSSARY OF TERMS:
--- Content provided by FirstRanker.com ---
Primary Source: It is one that itself collects the data.
Secondary Source: It is one that makes available data collected by some other
--- Content provided by FirstRanker.com ---
agency.Collection of Statistics: Collection means the assembling for the purpose of
particular investigation of entirely new data presumably not already available in
--- Content provided by FirstRanker.com ---
published sources.
Questionnaire: A list of questions properly selected and arranged pertaining to
--- Content provided by FirstRanker.com ---
the investigation.Investigator: Investigator is a person who collects the information.
Respondent: A person who fills the questionnaire or provides the required
--- Content provided by FirstRanker.com ---
information.
***
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT II
QUESTIONNAIRE AND SAMPLING
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Lesson OutlineMeaning of questionnaire.
Drafting of questionnaire.
--- Content provided by FirstRanker.com ---
Size of questions
Clarity of questions
--- Content provided by FirstRanker.com ---
Logical sequence of questionsSimple meaning questions
Other requirements of a good questionnaire
--- Content provided by FirstRanker.com ---
Meaning and essentials of sampling.
--- Content provided by FirstRanker.com ---
Learning Objectives
After reading this lesson you should be able to
--- Content provided by FirstRanker.com ---
Understand the meaning of questionnaire
Different requirements and characteristics of a good questionnaire
--- Content provided by FirstRanker.com ---
Meaning of samplingEssentials of sampling
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Introduction:
--- Content provided by FirstRanker.com ---
Nowadays questionnaire is widely used for data collection in social research. Itis a reasonably fair tool for gathering data from large, diverse, varied and
scattered social groups. The questionnaire is the media of communication
--- Content provided by FirstRanker.com ---
between the investigator and the respondents. According to Bogardus, a
questionnaire is a list of questions sent to a number of persons for their answers
--- Content provided by FirstRanker.com ---
and which obtains standardized results that can be tabulated and treatedstatistically. The Dictionary of Statistical Terms defines it as a "group of or
sequence of questions designed to elicit information upon a subject or sequence
--- Content provided by FirstRanker.com ---
of subjects from information." A questionnaire should be designed or drafted
with utmost care and caution so that all the relevant and essential information
--- Content provided by FirstRanker.com ---
for the enquiry may be collected without any difficulty, ambiguity andvagueness. Drafting of a good questionnaire is a highly specialized job and
requires great care skill, wisdom, efficiency and experience. No hard and fast
--- Content provided by FirstRanker.com ---
rule can be laid down for designing or framing a questionnaire. However, in this
connection, the following general points may be borne in mind:
--- Content provided by FirstRanker.com ---
1. Size of the Questionnaire Should be Small: A researcher should try his
best to keep the number of questions as small as possible, keeping in view the
--- Content provided by FirstRanker.com ---
nature, objectives and scope of the enquiry. Respondent's time should not be
wasted by asking irrelevant and unimportant questions. A large number of
--- Content provided by FirstRanker.com ---
questions would involve more work for the investigator and thus result in delayon his part in collecting and submitting the information. A large number of
unnecessary questions may annoy the respondent and he may refuse to
--- Content provided by FirstRanker.com ---
cooperate. A reasonable questionnaire should contain from 15 to 25 questions at
large. If a still larger number of questions are a must in any enquiry, then the
--- Content provided by FirstRanker.com ---
questionnaire should be divided into various sections or parts.--- Content provided by FirstRanker.com ---
2. The Questions Should be Clear: The questions should be easy, brief,
unambiguous, non-offending, courteous in tone, corroborative in nature and to
--- Content provided by FirstRanker.com ---
the point, so that much scope of guessing is left on the part of the respondents.
--- Content provided by FirstRanker.com ---
3. The Questions Should be Arranged in a Logical Sequence: Logicalarrangement of questions reduces lot of unnecessary work on the part of the
researcher because it not only facilitates the tabulation work but also does not
--- Content provided by FirstRanker.com ---
leave any chance for omissions or commissions. For example, to find if a person
owns a television, the logical order of questions would be: Do you own a
--- Content provided by FirstRanker.com ---
television? When did you buy it? What is its make? How much did it cost you?Is its performance satisfactory? Have you ever got it serviced?
--- Content provided by FirstRanker.com ---
4. Questions Should be Simple to Understand: The vague words like good,
--- Content provided by FirstRanker.com ---
bad, efficient, sufficient, prosperity, rarely, frequently, reasonable, poor, richetc., should not be used since these may be interpreted differently by different
persons and as such might give unreliable and misleading information. Similarly
--- Content provided by FirstRanker.com ---
the use of words having double meaning like price, assets, capital income etc.,
should also be avoided.
--- Content provided by FirstRanker.com ---
5. Questions Should be Comprehensive and Easily Answerable: Questions
should be designed in such a way that they are readily comprehensible and easy
--- Content provided by FirstRanker.com ---
to answer for the respondents. They should not be tedious nor should they tax
the respondents' memory. At the same time questions involving mathematical
--- Content provided by FirstRanker.com ---
calculations like percentages, ratios etc., should not be asked.--- Content provided by FirstRanker.com ---
6. Questions of Personal and Sensitive Nature Should Not be Asked: There
--- Content provided by FirstRanker.com ---
are some questions which disturb the respondents and he/she may be shy or
irritated by hearing such questions. Therefore, every effort should be made to
--- Content provided by FirstRanker.com ---
avoid such questions. For example, `do you cook yourself or your wife cooks?'`Or do you drink?' Such questions will certainly irk the respondents and thus be
avoided at any cost. If unavoidable then highest amount of politeness should be
--- Content provided by FirstRanker.com ---
used.
--- Content provided by FirstRanker.com ---
7. Types of Questions: Under this head, the questions in the questionnaire maybe classified as follows:
(a) Shut Questions: Shut questions are those where possible answers are
--- Content provided by FirstRanker.com ---
suggested by the framers of the questionnaire and the respondent is required to
tick one of them. Shut questions can further be subdivided into the following
--- Content provided by FirstRanker.com ---
forms:(i) Simple Alternate Questions: In this type of questions the respondent has to
choose from the two clear cut alternatives like `Yes' or `No', `Right or Wrong'
--- Content provided by FirstRanker.com ---
etc. Such questions are also called as dichotomous questions. This technique can
be applied with elegance to situations where two clear cut alternatives exist.
--- Content provided by FirstRanker.com ---
(ii) Multiple Choice Questions: Many a times it becomes difficult to define aclear cut alternative and accordingly in such a situation additional answers
between Yes and No, like Do not know, No opinion, Occasionally, Casually,
--- Content provided by FirstRanker.com ---
Seldom etc. are added. For example, in order to find if a person smokes or
drinks, the following multiple choice answers may be used:
--- Content provided by FirstRanker.com ---
Do you smoke?(a)Yes regularly [ ] (b) No never [ ]
(c) Occasionally [ ] (d) Seldom [ ]
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Multiple choice questions are very easy and convenient for the respondents to
answer. Such questions save time and also facilitate tabulation. This method
--- Content provided by FirstRanker.com ---
should be used if only a selected few alternative answers exist to a particularquestion.
--- Content provided by FirstRanker.com ---
8. Leading Questions Should be Avoided: Questions like `Why do you use a
particular type of car, say Maruti car' should preferably be framed into two
--- Content provided by FirstRanker.com ---
questions-(i) Which car do you use?
(ii) Why do you prefer it?
--- Content provided by FirstRanker.com ---
It gives smooth ride [ ]
It gives more mileage [ ]
--- Content provided by FirstRanker.com ---
It is cheaper [ ]It is maintenance free [ ]
--- Content provided by FirstRanker.com ---
9 Cross Checks: The questionnaire should be so designed as to provide
internal checks on the accuracy of the information supplied by the respondents
--- Content provided by FirstRanker.com ---
by including some connected questions at least with respect to matters which arefundamental to the enquiry.
--- Content provided by FirstRanker.com ---
10 Pre testing the Questionnaire: It would be practical in every sense to try
out the questionnaire on a small scale before using it for the given enquiry on a
--- Content provided by FirstRanker.com ---
large scale. This has been found extremely useful in practice. The givenquestionnaire can be improved or modified in the light of the drawbacks,
shortcomings and problems faced by the investigator in the pre test.
--- Content provided by FirstRanker.com ---
11 A Covering Letter: A covering letter from the organizers of the enquiry
--- Content provided by FirstRanker.com ---
should be enclosed along with the questionnaire for the purposes regarding--- Content provided by FirstRanker.com ---
definitions, units, concepts used in the questionnaire, for taking the respondent'sconfidence, self addressed envelop in case of mailed questionnaire, mention
about award or incentives for the quick response, a promise to send a copy of the
--- Content provided by FirstRanker.com ---
survey report etc.
--- Content provided by FirstRanker.com ---
SAMPLINGThough sampling is not new, the sampling theory has been developed
recently. People knew or not but they have been using the sampling technique in
--- Content provided by FirstRanker.com ---
their day to day life. For example a house wife tests a small quantity of rice to
see whether it has been well-cooked and gives the generalized result about the
--- Content provided by FirstRanker.com ---
whole rice boiling in the vessel. The result arrived at is most of the times 100%correct. In another example, when a doctor wants to examine the blood for any
deficiency, takes only a few drops of blood of the patient and examines. The
--- Content provided by FirstRanker.com ---
result arrived at is most of the times correct and represent the whole amount of
blood available in the body of the patient. In all these cases, by inspecting a few,
--- Content provided by FirstRanker.com ---
they simply believe that the samples give a correct idea about the population.Most of our decision are based on the examination of a few items only i.e.
sample studies. In the words of Croxton and Cowdon, "It may be too expensive
--- Content provided by FirstRanker.com ---
or too time consuming to attempt either a complete or a nearly complete
coverage in a statistical study. Further to arrive at valid conclusions, it may not
--- Content provided by FirstRanker.com ---
be necessary to enumerate all or nearly all of a population. We may study asample drawn from the large population and if that sample is adequately
representative of the population, we should be able to arrive at valid
--- Content provided by FirstRanker.com ---
conclusions."
According to Rosander, "The sample has many advantages over a census
--- Content provided by FirstRanker.com ---
or complete enumeration. If carefully designed, the sample is not onlyconsiderably cheaper but may give results which are just accurate and
--- Content provided by FirstRanker.com ---
sometimes more accurate than those of a census. Hence a carefully designed
sample may actually be better than a poorly planned and executed census."
--- Content provided by FirstRanker.com ---
Merits:
--- Content provided by FirstRanker.com ---
1.It saves time: Sampling method of data collection saves time because
fewer items are collected and processed. When the results are urgently required,
--- Content provided by FirstRanker.com ---
this method is very helpful.
2.
--- Content provided by FirstRanker.com ---
It reduces cost: Since only a few and selected items are studied insampling, there is reduction in cost of money and reduction in terms of man
hours.
--- Content provided by FirstRanker.com ---
3.
More reliable results can be obtained: Through sampling, more
--- Content provided by FirstRanker.com ---
reliable results can be obtained because (a) there are fewer chances of samplingstatistical errors. If there is sampling error, it is possible to estimate and control
the results.(b) Highly experienced and trained persons can be employed for
--- Content provided by FirstRanker.com ---
scientific processing and analyzing of relatively limited data and they can use
their high technical knowledge and get more accurate and reliable results.
--- Content provided by FirstRanker.com ---
4.It provides more detailed information: As it saves time, money and
labor, more detail information can be collected in a sample survey.
--- Content provided by FirstRanker.com ---
5.
Sometimes only Sampling method to depend upon: Some times it so
--- Content provided by FirstRanker.com ---
happens that one has to depend upon sampling method alone because if thepopulation under study is finite, sampling method is the only method to be used.
For example, if someone's blood has to be examined, it will become fatal to take
--- Content provided by FirstRanker.com ---
all the blood out from the body and study depending upon the total enumeration
method.
--- Content provided by FirstRanker.com ---
6.Administrative convenience: The organization and administration of
sample survey are easy for the reasons which have been discussed earlier.
--- Content provided by FirstRanker.com ---
7.
--- Content provided by FirstRanker.com ---
More scientific: Since the methods used to collect data are based on
scientific theory and results obtained can be tested, sampling is a more scientific
--- Content provided by FirstRanker.com ---
method of collecting data.It is not that sampling is free from demerits or shortcomings. There are certain
shortcomings of this method which are discussed below:
--- Content provided by FirstRanker.com ---
1.
Illusory conclusion: If a sample enquiry is not carefully planned and
--- Content provided by FirstRanker.com ---
executed, the conclusions may be inaccurate and misleading.2.
Sample not representative: To make the sample representative is a
--- Content provided by FirstRanker.com ---
difficult task. If a representative sample is taken from the universe, the result is
applicable to the whole population. If the sample is not representative of the
--- Content provided by FirstRanker.com ---
universe the result may be false and misleading.3.
Lack of experts: As there are lack of experts to plan and conduct a
--- Content provided by FirstRanker.com ---
sample survey, its execution and analysis, and its results would be
unsatisfactory and not trustworthy.
--- Content provided by FirstRanker.com ---
4.Sometimes more difficult than census method: Sometimes the
sampling plan may be complicated and requires more money, labor and time
--- Content provided by FirstRanker.com ---
than a census method.
5.
--- Content provided by FirstRanker.com ---
Personal bias: There may be personal biases and prejudices with regardto the choice of technique and drawing of sampling units.
6.
--- Content provided by FirstRanker.com ---
Choice of sample size: If the size of the sample is not appropriate then it
may lead to untrue characteristics of the population.
--- Content provided by FirstRanker.com ---
7.Conditions of complete coverage: If the information is required for
each and every item of the universe, then a complete enumeration survey is
--- Content provided by FirstRanker.com ---
better.
Essentials of sampling: In order to reach a clear conclusion, the sampling
--- Content provided by FirstRanker.com ---
should possess the following essentials:1.
It must be representative: The sample selected should possess the
--- Content provided by FirstRanker.com ---
similar characteristics of the original universe from which it has been drawn.
--- Content provided by FirstRanker.com ---
2.
Homogeneity: Selected samples from the universe should have similar
--- Content provided by FirstRanker.com ---
nature and should mot have any difference when compared with the universe.3.
Adequate samples: In order to have a more reliable and representative
--- Content provided by FirstRanker.com ---
result, a good number of items are to be included in the sample.
4.
--- Content provided by FirstRanker.com ---
Optimization: All efforts should be made to get maximum results bothin terms of cost as well as efficiency. If the size of the sample is larger, there is
better efficiency and at the same time the cost is more. A proper size of sample
--- Content provided by FirstRanker.com ---
is maintained in order to have optimized results in terms of cost and efficiency.
--- Content provided by FirstRanker.com ---
STATISTICAL LAWSOne of the basic reasons for undertaking a sample survey is to predict
and generalize the results for the population as a whole. The logical process of
--- Content provided by FirstRanker.com ---
drawing general conclusions from a study of representative items is called
induction. In statistics, induction is a generalization of facts on the assumption
--- Content provided by FirstRanker.com ---
that the results provided by an adequate sample may be taken as applicable tothe whole. The fact that the characteristics of the sample provide a fairly good
idea about the population characteristics is borne out by the theory of
--- Content provided by FirstRanker.com ---
probability. Sampling is based on two fundamental principles of statistics theory
viz, (i) the Law of Statistical Regularity and (ii) the Law of Inertia of Large
--- Content provided by FirstRanker.com ---
Numbers.THE LAW OF STATISTICAL REGULARITY
--- Content provided by FirstRanker.com ---
The Law of Statistical Regularity is derived from the mathematical theory of
probability. According to W.I.King, "The Law of Statistical Regularity
--- Content provided by FirstRanker.com ---
formulated in the mathematical theory of probability lays down that amoderately large number of items chosen at random from a very large group are
almost sure to have the characteristics of the large group." For example, if we
--- Content provided by FirstRanker.com ---
want to find out the average income of 10,000 people, we take a sample of 100
--- Content provided by FirstRanker.com ---
people and find the average. Suppose another person takes another sample of
100 people from the same population and finds the average, the average income
--- Content provided by FirstRanker.com ---
found out by both the persons will have the least difference. On the other hand ifthe average income of the same 10,000 people is found out by the census
method, the result will be more or less the same.
--- Content provided by FirstRanker.com ---
Characteristics
1. The item selected will represent the universe and the result is generalized
--- Content provided by FirstRanker.com ---
to universe as a whole.2. Since sample size is large, it is representative of the universe.
--- Content provided by FirstRanker.com ---
3. There is a very remote chance of bias.
--- Content provided by FirstRanker.com ---
LAW OF INERTIA OF LARGE NUMBERSThe Law of inertia of Large Numbers is an immediate deduction from
the Principle of Statistical Regularity. Law of Inertia of Large Numbers states,
--- Content provided by FirstRanker.com ---
"Other things being equal, as the sample size increases, the results tend to be
more reliable and accurate." This is based on the fact that the behavior or a
--- Content provided by FirstRanker.com ---
phenomenon en masse. i.e., on a large scale is generally stable. It implies thatthe total change is likely to be very small, when a large number or items are
taken in a sample. The law will be true on an average. If sufficient large samples
--- Content provided by FirstRanker.com ---
are taken from the patent population, the reverse movements of different parts in
the same will offset by the corresponding movements of some other parts.
--- Content provided by FirstRanker.com ---
Sampling Errors: In a sample survey, since only a small portion of thepopulation is studied its results are bound to differ from the census results and
thus, have a certain amount of error. In statistics the word error is used to denote
--- Content provided by FirstRanker.com ---
the difference between the true value and the estimated or approximated value.
This error would always be there no matter that the sample is drawn at random
--- Content provided by FirstRanker.com ---
and that it is highly representative. This error is attributed to fluctuations of--- Content provided by FirstRanker.com ---
sampling and is called sampling error. Sampling error is due to the fact that onlya sub set of the population has been used to estimate the population parameters
and draw inferences about the population. Thus, sampling error is present only
--- Content provided by FirstRanker.com ---
in a sample survey and is completely absent in census method.
Sampling errors occur primarily due to the following reasons:
--- Content provided by FirstRanker.com ---
1.Faulty selection of the sample: Some of the bias is introduced by the
use of defective sampling technique for the selection of a sample e.g. purposive
--- Content provided by FirstRanker.com ---
or judgment sampling in which the investigator deliberately selects a
representative sample to obtain certain results. This bias can be easily overcome
--- Content provided by FirstRanker.com ---
by adopting the technique of simple random sampling.2.
Substitution: When difficulties arise in enumerating a particular
--- Content provided by FirstRanker.com ---
sampling unit included in the random sample, the investigators usually substitute
a convenient member of the population. This obviously leads to some bias since
--- Content provided by FirstRanker.com ---
the characteristics possessed by the substituted unit will usually be differentfrom those possessed by the unit originally included in the sample.
3.
--- Content provided by FirstRanker.com ---
Faulty demarcation of sampling units: Bias due to defective
demarcation of sampling units is particularly significant in area surveys such as
--- Content provided by FirstRanker.com ---
agricultural experiments in the field of crop cutting surveys etc. In such surveys,while dealing with border line cases, it depends more or less on the discretion of
the investigator whether to include them in the sample or not.
--- Content provided by FirstRanker.com ---
4.
Error due to bias in the estimation method: Sampling method consists
--- Content provided by FirstRanker.com ---
in estimating the parameters of the population by appropriate statistics computedfrom the sample. Improper choice of the estimation techniques might introduce
the error.
--- Content provided by FirstRanker.com ---
5.
Variability of the population: Sampling error also depends on the
--- Content provided by FirstRanker.com ---
variability or heterogeneity of the population to be sampled.Sampling errors are of two types: Biased Errors and Unbiased Errors
--- Content provided by FirstRanker.com ---
Biased Errors: The errors that occur due to a bias of prejudice on the part of
--- Content provided by FirstRanker.com ---
the informant or enumerator in selecting, estimating measuring instruments are
called biased errors. Suppose for example, the enumerator used the deliberate
--- Content provided by FirstRanker.com ---
sampling method in the place of simple random sampling method, then it iscalled biased errors. These errors are cumulative in nature and increase when the
sample size also increases. These errors arise due to defect in the methods of
--- Content provided by FirstRanker.com ---
collection of data, defect in the method of organization of data and defect in the
method of analysis of data.
--- Content provided by FirstRanker.com ---
Unbiased errors: Errors which occur in the normal course of investigation orenumeration on account of chance are called unbiased errors. They may arise
accidentally without any bias or prejudice. These errors occur due to faulty
--- Content provided by FirstRanker.com ---
planning of statistical investigation.
To avoid these errors, the statistician must take proper precaution and care in
--- Content provided by FirstRanker.com ---
using the correct measuring instrument. He must see that the enumerators arealso not biased. Unbiased errors can be removed with the proper planning of
statistical investigations. Both these errors should be avoided by the statisticians.
--- Content provided by FirstRanker.com ---
Reducing Sampling Errors:
Errors in sampling can be reduced if the size of sample is increased. This is
--- Content provided by FirstRanker.com ---
shown in the following diagram.From the above diagram it is clear that when the size of the sample
--- Content provided by FirstRanker.com ---
increases, sampling error decreases. And by this process samples can be made
more representatives to the population.
--- Content provided by FirstRanker.com ---
Testing of Hypothesis:
As a part of investigation, samples are drawn from the population and
--- Content provided by FirstRanker.com ---
results are derived to help in taking the decisions. But such decisions involve an
element of uncertainty causing wrong decisions. Hypothesis is an assumption
--- Content provided by FirstRanker.com ---
which may or may not be true about a population parameter. For example, if we
--- Content provided by FirstRanker.com ---
toss a coin 200 times, we may get 110 heads and 90 tails. At this instance, weare interested in testing whether the coin is unbiased or not.
Therefore, we may conduct a test to judge the significance of the difference of
--- Content provided by FirstRanker.com ---
sampling or otherwise. To carry out a test of significance, the following
procedure has to be followed:
--- Content provided by FirstRanker.com ---
1. Framing the Hypothesis: To verify the assumption, which is based onsample study, we collect data and find out the difference between the sample
value and the population value. If there is no difference found or the difference
--- Content provided by FirstRanker.com ---
is very small then the hypothetical value is correct. Generally two hypotheses
are constructed, and if one is found correct, the other is rejected.
--- Content provided by FirstRanker.com ---
(a) Null Hypothesis: The random selection of the samples from the givenpopulation makes the tests of significance valid for us. For applying any test of
significance we first set up a hypothesis- a definite statement about the
--- Content provided by FirstRanker.com ---
population parameter/s. Such a statistical hypothesis, which is under test, is
usually a hypothesis of no difference and hence is called Null hypothesis. It is
--- Content provided by FirstRanker.com ---
usually denoted by Ho. In the words of Prof. R.A.Fisher "Null hypothesis is thehypothesis which is tested for possible rejection under the assumption that
it is true."
--- Content provided by FirstRanker.com ---
(b)
Alternative Hypothesis. Any hypothesis which is complementary to the
--- Content provided by FirstRanker.com ---
null hypothesis is called an alternative hypothesis. It is usually denoted by H1. Itis very important to explicitly state the alternative hypothesis in respect of any
null hypothesis H0 because the acceptance or rejection of Ho is meaningful only
--- Content provided by FirstRanker.com ---
if it is being tested against a rival hypothesis. For example, if we want to test the
null hypothesis that the population has a specified mean ?0(say), i.e.,
--- Content provided by FirstRanker.com ---
H0:?=? then the alternative hypothesis could be:(i) H1:?0 (i.e., ?>?0 or ?<?0)
(ii) H1: ?>?0
--- Content provided by FirstRanker.com ---
(iii) H1: ?<?0
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
The alternative hypothesis (i) is known as a two-tailed alternative and thealternatives in (ii) and (iii) are known as right-tailed and left-tailed alternatives.
Accordingly, the corresponding tests of significance are called two-tailed, right-
--- Content provided by FirstRanker.com ---
tailed and left-tailed tests respectively.
The null hypothesis consists of only a single parameter value and is
--- Content provided by FirstRanker.com ---
usually simple while alternative hypothesis is usually composite.Types of Errors in Testing of Hypothesis: As stated earlier, the inductive
--- Content provided by FirstRanker.com ---
inference consists in arriving at a decision to accept or reject a null hypothesis
(Ho) after inspecting only a sample from it. As such an element of risk ? the risk
--- Content provided by FirstRanker.com ---
of taking wrong decision is involved. In any test procedure, the four possiblemutually disjoint and exhaustive decisions are:
(i) Reject Ho when actually it is not true i.e., when Ho is false.
--- Content provided by FirstRanker.com ---
(ii) Accept Ho when it is true.
(iii) Reject Ho when it is true.
--- Content provided by FirstRanker.com ---
(iv)Accept Ho when it is false.
The decisions in (i) and (ii) are correct decisions while the decisions in
--- Content provided by FirstRanker.com ---
(iii) and (iv) are wrong decisions. These decisions may be expressed in the
following dichotomous table:
--- Content provided by FirstRanker.com ---
Decision from sample
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Reject HoAccept Ho
--- Content provided by FirstRanker.com ---
True State
--- Content provided by FirstRanker.com ---
Ho TrueWrong
Correct
--- Content provided by FirstRanker.com ---
Type I Error
--- Content provided by FirstRanker.com ---
Ho False
Correct Wrong
--- Content provided by FirstRanker.com ---
(H1True)
--- Content provided by FirstRanker.com ---
Type II Error.--- Content provided by FirstRanker.com ---
Thus, in testing of hypothesis we are likely to commit two types of
errors. The error of rejecting Ho when Ho is true is known as Type I error and
--- Content provided by FirstRanker.com ---
the error of accepting Ho when Ho is false is known as Type II Error.
For example, in the Industrial Quality Control, while inspecting the quality of a
--- Content provided by FirstRanker.com ---
manufactured lot, the Inspector commits Type I Error when he rejects a good lotand he commits Type II Error when he accepts a bad lot.
--- Content provided by FirstRanker.com ---
SUMMARY
Nowadays questionnaire method of data collection has become very popular. It
--- Content provided by FirstRanker.com ---
is a very powerful tool to collect required data in shortest period of time andwith little expense. It is scientific too. But drafting of questionnaire is a very
skilled and careful work. Therefore, there are certain requirements and essentials
--- Content provided by FirstRanker.com ---
which should be followed at the time of framing the questionnaire. They include
the following viz., (i) the size of the questionnaire should be small, (ii) questions
--- Content provided by FirstRanker.com ---
should be very clear in understanding, (iii) questions should be put in a logicalorder, (iv) questions should have simple meaning etc. Apart from this, multiple
choice questions should be asked. Questionnaire should be pre tested before
--- Content provided by FirstRanker.com ---
going for final data collection. Information supplied should be cross checked for
any false or insufficient information. After all these formalities have been
--- Content provided by FirstRanker.com ---
completed, a covering note should accompany the questionnaire explainingvarious purposes, designs, units and incentives.
--- Content provided by FirstRanker.com ---
There are two ways of survey- Census survey and Sample survey through
which data can be collected. Census survey means total enumeration i.e.,
--- Content provided by FirstRanker.com ---
collecting data from each and every unit of the universe, whereas sample surveyconcentrates on collecting data from a few units of the universe selected
scientifically for the purpose. Since census method is more time taking,
--- Content provided by FirstRanker.com ---
expensive and labor intensive, it becomes impractical to depend on it. Therefore,
--- Content provided by FirstRanker.com ---
sample survey is preferred which is scientific, less expensive, less time taking
and less labor intensive too.
--- Content provided by FirstRanker.com ---
But there are merits and demerits of this method which are detailed below:Merits - It reduces cost, saves time and is more reliable. It provides
more detailed information and is sometimes the only method to depend upon for
--- Content provided by FirstRanker.com ---
administrative convenience and scientifically.
Demerits - Sometimes samples may not be representative and may give
--- Content provided by FirstRanker.com ---
illusory conclusions. There are lack of experts and sometimes it is more difficultthan the census method, since there might arise personal bias and the
determination of the size of the sample might be very difficult.
--- Content provided by FirstRanker.com ---
Apart from these, there are some essentials of sampling which must be
followed. They are: Samples must be representative, samples must be
--- Content provided by FirstRanker.com ---
homogeneous and the number of samples must be adequate. When a researcherresorts to sampling, he intends to collect some data which would help him to
draw results and finally take a decision. When he takes a decision it's on the
--- Content provided by FirstRanker.com ---
basis of hypothesis which is precisely assumption and is prone to two types of
errors-Type I Error and Type II Error. When a researcher rejects a correct
--- Content provided by FirstRanker.com ---
hypothesis, he commits type I error and when he accepts a wrong hypothesis hecommits type II error. The researcher should try to avoid both types of errors but
committing type II error is more harmful than type I error.
--- Content provided by FirstRanker.com ---
SELF ASSESMENT QUESTIONS (SAQs)
--- Content provided by FirstRanker.com ---
1.Explain questionnaire and examine its main characteristics.
(Refer to the introduction part of the questionnaire section)
--- Content provided by FirstRanker.com ---
2.
--- Content provided by FirstRanker.com ---
Explain main requirements of a good questionnaire.(Refer to the sub points from 1 to 11)
--- Content provided by FirstRanker.com ---
3. What is sampling? Explain its main merits and demerits.
--- Content provided by FirstRanker.com ---
(Refer to the introduction and the following part of the lesson)
--- Content provided by FirstRanker.com ---
4What are null and alternative hypothesis? Explain.
(Refer the point Framing the Hypothesis)
--- Content provided by FirstRanker.com ---
6.
--- Content provided by FirstRanker.com ---
What are Type I error and Type II error? (Refer to types of error inhypothesis)
***
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT II
--- Content provided by FirstRanker.com ---
3. EXPERIMENTSLesson Outline
--- Content provided by FirstRanker.com ---
Procedures adopted in experiments
Meaning of Experiments
--- Content provided by FirstRanker.com ---
Research design in case of hypothesis testingresearch studies
Basic principles in experimental designs
--- Content provided by FirstRanker.com ---
Prominent experimental designs
--- Content provided by FirstRanker.com ---
Learning Objectives
--- Content provided by FirstRanker.com ---
After reading this lesson you should be able to understand the
--- Content provided by FirstRanker.com ---
Nature and meaning of ExperimentsKinds of experiments
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Introduction
The meaning of experiment lies in the process of examining the truth of
--- Content provided by FirstRanker.com ---
a statistical hypothesis related to some research problem. For example, aresearcher can conduct an experiment to examine the newly developed
medicine. Experiment is of two types: absolute experiment and comparative
--- Content provided by FirstRanker.com ---
experiment. When a researcher wants to determine the impact of a fertilizer on
the yield of a crop it is a case of absolute experiment. On the other hand, if he
--- Content provided by FirstRanker.com ---
wants to determine the impact of one fertilizer as compared to the impact ofsome other fertilizer, the experiment will then be called as a comparative
experiment. Normally, a researcher conducts a comparative experiment when he
--- Content provided by FirstRanker.com ---
talks of designs of experiments.
--- Content provided by FirstRanker.com ---
Research design can be of three types:(a)
Research design in the case of descriptive and diagnostic research
--- Content provided by FirstRanker.com ---
studies,
(b)
--- Content provided by FirstRanker.com ---
Research design in the case of exploratory research studies, and(c)
Research design in the case of hypothesis testing research studies.
--- Content provided by FirstRanker.com ---
Here we are mainly concerned with the third one which is Research design
in the case of hypothesis testing research studies.
--- Content provided by FirstRanker.com ---
Research design in the case of hypothesis testing research studies:
Hypothesis testing research studies are generally known as experimental studies.
--- Content provided by FirstRanker.com ---
This is a study where a researcher tests the hypothesis of causal relationships
between variables. This type of study requires some procedures which will not
--- Content provided by FirstRanker.com ---
only reduce bias and increase reliability, but will also permit drawing inferencesabout causality. Most of the times, experiments meet these requirements. Prof.
Fisher is considered as the pioneer of this type of studies (experimental studies).
--- Content provided by FirstRanker.com ---
He did pioneering work when he was working at Rothamsted Experimental
--- Content provided by FirstRanker.com ---
Station in England which was a centre for Agricultural Research. While working
there, Prof. Fisher found that by dividing plots into different blocks and then by
--- Content provided by FirstRanker.com ---
conducting experiments in each of these blocks whatever information iscollected and inferences drawn from them happened to be more reliable. This
was where he was inspired to develop certain experimental designs for testing
--- Content provided by FirstRanker.com ---
hypotheses concerning scientific investigations. Nowadays, the experimental
design is used in researches relating to almost every discipline of knowledge.
--- Content provided by FirstRanker.com ---
Prof. Fisher laid three principles of experimental designs:(1) The Principle of Replication
(2) The Principle of Randomization and
--- Content provided by FirstRanker.com ---
(3) The Principle of Local Control.
--- Content provided by FirstRanker.com ---
(1) The Principle of Replication:According to this principle, the experiment should be repeated more than
once. Thus, each treatment is applied in many experimental units instead of one.
--- Content provided by FirstRanker.com ---
This way the statistical accuracy of the experiments is increased. For example,
suppose we are going to examine the effect of two varieties of wheat.
--- Content provided by FirstRanker.com ---
Accordingly, we divide the field into two parts and grow one variety in one partand the other variety in the other. Then we compare the yield of the two parts
and draw conclusion on that basis. But if we are to apply the principle of
--- Content provided by FirstRanker.com ---
replication to this experiment, then we first divide the field into several parts,
grow one variety in half of these parts and the other variety in the remaining
--- Content provided by FirstRanker.com ---
parts. Then we collect the data of yield of the two varieties and draw conclusionby comparing the same. The result so obtained will be more reliable in
comparison to the conclusion we draw without applying the principle of
--- Content provided by FirstRanker.com ---
replication. The entire experiment can be repeated several times for better
results.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(2) The Principle of Randomization:When we conduct an experiment, the principle of randomization
provides us a protection against the effects of extraneous factors by
--- Content provided by FirstRanker.com ---
randomization. This means that this principle indicates that the researcher
should design or plan the experiment in such a way that the variations caused by
--- Content provided by FirstRanker.com ---
extraneous factors can all be combined under the general heading of `chance'.For example, when a researcher grows one variety of wheat , say , in the first
half of the parts of a field and the other variety he grows in the other half, then it
--- Content provided by FirstRanker.com ---
is just possible that the soil fertility may be different in the first half in
comparison to the other half. If this is so the researcher's result is not realistic.
--- Content provided by FirstRanker.com ---
In this situation, he may assign the variety of wheat to be grown in differentparts of the field on the basis of some random sampling technique i.e., he may
apply randomization principle and protect himself against the effects of the
--- Content provided by FirstRanker.com ---
extraneous factors. Therefore, by using the principle of randomization, he can
draw a better estimate of the experimental error.
--- Content provided by FirstRanker.com ---
(3). The Principle of Local Control:
This is another important principle of experimental designs. Under this
--- Content provided by FirstRanker.com ---
principle, the extraneous factor which is the known source of variability is made
to vary deliberately over as wide a range as necessary. This needs to be done in
--- Content provided by FirstRanker.com ---
such a way that the variability it causes can be measured and hence eliminatedfrom the experimental error. The experiment should be planned in such a way
that the researcher can perform a two-way analysis of variance, in which the
--- Content provided by FirstRanker.com ---
total variability of the data is divided into three components attributed to
treatments (varieties of wheat in this case), the extraneous factor (soil fertility in
--- Content provided by FirstRanker.com ---
this case) and experimental error. In short, through the principle of local controlwe can eliminate the variability due to extraneous factors from the experimental
error.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Kinds of experimental Designs and Control
Experimental designs refer to the framework of structure of an
--- Content provided by FirstRanker.com ---
experiment and as such there are several experimental designs. Generally,experimental designs are classified into two broad categories: informal
experimental designs and formal experimental designs. Informal experimental
--- Content provided by FirstRanker.com ---
designs are those designs that normally use a less sophisticated form of analysis
based on differences in the magnitudes, whereas formal experimental designs
--- Content provided by FirstRanker.com ---
offer relatively more control and use precise statistical procedures for analysis.Important experimental designs are discussed below:
(1) Informal experimental designs:
--- Content provided by FirstRanker.com ---
(i) Before and after without control design
(ii) After only with control design
--- Content provided by FirstRanker.com ---
(iii) Before and after with control design(2) Formal experimental designs:
--- Content provided by FirstRanker.com ---
(i) Completely randomized design (generally called C.R design)
(ii) Randomized block design (generally called R.B design)
--- Content provided by FirstRanker.com ---
(iii) Latin square design (generally called L.S design)(iv) Factorial designs.
--- Content provided by FirstRanker.com ---
(1)Before and after without control design:
In this design, a single test group or area is selected and the dependent
--- Content provided by FirstRanker.com ---
variable is measured before introduction of the treatment. Then the treatment isintroduced and the dependent variable is measured again after the treatment has
been introduced. The effect of the treatment would be equal to the level of the
--- Content provided by FirstRanker.com ---
phenomenon after the treatment minus the level of the phenomenon before the
treatment. Thus, the design can be presented in the following manner:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Test area Level of phenomenon Treatment Level of phenomenonBefore treatment(X) introduced after treatment(Y)
--- Content provided by FirstRanker.com ---
Treatment Effect =(Y)-(X)
--- Content provided by FirstRanker.com ---
The main difficulty of such a design is that with the passage of timeconsiderable extraneous variations may be there in its treatment effect.
--- Content provided by FirstRanker.com ---
(2) After-only with control design:
Two groups or areas are selected in this design and the treatment is
--- Content provided by FirstRanker.com ---
introduced into the test area only. Then the dependent variable is measured inboth the areas at the same time. Treatment impact is assessed by subtracting the
value of the dependent variable in the control area from its value in the test area.
--- Content provided by FirstRanker.com ---
The design can be presented in the following manner:
Test area: Treatment introduced Level of phenomenon
--- Content provided by FirstRanker.com ---
after Treatment (Y)Control area: Level of phenomenon
Without treatment (Z)
--- Content provided by FirstRanker.com ---
Treatment Effect = (Y)-(Z)
--- Content provided by FirstRanker.com ---
The basic assumption in this type of design is that the two areas are identicalwith respect to their behavior towards the phenomenon considered. If this
assumption is not true, there is the possibility of extraneous variation entering
--- Content provided by FirstRanker.com ---
into the treatment effect.
--- Content provided by FirstRanker.com ---
(3) Before and after with control design:--- Content provided by FirstRanker.com ---
In this design, two areas are selected and the dependent variable ismeasured in both the areas for an identical time-period before the treatment.
Thereafter, the treatment is introduced into the test area only, and the dependent
--- Content provided by FirstRanker.com ---
variable id measured in both for and identical time ?period after the introduction
of the treatment. The effect of the treatment is determined by subtracting the
--- Content provided by FirstRanker.com ---
change in the dependent variable in the control area from the change in thedependent variable in test area. This design can be shown in the following way:
Time Period I Time Period II
--- Content provided by FirstRanker.com ---
Test area: Level of phenomenon Treatment Level of phenomenon
--- Content provided by FirstRanker.com ---
Before treatment (X) introduced after treatment (Y)Control area: Level of phenomenon Level of phenomenon
Without treatment without treatment
--- Content provided by FirstRanker.com ---
(A) (Z)
--- Content provided by FirstRanker.com ---
Treatment Effect = (Y-X)-(Z-A)This design is superior to the previous two designs because it avoids extraneous
--- Content provided by FirstRanker.com ---
variation resulting both from the passage of time and from non-comparability of
the rest and control areas. But at times, due to lack of historical data time or a
--- Content provided by FirstRanker.com ---
comparable control area, we should prefer to select one of the first two informaldesigns stated above.
--- Content provided by FirstRanker.com ---
(2) Formal Experimental Design
(i) Completely randomized design: -
--- Content provided by FirstRanker.com ---
This design involves only two principles i.e., the principle of replicationand the principle of randomization of experimental designs. Among all other
designs this is the simpler and easier because it's procedure and analysis are
--- Content provided by FirstRanker.com ---
simple. The important characteristic of this design is that the subjects are
--- Content provided by FirstRanker.com ---
randomly assigned to experimental treatments. For example, if the researcher
has 20 subjects and if he wishes to test 10 under treatment A and 10 under
--- Content provided by FirstRanker.com ---
treatment B, the randomization process gives every possible group of 10subjects selected from a set of 20 an equal opportunity of being assigned to
treatment A and treatment B. One way analysis of variance (one way ANOVA)
--- Content provided by FirstRanker.com ---
is used to analyze such a design.
--- Content provided by FirstRanker.com ---
(ii) Randomized block design:-R. B. design is an improvement over the C.R. design. In the R .B. design,
the principle of local control can be applied along with the other two principles
--- Content provided by FirstRanker.com ---
of experimental designs. In the R.B. design, subjects are first divided into
groups, known as blocks, such that within each group the subjects are relatively
--- Content provided by FirstRanker.com ---
homogenous in respect to some selected variable. The number of subjects in agiven block would be randomly assigned to each treatment. Blocks are the levels
at which we hold the extraneous factor fixed, so that its contribution to the total
--- Content provided by FirstRanker.com ---
variability of data can be measured. The main feature of the R.B. design is that,
in this, each treatment appears the same number of times in each block. This
--- Content provided by FirstRanker.com ---
design is analyzed by the two-way analysis of variance (two-way ANOVA)technique.
--- Content provided by FirstRanker.com ---
(3) Latin squares design:-
The Latin squares design (L.S design) is an experimental design which is
--- Content provided by FirstRanker.com ---
very frequently used in agricultural research. Since agriculture depends uponnature to a large extent, the condition of research and investigation in agriculture
is different than the other studies. For example, an experiment has to be made
--- Content provided by FirstRanker.com ---
through which the effects of fertilizers on the yield of a certain crop, say wheat,
is to be judged. In this situation, the varying fertility of the soil in different
--- Content provided by FirstRanker.com ---
blocks in which the experiment has to be performed must be taken into
--- Content provided by FirstRanker.com ---
consideration; otherwise the results obtained may not be very dependablebecause the output happens to be the effects of not only of fertilizers, but also of
the effect of fertility of soil. Similarly there may be the impact of varying seeds
--- Content provided by FirstRanker.com ---
of the yield. In order to overcome such difficulties, the L.S. design is used when
there are two major extraneous factors such as the varying soil fertility and
--- Content provided by FirstRanker.com ---
varying seeds. The Latin square design is such that each fertilizer will appearfive times but will be used only once in each row and in each column of the
design. In other words, in this design, the treatment is so allocated among the
--- Content provided by FirstRanker.com ---
plots that no treatment occurs more than once in any one row or any one
column. This experiment can be shown with the help of the following diagram:
--- Content provided by FirstRanker.com ---
FERTILITY LEVEL
I II III IV V
--- Content provided by FirstRanker.com ---
X1
A B
--- Content provided by FirstRanker.com ---
CD E
X2
--- Content provided by FirstRanker.com ---
B
C
--- Content provided by FirstRanker.com ---
DE
A
--- Content provided by FirstRanker.com ---
X3
C
--- Content provided by FirstRanker.com ---
DE
A
--- Content provided by FirstRanker.com ---
B
X4
--- Content provided by FirstRanker.com ---
DE
A
--- Content provided by FirstRanker.com ---
B
C
--- Content provided by FirstRanker.com ---
X5E
A
--- Content provided by FirstRanker.com ---
B
C
--- Content provided by FirstRanker.com ---
DFrom the above diagram, it is clear that in L.S. design the field is divided into as
--- Content provided by FirstRanker.com ---
many blocks as there are varieties of fertilizers. Then, each block is again
divided into as many parts as there are varieties of fertilizers in such a way that
--- Content provided by FirstRanker.com ---
each of the fertilizer variety is used in each of the block only once. The analysisof L.S. design is very similar to the two-way ANOVA technique.
--- Content provided by FirstRanker.com ---
4. Factorial design:
--- Content provided by FirstRanker.com ---
Factorial designs are used in experiments where the effects of varying
more than one factor are to be determined. These designs are used more in
--- Content provided by FirstRanker.com ---
economic and social matters where usually a large number of factors affect aparticular problem. Factorial designs are usually of two types:
--- Content provided by FirstRanker.com ---
(i) Simple factorial designs and (ii) complex factorial designs.
(i) Simple factorial design:
--- Content provided by FirstRanker.com ---
In simple factorial design, the effects of varying two factors on thedependent variable are considered but when an experiment is done with more
than two factors, complex factorial designs are used. Simple factorial design is
--- Content provided by FirstRanker.com ---
also termed as a `two-factor-factorial design,' whereas complex factorial design
is known as `multi-factor-factorial design.
--- Content provided by FirstRanker.com ---
(ii) Complex factorial designs:-
When the experiments with more than two factors at a time are
--- Content provided by FirstRanker.com ---
conducted, it involves the use of complex factorial designs. A design which
considers three or more independent variables simultaneously is called a
--- Content provided by FirstRanker.com ---
complex factorial design. In case of three factors with one experimentalvariable, two treatments and two levels, complex factorial design will contain a
total of eight cells. This can be seen through the following diagram:
--- Content provided by FirstRanker.com ---
2x2x2 COMPLEX FACTORIAL DESIGN
--- Content provided by FirstRanker.com ---
Experimental Variable
--- Content provided by FirstRanker.com ---
Treatment A
--- Content provided by FirstRanker.com ---
Treatment BControl
Control
--- Content provided by FirstRanker.com ---
Control
Control
--- Content provided by FirstRanker.com ---
Variable 2Variable 2
Variable 2
--- Content provided by FirstRanker.com ---
Variable 2
Level I
--- Content provided by FirstRanker.com ---
Level IILevel I
Level II
--- Content provided by FirstRanker.com ---
Level I
Cell 1
--- Content provided by FirstRanker.com ---
Cell 3Cell 5
Cell 7
--- Content provided by FirstRanker.com ---
Control
--- Content provided by FirstRanker.com ---
Cell 2
Cell 4
--- Content provided by FirstRanker.com ---
Cell 6Cell 8
Variable 2
--- Content provided by FirstRanker.com ---
Level II
--- Content provided by FirstRanker.com ---
A pictorial presentation is given of the design shown above in the following:Experimental Variable
--- Content provided by FirstRanker.com ---
Treatment
--- Content provided by FirstRanker.com ---
TreatmentA B
Control Variable II
--- Content provided by FirstRanker.com ---
Level II
Level I
--- Content provided by FirstRanker.com ---
Level ILevel II
--- Content provided by FirstRanker.com ---
Control Variable I
--- Content provided by FirstRanker.com ---
The dotted line cell in this diagram corresponds to cell I of the above stated2x2x2 design and is for treatment A, level I of the control variable 1, and level I
of the control variable 2. From this design, it is possible to determine the main
--- Content provided by FirstRanker.com ---
effects for three variables i.e., one experimental and true control variables. The
researcher can also determine the interaction between each possible pair of
--- Content provided by FirstRanker.com ---
variables (such interactions are called `first order interactions') and interactionbetween variable taken in triplets (such interactions are called second order
interactions). In case of a 2x2x2 design, the further given first order interactions
--- Content provided by FirstRanker.com ---
are possible:
--- Content provided by FirstRanker.com ---
Experimental variable with control variable 1 (or EV x CV 1);Experimental variable with control variable 2 (or EV x CV 2);
Control variable 1 with control variable 2 (or CV 1 x CV 2);
--- Content provided by FirstRanker.com ---
There will be one second order interaction as well in the given design (it is
--- Content provided by FirstRanker.com ---
between all the three variables i.e., EV x CV 1 x CV 2).
--- Content provided by FirstRanker.com ---
To determine the main effect for the experimental variable, theresearcher must necessarily compare the combined mean of data in cells 1, 2, 3
and 4 for Treatment A with the combined mean of data in cells 5,6,7 and 8 for
--- Content provided by FirstRanker.com ---
Treatment B. In this way the main effect experimental variable, independent of
control variable 1 and variable 2, is obtained. Similarly, the main effect for
--- Content provided by FirstRanker.com ---
control variable 1, independent experimental variable and control variable 2, isobtained if we compare the combined mean of data in cells 1, 3, 5 and 7 with the
combined mean of data in cells 2, 4, 6 and 8 of our 2x2x2 factorial design. On
--- Content provided by FirstRanker.com ---
similar lines, one can determine the effect of control variable 2 independent of
experimental variable and control variable 1, if the combined mean of data in
--- Content provided by FirstRanker.com ---
cells 1,2,5 and 6 are compared with the combined mean of data in cells 3,4,7 and8.
--- Content provided by FirstRanker.com ---
To obtain the first order interaction, say, for EV x CV 1 in the above
stated design, the researcher must necessarily ignore control variable 2 for which
--- Content provided by FirstRanker.com ---
purpose he may develop 2x2 design from the 2x2x2 design by combining thedata of the relevant cells of the latter design as has been shown on next page:
--- Content provided by FirstRanker.com ---
Experimental Variable
--- Content provided by FirstRanker.com ---
Treatment ATreatment B
Control
--- Content provided by FirstRanker.com ---
Level I
Cells 1.3
--- Content provided by FirstRanker.com ---
Cells 5,7Variable 1
Level II
--- Content provided by FirstRanker.com ---
Cells 2,4
Cells 6,8
--- Content provided by FirstRanker.com ---
Similarly, the researcher can determine other first order interactions. The
analysis of the first order interaction in the manner described above is essentially
--- Content provided by FirstRanker.com ---
a simple factorial analysis as only two variables are considered at a time and the
remaining ones are ignored. But the analysis of the second order interaction
--- Content provided by FirstRanker.com ---
would not ignore one of the three independent variables in case of a 2x2x2
--- Content provided by FirstRanker.com ---
design. The analysis would be termed as a complex factorial analysis.--- Content provided by FirstRanker.com ---
It may, however, be remembered that the complex factorial design need not
necessarily be of 2x2x2 type design, but can be generalized to any number and
--- Content provided by FirstRanker.com ---
combinations of experimental and control independent variables. Of course, thegreater the number of independent variables included in a complex factorial
design, the higher the order of the interaction analysis possible. But the overall
--- Content provided by FirstRanker.com ---
task goes on becoming more and more complicated with the inclusion of more
and more independent variables in our design.
--- Content provided by FirstRanker.com ---
Factorial designs are used mainly because of the two advantages -
--- Content provided by FirstRanker.com ---
(i) They provide equivalent accuracy (as happens in the case of experiments
with only one factor) with less labour and as such are source of economy. Using
--- Content provided by FirstRanker.com ---
factorial designs, we can determine the effects of two (in simple factorialdesign) or more (in case of complex factorial design) factors (or variables) in
one single experiment. (ii) They permit various other comparisons of interest.
--- Content provided by FirstRanker.com ---
For example, they give information about such effects which cannot be obtained
by treating one single factor at a time. The determination of interaction effects is
--- Content provided by FirstRanker.com ---
possible in case of factorial designs.Conclusion
--- Content provided by FirstRanker.com ---
There are several research designs and the researcher must decide in advance of
collection and analysis of data as to which design would prove to be more
--- Content provided by FirstRanker.com ---
appropriate for his research project. He must give due weight to various pointssuch as type of universe and it's nature, the objective of the study, the source list
or the sampling frame, desired standard accuracy and the like when taking a
--- Content provided by FirstRanker.com ---
decision in respect of the design for his research project.
--- Content provided by FirstRanker.com ---
SUMMARY
--- Content provided by FirstRanker.com ---
Experiment is the process of examining the truth of a statistical hypothesisrelated to some research problem. There are two types of experiments - absolute
and comparative. There are three types of research designs - research design for
--- Content provided by FirstRanker.com ---
descriptive and diagnostic research, research design for exploratory research
studies and research design for hypothesis testing. Prof. Fisher has laid three
--- Content provided by FirstRanker.com ---
principles of experimental design. They are Principle of Replication, Principle ofRandomization and Principle of Local control. There are different kinds of
experimental designs. Some of them are Informal experimental design, After
--- Content provided by FirstRanker.com ---
only with control design, Formal experimental design, Completely randomized
design, Randomized block design, Latin square design and Factorial design.
--- Content provided by FirstRanker.com ---
SELF ASSESMENT QUESTIONS (SAQs)
1.
--- Content provided by FirstRanker.com ---
Explain the meaning and types of experiment.
(Ref. introduction and types of research design next to introduction)
--- Content provided by FirstRanker.com ---
2.
Explain informal designs.
--- Content provided by FirstRanker.com ---
(Ref. i,ii,iii in informal experiment design portion.)
--- Content provided by FirstRanker.com ---
3. Explain formal experimental design and control.(Ref. i,ii,iii,iv in formal experiment design section)
--- Content provided by FirstRanker.com ---
4. Explain complex factorial design.
***
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT II
--- Content provided by FirstRanker.com ---
4. OBSERVATION
--- Content provided by FirstRanker.com ---
Lesson OutlineSteps in obMeaning and Characteristics of
observation
--- Content provided by FirstRanker.com ---
Types of observation
Stages of observation
--- Content provided by FirstRanker.com ---
servationProblems and
Merits
--- Content provided by FirstRanker.com ---
and
Demerits
--- Content provided by FirstRanker.com ---
Lesson Objectives
--- Content provided by FirstRanker.com ---
After reading this lesson you will be able to know
Meaning and types of observation
--- Content provided by FirstRanker.com ---
Stages through which observation passesSteps followed and the problems coming in observation
Merits and Demerits
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
IntroductionObservation is a method that employs vision as its main means of data
collection. It implies the use of eyes rather than of ears and the voice. It is
--- Content provided by FirstRanker.com ---
accurate watching and noting of phenomena as they occur with regard to the
cause and effect or mutual relations. It is watching other persons' behavior as it
--- Content provided by FirstRanker.com ---
actually happens without controlling it. For example, watching bondedlabourer's life, or treatment of widows and their drudgery at home, provide
graphic description of their social life and sufferings. Observation is also defined
--- Content provided by FirstRanker.com ---
as "a planned methodical watching that involves constraints to improve
--- Content provided by FirstRanker.com ---
accuracy".
CHARACTERISTICS OF OBSERVATION
--- Content provided by FirstRanker.com ---
Scientific observation differs from other methods of data collectionspecifically in four ways: (i) observation is always direct while other methods
could be direct or indirect; (ii) field observation takes place in a natural setting;
--- Content provided by FirstRanker.com ---
(iii) observation tends to be less structured; and (iv) it makes only the qualitative
(and not the quantitative) study which aims at discovering subjects' experiences
--- Content provided by FirstRanker.com ---
and how subjects make sense of them (phenomenology) or how subjectsunderstand their life (interpretivism).
Lofland (1955:101-113) has said that this method is more appropriate for
--- Content provided by FirstRanker.com ---
studying lifestyles or sub-cultures, practices, episodes, encounters, relationships,
groups, organizations, settlements and roles etc. Black and Champion
--- Content provided by FirstRanker.com ---
(1976:330) have given the following characteristics of observation:? Behavior is observed in natural surroundings.
? It enables understanding significant events affecting social relations of the
--- Content provided by FirstRanker.com ---
participants.? It determines reality from the perspective of observed person himself.
? It identifies regularities and recurrences in social life by comparing data in
--- Content provided by FirstRanker.com ---
our study with that of those in other studies.Besides, four other characteristics are:
--- Content provided by FirstRanker.com ---
? Observation involves some control pertaining to the observation and to the
--- Content provided by FirstRanker.com ---
means he uses to record data. However, such controls do not exist for thesetting or the subject population.
? It is focused on hypotheses-free inquiry.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
? It avoids manipulations in the independent variable i.e., one that is supposedto cause other variable(s) and is not caused by them.
? Recording is not selective.
--- Content provided by FirstRanker.com ---
Since at times, observation technique is indistinguishable from
experiment technique, it is necessary to distinguish the two.
--- Content provided by FirstRanker.com ---
(i) Observation involves few controls than the experiment technique.(ii) The behaviour observed in observation is natural, whereas in
experiment it is not always so.
--- Content provided by FirstRanker.com ---
(iii) The behavior observed in experiment is more molecular (of a
smaller unit), while one in observation is molar.
--- Content provided by FirstRanker.com ---
(iv) In observation, fewer subjects are watched for long periods of timein more varied circumstances than in experiment.
(v) Training required in observation study is directed more towards
--- Content provided by FirstRanker.com ---
sensitizing the observer to the flow of events, whereas training in experiments
serves to sharpen the judgment of the subject.
--- Content provided by FirstRanker.com ---
(vi) In observational study, the behavior observed is more diffused.Observational methods differ from one another along several variables or
dimensions.
--- Content provided by FirstRanker.com ---
***
--- Content provided by FirstRanker.com ---
UNIT ? III
--- Content provided by FirstRanker.com ---
STATISTICAL ANALYSIS
--- Content provided by FirstRanker.com ---
CONTENTS
--- Content provided by FirstRanker.com ---
1. Probability2. Probability distribution
2.1 Binomial distribution
--- Content provided by FirstRanker.com ---
2.2 Poisson distribution
2.3 Normal distribution
--- Content provided by FirstRanker.com ---
3. Testing of Hypothesis3.1 Small sample
3.2 Large sample test
--- Content provided by FirstRanker.com ---
4. 2 test
--- Content provided by FirstRanker.com ---
5. Index Number6. Analysis of Time Series
--- Content provided by FirstRanker.com ---
OBJECTIVES:
The objectives of the present chapter are:
--- Content provided by FirstRanker.com ---
i)To examine the utility of various statistical tools in decision making.
ii)
--- Content provided by FirstRanker.com ---
To inquire about the testing of a hypothesis
--- Content provided by FirstRanker.com ---
1. PROBABILITY
--- Content provided by FirstRanker.com ---
If an experiment is repeated under essentially homogeneous and similar
conditions, we will arrive at two types of conclusions. They are: the results are
--- Content provided by FirstRanker.com ---
unique and the outcome can be predictable and result is not unique but may be
one of the several possible outcomes. In this context, it is better to understand
--- Content provided by FirstRanker.com ---
various terms pertaining to probability before examining the probability theory.The main terms are explained as follows:
--- Content provided by FirstRanker.com ---
(i)
Random experiment
--- Content provided by FirstRanker.com ---
An experiment which can be repeated under the same conditions and theoutcome cannot be predicted under any circumstances is known as random
experiment. For example: An unbiased coin is tossed. Here we are not in a
--- Content provided by FirstRanker.com ---
position to predict whether head or tail is going to occur. Hence, this type of
experiment is known as random experiment.
--- Content provided by FirstRanker.com ---
(ii)
Sample Space
--- Content provided by FirstRanker.com ---
A set of possible outcomes of a random experiment is known as sample
space. For example, in the case of tossing of an unbiased coin twice, the
--- Content provided by FirstRanker.com ---
possible outcomes are HH, HT, TH and TT. This can be represented in a samplespace as S= (HH, HT, TH, TT).
--- Content provided by FirstRanker.com ---
(iii)
An event
--- Content provided by FirstRanker.com ---
Any possible outcomes of an experiment are known as an event. In thecase of tossing of an unbiased coin twice, HH is an event. An event can be
classified into two. They are: (a) Simple events, and (ii) compound events.
--- Content provided by FirstRanker.com ---
Simple event is an event which has only one sample point in the sample space.
--- Content provided by FirstRanker.com ---
Compound event is an event which has more than one sample point in the
sample space. In the case of tossing of an unbiased coin twice HH is a simple
--- Content provided by FirstRanker.com ---
event and TH and TT are the compound events.--- Content provided by FirstRanker.com ---
(iv)
Complementary event
--- Content provided by FirstRanker.com ---
A and A' are the complementary event if A' consists of all those samplepoint which is not included in A. For instance, an unbiased dice is thrown once.
The probability of an odd number turns up are complementary to an even
--- Content provided by FirstRanker.com ---
number turns up. Here, it is worth mentioning that the probability of sample
space is always is equal to one. Hence, the P (A') = 1 - P (A).
--- Content provided by FirstRanker.com ---
(v)
Mutually exclusive events
--- Content provided by FirstRanker.com ---
A and B are the two mutually exclusive events if the occurrence of A
precludes the occurrence of B. For example, in the case of tossing of an
--- Content provided by FirstRanker.com ---
unbiased coin once, the occurrence of head precludes the occurrence of tail.Hence, head and tail are the mutually exclusive event in the case of tossing of an
unbiased coin once. If A and B are mutually exclusive events, then the
--- Content provided by FirstRanker.com ---
probability of occurrence of A or B is equal to sum of their individual
probabilities. Symbolically, it can be presented as:
--- Content provided by FirstRanker.com ---
P (A U B) = P (A) + P (B)
--- Content provided by FirstRanker.com ---
If A and B is joint sets, then the addition theorem of probability can be
stated as:
--- Content provided by FirstRanker.com ---
P (A U B ) = P(A) + P(B) - P(AB)(vi)
--- Content provided by FirstRanker.com ---
Independent event
--- Content provided by FirstRanker.com ---
A and B are the two independent event if the occurrence of A does not
influence the occurrence of B. In the case of tossing of an unbiased coin twice,
--- Content provided by FirstRanker.com ---
the occurrence of head in the first toss does not influence the occurrence of heador tail in the toss. Hence, these two events are called independent events. In the
case of independent event, the multiplication theorem can be stated as the
--- Content provided by FirstRanker.com ---
probability of A and B is the product of their individual probabilities.
Symbolically, it can be presented as:-
--- Content provided by FirstRanker.com ---
P (A B) = P (A) * P (B)--- Content provided by FirstRanker.com ---
Addition theorem of Probability
--- Content provided by FirstRanker.com ---
Let A and B be the two mutually exclusive events, then the probability ofA or B is equal to the sum of their individual probabilities. (For detail refer
mutually exclusive events)
--- Content provided by FirstRanker.com ---
Multiplication theorem of Probability
--- Content provided by FirstRanker.com ---
Let A and B be the two independent events, then the probability of A andB is equal to the product of their individual probabilities. (For details refer
independent events)
--- Content provided by FirstRanker.com ---
Example: The odds that person X speaks the truth are 4:1 and the odds that Y
speaks the truth are 3:1. Find the probability that:-
--- Content provided by FirstRanker.com ---
(i)both of them speak the truth,
(ii)
--- Content provided by FirstRanker.com ---
any one of them speak the truth and
(iii)
--- Content provided by FirstRanker.com ---
truth may not be told.Solution: The probability of X speaks the truth = 1/5
The probability that X speaks lie = 4/5
--- Content provided by FirstRanker.com ---
The probability that Y speaks the truth = 1/4
--- Content provided by FirstRanker.com ---
The probability that Y speaks lie = ?
(i) Both of them speak truth = P(X) * P(Y) = 1/5 * 1/4 = 1/20
--- Content provided by FirstRanker.com ---
(independent event)(ii) any one of them speak truth = P(X) + P(Y) - P(X*Y)
= 1/5 + 1/4 - 1/5*1/4 = 8/20 = 2/5 (not mutually exclusive events)
--- Content provided by FirstRanker.com ---
(iii)
Truth may not be told
--- Content provided by FirstRanker.com ---
= 1 ? P(any one of them speak truth)( complementary event)= 1 ? 2/5 = 3/5.
--- Content provided by FirstRanker.com ---
2. PROBABILITY
DISTRIBUTION
--- Content provided by FirstRanker.com ---
If X is discrete random variable which takes the values of x1, x2,x3..... xnand the corresponding probabilities are p1, p2, ..........pn, then, X follows the
probability distribution. The two main properties of probability distribution are:
--- Content provided by FirstRanker.com ---
(i) P(Xi) is always greater than or equal to zero and less than or equal to one,
and (ii) the summation of probability distribution is always equal to one. For
--- Content provided by FirstRanker.com ---
example, tossing of an unbiased coin twice.Then the probability distribution is:
X (probability of obtaining head): 0 1 2
--- Content provided by FirstRanker.com ---
P(Xi) : ? ? ?
--- Content provided by FirstRanker.com ---
Expectation of probabilityLet X be the discrete random variable which takes the value of x1, x2,...... xn
then the respective probability is p1, p2, ............ pn, Then the expectation of
--- Content provided by FirstRanker.com ---
probability distribution is p1x1 + p2x2 + .............. + pnxn. In the above
example, the expectation of probability distribution is (0* ? +1*1/2+2*?) =1.
--- Content provided by FirstRanker.com ---
2.1 BINOMIAL
DISTRIBUTION
--- Content provided by FirstRanker.com ---
The binomial distribution also known as `Bernoulli Distribution' is
--- Content provided by FirstRanker.com ---
associated with the name of a Swiss mathematician, James Bernoulli who is also
known as Jacques or Jakon (1654 ? 1705). Binomial distribution is a probability
--- Content provided by FirstRanker.com ---
distribution expressing the probability of one set of dichotomous alternatives. Itcan be explained as follows:
(i)
--- Content provided by FirstRanker.com ---
If an experiment is repeated under the same conditions for a fixed
number of trials, say, n.
--- Content provided by FirstRanker.com ---
(ii)In each trial, there are only two possible outcomes of the experiment.
Let us define it as "success" or "failure". Then the sample space of possible
--- Content provided by FirstRanker.com ---
outcomes of each experiment is:
S = [failure, success]
--- Content provided by FirstRanker.com ---
(iii)The probability of a success denoted by p remains constant from trial to
trial and the probability of a failure denoted by q which is equal to (1 ? p).
--- Content provided by FirstRanker.com ---
(iv)
The trials are independent in nature i.e., the outcomes of any trial or
--- Content provided by FirstRanker.com ---
sequence of trials do not affect the outcomes of subsequent trials. Hence, theMultiplication theorem of probability can be applied for the occurrence of
success and failure. Thus, the probability of success or failure is p.q.
--- Content provided by FirstRanker.com ---
(v) Let us assume that we conduct an experiment in n times. Out of which x
times be the success and failure is (n-x) times. The occurrence of success or
--- Content provided by FirstRanker.com ---
failure in successive trials is mutually exclusive events. Hence, we can applyaddition theorem of probability.
(vi) Based on the above two theorems, the probability of success or failure is
--- Content provided by FirstRanker.com ---
P(X) = nCxpxqn-x
--- Content provided by FirstRanker.com ---
n !--------------- . px qn-x
x ! (n ? x) !
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Where P = Probability of success in a single trail, q = 1 ? p, n = Number of trials
and x = no. of successes in n trials.
--- Content provided by FirstRanker.com ---
Thus, for an event A with probability of occurrence p and non-
--- Content provided by FirstRanker.com ---
occurrence q, if n trials are made,
probability distribution of the number of
--- Content provided by FirstRanker.com ---
occurrences of A will be as set. If we want to obtain the probable frequencies ofthe various outcomes in N sets of n trials, the following expression shall be
used: N(p + q)n
--- Content provided by FirstRanker.com ---
N(p + q)n = Npn + nC1pn-1q + nC2pn-2q2 + ......+ nCrpn-rqr + ......qn.
The frequencies obtained by the above expansion are known as expected
--- Content provided by FirstRanker.com ---
or theoretical frequencies. On the other hand, the frequencies actually obtainedby making experiments are called actual or observed frequencies. Generally,
there is some difference between the observed and expected frequencies but the
--- Content provided by FirstRanker.com ---
difference becomes smaller and smaller as N increases.
--- Content provided by FirstRanker.com ---
Obtaining Coefficient of the Binomial Distribution:The following rules may be considered for obtaining coefficients from
the binomial expansion:
--- Content provided by FirstRanker.com ---
(i)
The first term is qn.,
--- Content provided by FirstRanker.com ---
(ii)The second term is nC1qn-1p,
(iii)
--- Content provided by FirstRanker.com ---
In each succeeding term the power of q is reduced by 1 and the power of
p is increased by 1.
--- Content provided by FirstRanker.com ---
(iv)The coefficient of any term is found by multiplying the coefficient of the
preceding term by the power of q in that preceding term, and dividing the
--- Content provided by FirstRanker.com ---
products so obtained by one more than the power of p in that proceeding
term.
--- Content provided by FirstRanker.com ---
Thus, when we expand (q + p)n, we will obtain the following:-(p + q)n = pn + nC1pn-1q + nC2pn-2q2 + ......+ nCrpn-rqr + ......qn.
--- Content provided by FirstRanker.com ---
Where, 1, nC1, nC2 ....... are called the binomial coefficient. Thus in the
expansion of (p + q)4 we will have (p + q)4 = p4 +4p3q +6p2q2 + 4p1q3 + q4 and
--- Content provided by FirstRanker.com ---
the coefficients will be 1, 4, 6, 4, 1.
--- Content provided by FirstRanker.com ---
From the above binomial expansion, the following general relationshipsshould be noted:
(i)
--- Content provided by FirstRanker.com ---
The number of terms in a binomial expansion is always n + 1,
(ii)
--- Content provided by FirstRanker.com ---
The exponents of p and q, for any single term, when added together,always sum to n.
(iii)
--- Content provided by FirstRanker.com ---
The exponents of p are n, (n ? 1), (n ? 2),.......1, 0, respectively and the
exponents of q are 0, 1, 2, ......(n ? 1), n, respectively.
--- Content provided by FirstRanker.com ---
(iv)The coefficients for the n + 1 terms of the distribution are always
symmetrical in nature.
--- Content provided by FirstRanker.com ---
Properties of Binomial Distribution
--- Content provided by FirstRanker.com ---
The main properties of Binomial Distribution are:-
(i)
--- Content provided by FirstRanker.com ---
The shape and location of binomial distribution changes as p changes for
a given n or as n changes for a given p. As p increases for a fixed n, the
--- Content provided by FirstRanker.com ---
binomial distribution shifts to the right.(ii) The mode of the binomial distribution is equal to the value of x which
has the largest probability. The mean and mode are equal if np is an integer.
--- Content provided by FirstRanker.com ---
(iii) As n increases for a fixed p, the binomial distribution moves to the right,
flattens and spreads out.
--- Content provided by FirstRanker.com ---
(iv) The mean of the binomial distribution is np and it increases as nincreases with p held constant. For larger n there are more possible outcomes of
a binomial experiment and the probability associated with any particular
--- Content provided by FirstRanker.com ---
outcome becomes smaller.
--- Content provided by FirstRanker.com ---
(v) If n is larger and if neither p nor q is too close to zero, the binomial
distribution can be closely approximated by a normal distribution with
--- Content provided by FirstRanker.com ---
standardized variable given by z = (X ? np) / npq.(vi) The various constants of binomial distribution are:
Mean
--- Content provided by FirstRanker.com ---
=
np
--- Content provided by FirstRanker.com ---
Standard Deviation=
--- Content provided by FirstRanker.com ---
npq
?1
--- Content provided by FirstRanker.com ---
= 0?2
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
npq?3
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
npq(q ? p)?4
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
=3n2p2q2 + npq(1 ? 6pq).
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(q ? p)2
Skewness
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
=---------
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
npq--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1 ? 6pqKurtosis
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
3 + ------------ Content provided by FirstRanker.com ---
npq
--- Content provided by FirstRanker.com ---
Illustrations: A coin is tossed four times. What is the probability of obtaining
two or more heads?
--- Content provided by FirstRanker.com ---
Solution: When a coin is tossed the probabilities of head and tail in case of an
unbiased coin are equal, i.e., p = q = ?
--- Content provided by FirstRanker.com ---
The various possibilities for all the events are the terms of the expansion (q+p)4(p ? q)4 = p4 + 4p3q + 6p2q2 + 4p1q3 + q4
Therefore, the probability of obtaining 2 heads is
--- Content provided by FirstRanker.com ---
6p2q2 = 6 x (?)2(?)2 = 3/8
The probability of obtaining 3 heads is 6p3q1 = 4 x (?)3(?)1 = 1/4
--- Content provided by FirstRanker.com ---
The probability of obtaining 4 heads is (q)4 = (?)4 = 1/16--- Content provided by FirstRanker.com ---
Therefore, the probability of obtaining 2 or more heads is3 1 1 11
--- + --- + --- = -----
--- Content provided by FirstRanker.com ---
8 4 16 16
--- Content provided by FirstRanker.com ---
Illustration: Assuming that half the population is vegetarian so that the chanceof an individual being a vegetarian is ? and assuming that 100 investigations
can take sample of 10 individuals to verify whether they are vegetarians, how
--- Content provided by FirstRanker.com ---
many investigation would you expect to report that three people or less were
vegetarians?
--- Content provided by FirstRanker.com ---
Solution:n = 10, p, i.e., probability of an individual being vegetarian = ?.q =1 ? p= ?
Using binomial distribution, we have P(r) = ncr qn-rpr
--- Content provided by FirstRanker.com ---
Putting the various values, we have
1
--- Content provided by FirstRanker.com ---
10c r(?)r(?)10 ? r = 10cr = (?)10 = --------10cr1024
The probability that in a sample of 10, three or less people are vegetarian shall
--- Content provided by FirstRanker.com ---
be given by: P(0) + P(1) + P(2) + P(3)
1
--- Content provided by FirstRanker.com ---
= --------- [10c0 + 10c1 + 10c2 + 10c3]1024
--- Content provided by FirstRanker.com ---
1
--- Content provided by FirstRanker.com ---
176 11
--- Content provided by FirstRanker.com ---
= --------- [ 1 + 10 + 45 + 120] = -------- = -----
1024
--- Content provided by FirstRanker.com ---
1024 64Hence out of 1000 investigators, the number of investigators who will
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
11
--- Content provided by FirstRanker.com ---
report 3 or less vegetarians in a sample of 10 is 1000 x --- = 172.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
64
--- Content provided by FirstRanker.com ---
2.2 POISSON
DISTRIBUTION
--- Content provided by FirstRanker.com ---
Poisson distribution was derived in 1837 by a French Mathematician Simeon
D Poisson (1731 ? 1840). In binomial distribution, the values of p and q and n
--- Content provided by FirstRanker.com ---
are given. There is a certainty of the total number of events. But there are caseswhere p is very small and n is very large and such case is normally related to
Poisson distribution. For example, persons killed in road accidents, the number
--- Content provided by FirstRanker.com ---
of defective articles produced by a quality machine. Poisson distribution may be
obtained as a limiting case of binomial probability distribution, under the
--- Content provided by FirstRanker.com ---
following condition.(i) p, successes, approach zero (p 0)
(ii) np = m is finite.
--- Content provided by FirstRanker.com ---
The Poisson distribution of the probabilities of occurrence of various rare
--- Content provided by FirstRanker.com ---
events (successes) 0,1,2,.... are Given below:Number of success (X)
--- Content provided by FirstRanker.com ---
Probabilities p(X)
0
--- Content provided by FirstRanker.com ---
e-m1
me-m
--- Content provided by FirstRanker.com ---
m2e-m
--- Content provided by FirstRanker.com ---
2--------
--- Content provided by FirstRanker.com ---
2!
--- Content provided by FirstRanker.com ---
r
mre-m
--- Content provided by FirstRanker.com ---
--------
--- Content provided by FirstRanker.com ---
r!
--- Content provided by FirstRanker.com ---
n
--- Content provided by FirstRanker.com ---
mne-m
--- Content provided by FirstRanker.com ---
--------n!
--- Content provided by FirstRanker.com ---
Where, e = 2.718, and m = average number of occurrence of given distribution.
--- Content provided by FirstRanker.com ---
The Poisson distribution is a discrete distribution with a parameter m.The various constants are:
(i)
--- Content provided by FirstRanker.com ---
Mean
=
--- Content provided by FirstRanker.com ---
m=
p
--- Content provided by FirstRanker.com ---
(ii) Standard
Deviation
--- Content provided by FirstRanker.com ---
= m(iii) Skewness
--- Content provided by FirstRanker.com ---
1
= 1/m
--- Content provided by FirstRanker.com ---
(iv) Kurtosis,2
--- Content provided by FirstRanker.com ---
=
3 + 1/m
--- Content provided by FirstRanker.com ---
(v) Variance= m
--- Content provided by FirstRanker.com ---
Illustration: A book contains 100 misprints distributed randomly throughout its
--- Content provided by FirstRanker.com ---
100 pages. What is the probability that a page observed at random contains atleast two misprints? Assume Poisson Distribution.
Solution:
--- Content provided by FirstRanker.com ---
Total Number of misprints 100
m = ------------------------------- = ----- = 1
--- Content provided by FirstRanker.com ---
Total number of pages 100Probability that a page contains at least two misprints:
--- Content provided by FirstRanker.com ---
p(r2) = 1 ? [p(0) + p (1)]
mre-m
--- Content provided by FirstRanker.com ---
p(r) = ----------- Content provided by FirstRanker.com ---
r!10e-1 1 1
--- Content provided by FirstRanker.com ---
p(0) = ------ = e-1 = ---- = ---------
0! e 2.7183
--- Content provided by FirstRanker.com ---
11e-1 1 1
p(1) = ------ = e-1 = ---- = ---------
--- Content provided by FirstRanker.com ---
1! e 2.7183
--- Content provided by FirstRanker.com ---
1 1p(0) + p(1) = ----------- + ----------- = 0.736
2.718 2.718
--- Content provided by FirstRanker.com ---
P(r2) = 1 ? [p(0) + p (1)] = 1-0.736 = 0.264
--- Content provided by FirstRanker.com ---
Illustration: If the mean of a Poisson distribution is 16, find (1) S.D.(2) B1
(3) B2 (4) ?3 (5) ?4
--- Content provided by FirstRanker.com ---
Solution: m = 16
1. S.D.
--- Content provided by FirstRanker.com ---
=m = 16 = 4
2.
--- Content provided by FirstRanker.com ---
1 = 1/m = 1/16 = 0.625
3.
--- Content provided by FirstRanker.com ---
2 = 3 + 1/m = 3 + 0.625 = 3.06254.
?3 = m = 16
--- Content provided by FirstRanker.com ---
5.
?4 = m + 3m2 = 16 + 3(16)2 = 784
--- Content provided by FirstRanker.com ---
2.3 NORMAL
--- Content provided by FirstRanker.com ---
DISTRIBUTION
The normal distribution was first described by Abraham Demoivre (1667-1754)
--- Content provided by FirstRanker.com ---
as the limiting form of binomial model in 1733. Normal distribution was--- Content provided by FirstRanker.com ---
rediscovered by Gauss in 1809 and by Laplace in 1812. Both Gauss andLaplace were led to the distribution by their work on the theory of errors of
observations arising in physical measuring processes particularly in astronomy.
--- Content provided by FirstRanker.com ---
The probability function of a Normal Distribution is defined as:
1 -(x - ?)2 / 22
--- Content provided by FirstRanker.com ---
P(X) = ------------ e2
Where, X = Values of the continuous random variable, ? = Mean of the normal
--- Content provided by FirstRanker.com ---
random variable, e = 2.7183, = 3.1416
--- Content provided by FirstRanker.com ---
Relation between Binomial, Poisson and Normal DistributionsBinomial, Poisson and Normal distribution are closely related to one other.
When N is large while the probability P of the occurrence of an event is close to
--- Content provided by FirstRanker.com ---
zero so that q = (1-p) the binomial distribution is very closely approximated by
the Poisson distribution with m = np.
--- Content provided by FirstRanker.com ---
The Poisson distribution approaches a normal distribution with standardizedvariable (x ? m)/ m as m increases to infinity.
--- Content provided by FirstRanker.com ---
Normal Distribution and its properties
--- Content provided by FirstRanker.com ---
The important properties of the normal distribution are:-1. The normal curve is "bell shaped" and symmetrical in nature. The distribution
of the frequencies on either side of the maximum ordinate of the curve is similar
--- Content provided by FirstRanker.com ---
with each other.
2. The maximum ordinate of the normal curve is at x = ?. Hence the mean,
--- Content provided by FirstRanker.com ---
median and mode of the normal distribution coincide.3. It ranges between - to +
4. The value of the maximum ordinate is 1/ 2.
--- Content provided by FirstRanker.com ---
5. The points where the curve change from convex to concave or vice versa is at X
--- Content provided by FirstRanker.com ---
= ? ? .
6. The first and third quartiles are equidistant from median.
--- Content provided by FirstRanker.com ---
7. The area under the normal curve distribution are:a)
? ? 1 covers 68.27% area;
--- Content provided by FirstRanker.com ---
b)
? ? 2 covers 95.45% area.
--- Content provided by FirstRanker.com ---
c)? ? 3 covers 99.73% area.
68.27%
--- Content provided by FirstRanker.com ---
95.45%
99.73%
--- Content provided by FirstRanker.com ---
? - 36 ? - 26? - 16
? = 0
--- Content provided by FirstRanker.com ---
? + 16
? + 26
--- Content provided by FirstRanker.com ---
? + 36- 3
- 2
--- Content provided by FirstRanker.com ---
- 1
Z = 0
--- Content provided by FirstRanker.com ---
+ 1+ 2
+ 3
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
8. When? = 0 and = 1, then the normal distribution will be a standard
normal curve. The probability function of standard normal curve is
--- Content provided by FirstRanker.com ---
1 -x2/2
P(X) = ------------ e
--- Content provided by FirstRanker.com ---
2The following table gives the area under the norm
--- Content provided by FirstRanker.com ---
ability curve for
al prob
--- Content provided by FirstRanker.com ---
some important value of Z.Distance from the mean ordinate in Are
--- Content provided by FirstRanker.com ---
a under the curve
Terms of ?
--- Content provided by FirstRanker.com ---
Z=
?
--- Content provided by FirstRanker.com ---
0.6745
0.50
--- Content provided by FirstRanker.com ---
?1
Z
--- Content provided by FirstRanker.com ---
=
.0
--- Content provided by FirstRanker.com ---
0.6826?
1
--- Content provided by FirstRanker.com ---
Z
=
--- Content provided by FirstRanker.com ---
.960.95
Z
--- Content provided by FirstRanker.com ---
=
?
--- Content provided by FirstRanker.com ---
2.000.9544
--- Content provided by FirstRanker.com ---
Z
=
--- Content provided by FirstRanker.com ---
?
2.58
--- Content provided by FirstRanker.com ---
0.99Z = ? 3.0
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
0.9973
--- Content provided by FirstRanker.com ---
9.
--- Content provided by FirstRanker.com ---
All odd moments are equal to zero.10.
Skewness = 0 and Kurtosis = 3 in normal distribution.
--- Content provided by FirstRanker.com ---
Illustration: Find the probability that the standard normal value lies between 0
--- Content provided by FirstRanker.com ---
and 1.50.4332 (43.32%)
--- Content provided by FirstRanker.com ---
Z = 0
Z = 1.5
--- Content provided by FirstRanker.com ---
As the mean, Z = 0.
To find the area between Z = 0 and Z = 1.5, look the area between 0 and 1.5,
--- Content provided by FirstRanker.com ---
from the table. It is 0.4332 (shaded area)
Illustration: The results of a particular examination are given below in a
--- Content provided by FirstRanker.com ---
summary form:Result
--- Content provided by FirstRanker.com ---
Percentage of candidates
Passed with distinction
--- Content provided by FirstRanker.com ---
10
Passed
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
60
Failed
--- Content provided by FirstRanker.com ---
30
--- Content provided by FirstRanker.com ---
It is known that a candidate gets plucked if he obtains less than 40marks, out of 100 while he must
in
--- Content provided by FirstRanker.com ---
obta
at least 75 marks in order to pass with
--- Content provided by FirstRanker.com ---
distinction. Determine the mean and standard deviation of the distribution ofmarks assumi
a
--- Content provided by FirstRanker.com ---
ng this to be norm l.
Solution:
--- Content provided by FirstRanker.com ---
30% students get marks less tha 40.
n
--- Content provided by FirstRanker.com ---
40 ?
--- Content provided by FirstRanker.com ---
XZ = ---------- = -0.52 (from the ta l
b e)
--- Content provided by FirstRanker.com ---
30%
--- Content provided by FirstRanker.com ---
20%40%
10%
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
40 ? X = -0.52 -----------(i)
o
--- Content provided by FirstRanker.com ---
10% students get m re than 75
40% area
--- Content provided by FirstRanker.com ---
= 75 ? X = 1.28------------ (ii)
--- Content provided by FirstRanker.com ---
= 75 ? X = 1.28
--- Content provided by FirstRanker.com ---
tSubtrac (ii) from (i)
40 ? X = -0.52
--- Content provided by FirstRanker.com ---
75 ? X = 1.28
--------------------
--- Content provided by FirstRanker.com ---
-35 = -1.835 = 1.8
1.80 = 35
--- Content provided by FirstRanker.com ---
35
= ------- = 19.4
--- Content provided by FirstRanker.com ---
1.80Mean
--- Content provided by FirstRanker.com ---
40 ? X = -0.52 x (19.4)
--- Content provided by FirstRanker.com ---
-X = -40 ? 10.09 = 50.09
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Illustration: The scores made by candidate in a certain test are normally
distributed with mean 1000 and standard deviation 200. what per cent of
--- Content provided by FirstRanker.com ---
candidates receive scores (i) less than 800, (ii) between 800 and 1200?(the area under the curve between Z = 0 and Z = 1 is 0.34134).
Solution:
--- Content provided by FirstRanker.com ---
X = 1000; = 200
X ? X
--- Content provided by FirstRanker.com ---
Z = ------------- Content provided by FirstRanker.com ---
(i) For X = 800
800 ? 1000
--- Content provided by FirstRanker.com ---
Z = ------------- = -1200
--- Content provided by FirstRanker.com ---
Area between Z = -1 and Z = 0 is 0.34134
Area for Z = -1 = 0.5 ? 0.34134 = 0.15866
--- Content provided by FirstRanker.com ---
Therefore, the percentage = 0.15866 x 100 = 15.86%--- Content provided by FirstRanker.com ---
(ii)
en,
--- Content provided by FirstRanker.com ---
WhX = 1200,
--- Content provided by FirstRanker.com ---
1200 ? 1000
Z = -------------- = 1
--- Content provided by FirstRanker.com ---
200Area between Z = 0 and Z = 1 is 0.34134
Area between X = 400 to X = 600
--- Content provided by FirstRanker.com ---
i.e., Z = -1 and Z = 1 is 0.34134 + 0.34134 = 0.6826 = 68.26%
--- Content provided by FirstRanker.com ---
0.6826
0.1586
--- Content provided by FirstRanker.com ---
8001000
1200
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
3.
--- Content provided by FirstRanker.com ---
TESTING OF HYPOTHESIS
--- Content provided by FirstRanker.com ---
3.1Test of Significance for Large Samples
The test of significance for the large samples can be explained by the following
--- Content provided by FirstRanker.com ---
assumptions:
(i)
--- Content provided by FirstRanker.com ---
The random sampling distribution of statistics is approximately normal.(ii)
Sampling values are sufficiently close to the population value and can be
--- Content provided by FirstRanker.com ---
used for the calculation of standard error of estimate.
--- Content provided by FirstRanker.com ---
1.The standard error of mean.
In the case of large samples, when we are testing the significance of statistic, the
--- Content provided by FirstRanker.com ---
concept of standard error is used. It measures only sampling errors. Sampling
errors are involved i
--- Content provided by FirstRanker.com ---
n estimating a population parameter from a sample, insteadof including all the essential information in the population.
--- Content provided by FirstRanker.com ---
(i)
when standard deviation of the population is known, the formula is
--- Content provided by FirstRanker.com ---
pS.E. X = ----
n
--- Content provided by FirstRanker.com ---
Where,
--- Content provided by FirstRanker.com ---
S.E.X = The standard error of the mean, p = Standard deviation of the
population, and n = Number of observations in the sample.
--- Content provided by FirstRanker.com ---
(ii)When standard deviation of population is not known, we have to use the
standard deviation of the sample in calculating standard error of mean. The
--- Content provided by FirstRanker.com ---
formula is
(Sample)
--- Content provided by FirstRanker.com ---
S.E. X = ------------n
--- Content provided by FirstRanker.com ---
Where,
= standard deviation of the sample, and n = sample size
--- Content provided by FirstRanker.com ---
Illustration: A sample of 100 students from Pondicherry University was taken
and their average was found to be 116 lbs with a standard deviation of 20 lbs.
--- Content provided by FirstRanker.com ---
Could the mean weight of students in the population be 125 pounds?
--- Content provided by FirstRanker.com ---
Solution:Let us take the hypothesis that there is no significant difference between the
sample mean and the hypothetical population mean.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
20 20S.E. X = ---- = -------- = -------- = 2
--- Content provided by FirstRanker.com ---
n 100 10
--- Content provided by FirstRanker.com ---
Difference 125 ? 116 9
-------------- = ------------- = ------- = 4.5
--- Content provided by FirstRanker.com ---
S.E.X 2 2
--- Content provided by FirstRanker.com ---
Since, the difference is more than 2.58 S.E.( %1
level) it could not have arisen
--- Content provided by FirstRanker.com ---
due to fluctuations of sampling. Hence the mean weight of students in the
population could not be 125 lbs.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
3.2Test of Significance for Small Samples
If the sample size is less than 30, then those samples may be regarded as
--- Content provided by FirstRanker.com ---
small samples. As a rule, the methods and the theory of large samples are not
applicable to the small samples. The small samples are used in testing a given
--- Content provided by FirstRanker.com ---
yh pothesis, to find out the observed values, which could have arisen by
sampling fluctuations from some values given in advance. In a small sample,
--- Content provided by FirstRanker.com ---
the investigator's estimate will vary widely from sample to sample. An
inference drawn from a smaller sample result is less precise than the inference
--- Content provided by FirstRanker.com ---
drawn from a large sample result.t-distribution will be employed, when the sample size is 30 or less and
the population standard deviation is unknown.
--- Content provided by FirstRanker.com ---
The formula is
--- Content provided by FirstRanker.com ---
( X - ?)
t = ------- x n
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Where, = (X ? X)2/n ? 1
--- Content provided by FirstRanker.com ---
Illustratio :
n The following results are obtained from a sample of 20 boxes of
--- Content provided by FirstRanker.com ---
am ngoes:
Mean weight of contents = 490gms,
--- Content provided by FirstRanker.com ---
Standard deviation of the weight = 9 gms.
Could the sample come from a population having a mean of 500 gms?
--- Content provided by FirstRanker.com ---
Solution:
Let us take the hypothesis that ? = 510 gms.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
( X - ?)
t = ------- x n
--- Content provided by FirstRanker.com ---
X = 500; ? = 510; = 10; n = 20.
500 ? 510
--- Content provided by FirstRanker.com ---
t = ------------- x 20
10
--- Content provided by FirstRanker.com ---
Df = 20 ? 1 = 19 = (10/9) 20 = (10/9) x 4.47 = 44.7/9 = 4.96
Df = 19, t0.01 = 3.25
--- Content provided by FirstRanker.com ---
The computed value is less than the table value. Hence, our null hypothesis is
--- Content provided by FirstRanker.com ---
accepted.4. CHI-SQUARE
--- Content provided by FirstRanker.com ---
TEST
F, t and Z tests were based on the assumption that the samples were drawn from
--- Content provided by FirstRanker.com ---
nor am lly distributed populations. The testing procedure requires assumption
about the type of population or parameters, and these tests are known as
--- Content provided by FirstRanker.com ---
`parametric tests'.
--- Content provided by FirstRanker.com ---
There are many situations in which it is not possible to make any rigidassumption about the distribution of the population from which samples are
being drawn. This limitation has led to the development of a group of
--- Content provided by FirstRanker.com ---
alternative techniques known as n
a
--- Content provided by FirstRanker.com ---
on-p rametric tests. Chi-square test ofindependence and goodness of fit is a prominent example of the use of non-
parametric tests.
--- Content provided by FirstRanker.com ---
Though non-parametric theory developed as early as the middle of the
--- Content provided by FirstRanker.com ---
nineteenth century, it was only after 1945 that non-parametric tests came to be--- Content provided by FirstRanker.com ---
used widely in sociological and psychological research. The main reasons forthe increasing use of non-parametric tests in business research are:-
(i)
--- Content provided by FirstRanker.com ---
These statistical tests are distribution-free
(ii)
--- Content provided by FirstRanker.com ---
They are usually computationally easier to handle and understand thane
--- Content provided by FirstRanker.com ---
param tric tests; and
(iii)
--- Content provided by FirstRanker.com ---
They can be used with type of measurements that prohibit the use ofparametric tests.
--- Content provided by FirstRanker.com ---
The 2 test is one of the simplest and most widely used non-parametric
tests in statistical work. It is defined as:
--- Content provided by FirstRanker.com ---
(O ? E)2
--- Content provided by FirstRanker.com ---
2 = ------------
E
--- Content provided by FirstRanker.com ---
Where O = the observed frequencies, and E = the expected frequencies.Steps: The steps required to determine the value of 2are:
(i)
--- Content provided by FirstRanker.com ---
Calculate the expected frequencies. In general the expected frequency
for any cell can be calculated from
--- Content provided by FirstRanker.com ---
:the following equation
R X C
--- Content provided by FirstRanker.com ---
E = ------------
--- Content provided by FirstRanker.com ---
NWhere E = Expected frequency, R = row's total of the respective cell, C =
column's total of the respective cell and N = the total num
--- Content provided by FirstRanker.com ---
bservations.
ber of o
--- Content provided by FirstRanker.com ---
(ii)Take the difference between observed and expected frequencies and
obtain the squares of these differences. Symbolically, it can be represented as
--- Content provided by FirstRanker.com ---
(O ? E)2
--- Content provided by FirstRanker.com ---
(iii)Divide the values of (O ? E)2 obtained in step (ii) by the respective
expected frequency and obtain the total, which can be symbolically represented
--- Content provided by FirstRanker.com ---
by [(O ? E)2/E]. This gives the value of 2 which can range from zero to
--- Content provided by FirstRanker.com ---
infinity. If 2 is zero it means that the observed and expected frequencies
completely coincide. The greater the discrepancy between the observed and
--- Content provided by FirstRanker.com ---
expected frequencies, the greater shall be the value of 2.The computed value of 2 is compared with the table value of 2 for
--- Content provided by FirstRanker.com ---
given degrees of freedom at a certain specified level of significance. If at the
stated level, the calculated value of 2 is less than the table value, the difference
--- Content provided by FirstRanker.com ---
between theory and observation is not considered as significant.The following observation may be made with regard to the 2
distribution:-
--- Content provided by FirstRanker.com ---
(i)
--- Content provided by FirstRanker.com ---
The sum of the observed and expected frequencies is always zero.Symbolically,
) =
--- Content provided by FirstRanker.com ---
(O ? E
O - E = N ? N = 0
--- Content provided by FirstRanker.com ---
(ii)The 2 test depends only on the set of observed and expected frequencies
and on degrees of freedom v. It is a non-parametric test.
--- Content provided by FirstRanker.com ---
(iii)
2 distribution is a limiting approximation of the multinomial
--- Content provided by FirstRanker.com ---
distribution.(iv)
Even though 2 distribution is essentially a continuous distribution it can
--- Content provided by FirstRanker.com ---
be applied to discrete random variables whose frequencies can be counted and
tabulated with or without grouping.
--- Content provided by FirstRanker.com ---
The Chi-Square DistributionFor large sample sizes, the sampling distribution of 2 can be closely
approximated by a continuous curve known as the Chi-square distribution. The
--- Content provided by FirstRanker.com ---
probability function of 2 distribution is:
F(2) = C (2)(v/2 ? 1)e ? x2/2
--- Content provided by FirstRanker.com ---
Where e = 2.71828, v = number of degrees of freedom, C = a constantdepending only on v.
--- Content provided by FirstRanker.com ---
The 2 distribution has only one parameter, v, the number of degrees of
--- Content provided by FirstRanker.com ---
freedom. As in case of t-distribution there is a distribution for each different
number of degrees of freedom. For very small number of degrees of freedom,
--- Content provided by FirstRanker.com ---
the Chi-square distribution is severely skewed to the right. As the number ofdegrees of freedom increases, the curve rapi l
d y becomes more symmetrical. For
--- Content provided by FirstRanker.com ---
large values of v the Chi-square distribution is closely approximated by the
normal curve.
--- Content provided by FirstRanker.com ---
The following diagram gives 2 distribution for 1, 5 and 10 degrees offreedom:
F(x2)
--- Content provided by FirstRanker.com ---
v = 1
v = 5
--- Content provided by FirstRanker.com ---
v = 100
2
--- Content provided by FirstRanker.com ---
4
6
--- Content provided by FirstRanker.com ---
810
12
--- Content provided by FirstRanker.com ---
14
16 18
--- Content provided by FirstRanker.com ---
2022
2
--- Content provided by FirstRanker.com ---
2 Distribution
--- Content provided by FirstRanker.com ---
It is clear from the given diagram that as the degrees of freedom
increase, the curve becomes more and more symmetric. The Chi-square
--- Content provided by FirstRanker.com ---
distribution is a probability distribution and the total area under the curve in
each chi-square distribution is unity.
--- Content provided by FirstRanker.com ---
Properties of 2 distribution
--- Content provided by FirstRanker.com ---
The main Properties of 2 distribution are:-
--- Content provided by FirstRanker.com ---
(i)
the mean of the 2 distribution is equal to the number of degrees of freedom, i.e.,
--- Content provided by FirstRanker.com ---
X = v(ii)
the variance of the 2 distribution is twice the degrees of freedom, Variance =
--- Content provided by FirstRanker.com ---
2v
(iii)
--- Content provided by FirstRanker.com ---
?1 = 0,(iv)
?2 = 2v,
--- Content provided by FirstRanker.com ---
(v)
?3 = 8v,
--- Content provided by FirstRanker.com ---
(vi)?4 = 48v + 12v2.
2
--- Content provided by FirstRanker.com ---
?3 64v2 8
(vii) 1 = --- = ----- = --
--- Content provided by FirstRanker.com ---
2?2 8v3 v
--- Content provided by FirstRanker.com ---
?4 48v + 12v2
12
--- Content provided by FirstRanker.com ---
)
(v
--- Content provided by FirstRanker.com ---
1?3 = ---- = --------------- = 3 + ---
2
--- Content provided by FirstRanker.com ---
?24v2
--- Content provided by FirstRanker.com ---
v
--- Content provided by FirstRanker.com ---
The table values of 2are available only up to 30 degrees of freedom.
--- Content provided by FirstRanker.com ---
For degrees of freedom greater than 30, the distribution of 22 approximates
the normal distribution. For degrees of freedom greater than 30, the
--- Content provided by FirstRanker.com ---
approximation is acceptable close. The mean of the distribution 22 is 2v ? 1,and the standard deviation is equal to 1. Thus the application of the test is
simple, for deviation of 22 from 2v ? 1 may be interpreted as a normal
--- Content provided by FirstRanker.com ---
deviate with units standard deviation. That is,
Z = 22 - 2v ? 1
--- Content provided by FirstRanker.com ---
Alternative Method of Obtaining the Value of 2In a 2x2 table where the cell frequencies and marginal totals are as below:
--- Content provided by FirstRanker.com ---
a
--- Content provided by FirstRanker.com ---
b
--- Content provided by FirstRanker.com ---
(a+b)
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
c--- Content provided by FirstRanker.com ---
d
(c+d)
--- Content provided by FirstRanker.com ---
(a+c)
--- Content provided by FirstRanker.com ---
(b+d)
--- Content provided by FirstRanker.com ---
NN is the total frequency and ad the larger cross-product, the value of 2
--- Content provided by FirstRanker.com ---
can easily be obtained by the following formula:
N (ad ? bc)2
--- Content provided by FirstRanker.com ---
2 = --------------------------------- or(a + c) (b + d) (c + d) (a + b)
--- Content provided by FirstRanker.com ---
With Yate's corrections
--- Content provided by FirstRanker.com ---
N (ab ? bc - ?N)22
= -----------------------------------
--- Content provided by FirstRanker.com ---
(a + c) (b + d) (c + d) (a + b)
--- Content provided by FirstRanker.com ---
Conditions for applying 2 test:
--- Content provided by FirstRanker.com ---
The main conditions considered for employing the 2 test are:
(i) N must be to ensure the similarity between theoretically correct distribution and
--- Content provided by FirstRanker.com ---
our sampling distribution of 2.(ii) No theoretical cell frequency should be small when the expected frequencies are
--- Content provided by FirstRanker.com ---
too small. If it is so, then the value of 2 will be overestimated and will result in
too many rejections of the null hypothesis. To avoid making incorrect
--- Content provided by FirstRanker.com ---
inferences, a general rule is followed that expected frequency of less than 5 inone cell of a contingency table is too small to use. When the table contains
more than one cell with an expected frequency of less than 5 then add with the
--- Content provided by FirstRanker.com ---
preceding or succeeding frequency so that the resulting sum is 5 or more.
--- Content provided by FirstRanker.com ---
However, in doing so, we reduce the number of categories of data and will gain
less information from contingency table.
--- Content provided by FirstRanker.com ---
(iii) The constraints on the cell frequencies if any should be linear, i.e., they should
not involve square and higher powers of the frequencies such as O = E = N.
--- Content provided by FirstRanker.com ---
Uses of 2 test:
--- Content provided by FirstRanker.com ---
The main uses of 2 test are:
--- Content provided by FirstRanker.com ---
(i)
2 test as a test of independence. With the help of 2 test, we can find
--- Content provided by FirstRanker.com ---
out whether two or more attributes are associated or not. Let's assume that wehave N observations classified according to some attributes. We may ask
whether the attributes are related or independent. Thus, we can find out whether
--- Content provided by FirstRanker.com ---
there is any association between skin colour of husband and wife. To examine
the attributes that are associated, we formulate the null hypothesis that there is
--- Content provided by FirstRanker.com ---
no association against an alternative hypothesis and that there is an associationbetween the attributes under study. If the calculated value of 2 is less than the
table value at a certain level of significance, we say that the result of the
--- Content provided by FirstRanker.com ---
experiment provides no evidence for doubting the hypothesis. On the other
hand, if the calculated value of 2 is greater than the table value at a certain level
--- Content provided by FirstRanker.com ---
of significance, the results of the experiment do not support the hypothesis.(ii)
--- Content provided by FirstRanker.com ---
2 test as a test of goodness of fit. This is due to the f
le
--- Content provided by FirstRanker.com ---
act that it enab sus to ascertain how appropriately the theoretical distributions such as binomial,
Poisson, Normal, etc., fit empirical distributions. When an ideal frequency
--- Content provided by FirstRanker.com ---
curve whether normal or some other type is fitted to the data, we are interested
in finding out how well this curve fits with the observed facts. A test of the
--- Content provided by FirstRanker.com ---
concordance of the two can be made just by inspection, but such a test is
--- Content provided by FirstRanker.com ---
obviously inadequate. Precision can be secured by applying the 2 test.(iii)
--- Content provided by FirstRanker.com ---
2 test as a test of Homogeneity. The 2 test of homogeneity is an
extension of the chi-square test of independence. Tests of homogeneity are
--- Content provided by FirstRanker.com ---
designed to determine whether two or more independent random samples aredrawn from the same population or from different populations. Instead of one
sample as we use with independence pr
--- Content provided by FirstRanker.com ---
em
obl
--- Content provided by FirstRanker.com ---
we shall now have 2 or moresamples. For example, we may be interested in finding out whether or not
university students of various levels, i.e., mi
--- Content provided by FirstRanker.com ---
e
ddle and richer poor incom groups
--- Content provided by FirstRanker.com ---
are homogeneous in performination.
ance in the exam
--- Content provided by FirstRanker.com ---
Illustration: In an anti-diabetes campaign in a certain area, a particular
e
--- Content provided by FirstRanker.com ---
m dicine, say x was administered to 812 persons out of a total population of3248. The num er of
b
--- Content provided by FirstRanker.com ---
diabetes cases is shown below:
Treatment
--- Content provided by FirstRanker.com ---
Diabetes
No
--- Content provided by FirstRanker.com ---
Total
Diabetes
--- Content provided by FirstRanker.com ---
Medicinex
20
--- Content provided by FirstRanker.com ---
79
--- Content provided by FirstRanker.com ---
2812
No
--- Content provided by FirstRanker.com ---
Medicine
x 220
--- Content provided by FirstRanker.com ---
22162436
Total
--- Content provided by FirstRanker.com ---
30
--- Content provided by FirstRanker.com ---
24008
3248
--- Content provided by FirstRanker.com ---
Discuss the usefulness of medicine x in checking malaria.
--- Content provided by FirstRanker.com ---
Solution: Let us take the hypothesis that quinine is not effective in checking
diabetes. Applying 2 test :
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(A) X (B) 240 x 812
Expectation of (AB) = ------------ = ------------ = 60
--- Content provided by FirstRanker.com ---
N 3248
Or E1, i.e., expected frequency corresponding to fi
--- Content provided by FirstRanker.com ---
nd first columrst row a
n is 60.
--- Content provided by FirstRanker.com ---
the bale of expected frequencies shall be:
60 752 812
--- Content provided by FirstRanker.com ---
180 2256 2436
--- Content provided by FirstRanker.com ---
240 3008 3248--- Content provided by FirstRanker.com ---
O
--- Content provided by FirstRanker.com ---
E
--- Content provided by FirstRanker.com ---
(O ? E)2
--- Content provided by FirstRanker.com ---
(O ? E)2/E20
60
--- Content provided by FirstRanker.com ---
1600
26.667
--- Content provided by FirstRanker.com ---
220180
1600
--- Content provided by FirstRanker.com ---
8.889
792
--- Content provided by FirstRanker.com ---
7521600
2.218
--- Content provided by FirstRanker.com ---
2216
2256
--- Content provided by FirstRanker.com ---
16000.709
2
--- Content provided by FirstRanker.com ---
[(O ? E) /E] = 38.593
--- Content provided by FirstRanker.com ---
22
= [(O ? E) /E] = 38.593
--- Content provided by FirstRanker.com ---
v = (r ? 1) (c ? 1) = (2 ? 1) (2 ? 1) = 1
2
--- Content provided by FirstRanker.com ---
for v = 1, 0.05 = 3.84The calculated value of 2 is greater than the table value. The hypothesis is
--- Content provided by FirstRanker.com ---
rejected. Hence medicine x is useful in checking malaria.
--- Content provided by FirstRanker.com ---
Illustration: In an experiment on immunization of cattle from tuberculosis thefollowing results were obtained:
Affected Not
--- Content provided by FirstRanker.com ---
affected
Inoculated
--- Content provided by FirstRanker.com ---
10
--- Content provided by FirstRanker.com ---
20
--- Content provided by FirstRanker.com ---
Not inoculated15
--- Content provided by FirstRanker.com ---
5
--- Content provided by FirstRanker.com ---
Calculate 2 and discuss the effect of vaccine in controlling susceptibility to
tuberculosis (5% value of 2 for one degree of freedom = 3.84).
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Solution: Let us take the hypothesis that the vaccine is not effective in
controlling susceptibility to
--- Content provided by FirstRanker.com ---
is. Applyingtuberculos
2 test:
--- Content provided by FirstRanker.com ---
N(ad ? bc)2 50 (11x5 ? 20x15)2
--- Content provided by FirstRanker.com ---
2 = -------------------------- = ------------------------ = 8.3(a+b) (c+d)(a+c)(b+d) 30x20x25x25
--- Content provided by FirstRanker.com ---
Since the calculated value of 2 is greater than the table value the hypothesis is
not true. We, therefore, conclude the vaccine is effective in controlling
--- Content provided by FirstRanker.com ---
susceptibility to tuberculosis.5. INDEX
--- Content provided by FirstRanker.com ---
NUMBERS
An Index Number is used to measure the level of a certain phenomenon
--- Content provided by FirstRanker.com ---
as compared to the level of the sameenon at som
phenom
--- Content provided by FirstRanker.com ---
e standard period. An
Index Number is a statistical device for comparing the general level of
--- Content provided by FirstRanker.com ---
magnitude of a group of related variables in two or more situations. If we wantto compare the price level of 2004 with what it was in 2000, we may have to
look into a group of variables ? prices of rice, wheat, vegetables clothes, etc.
--- Content provided by FirstRanker.com ---
Hence, we will have one figure to indicate the changes of different commodities
as a whole and it is cal
--- Content provided by FirstRanker.com ---
bled an Index Num er.
Utility of Index Number:
--- Content provided by FirstRanker.com ---
The main uses of index numbers are:
--- Content provided by FirstRanker.com ---
(i)Index Numbers are particularly useful in measuring relative changes. Example
Changes in level of price, production, etc.
--- Content provided by FirstRanker.com ---
(ii) Index numbers are economic barometers. Various index numbers computed for
different purposes, like employment, trade, agriculture are of immense value in
--- Content provided by FirstRanker.com ---
dealing with different economic problems.--- Content provided by FirstRanker.com ---
(iii) Index numbers are useful to compute the standard of living. Index numbers maymeasure the cost of living of different classes and comparison across groups
becomes easier.
--- Content provided by FirstRanker.com ---
(iv) They help in form
policies.
--- Content provided by FirstRanker.com ---
ulatingFor instance increase or decrease in wages
require the study of the cost of living index numbers.
--- Content provided by FirstRanker.com ---
Steps in construction of Index Numbers:
--- Content provided by FirstRanker.com ---
The main steps involved in the construction of index numbers are:)
(i
--- Content provided by FirstRanker.com ---
Purpose. The researcher must clearly define the purpose for which the index
numbers are to be constructed. For example, cost of living index numbers of
--- Content provided by FirstRanker.com ---
workers in an industrial area and those of the workers of an agricultural area aredifferent in respect of requirement. So, it is very essential to define the purpose
of the index numbers.
--- Content provided by FirstRanker.com ---
i)
--- Content provided by FirstRanker.com ---
(iSelection of Base. The base period is important for the construction of index
numbers. When we select a base year, the year must be recent and normal. A
--- Content provided by FirstRanker.com ---
normal year is one which is free from economic and natural, social and
economic disturbance. Besides, when we are selecting the base period one of
--- Content provided by FirstRanker.com ---
the following criteria should be considered (a) Fixed base, (b) Average base, (b).
Chain Base
--- Content provided by FirstRanker.com ---
(iii) Selection of commodities. We should in
--- Content provided by FirstRanker.com ---
oclude important c mmodities and they
are represen
--- Content provided by FirstRanker.com ---
th
tative of
--- Content provided by FirstRanker.com ---
e defined purpose. For the purpose of finding the cost ofliving index number for lo
e
--- Content provided by FirstRanker.com ---
w incom groups, the selected items should be mostly
consumed by that group.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(iv) Sources of data. The price relating tobe
the thing to
--- Content provided by FirstRanker.com ---
measured must be
ollected. If we want to study the ch
--- Content provided by FirstRanker.com ---
canges in industrial production, we must
lect
--- Content provided by FirstRanker.com ---
col
ting to the pr
--- Content provided by FirstRanker.com ---
the prices relaoduction of various goods of factories.
--- Content provided by FirstRanker.com ---
(v) Weighting. All commodities are not equally important because different groups
of people will have different preferences on different commodities. For
--- Content provided by FirstRanker.com ---
instance, when the price of rice is doubled than the price of ice-cream, then thepeople suffer much, due
price of rice which is essential. Therefore, a
--- Content provided by FirstRanker.com ---
to hike in
ould be given for each
--- Content provided by FirstRanker.com ---
relative weight shcommodity based on its importance.
--- Content provided by FirstRanker.com ---
(vi) Choice of Formulae. The index number computed based on different formulas
roduce different results. Hence,
--- Content provided by FirstRanker.com ---
usually pthe problem is perhaps of greater
theoretical than practical importance. In general, choice of the formula to be
--- Content provided by FirstRanker.com ---
used depends upon the availabi
nature, purpose and scope of
--- Content provided by FirstRanker.com ---
lity of data and thethe study.
--- Content provided by FirstRanker.com ---
The various methods of construction of index number are:
--- Content provided by FirstRanker.com ---
1.Unweighted
(a)
--- Content provided by FirstRanker.com ---
Simple Aggregate
(b)
--- Content provided by FirstRanker.com ---
Simple average of price relative2.
--- Content provided by FirstRanker.com ---
Weighted
(a)
--- Content provided by FirstRanker.com ---
Weighted Aggregate(b)
Weighted average of price relative.
--- Content provided by FirstRanker.com ---
1.
--- Content provided by FirstRanker.com ---
Unweighted(a)
Simple Aggregate method.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
The price of the different commodities of the current year is added to the total
and it is divided by the sum of the prices of the base year commodity and
--- Content provided by FirstRanker.com ---
multiplied by 100: symbolically,1
P x 100
--- Content provided by FirstRanker.com ---
P01 = -------------
P0
--- Content provided by FirstRanker.com ---
Where,01
P = Price index number for the current year with reference to the base year.
--- Content provided by FirstRanker.com ---
P1 = Aggregate of prices for the current year, and
P0 = Aggregate of prices for the base year.
--- Content provided by FirstRanker.com ---
(b)
--- Content provided by FirstRanker.com ---
Simple average of price relative method.Under this method, the price relative of each item is calculated separately and
then averaged. A price relative is the price of the current year expressed as a
--- Content provided by FirstRanker.com ---
percentage of the price of the base year:
P1 x 100
--- Content provided by FirstRanker.com ---
------------P0
P
--- Content provided by FirstRanker.com ---
P01 = ------------------ = -----
--- Content provided by FirstRanker.com ---
N
--- Content provided by FirstRanker.com ---
N
Where, N = Number of ite s
--- Content provided by FirstRanker.com ---
m , P = P1 x 100 / P0If we employ geometric ean in the pl
m
--- Content provided by FirstRanker.com ---
ace of the arithmetic mean the
n the
--- Content provided by FirstRanker.com ---
formula is--- Content provided by FirstRanker.com ---
P1 x 100
--- Content provided by FirstRanker.com ---
log ------------
--- Content provided by FirstRanker.com ---
P0 logP
P01 = antilog ------------------ = antilog -------
--- Content provided by FirstRanker.com ---
N
--- Content provided by FirstRanker.com ---
NIllustration: Compute a price index for the following by (a) simple aggregate
--- Content provided by FirstRanker.com ---
and (b) average of price relative method by using both arithmetic mean and
geometric mean:
--- Content provided by FirstRanker.com ---
Commodity
--- Content provided by FirstRanker.com ---
A B C D E F
rice
--- Content provided by FirstRanker.com ---
Pin
2000
--- Content provided by FirstRanker.com ---
(Rs.)
20 30 10 25 40 50
--- Content provided by FirstRanker.com ---
Pricein
2005
--- Content provided by FirstRanker.com ---
(Rs.)
15 35 45 55
--- Content provided by FirstRanker.com ---
25 30Solution: Ca
--- Content provided by FirstRanker.com ---
Price Index
lculation for
--- Content provided by FirstRanker.com ---
Commodity Pricein
Price
--- Content provided by FirstRanker.com ---
2000
in 2005 Price relative
--- Content provided by FirstRanker.com ---
log P--- Content provided by FirstRanker.com ---
P0
--- Content provided by FirstRanker.com ---
P1 P= P1/P0 x 100A
--- Content provided by FirstRanker.com ---
20
25
--- Content provided by FirstRanker.com ---
1252.0969
B
--- Content provided by FirstRanker.com ---
30
30
--- Content provided by FirstRanker.com ---
1002.0000
C
--- Content provided by FirstRanker.com ---
10
15
--- Content provided by FirstRanker.com ---
1502.1761
D
--- Content provided by FirstRanker.com ---
25
--- Content provided by FirstRanker.com ---
35140
2.1461
--- Content provided by FirstRanker.com ---
E
--- Content provided by FirstRanker.com ---
4045
112.5
--- Content provided by FirstRanker.com ---
2.0511
F
--- Content provided by FirstRanker.com ---
5055
--- Content provided by FirstRanker.com ---
110
2.0414
--- Content provided by FirstRanker.com ---
175 205737.5 12.5116
--- Content provided by FirstRanker.com ---
P1 x 100
--- Content provided by FirstRanker.com ---
(a) Simple Aggregative Index = -----------P0
--- Content provided by FirstRanker.com ---
P0 = 175, P1 = 205
205
--- Content provided by FirstRanker.com ---
= ----- x 100 = 117.143175
--- Content provided by FirstRanker.com ---
(b) (i) Arithmetic mean of Price
--- Content provided by FirstRanker.com ---
Relatives = P / N
P = 737.5, N = 6
--- Content provided by FirstRanker.com ---
= 737.5 / 6 = 122.92(ii) Geometric Mean of Price
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
logPRelative Index = Antilog ----------
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
N
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
12.5116= Antilog ----------- = Antilog 2.0853 = 121.7
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
6--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Weighted Index Numbersethod, prices them
Under this m
--- Content provided by FirstRanker.com ---
selves are weighted by quantities, i.e., p*q. Thus
physical quantities are used
--- Content provided by FirstRanker.com ---
ights. The diffeas we
rent methods of assigning
--- Content provided by FirstRanker.com ---
weights are:
(a)
--- Content provided by FirstRanker.com ---
'sLaspeyre
method,
--- Content provided by FirstRanker.com ---
(b)
Paasche's
--- Content provided by FirstRanker.com ---
method,
--- Content provided by FirstRanker.com ---
(c)
Bowley
--- Content provided by FirstRanker.com ---
ethod,Dorfish
m
--- Content provided by FirstRanker.com ---
(d)
--- Content provided by FirstRanker.com ---
Fisher's Ideal method,(e)
--- Content provided by FirstRanker.com ---
Marshall Edgworth method,
(f)
--- Content provided by FirstRanker.com ---
Kelley'sMethod
--- Content provided by FirstRanker.com ---
)
(a
--- Content provided by FirstRanker.com ---
Laspeyre's method.Under this me
uantitie
--- Content provided by FirstRanker.com ---
thod, the base year q
s are taken as weights: symbolically,
--- Content provided by FirstRanker.com ---
p1q 0--- Content provided by FirstRanker.com ---
P01(La) = --------- x 100--- Content provided by FirstRanker.com ---
p0q 0
--- Content provided by FirstRanker.com ---
(b)Paasche's method.
The current year quantities are taken as weights under Paasche's method:
--- Content provided by FirstRanker.com ---
symbolically,
p1q 1
--- Content provided by FirstRanker.com ---
P01(Pa) = -------- x 100--- Content provided by FirstRanker.com ---
p0q 1
--- Content provided by FirstRanker.com ---
(c)Bowley Dorfish method.
This is an index number got by the arithmetic mean of Laspeyre's and Paasche's
--- Content provided by FirstRanker.com ---
methods; symbolically
p
--- Content provided by FirstRanker.com ---
1q 0 p1q 1-------- + --------
--- Content provided by FirstRanker.com ---
p
--- Content provided by FirstRanker.com ---
0q 0 p0q 1 L + PP01(B) = ---------------------- x 100 = ------
--- Content provided by FirstRanker.com ---
2 2
Where, L = Laspeyre's method & P = Paasche's method.
--- Content provided by FirstRanker.com ---
(d)
Fisher's Ideal method.
--- Content provided by FirstRanker.com ---
Fisher's price index n m
u ber is given by the geometric mean of Laspeyre's and
--- Content provided by FirstRanker.com ---
Paasche's Index; symbolically,p
--- Content provided by FirstRanker.com ---
1q 0 p1q 1
P01(F) = L x P = -------- x -------- x 100
--- Content provided by FirstRanker.com ---
p
--- Content provided by FirstRanker.com ---
0q 0 p0q 1
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
)(e
d
--- Content provided by FirstRanker.com ---
Marshall E geworth Method
--- Content provided by FirstRanker.com ---
p1 (q 1 + q 0)0
P M
--- Content provided by FirstRanker.com ---
1( a) = -----------------
--- Content provided by FirstRanker.com ---
0
p (q 0 + q 1)
--- Content provided by FirstRanker.com ---
By removal of brackets,
--- Content provided by FirstRanker.com ---
p
1q 0 p1q 1
--- Content provided by FirstRanker.com ---
P01(Ma) = -------- + -------- x 100
--- Content provided by FirstRanker.com ---
p
--- Content provided by FirstRanker.com ---
0q 0
p0
--- Content provided by FirstRanker.com ---
1q
--- Content provided by FirstRanker.com ---
(f)
--- Content provided by FirstRanker.com ---
Kelley's method.--- Content provided by FirstRanker.com ---
p1q
--- Content provided by FirstRanker.com ---
P01(K) = ------- x 100
p0q
--- Content provided by FirstRanker.com ---
q = q 0 + q1 2
--- Content provided by FirstRanker.com ---
/--- Content provided by FirstRanker.com ---
Illustrations:
Calculate various weighted index number from
--- Content provided by FirstRanker.com ---
lowing data:the fol
--- Content provided by FirstRanker.com ---
ar
--- Content provided by FirstRanker.com ---
Base yeCurrent year
--- Content provided by FirstRanker.com ---
Kilo
--- Content provided by FirstRanker.com ---
Rate (Rs.)Kilo
--- Content provided by FirstRanker.com ---
Rate (Rs.)
Bread
--- Content provided by FirstRanker.com ---
103
10
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
4
--- Content provided by FirstRanker.com ---
Meat
20
--- Content provided by FirstRanker.com ---
1516
--- Content provided by FirstRanker.com ---
20
--- Content provided by FirstRanker.com ---
Tea2
--- Content provided by FirstRanker.com ---
20
--- Content provided by FirstRanker.com ---
330
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Solution:--- Content provided by FirstRanker.com ---
Base year
Current year
--- Content provided by FirstRanker.com ---
Kilo Rate Kilo Rate p1q 0
--- Content provided by FirstRanker.com ---
p0q 0p1q1
p0q1
--- Content provided by FirstRanker.com ---
(Rs.)
(Rs.)
--- Content provided by FirstRanker.com ---
Q 0p0
Q1
--- Content provided by FirstRanker.com ---
p1
Bread
--- Content provided by FirstRanker.com ---
103.00
10
--- Content provided by FirstRanker.com ---
4.00
40.00
--- Content provided by FirstRanker.com ---
30.0040.00
30.00
--- Content provided by FirstRanker.com ---
Meat
20
--- Content provided by FirstRanker.com ---
15.016
20.0
--- Content provided by FirstRanker.com ---
400.00
300.00
--- Content provided by FirstRanker.com ---
320.00 240.00Tea
2
--- Content provided by FirstRanker.com ---
0
3
--- Content provided by FirstRanker.com ---
060.00
40.00
--- Content provided by FirstRanker.com ---
90.00
60.00
--- Content provided by FirstRanker.com ---
20.030.0
0
--- Content provided by FirstRanker.com ---
0
Total
--- Content provided by FirstRanker.com ---
500.00370.00
450.00 330.00
--- Content provided by FirstRanker.com ---
(a) Laspeyre's
--- Content provided by FirstRanker.com ---
methodp1q 0 x 100 500 x 100
P01(La) = ----------------- = ---------------- = 135.1
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
p0q 0 370.00(b)
--- Content provided by FirstRanker.com ---
Paasche's method
p1q 1 x 100 450 x 100
--- Content provided by FirstRanker.com ---
P01(Pa) = ----------------- = -------------- = 136.4--- Content provided by FirstRanker.com ---
p0q 1 370
--- Content provided by FirstRanker.com ---
(c) Bowley'sMethod
p
--- Content provided by FirstRanker.com ---
1q 0 p1q 1
-------- + --------
--- Content provided by FirstRanker.com ---
p0q 0 p0q 1 L + P
--- Content provided by FirstRanker.com ---
P01( = ------------
B)
--- Content provided by FirstRanker.com ---
---------- x 100 = ------2 2
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
L + P 135.1 + 136.1
--- Content provided by FirstRanker.com ---
= ------- = ------------------ = 135.8
2 2
--- Content provided by FirstRanker.com ---
(d)
Fisher's ideal formula
--- Content provided by FirstRanker.com ---
p
1q 0 p1q 1
--- Content provided by FirstRanker.com ---
P01(F) = L x P = -------- x -------- x 100--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
p0q 0 p0q 1
--- Content provided by FirstRanker.com ---
= L x P = (135.1 x 136.1) = 135.7
--- Content provided by FirstRanker.com ---
(e)Marshall Edgeworth method
p
--- Content provided by FirstRanker.com ---
1q 0 p1q 1 500 + 450
P01(Ma) = -------- + -------- x 100 = --------------- x 100
--- Content provided by FirstRanker.com ---
p
0q 0 p0q 1 500 + 330
--- Content provided by FirstRanker.com ---
950 x 100
--- Content provided by FirstRanker.com ---
= --------------- = 1.14 x 100 = 114
830
--- Content provided by FirstRanker.com ---
6.
--- Content provided by FirstRanker.com ---
ANALYSIS OF TIME SERIESAn arrangement of statistical data in accordance with time of occurrence or in a
chronological order is called a time series. The numerical data which we get at
--- Content provided by FirstRanker.com ---
different points of time is known as time series. It plays an important role in
economics, statistics and commerce. For example, if we observe agricultural
--- Content provided by FirstRanker.com ---
production, sales, national income etc., over a period of time, say the last 3 or 5years, the set of observations is called time series. The analysis of time series is
--- Content provided by FirstRanker.com ---
done ma
of for
--- Content provided by FirstRanker.com ---
inly for the purpose
ecasts and for evaluating the past
--- Content provided by FirstRanker.com ---
performances.Utility of Time series.
--- Content provided by FirstRanker.com ---
The main uses of time series are:
(i)
--- Content provided by FirstRanker.com ---
It helps in understanding the past behaviour and estimating the futurebehaviour.
(ii)
--- Content provided by FirstRanker.com ---
It helps in planning and forecasting and is very essential for the business
and economics to prepare plans for the future.
--- Content provided by FirstRanker.com ---
(iii)Comparison between data of one period with that of another period is
possible.
--- Content provided by FirstRanker.com ---
(iv)
We can evaluate the progress in any field of economic and business
--- Content provided by FirstRanker.com ---
activity with the help of time series data.(v)
Seasonal, cyclical, secular trend of data is useful not only to economists
--- Content provided by FirstRanker.com ---
but also to the businessmen.
--- Content provided by FirstRanker.com ---
Components of time series:There are four basic types of variations, which are called the components or
elements of time series. They are:
--- Content provided by FirstRanker.com ---
1.
Secular Trend,
--- Content provided by FirstRanker.com ---
2.Seasonal variation,
3.
--- Content provided by FirstRanker.com ---
Cyclical fluctuations, and
4.
--- Content provided by FirstRanker.com ---
irregular or random fluctuations.1.
--- Content provided by FirstRanker.com ---
Secular trend
The general tendency of the time series data to increase or decrease or stagnate
--- Content provided by FirstRanker.com ---
during a long period of time is called the secular trend, also known as long-termtrend. This phenomenon is usually observed in most of the series relating to
--- Content provided by FirstRanker.com ---
Economics and Business. For instance, an upward tendency is usually observed
in time series relating to population, production, prices, income, money in
--- Content provided by FirstRanker.com ---
l
circu ation etc. while a downward tendency is noticed in the time series relating
--- Content provided by FirstRanker.com ---
to deaths, epidemics etc. due to an advancement in medical technology,improved medical facilities, better sanitation, etc. In a long term trend, there are
two types of trend. They are:
--- Content provided by FirstRanker.com ---
(i)
Linear ? Straight Line Trend, and
--- Content provided by FirstRanker.com ---
(ii)Non-Linear or Curvilinear Trend.
--- Content provided by FirstRanker.com ---
(i)
Linear or Straight Line Trend. When the values of time series are
--- Content provided by FirstRanker.com ---
plotted on a graph, then it is called the straight line trend or linear trend.(ii)
--- Content provided by FirstRanker.com ---
Non-linear or Curvilinear Trend. When we plot the time series values
on a graph and if it forms a curve or a non-linear one, then it is called Non-linear
--- Content provided by FirstRanker.com ---
or Curvilinear Trend.2.
--- Content provided by FirstRanker.com ---
Seasonal Variation
A variation which occurs weekly, monthly or quarterly is known as Seasonal
--- Content provided by FirstRanker.com ---
mayVariation. The seasonal variation
occur due to the following reasons:
--- Content provided by FirstRanker.com ---
)
(i
--- Content provided by FirstRanker.com ---
Climate and natural forces:The result of natural forces like climate is causing seasonal variation. For
example, umbrellas are sold more in rainy season (in winter season).
--- Content provided by FirstRanker.com ---
(ii)
Customs and habits:
--- Content provided by FirstRanker.com ---
Man-made conventions are the customs, habits, fashion, etc. There is a custom
f wearing new clothes, preparing sweets fo
--- Content provided by FirstRanker.com ---
o
r Deepavali, Christmas etc. At that
--- Content provided by FirstRanker.com ---
time there is more demand for clothes, sweets, etc.3.
Cyclical Variation:
--- Content provided by FirstRanker.com ---
According to Lincoln L. Chou, "Up and down movements are different from
--- Content provided by FirstRanker.com ---
seasonal fluctuations, in that they extend over longer period of time-usually two
or more years". Most of economic and business time series are influenced by
--- Content provided by FirstRanker.com ---
the wave-like changes of prosperity adepression. There is periodic up and
nd
--- Content provided by FirstRanker.com ---
down movement. This movement is known as cyclical variation. There are four
es
--- Content provided by FirstRanker.com ---
phashey are
in a business cycle. T
--- Content provided by FirstRanker.com ---
a) Prosperity (boom), b) recession, c)
depression, and d) recovery.
--- Content provided by FirstRanker.com ---
4.Irregular variation:
Irregular variations arise owing to unforeseen and unpredictable forces at
--- Content provided by FirstRanker.com ---
random and affect the data. These variations are not regular ones. These are
caused by war, flood, strike etc.
--- Content provided by FirstRanker.com ---
In the classical time series model, the elements of trend, cyclical and
seasonal variations are viewed resulting from systematic influences. These
--- Content provided by FirstRanker.com ---
influences led to gradual growth, decline or recurrent movements and irregular
movements and are considered to be erratic movement. Therefore, the residual
--- Content provided by FirstRanker.com ---
that remains after the elimination of systematic components is taken asrepresenting irregular fluctuations.
--- Content provided by FirstRanker.com ---
Measurement of Secular Trend
The time series analysis is absolutely essential for planning. It guides the
--- Content provided by FirstRanker.com ---
planners to achieve better results. The study of trend enables the planner toproject the plan in a better
he following
--- Content provided by FirstRanker.com ---
direction. T
are the four methods which
--- Content provided by FirstRanker.com ---
can be used for determining the trend.(i)
Free-hand or Graphic Method,
--- Content provided by FirstRanker.com ---
(ii)
Semi-average Method,
--- Content provided by FirstRanker.com ---
(iii)Moving Average Method, and
(iv)
--- Content provided by FirstRanker.com ---
Method of Least Squares.
--- Content provided by FirstRanker.com ---
(i)
--- Content provided by FirstRanker.com ---
Graphic or Free-hand Fitting Method:This is the easiest, simplest and the most flexible method of estimating secular
trend. In this method we must plot the original data on the graph. Draw a
--- Content provided by FirstRanker.com ---
smooth curve carefully which will show the direction of the trend. Here time is
shown on the horizontal axis and the value of the variables is shown on the
--- Content provided by FirstRanker.com ---
vertical axis.he fr
For fitting a trend line by t
--- Content provided by FirstRanker.com ---
ee-hand method, the following points
should be taken into consideration:
--- Content provided by FirstRanker.com ---
a)the curve should be smooth.
b)
--- Content provided by FirstRanker.com ---
Approximately there must be equal number of points above and below
the curve.
--- Content provided by FirstRanker.com ---
c)The total deviations of the data above the trend line must be the same as
ine.
--- Content provided by FirstRanker.com ---
the vertical deviations below the l
d)
--- Content provided by FirstRanker.com ---
The sum of the squares ofthe trend should
the vertical deviations from
--- Content provided by FirstRanker.com ---
be as small as possible.
i)
--- Content provided by FirstRanker.com ---
(iSemi-average Method:
In this me
--- Content provided by FirstRanker.com ---
ual parts and averages are
thod, the original data is divided into two eq
--- Content provided by FirstRanker.com ---
calculated for both the parts. These averages are called se im -averages. For
example, we can divide the 10 years, 1993 to 2002 into two equal parts; from
--- Content provided by FirstRanker.com ---
1993 to 1997 and 1998 to 2002. If the period is odd number of years, the value
of the middle year is omitted.
--- Content provided by FirstRanker.com ---
We can draw the line by a straight line by joining the two points of
average. By extending the line downward
--- Content provided by FirstRanker.com ---
we can predict the future
or upward,
--- Content provided by FirstRanker.com ---
values.(iii)
--- Content provided by FirstRanker.com ---
Moving Average Method:
In the moving average method, the averag
--- Content provided by FirstRanker.com ---
for a number of periods ise value
considered and placed at the centre of the time-span. It is calculated from
--- Content provided by FirstRanker.com ---
overlapping groups of successive time series data. It simplifies the analysis and
--- Content provided by FirstRanker.com ---
removes periodic variations; and the influence of the fluctuations is also
reduced. The formula for calculating 3 yearly moving averages is:
--- Content provided by FirstRanker.com ---
a + b + c b + c + d c + d + e----------- , ---------- , ------------
3 3 3
--- Content provided by FirstRanker.com ---
Steps for calculating odd number of years (3, 5, 7, 9)
--- Content provided by FirstRanker.com ---
If we want to calculate the three-yearly moving average, then:(i)
--- Content provided by FirstRanker.com ---
Compute the value of first three years (1, 2, 3) and place the three year
total against the middle year
--- Content provided by FirstRanker.com ---
(ii)Leave the first year's value and a
values of the next three years
--- Content provided by FirstRanker.com ---
dd up the
and place the three-year total against the middle year.
--- Content provided by FirstRanker.com ---
(iii)this process must be continued until the last year's value is taken for
calculating moving average.
--- Content provided by FirstRanker.com ---
(iv)
the three-yearly total must be divided by 3 and placed in the next
--- Content provided by FirstRanker.com ---
column. This is the trend value of moving average.Even period of moving average:
--- Content provided by FirstRanker.com ---
If the period of moving average is 4,6,8, it is an even number. The four-yearly
total cannot be placed against any year as the median 2.5 is between the second
--- Content provided by FirstRanker.com ---
and the third year. So the total should be placed in between the 2nd and 3rdyears.
(iv)
--- Content provided by FirstRanker.com ---
Method of least square:
By the method of least square
--- Content provided by FirstRanker.com ---
t, a straigh
be
--- Content provided by FirstRanker.com ---
line trend can
tim
--- Content provided by FirstRanker.com ---
fitted to the givene
series data. With this method, economi
--- Content provided by FirstRanker.com ---
e series data can be
c and business tim
--- Content provided by FirstRanker.com ---
fitted and can derive the results for the forecasting and prtren
ediction. The
--- Content provided by FirstRanker.com ---
d
--- Content provided by FirstRanker.com ---
line is called the line o b
f
--- Content provided by FirstRanker.com ---
est fe
it. Th straight line
--- Content provided by FirstRanker.com ---
e
tr nd
--- Content provided by FirstRanker.com ---
greeor the first de
parabola is represented by the mathematical equation.
--- Content provided by FirstRanker.com ---
Y = a + bX
Where, Y = required trend value, X = unit of time
--- Content provided by FirstRanker.com ---
a and b are constantsthe value of
or constants can be calculated by the following two
--- Content provided by FirstRanker.com ---
the unknown
normal equation.
--- Content provided by FirstRanker.com ---
Y = Na + bXYX = a
X + bX2
--- Content provided by FirstRanker.com ---
Where, N = the number of period
By solving the above two equation obtain the parameters of a and b.
--- Content provided by FirstRanker.com ---
Illustration: Calculation of Trend Values by the Method of Least Square
Year Sales
--- Content provided by FirstRanker.com ---
Deviation from 1988
X2
--- Content provided by FirstRanker.com ---
Y X XY
2000 100
--- Content provided by FirstRanker.com ---
-2
-200
--- Content provided by FirstRanker.com ---
42001 110
-1
--- Content provided by FirstRanker.com ---
-110
1
--- Content provided by FirstRanker.com ---
2200
130
--- Content provided by FirstRanker.com ---
0
0
--- Content provided by FirstRanker.com ---
02003 140
--- Content provided by FirstRanker.com ---
+1
+140
--- Content provided by FirstRanker.com ---
12004 150
+2
--- Content provided by FirstRanker.com ---
+300
4
--- Content provided by FirstRanker.com ---
N = 5 Y = 630 X = 0 xy = 130 X2=10
--- Content provided by FirstRanker.com ---
Since
X = 0
--- Content provided by FirstRanker.com ---
Y 400
--- Content provided by FirstRanker.com ---
a = ----- = ------ = 80
--- Content provided by FirstRanker.com ---
N 5--- Content provided by FirstRanker.com ---
XY 52
b = -------- = ----- = 5.2
--- Content provided by FirstRanker.com ---
X2 10
--- Content provided by FirstRanker.com ---
Hence, Y = 126 + 13XThe for
= 165
--- Content provided by FirstRanker.com ---
ecasted value for 2005 is Y = 126 + 13(3)
--- Content provided by FirstRanker.com ---
Questions:1.
Define probability and explain various concepts of probability.
--- Content provided by FirstRanker.com ---
2.
State and explain the addition and multiplication theorem of probability
--- Content provided by FirstRanker.com ---
with an example.3.
Define Binomial distribution. Explain its properties.
--- Content provided by FirstRanker.com ---
4.
What are the properties of Poisson distribution?
--- Content provided by FirstRanker.com ---
5.What are the salient features of Normal distribution?
6.
--- Content provided by FirstRanker.com ---
Explain the utility of normal distribution in statistical analysis.
7.
--- Content provided by FirstRanker.com ---
Explain how Poisson, binomial and normal distribution are related.8.
Distinguish between null and alternative hypothesis.
--- Content provided by FirstRanker.com ---
9.
How will you conduct test pertaining to comparison between sample
--- Content provided by FirstRanker.com ---
mean and population mean?10.
What are the properties of 2 distribution?
--- Content provided by FirstRanker.com ---
11.
What are the uses of 2 test?
--- Content provided by FirstRanker.com ---
12.Define Index Number. Explain its uses.
13.
--- Content provided by FirstRanker.com ---
What are the steps involved in the construction of index number?
14.
--- Content provided by FirstRanker.com ---
Explain any four weighted index number.15.
What are the components of time series?
--- Content provided by FirstRanker.com ---
16.
What do you mean by time series? State its utility.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
17.The probability of defective needle is 0.3 in a box, find (a) the mean and
standard deviation for the distribution of defective needles in a total of 1000
--- Content provided by FirstRanker.com ---
box, and (b) the moment coefficient of skewness and kurtosis of the distribution.
--- Content provided by FirstRanker.com ---
18.The incidence of a certain disease is such that on the average 10% of
orkers suffer from it. If 10 workers ar
--- Content provided by FirstRanker.com ---
w
e selected at random, find the probability
--- Content provided by FirstRanker.com ---
that (i) Exactly 4 workers suffer from the disease, (ii) not more than 2 workerssuffer from the disease.
--- Content provided by FirstRanker.com ---
19.
Out of 1000 families with 4 children each, what percentage would be
--- Content provided by FirstRanker.com ---
expected to have (a) 2 boys and 2 girls, (b) at least one boy, (c) no girls, and (d)at the most 2 girls. Assume equal probabilities for boys and girls.
--- Content provided by FirstRanker.com ---
20.
m
--- Content provided by FirstRanker.com ---
Aultiple-choice tes consi
t
--- Content provided by FirstRanker.com ---
sts of 8 questions with 3 answers to each
question (of which only one is correct). A student answers each question by
--- Content provided by FirstRanker.com ---
lingrol
balan
--- Content provided by FirstRanker.com ---
a
die
--- Content provided by FirstRanker.com ---
cedch
and
--- Content provided by FirstRanker.com ---
ecking the first answer if he gets 1 or 2, the second
answer if he gets 3 or
--- Content provided by FirstRanker.com ---
the4 and
third answer if he gets 5 or 6. To get a
--- Content provided by FirstRanker.com ---
distinction, the student must secure at least 75% correct answers. If there is no
negativ m
--- Content provided by FirstRanker.com ---
e arking, what is the probability that the student secures distinctions?21.
--- Content provided by FirstRanker.com ---
One fifth per cent of the blades produced by a blade manufacturing
factory turn out to be defective. The blades are supplied in packets of 10. Use
--- Content provided by FirstRanker.com ---
Poisson distribution to calculate the approximate number of packets containingve, one defective and two
no defecti
--- Content provided by FirstRanker.com ---
defective blades respectively in a
consignment of 100,000 packets.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
22.It is known from past experience that in a certain plant there are on the
average 4 industrial accidents per month. Find the probability that in a given
--- Content provided by FirstRanker.com ---
year there will be less than 4 accidents. Assume Poisson distribution.
--- Content provided by FirstRanker.com ---
23. Calculate Laspeyre's, Paasche's, Bowley's, Fisher's, MarshallEdgeworth index number from
ing
--- Content provided by FirstRanker.com ---
the follow
data:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Base year
--- Content provided by FirstRanker.com ---
Current yearPrice
--- Content provided by FirstRanker.com ---
Value
Price
--- Content provided by FirstRanker.com ---
ValueA
--- Content provided by FirstRanker.com ---
6
50
--- Content provided by FirstRanker.com ---
6
--- Content provided by FirstRanker.com ---
75
B
--- Content provided by FirstRanker.com ---
8
90
--- Content provided by FirstRanker.com ---
12
--- Content provided by FirstRanker.com ---
80
--- Content provided by FirstRanker.com ---
C
--- Content provided by FirstRanker.com ---
1280
--- Content provided by FirstRanker.com ---
15
100
--- Content provided by FirstRanker.com ---
D5
--- Content provided by FirstRanker.com ---
20
--- Content provided by FirstRanker.com ---
830
--- Content provided by FirstRanker.com ---
E
--- Content provided by FirstRanker.com ---
1060
--- Content provided by FirstRanker.com ---
12
--- Content provided by FirstRanker.com ---
75--- Content provided by FirstRanker.com ---
24.
From the data given below about the treatment of 500 patients suffering
--- Content provided by FirstRanker.com ---
from a disease, state whether the new treatment is superior to the conventionaltrea
e
--- Content provided by FirstRanker.com ---
tm nt:
Treatment
--- Content provided by FirstRanker.com ---
No.of
Patients
--- Content provided by FirstRanker.com ---
Favourable
Not
--- Content provided by FirstRanker.com ---
favourable TotalNew
--- Content provided by FirstRanker.com ---
250
40 290
--- Content provided by FirstRanker.com ---
Conventional160 50 210
Total
--- Content provided by FirstRanker.com ---
410
--- Content provided by FirstRanker.com ---
90 500(Given for degrees of freedom = 1, chi-square 5 per cent = 3.84)
--- Content provided by FirstRanker.com ---
25.
300 digits are chosen at random from a set of tables. The frequencies of
--- Content provided by FirstRanker.com ---
the digits are as follows:t
--- Content provided by FirstRanker.com ---
Digi
--- Content provided by FirstRanker.com ---
0 13 4
2
--- Content provided by FirstRanker.com ---
5
6
--- Content provided by FirstRanker.com ---
78
9
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Frequency
--- Content provided by FirstRanker.com ---
28 29 36 31 20 3535
30
--- Content provided by FirstRanker.com ---
31
25
--- Content provided by FirstRanker.com ---
Use 2 test to assert the correctness of the hypothesis that the digits were
distributed in equal numbers in the tables from which they were chosen.
--- Content provided by FirstRanker.com ---
(Given for degrees of freedom = 1, chi-square 5 per cent = 3.84)
26.
--- Content provided by FirstRanker.com ---
The number of defects per unit in a samf 165 units of a
ple o
--- Content provided by FirstRanker.com ---
manufactured product was found as follows:
Number
--- Content provided by FirstRanker.com ---
ofdefects:
0 1 2 3 4
--- Content provided by FirstRanker.com ---
Number of units : 107
46
--- Content provided by FirstRanker.com ---
101
1
--- Content provided by FirstRanker.com ---
Fit a Poisson distribution to the data and test for goodness
27.
--- Content provided by FirstRanker.com ---
Assume the mean height of soldiers to be 68 inches with a variance of9 inches. How many solders in a regime
u expect are over
--- Content provided by FirstRanker.com ---
nt of 1,000 would yo
70 inches tall?
--- Content provided by FirstRanker.com ---
28.The weekly wages of 5,000 workmen are normally distributed around a
mean of Rs.70 and with a standard deviation of Rs. 5. Estimate the number of
--- Content provided by FirstRanker.com ---
workers whose weekly wages will be:
(a)
--- Content provided by FirstRanker.com ---
between Rs. 70 and Rs. 72,(b)
between Rs. 69 and Rs. 72,
--- Content provided by FirstRanker.com ---
(c)
more than Rs. 75,
--- Content provided by FirstRanker.com ---
(d)less than Rs. 63, and
(e)
--- Content provided by FirstRanker.com ---
more than Rs. 80.
--- Content provided by FirstRanker.com ---
29.In a distribution exactly normal, 7% of the items are under 35 and 89%
are under 63. What are the mean and standard deviation of the distribution?
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
30.
Find the mean and standard deviation of a normal distribution of marks
--- Content provided by FirstRanker.com ---
in an examination where 58 percent of the candidates obtained marks below 75,four per cent got above 80 and the rest between 75 and 80.
31.
--- Content provided by FirstRanker.com ---
A sample of 1600 male students is found to have a mean height of 170
cms. Can it be reasonably regarded as a sample from a la
--- Content provided by FirstRanker.com ---
on withrge populati
mean height 173 cms and standard deviation 3.50 cms.
--- Content provided by FirstRanker.com ---
32.
Fit a trend line to the following data by the free-hand method, semi-
--- Content provided by FirstRanker.com ---
average method and moving average method.Year 1995 1996 1997 1998 1999 2000 2001
Sales 65 95 85 115 110 120 130
--- Content provided by FirstRanker.com ---
33.
The following table gives the sterling assets of the R.B.I. in crores of
--- Content provided by FirstRanker.com ---
rupees:(a)
Represent the data graphically
--- Content provided by FirstRanker.com ---
(b)
Fit a straight line trend
--- Content provided by FirstRanker.com ---
(c)Show the trend on the graph
--- Content provided by FirstRanker.com ---
Year: 1996-97 1997-98 1998-99 1999-00 2000-01 2001-02
Assets: 83 92 71 90 169 191
--- Content provided by FirstRanker.com ---
Also estimate the figures for 1996-97.
--- Content provided by FirstRanker.com ---
***
--- Content provided by FirstRanker.com ---
UNIT ? IV
--- Content provided by FirstRanker.com ---
STATISTICAL APPLICATIONS
--- Content provided by FirstRanker.com ---
A BRIEF INTRODUCTION TO STATISTICAL APPLICATIONS
--- Content provided by FirstRanker.com ---
A manager in a business organization ? whether in the top level, or themiddle level, or the bottom level - has to perform an important role of decision
making. For solving any organizational problem ? which most of the times
--- Content provided by FirstRanker.com ---
happens to be complex in nature -, he has to identify a set of alternatives,
evaluate them and choose the best alternative. The experience, expertise,
--- Content provided by FirstRanker.com ---
rationality and wisdom gained by the manager over a period of time willdefinitely stand in good stead in the evaluation of the alternatives available at his
disposal. He has to consider several factors, sometimes singly and sometimes
--- Content provided by FirstRanker.com ---
jointly, during the process of decision making. He has to deal with the data of
not only his organization but also of other competing organizations.
--- Content provided by FirstRanker.com ---
It would be a challenging situation for a manager when he has to face so
many variables operating simultaneously, something internal and something
--- Content provided by FirstRanker.com ---
external. Among them, he has to identify the important variables or the
dominating factors and he should be able to distinguish one factor from the
--- Content provided by FirstRanker.com ---
other. He should be able to find which factors have similar characteristics andwhich factors stand apart. He should be able to know which factors have an inter
play with each other and which factors remain independent. It would be
--- Content provided by FirstRanker.com ---
advantageous to him to know whether there is any clear pattern followed by the
variables under consideration. At times he may be required to have a good idea
--- Content provided by FirstRanker.com ---
of the values that the variables would assume in future occasions. The task of amanager becomes all the more difficult in view of the risks and uncertainties
--- Content provided by FirstRanker.com ---
surrounding the future events. It is imperative on the part of a manager to
understand the impact of various policies and programmes on the development
--- Content provided by FirstRanker.com ---
of the organization as well as the environment. Also he should be able to
understand the impact of several of the environmental factors on his
--- Content provided by FirstRanker.com ---
organization. Sometimes a manager has to take a single stage decision and attimes he is called for to take a multistage decision on the basis of various factors
operating in a situation.
--- Content provided by FirstRanker.com ---
Statistical analysis is a tool for a manager in the process of decision
making by means of the data on hand. All managerial activities involve an
--- Content provided by FirstRanker.com ---
analysis of data. Statistical approach would enable a manager to have a scientificguess of the future events also. Statistical methods are systematic and built by
several experts on firmly established theories and consequently they would
--- Content provided by FirstRanker.com ---
enable a manager to overcome the uncertainties associated with future
occasions. However, statistical tools have their shortcomings too. The
--- Content provided by FirstRanker.com ---
limitations do not reflect on the subject. Rather they shall be traced to themethods of data collection and recording of data. Even with highly sophisticated
statistical methods, one may not arrive at valid conclusions if the data collected
--- Content provided by FirstRanker.com ---
are devoid of representative character.
In any practical problem, one has to see whether the assumptions are
--- Content provided by FirstRanker.com ---
reasonable or not, whether the data represents a wide spectrum, whether the datais adequate, whether all the conditions for the statistical tests have been fulfilled,
etc. If one takes care of these aspects, it would be possible to arrive at better
--- Content provided by FirstRanker.com ---
alternatives and more reliable solutions, thereby avoiding future shocks. While it
is true that a statistical analysis, by itself, cannot solve all the problems faced by
--- Content provided by FirstRanker.com ---
an organization, it will definitely enable a manager to comprehend the groundrealities of the situation. It will for sure provide a foresight in the identification
of the crucial variables and the key areas so that he can locate a set of possible
--- Content provided by FirstRanker.com ---
solutions within his ambit. A manager has to have a proper blend of the
--- Content provided by FirstRanker.com ---
statistical theories and practical wisdom and he shall always strive for a holistic
approach to solve any organizational problem. A manager has to provide some
--- Content provided by FirstRanker.com ---
safe-guarding measures against the limitations of the statistical tools. In theprocess he will be able to draw valid inferences thereby providing a clue as to
the direction in which the organization shall move in future. He will be ably
--- Content provided by FirstRanker.com ---
guided by the statistical results in the formulation of appropriate strategies for
the organization. Further, he can prepare the organization to face the possible
--- Content provided by FirstRanker.com ---
problems of business fluctuations in future and minimize the risks with the helpof the early warning signals indicated by the relevant statistical tools.
A marketing manager of a company or a manager in a service
--- Content provided by FirstRanker.com ---
organization will have occasions to come across the general public and
consumers with several social and psychological variables which are difficult to
--- Content provided by FirstRanker.com ---
be measured and quantified.Depending on the situation and the requirement, a manager may have to
deal with the data of just one variable (univariate data), or data on two variables
--- Content provided by FirstRanker.com ---
(bivariate data) or data concerning several simultaneous variables (multivariate
data).
--- Content provided by FirstRanker.com ---
The unit on hand addresses itself to the role of a manager as a decisionmaker with the help of data available with him. Different statistical techniques
which are suitable for different requirements are presented in this unit in a
--- Content provided by FirstRanker.com ---
simple style. A manager shall know the strengths and weaknesses of various
statistical tools. He shall know which statistical tool would be the most
--- Content provided by FirstRanker.com ---
appropriate in a particular context so that the organization will derive themaximum benefit out of it.
--- Content provided by FirstRanker.com ---
The interpretation of the results from statistical analysis occupies an
important place. Statistics is concerned with the aggregates and not just the
--- Content provided by FirstRanker.com ---
individual data items or isolated measurements of certain variables. Therefore--- Content provided by FirstRanker.com ---
the conclusions from a statistical study will be valid for a majority of the objectsand normal situations only. There are always extreme cases in any problem and
ealt with separately. St
--- Content provided by FirstRanker.com ---
they have to be d
atistical tools will enable a manager to
--- Content provided by FirstRanker.com ---
identify such outliers (abnormal cases or extreme variables) in a problem. Amanager has to evaluate the statistical inferences, interpret them in the proper
context and apply them in
--- Content provided by FirstRanker.com ---
ons.
appropriate situati
--- Content provided by FirstRanker.com ---
While in an actual research problem, one has to handle a large quantumof data, it is not possible to treat such voluminous data by a beginner in the
subject. Keeping this point in mi
--- Content provided by FirstRanker.com ---
, any
nd
--- Content provided by FirstRanker.com ---
numerical example in the present unit isbased on a few data items only. It would be worthwhile to the budding managers
to make a start in solving statistical problems by practicing the ones furnished in
--- Content provided by FirstRanker.com ---
this unit.
The candid
--- Content provided by FirstRanker.com ---
are suates
ggested to use hand calculators for solving
--- Content provided by FirstRanker.com ---
statistical problems. There will be frequent occasions to use Statistical Tables of
F-values furnished in this unit. The candidates are suggested to have with them a
--- Content provided by FirstRanker.com ---
copy of the tables for easy, ready reference. The books and articles listed underthe references may be consulted for further study or applications of statistical
techniques in relevant research areas.
--- Content provided by FirstRanker.com ---
UNIT IV
--- Content provided by FirstRanker.com ---
RRE
1. CO
--- Content provided by FirstRanker.com ---
LATION AND REGRESSION ANALYSISLesson Outline
?
--- Content provided by FirstRanker.com ---
The concept of correlation
?
--- Content provided by FirstRanker.com ---
Determination of simple correlation coefficient?
Properties of correlation coefficient
--- Content provided by FirstRanker.com ---
?
The concept of rank correlation
--- Content provided by FirstRanker.com ---
?Determination of rank correlation coefficient
?
--- Content provided by FirstRanker.com ---
The concept of regression
?
--- Content provided by FirstRanker.com ---
The principle of least squares?
Normal equations
--- Content provided by FirstRanker.com ---
?
Determination of regression equations
--- Content provided by FirstRanker.com ---
Learning ObjectivesAfter reading this lesson you should be able to
-
--- Content provided by FirstRanker.com ---
understand the concept of correlation
-
--- Content provided by FirstRanker.com ---
calculate simple correlation coefficient-
understand the properties of correlation coefficient
--- Content provided by FirstRanker.com ---
-
understand the concept of rank correlation
--- Content provided by FirstRanker.com ---
-calculate rank correlation coefficient
-
--- Content provided by FirstRanker.com ---
resolve ties in ranks
- understand the concept of regression
--- Content provided by FirstRanker.com ---
- determine regression equations- understand the managerial applications of correlation and regression
--- Content provided by FirstRanker.com ---
SIMPLE COR E
R LATION
--- Content provided by FirstRanker.com ---
orrelation
C
--- Content provided by FirstRanker.com ---
Correlation means the average relationship between two or morevariables. When changes in the values of a variable affect the values of another
ariab
--- Content provided by FirstRanker.com ---
v
le, we say that there is a correlation between the two variables. The two
--- Content provided by FirstRanker.com ---
variables may move in the same direction or in opposite directions. Simplybecause of the presence of correlation
le
--- Content provided by FirstRanker.com ---
between two variab s, we cannot jump to
the conclusion that there is a cause-effect relationship between them.
--- Content provided by FirstRanker.com ---
Sometimes, it may be due to chance also.Simple correlation
We say that the correlation is simple if the comparison involves two variables
--- Content provided by FirstRanker.com ---
only.
S OF CORRELATION
--- Content provided by FirstRanker.com ---
TYPEPositive correlation
--- Content provided by FirstRanker.com ---
If two variables x and y move in the same direction, we say that there is a
positive correlation between them. In this case, when the value of one variable
--- Content provided by FirstRanker.com ---
increases, the value of the other variable also increases and when the value ofone variable decreases, the value of the other variable also decreases. Eg. The
age and height of a child.
--- Content provided by FirstRanker.com ---
Negative correlation
If two variables x and y move in opposite directions, we say that there is a
--- Content provided by FirstRanker.com ---
negative correlation between them. i.e., when the value of one variableincreases, the value of the other variable decreases and vice versa. Eg. The price
and demand of a normal good.
--- Content provided by FirstRanker.com ---
The following diagrams illustrate positive and negative correlations
--- Content provided by FirstRanker.com ---
between x and y.--- Content provided by FirstRanker.com ---
y
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
y
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
x
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Positive Correlation
--- Content provided by FirstRanker.com ---
Negative Correlation
--- Content provided by FirstRanker.com ---
Perfect positive correlationIf changes in two vari
les a
--- Content provided by FirstRanker.com ---
ab
re in the same direction and the changes are
--- Content provided by FirstRanker.com ---
in equal proportion, we say that there is a perfect positive correlation betweenthem.
Perfect negative correlation
--- Content provided by FirstRanker.com ---
If changes in two variables are in opposite directions and the absolute
values of changes are in equal proportion, we say that there is a perfect negative
--- Content provided by FirstRanker.com ---
correlation between them.y
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
y
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
x
--- Content provided by FirstRanker.com ---
x
Perfect Positive Correlation
--- Content provided by FirstRanker.com ---
Perfect Negative Correlation
--- Content provided by FirstRanker.com ---
Zero correlation
--- Content provided by FirstRanker.com ---
If there is no relationship between the two variables, then the variables aresaid to be independent. In this case the correlation between the two variables is
zero.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
y--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
x
--- Content provided by FirstRanker.com ---
Zero correlation
--- Content provided by FirstRanker.com ---
relationLinear cor
--- Content provided by FirstRanker.com ---
If the quantum of change in one variable always bears a constant ratio to
the quantum of change in the other variable, we say that the two variables have a
--- Content provided by FirstRanker.com ---
linear correlation between them.Coefficient of correlation
The coefficient of correlation between two variables X, Y is a measure of the
--- Content provided by FirstRanker.com ---
degree of association (i.e., strength of relationship) between them. The
coefficient of correlation is usually denoted by `r'.
--- Content provided by FirstRanker.com ---
Karl Pearson's Coefficient of Simple Correlation:Let N denote t
ber of pairs of obs
--- Content provided by FirstRanker.com ---
he num
ervations of two variables X and Y.
--- Content provided by FirstRanker.com ---
The correlation coefficient r between X and Y is defined byN XY - ( X ) (Y )
r =
--- Content provided by FirstRanker.com ---
N X -( X )2 N Y -(Y )2
--- Content provided by FirstRanker.com ---
22
This formula is suitable for solving problems with hand calculators. To apply
--- Content provided by FirstRanker.com ---
Y,
2
--- Content provided by FirstRanker.com ---
2this formula, we have to calculate X,
XY, X , Y .
--- Content provided by FirstRanker.com ---
Properties of Correlation Coefficient
--- Content provided by FirstRanker.com ---
Let r denote the correlation coefficient between two variables. r is interpreted
--- Content provided by FirstRanker.com ---
using the following properties:1.
The value of r ranges from ?
--- Content provided by FirstRanker.com ---
o 0.0 or fr
1.0 t
--- Content provided by FirstRanker.com ---
om 0.0 to 1.02.
A value of r = 1.0 indicates
--- Content provided by FirstRanker.com ---
there exis
that
--- Content provided by FirstRanker.com ---
rfect posits pe
tive correlation
--- Content provided by FirstRanker.com ---
between the two variables.
3.
--- Content provided by FirstRanker.com ---
A value of r = - 1.0 indicates that there exists perfect negative correlationbetween the two variables.
4.
--- Content provided by FirstRanker.com ---
A value r = 0.0 indicates zero correlation i.e., It shows that there is no
correlation at all between the two variables.
--- Content provided by FirstRanker.com ---
5.A positive value of r shows a positive correlation between the two
variables.
--- Content provided by FirstRanker.com ---
6.
A negative value of r shows a negative correlation between the two
--- Content provided by FirstRanker.com ---
variables.7.
A value of r = 0.9 and above indicates a very high degree of positive
--- Content provided by FirstRanker.com ---
correlation between the two variables.
8.
--- Content provided by FirstRanker.com ---
A value of - 0.9 r > - 1.0 shows a very high degree of negativee
correlation b tween the two variables.
--- Content provided by FirstRanker.com ---
9.
n
--- Content provided by FirstRanker.com ---
For a reaso ably high degree of positive correlation, we require r to befrom 0.75 to 1.0.
10.
--- Content provided by FirstRanker.com ---
A value of r from 0.6 to 0.75 may be taken as a moderate degree of
positive correlation.
--- Content provided by FirstRanker.com ---
Problem 1
The following are data on Advertising Expenditure (in Rupees Thousand) and
--- Content provided by FirstRanker.com ---
Sales (Rupees In lakhs) in a com any.
p
--- Content provided by FirstRanker.com ---
AdvertisingExpenditure :
18
--- Content provided by FirstRanker.com ---
19 20 21 22 23
Sales : 17 17
--- Content provided by FirstRanker.com ---
18 19 19 19--- Content provided by FirstRanker.com ---
Determine the correlation coefficient between them and interpret the result.Solution: We have N = 6. Calculate X, Y, XY, X2, Y2 as follows:
X
--- Content provided by FirstRanker.com ---
Y
XY
--- Content provided by FirstRanker.com ---
2X
2
--- Content provided by FirstRanker.com ---
Y
18
--- Content provided by FirstRanker.com ---
17306
324
--- Content provided by FirstRanker.com ---
289
19
--- Content provided by FirstRanker.com ---
17323
361
--- Content provided by FirstRanker.com ---
289
20
--- Content provided by FirstRanker.com ---
18360
400
--- Content provided by FirstRanker.com ---
324
21
--- Content provided by FirstRanker.com ---
19399
441
--- Content provided by FirstRanker.com ---
361
22
--- Content provided by FirstRanker.com ---
19418
484
--- Content provided by FirstRanker.com ---
361
23
--- Content provided by FirstRanker.com ---
19437
529
--- Content provided by FirstRanker.com ---
361
Total
--- Content provided by FirstRanker.com ---
:123109 2243 2539 1985
--- Content provided by FirstRanker.com ---
The correlation coefficient r bet
--- Content provided by FirstRanker.com ---
ween the two variables is calculated as of llows:
N XY - ( X )(Y )
--- Content provided by FirstRanker.com ---
r =
--- Content provided by FirstRanker.com ---
2N X - ( X )2
2
--- Content provided by FirstRanker.com ---
2
N Y - (Y )
--- Content provided by FirstRanker.com ---
6 ? 2243 -123?109r =
--- Content provided by FirstRanker.com ---
6 ? 2539 - (123)2 6?1985 - (109)2
= (13458 ? 13407) / {(15234- 15129) (11910- 11881)}
--- Content provided by FirstRanker.com ---
=51/{105 29} = 51/ (10.247 X 5.365) = 51/ 54.975 = 0.9277Interpretation
--- Content provided by FirstRanker.com ---
The value of r is 0.92. It shows that there is a high, positive correlation
between the two variables `Advertising Expenditure' and `Sales'. This
--- Content provided by FirstRanker.com ---
provides a basis to consider some functional relationship between them.Problem 2
--- Content provided by FirstRanker.com ---
Consider the following data on two variables X and Y.
X
--- Content provided by FirstRanker.com ---
: 1214
18 23
--- Content provided by FirstRanker.com ---
24
27
--- Content provided by FirstRanker.com ---
Y : 18 13
--- Content provided by FirstRanker.com ---
12 30 25 10Determine the correlation coefficient between the two variables and interpret the
result.
--- Content provided by FirstRanker.com ---
Solution:
--- Content provided by FirstRanker.com ---
eW have N = 6. Calculate X, Y, XY, X2, Y2 as follows:
X
--- Content provided by FirstRanker.com ---
Y
XY
--- Content provided by FirstRanker.com ---
2X
2
--- Content provided by FirstRanker.com ---
Y
12
--- Content provided by FirstRanker.com ---
18216
144
--- Content provided by FirstRanker.com ---
324
14
--- Content provided by FirstRanker.com ---
13182
6
--- Content provided by FirstRanker.com ---
19
--- Content provided by FirstRanker.com ---
16918
12
--- Content provided by FirstRanker.com ---
216
--- Content provided by FirstRanker.com ---
324
--- Content provided by FirstRanker.com ---
144
23
--- Content provided by FirstRanker.com ---
30690
--- Content provided by FirstRanker.com ---
529
--- Content provided by FirstRanker.com ---
900
24
--- Content provided by FirstRanker.com ---
25
--- Content provided by FirstRanker.com ---
600576
--- Content provided by FirstRanker.com ---
625
--- Content provided by FirstRanker.com ---
2710
--- Content provided by FirstRanker.com ---
270
--- Content provided by FirstRanker.com ---
729100
--- Content provided by FirstRanker.com ---
Total : 118 108
2174
--- Content provided by FirstRanker.com ---
24 89
2262
--- Content provided by FirstRanker.com ---
The correlation coefficient between the two variables is r =
--- Content provided by FirstRanker.com ---
{6 X 2174 ? (118 X 108)} / { (6 X 249 -8 1182) (
2262
--- Content provided by FirstRanker.com ---
6 X
- 1082) }
--- Content provided by FirstRanker.com ---
= (13044 ? 12744) / {(14988- 13924) (13572- 11664)}=300 / {1064 1908} = 300 / (32.62 X 43.68) = 300 / 1424.84 = 0.2105
re
--- Content provided by FirstRanker.com ---
Interp tation
--- Content provided by FirstRanker.com ---
The value of r is 0.21. Even though it is positive, the value of r is very less.Hence we conclude that there is no correlation between the two variables X
and Y. Consequently we cannot construct any functional relational
--- Content provided by FirstRanker.com ---
relationship between them.
--- Content provided by FirstRanker.com ---
Problem 3Consider the following data on supply and price. Determine the correlation
coefficient between the two variables and interpret the result.
--- Content provided by FirstRanker.com ---
Supply : 11 13 17 18 22 24 26 28
--- Content provided by FirstRanker.com ---
Price : 25 32 26 25 20 17 11 10
Determine the correlation coefficient between the two variables and interpret the
--- Content provided by FirstRanker.com ---
result.Solution:
--- Content provided by FirstRanker.com ---
We have N = 8. Take X = Supply and Y = Price.
Calculate X, Y, XY, X2, Y2 as follows:
--- Content provided by FirstRanker.com ---
XY
--- Content provided by FirstRanker.com ---
XY
X2
--- Content provided by FirstRanker.com ---
Y211
25
--- Content provided by FirstRanker.com ---
275
121
--- Content provided by FirstRanker.com ---
62513
32
--- Content provided by FirstRanker.com ---
416
169
--- Content provided by FirstRanker.com ---
102417
26
--- Content provided by FirstRanker.com ---
442
289
--- Content provided by FirstRanker.com ---
67618
25
--- Content provided by FirstRanker.com ---
450
324
--- Content provided by FirstRanker.com ---
62522
20
--- Content provided by FirstRanker.com ---
440
484
--- Content provided by FirstRanker.com ---
40024
17
--- Content provided by FirstRanker.com ---
408
576
--- Content provided by FirstRanker.com ---
28926
11
--- Content provided by FirstRanker.com ---
286
676
--- Content provided by FirstRanker.com ---
12128
10
--- Content provided by FirstRanker.com ---
280
784
--- Content provided by FirstRanker.com ---
100Total: 159
166
--- Content provided by FirstRanker.com ---
2997
3423
--- Content provided by FirstRanker.com ---
3860--- Content provided by FirstRanker.com ---
The correlation coefficient between the two variables is r =
{8 X 2997 ? (159 X 166)} / { (8 X 3423 - 1592) (8 X 3860 - 1662) }
--- Content provided by FirstRanker.com ---
= (23976 ? 26394) / {84- 2
(273
--- Content provided by FirstRanker.com ---
5281) (30880- 27566)}
= - 2418 / {2103 3
--- Content provided by FirstRanker.com ---
- 24314} =
86 X 5
--- Content provided by FirstRanker.com ---
18 / (45.
7.57)
--- Content provided by FirstRanker.com ---
= - 2418 / 2640.16 = - 0.9159Interpretation
--- Content provided by FirstRanker.com ---
The value of r is - 0.92. The negative sign in r shows that the two variables
--- Content provided by FirstRanker.com ---
move in opposite directions. The absolute value of r is 0.92 which is very
high. Therefore we conclude that there is high negative correlation between
--- Content provided by FirstRanker.com ---
the two variables `Supply' and `Price'.Problem 4
--- Content provided by FirstRanker.com ---
Consider the following data on income and savings in Rs. thousand.
Income : 50 51 52 55 56 58 60 62 65 66
--- Content provided by FirstRanker.com ---
Savings : 10 11 13 14 15 15 16 16 17 17Determine the correlation coefficient between the two variables and interpret the
result.
--- Content provided by FirstRanker.com ---
Solution:
--- Content provided by FirstRanker.com ---
We have N = 10. Take X = Income and Y = Savings.Calculate X, Y, XY, X2, Y2 as follows:
X
--- Content provided by FirstRanker.com ---
Y
--- Content provided by FirstRanker.com ---
2XY
X2
--- Content provided by FirstRanker.com ---
Y
50
--- Content provided by FirstRanker.com ---
10500
2500
--- Content provided by FirstRanker.com ---
100
--- Content provided by FirstRanker.com ---
5111
561
--- Content provided by FirstRanker.com ---
2601
121
--- Content provided by FirstRanker.com ---
5213
676
--- Content provided by FirstRanker.com ---
2704
169
--- Content provided by FirstRanker.com ---
5514
770
--- Content provided by FirstRanker.com ---
3025
196
--- Content provided by FirstRanker.com ---
5615
840
--- Content provided by FirstRanker.com ---
3136
225
--- Content provided by FirstRanker.com ---
5815
870
--- Content provided by FirstRanker.com ---
3364
225
--- Content provided by FirstRanker.com ---
6016
960
--- Content provided by FirstRanker.com ---
3600
256
--- Content provided by FirstRanker.com ---
6216
992
--- Content provided by FirstRanker.com ---
3844
256
--- Content provided by FirstRanker.com ---
6517
1105
--- Content provided by FirstRanker.com ---
4225
289
--- Content provided by FirstRanker.com ---
6617
1122
--- Content provided by FirstRanker.com ---
4356
289
--- Content provided by FirstRanker.com ---
Total: 575144
8396
--- Content provided by FirstRanker.com ---
33355
2126
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
The correlation coefficient between the two variables is r ={10 X 8396 ? (575 X 144)} / {(10 X 33355 - 5752) (10 X 2126 - 1442)}
= (83960 ? 82800) / {(333550- 3 0
--- Content provided by FirstRanker.com ---
3 625) 2
( 126 -
--- Content provided by FirstRanker.com ---
0 20 37 6)
}
--- Content provided by FirstRanker.com ---
= 1160 / {2925 524} = 1160 / (54.08 X 22.89)
= 1160 / 1237.89 = 0.9371
--- Content provided by FirstRanker.com ---
Interpretation
The value of r is 0.93. The positive sign in r shows that the two variables
--- Content provided by FirstRanker.com ---
move in the same direction. The value of r is very high. Th
e
--- Content provided by FirstRanker.com ---
erefor weconclude that there is high positive correlation between the two variables
`Income' and `Savings'. As a result, e
--- Content provided by FirstRanker.com ---
w can construct a functional
relationshi betw
--- Content provided by FirstRanker.com ---
peen them.
--- Content provided by FirstRanker.com ---
RANK CORRELATION
Spearman's Rank Correlation Coefficient
--- Content provided by FirstRanker.com ---
If ranks can be assigned to pairs of observations for two variables X and Y, thenthe corre
e ranks is
--- Content provided by FirstRanker.com ---
lation between th
called the rank correlation coefficient. It
--- Content provided by FirstRanker.com ---
is usually denoted by the symbol (rho). It is given by the formula2
6 D
--- Content provided by FirstRanker.com ---
= 1-
--- Content provided by FirstRanker.com ---
3N - N
where D = i
--- Content provided by FirstRanker.com ---
d fference between t e
h corresponding rank
--- Content provided by FirstRanker.com ---
s of X and Y--- Content provided by FirstRanker.com ---
= R - R
X
--- Content provided by FirstRanker.com ---
Yand N is the total number of pairs of observations of X and Y.
--- Content provided by FirstRanker.com ---
Problem 5
Alpha Recruiting Agency short listed 10 candidates for final selection. They
--- Content provided by FirstRanker.com ---
were examined int
wri ten and oral communication skills. They were ranked as
--- Content provided by FirstRanker.com ---
follows:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Candidate's Serial No.1 2
3
--- Content provided by FirstRanker.com ---
4 5
6
--- Content provided by FirstRanker.com ---
78
9
--- Content provided by FirstRanker.com ---
10
Rank in written
--- Content provided by FirstRanker.com ---
8 72
10 3
--- Content provided by FirstRanker.com ---
5
1
--- Content provided by FirstRanker.com ---
96
4
--- Content provided by FirstRanker.com ---
communication
Rank in oral communication 10 7
--- Content provided by FirstRanker.com ---
26 5
4
--- Content provided by FirstRanker.com ---
1
9
--- Content provided by FirstRanker.com ---
83
--- Content provided by FirstRanker.com ---
Find out whether there is any correlation between th
w
--- Content provided by FirstRanker.com ---
e rn
itte
--- Content provided by FirstRanker.com ---
d
an
--- Content provided by FirstRanker.com ---
lora
communication skills of the o
--- Content provided by FirstRanker.com ---
sh
ist
--- Content provided by FirstRanker.com ---
rt la
ed c
--- Content provided by FirstRanker.com ---
a
ndid tes.
--- Content provided by FirstRanker.com ---
Solution:
Take X = Wr
--- Content provided by FirstRanker.com ---
n sk
itten communicatio
--- Content provided by FirstRanker.com ---
ill and Y = Oral communication skill.ANK OF X: R
--- Content provided by FirstRanker.com ---
R
1
--- Content provided by FirstRanker.com ---
RANK OF Y: R2D=R1- R2
D2
--- Content provided by FirstRanker.com ---
8
10
--- Content provided by FirstRanker.com ---
- 24
7
--- Content provided by FirstRanker.com ---
7
0
--- Content provided by FirstRanker.com ---
02
2
--- Content provided by FirstRanker.com ---
0
0
--- Content provided by FirstRanker.com ---
106
--- Content provided by FirstRanker.com ---
4
16
--- Content provided by FirstRanker.com ---
35
- 2
--- Content provided by FirstRanker.com ---
4
5
--- Content provided by FirstRanker.com ---
4
1
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
1
0
--- Content provided by FirstRanker.com ---
0
9
--- Content provided by FirstRanker.com ---
9
--- Content provided by FirstRanker.com ---
0
0
--- Content provided by FirstRanker.com ---
68
- 2
--- Content provided by FirstRanker.com ---
4
4
--- Content provided by FirstRanker.com ---
3
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
Total: 30We hav
= 10. The rank c
--- Content provided by FirstRanker.com ---
e N
orrelation coefficient is
--- Content provided by FirstRanker.com ---
= 1 - {6 D2/ (N3 ? N)} = 1 ? {6 x 30 / (1000 ? 10)} = 1 ? (180 / 990)
= 1 ? 0.18 = 0.82
--- Content provided by FirstRanker.com ---
Inference:
--- Content provided by FirstRanker.com ---
From the value of r, it is inferred that there is a high, positive rank correlation
between the written and oral communication skills of the short listed candidates.
--- Content provided by FirstRanker.com ---
Problem 6The following are the ranks obtained by 10 workers in ABC Company on the
basis of their length of service and efficiency.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Ranking as per service 12
3
--- Content provided by FirstRanker.com ---
4
5
--- Content provided by FirstRanker.com ---
6 78
9
--- Content provided by FirstRanker.com ---
10
Rank as per efficiency
--- Content provided by FirstRanker.com ---
23
6
--- Content provided by FirstRanker.com ---
5
1
--- Content provided by FirstRanker.com ---
10 79
8
--- Content provided by FirstRanker.com ---
4
--- Content provided by FirstRanker.com ---
Find out whether there is any correlation between the ranks obtained by theworkers as per the two criteria.
--- Content provided by FirstRanker.com ---
Solution:
Take X = Length of service and Y = Efficiency.
--- Content provided by FirstRanker.com ---
Rank of X: R1
Rank of Y: R2
--- Content provided by FirstRanker.com ---
D= R1- R2
D2
--- Content provided by FirstRanker.com ---
12
- 1
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
3- 1
1
--- Content provided by FirstRanker.com ---
3
6
--- Content provided by FirstRanker.com ---
- 39
4
--- Content provided by FirstRanker.com ---
5
- 1
--- Content provided by FirstRanker.com ---
15
1
--- Content provided by FirstRanker.com ---
4
16
--- Content provided by FirstRanker.com ---
610
- 4
--- Content provided by FirstRanker.com ---
16
7
--- Content provided by FirstRanker.com ---
70
0
--- Content provided by FirstRanker.com ---
8
9
--- Content provided by FirstRanker.com ---
- 11
9
--- Content provided by FirstRanker.com ---
8
1
--- Content provided by FirstRanker.com ---
1
10
--- Content provided by FirstRanker.com ---
4
6
--- Content provided by FirstRanker.com ---
36
--- Content provided by FirstRanker.com ---
Total
--- Content provided by FirstRanker.com ---
82
We have N = 10. The rank correlation coefficient is
--- Content provided by FirstRanker.com ---
= 1 - {6 D2 / (N3 ? N)} = 1 ? { 6 x 82 / (1000 ? 10) } = 1 ? (492 / 990)= 1
.497 =
--- Content provided by FirstRanker.com ---
? 0
503
--- Content provided by FirstRanker.com ---
0.--- Content provided by FirstRanker.com ---
Inference:
The rank correlation coefficient is not high.
--- Content provided by FirstRanker.com ---
Pro
(C
--- Content provided by FirstRanker.com ---
blem 7
onversion of s
--- Content provided by FirstRanker.com ---
res into ranksco
)
--- Content provided by FirstRanker.com ---
Calculate the rank correlation to determine the relationship between equity
shares and preference shares given by the fol
--- Content provided by FirstRanker.com ---
ing data onlow
their price.
--- Content provided by FirstRanker.com ---
Equity share
--- Content provided by FirstRanker.com ---
90.0 92.4 98.5 98.395.4 91.3 98.0
92.0
--- Content provided by FirstRanker.com ---
Preference share
76.0 74.2 75.0 77.4
--- Content provided by FirstRanker.com ---
78.3 78.8 73.276.5
--- Content provided by FirstRanker.com ---
Solution:
From the given data on share price, we have to find out the ranks for equity
--- Content provided by FirstRanker.com ---
shares and prares.
eference sh
--- Content provided by FirstRanker.com ---
Step 1. First, consider the equity shares and arrange them in descending order
--- Content provided by FirstRanker.com ---
of their price as 1,2,...,8. We have the following ranks:Equity share
98.5
--- Content provided by FirstRanker.com ---
98.3 98.0
95.4 92.4
--- Content provided by FirstRanker.com ---
92.091.3 90.0
Rank
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
34 5
6
--- Content provided by FirstRanker.com ---
7
8
--- Content provided by FirstRanker.com ---
Step 2. Next, take the preference shares and arrange them in descending order
of their price as 1,2,...,8. We obtain the following ranks:
--- Content provided by FirstRanker.com ---
Preference share
78.8
--- Content provided by FirstRanker.com ---
78.3 77.4 76.576.0 75.0
74.2
--- Content provided by FirstRanker.com ---
73.2
--- Content provided by FirstRanker.com ---
Rank
1
--- Content provided by FirstRanker.com ---
23
4
--- Content provided by FirstRanker.com ---
5
6
--- Content provided by FirstRanker.com ---
78
--- Content provided by FirstRanker.com ---
Step 3. Calculation of D2:
Fit the given data with the correct rank. Take X = Equity share and Y =
--- Content provided by FirstRanker.com ---
hPreference s are. We have the following table:
X
--- Content provided by FirstRanker.com ---
Y
Rank of X: R1
--- Content provided by FirstRanker.com ---
Rank of Y: R2D=R1- R2
D2
--- Content provided by FirstRanker.com ---
90.0 76.0
8
--- Content provided by FirstRanker.com ---
53
9
--- Content provided by FirstRanker.com ---
92.4 74.2
5
--- Content provided by FirstRanker.com ---
7-
2
--- Content provided by FirstRanker.com ---
4
98.5 75.0
--- Content provided by FirstRanker.com ---
16
-
--- Content provided by FirstRanker.com ---
5
25
--- Content provided by FirstRanker.com ---
98.3 77.42
3
--- Content provided by FirstRanker.com ---
-
1
--- Content provided by FirstRanker.com ---
195.4 78.3
4
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
491.3
78.8 7
--- Content provided by FirstRanker.com ---
1
6 36
--- Content provided by FirstRanker.com ---
98.0 73.23
8
--- Content provided by FirstRanker.com ---
-
5
--- Content provided by FirstRanker.com ---
2592.0
76.5
--- Content provided by FirstRanker.com ---
6
--- Content provided by FirstRanker.com ---
42
4
--- Content provided by FirstRanker.com ---
Total 108
--- Content provided by FirstRanker.com ---
Step 4. Calculationof :
e have
--- Content provided by FirstRanker.com ---
W
8. The
--- Content provided by FirstRanker.com ---
N =correlatio
rank
--- Content provided by FirstRanker.com ---
oefficient i
n c
--- Content provided by FirstRanker.com ---
s= 1 - { 6 D2
/ (N3 ? N)} = 1 ? { 6 x 108 / (512 ? 8) } = 1 ? (648 / 504)
--- Content provided by FirstRanker.com ---
= 1 ? 1.29 = - 0.29
--- Content provided by FirstRanker.com ---
Inference:From the value of , it is inferred that the equity shares and preference shares
under consideration are negatively correlate
--- Content provided by FirstRanker.com ---
d. However, the absolute value of
--- Content provided by FirstRanker.com ---
is 0.29 which is not even moderate.--- Content provided by FirstRanker.com ---
Problem 8
Three managers evaluate the performance
--- Content provided by FirstRanker.com ---
an organization
of 10 sales persons in
--- Content provided by FirstRanker.com ---
and award ranks to them as follows:Sales Person
1 2
--- Content provided by FirstRanker.com ---
3
4 5
--- Content provided by FirstRanker.com ---
67
8
--- Content provided by FirstRanker.com ---
9
10
--- Content provided by FirstRanker.com ---
Rank awarded by Manager I8 7
6
--- Content provided by FirstRanker.com ---
1 5
9
--- Content provided by FirstRanker.com ---
102
3
--- Content provided by FirstRanker.com ---
4
Rank awarded by Manager II
--- Content provided by FirstRanker.com ---
7 84
6 5
--- Content provided by FirstRanker.com ---
10
9
--- Content provided by FirstRanker.com ---
32
1
--- Content provided by FirstRanker.com ---
Rank awarded by
4 5
--- Content provided by FirstRanker.com ---
18 9
10
--- Content provided by FirstRanker.com ---
6
7
--- Content provided by FirstRanker.com ---
32
Manager III
--- Content provided by FirstRanker.com ---
Determine which two managers have the nearest approach in the evaluation of
the performance of the sales persons.
--- Content provided by FirstRanker.com ---
Solution:
Sales
--- Content provided by FirstRanker.com ---
Manager I Manager II
Manager III
--- Content provided by FirstRanker.com ---
(R1- R2) 2(R1 -R3) 2
(R2-R3) 2
--- Content provided by FirstRanker.com ---
Rank: R1
Rank: R
--- Content provided by FirstRanker.com ---
Person2
Rank: R3
--- Content provided by FirstRanker.com ---
1
8
--- Content provided by FirstRanker.com ---
74
1 16 9
--- Content provided by FirstRanker.com ---
2
7
--- Content provided by FirstRanker.com ---
85
1 4 9
--- Content provided by FirstRanker.com ---
3
6
--- Content provided by FirstRanker.com ---
41
4 25 9
--- Content provided by FirstRanker.com ---
4
1
--- Content provided by FirstRanker.com ---
68
25 49 4
--- Content provided by FirstRanker.com ---
5
5
--- Content provided by FirstRanker.com ---
59
0 16 16
--- Content provided by FirstRanker.com ---
6
9
--- Content provided by FirstRanker.com ---
1010
1 1 0
--- Content provided by FirstRanker.com ---
7
10
--- Content provided by FirstRanker.com ---
96
1 16 9
--- Content provided by FirstRanker.com ---
8
2
--- Content provided by FirstRanker.com ---
37
1 25 16
--- Content provided by FirstRanker.com ---
9
3
--- Content provided by FirstRanker.com ---
23
1 0 1
--- Content provided by FirstRanker.com ---
10
4
--- Content provided by FirstRanker.com ---
12
9 4 1
--- Content provided by FirstRanker.com ---
Total
44
--- Content provided by FirstRanker.com ---
156 74We have N = 10. The rank correlation coefficient between mangers I and II is
--- Content provided by FirstRanker.com ---
= 1 - { 6 D2 / (N3 ? N)} = 1 ? { 6 x 44 / (1000 ? 10) } = 1 ? (264 / 990)
--- Content provided by FirstRanker.com ---
= 1 ? 0.27 = 0.73The rank correlation coefficient between mangers I and III is
1 ? { 6 x 156 / (1000 ? 10) } = 1 ? (936 / 990) = 1 ? 0.95 = 0.05
--- Content provided by FirstRanker.com ---
The rank correlation coefficient between mangers II and III is
1 ? { 6 x 74 / (1000 ? 10) } = 1 ? (444 / 990) = 1 ? 0.44 = 0.56
--- Content provided by FirstRanker.com ---
Inference:
Comparing the 3 values of , it is inferred that Mangers I and II have the
--- Content provided by FirstRanker.com ---
nearest approach in the evaluation of the performance of the sales persons.
--- Content provided by FirstRanker.com ---
Repeated values: Resolving ties in ranks
--- Content provided by FirstRanker.com ---
When ranks are awarded to candidates, it is possible that certain
candidates obtain equal ranks. For example, if two or three, or four candidates
--- Content provided by FirstRanker.com ---
secure equal ranks, a procedure that can be followed to resolve the ties isdescribed below.
We follow the Average Rank Method. If there are n items, arrange
--- Content provided by FirstRanker.com ---
them in ascending order or descending order and give ranks 1, 2, 3, ..., n. Then
look at those items which have equal values. For such items, take the average
--- Content provided by FirstRanker.com ---
ranks.If there are two items with equal values, their ranks will be two
consecutive integers, say s and s + 1. Their average is { s + (s+1)} / 2. Assign
--- Content provided by FirstRanker.com ---
this rank to both items. Note that we allow ranks to be fractions also.
--- Content provided by FirstRanker.com ---
If there are three items with equal values, their ranks wre
ill be th e
--- Content provided by FirstRanker.com ---
consecutive integers, say s, s + 1 and s + 2. Their average is { s + (s+1) +
--- Content provided by FirstRanker.com ---
(s+2) } / 3 = (3s + 3) / 3 = s + 1. Assign this rank to all the three items. A similar
procedure is followed if four or more number of items has equal values.
--- Content provided by FirstRanker.com ---
Correction term for when ranks are tied
--- Content provided by FirstRanker.com ---
Consider the formula for rank correlation coefficient. We have
--- Content provided by FirstRanker.com ---
2= 1- 6 D
3
--- Content provided by FirstRanker.com ---
N - N
--- Content provided by FirstRanker.com ---
If there is a tie involving m items, we have to add3
m - m
--- Content provided by FirstRanker.com ---
12
to the term D2 in . We have to add as many terms like (m3 ? m) / 12 as there
--- Content provided by FirstRanker.com ---
are ties.Let us calculate the correction terms for certain values of m. These are provided
--- Content provided by FirstRanker.com ---
in the following table.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
3
--- Content provided by FirstRanker.com ---
m - m=
Correction term
--- Content provided by FirstRanker.com ---
m
--- Content provided by FirstRanker.com ---
m3m3 ? m
12
--- Content provided by FirstRanker.com ---
2 8 6
0.5
--- Content provided by FirstRanker.com ---
3 27 242
4 64 60
--- Content provided by FirstRanker.com ---
5
5 125 120
--- Content provided by FirstRanker.com ---
10Illustrative examples:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
If there is a tie involving 2 items, then the correction term is 0.5
If there are 2 ties involving 2
--- Content provided by FirstRanker.com ---
sitem each, then the correction
is
--- Content provided by FirstRanker.com ---
term
0.5 + 0.5 = 1
--- Content provided by FirstRanker.com ---
there aIf
re 3 ties with 2 items each, then the correction term is
--- Content provided by FirstRanker.com ---
0.5 + 0.5 + 0.5 = 1.5
If there is a tie involving 3 items, then the correction term is 2
--- Content provided by FirstRanker.com ---
If there are 2 ties involving 3 items each, then the correction term is 2 + 2 = 4If there is a tie with 2 items and another tie with 3 items, then the correction
term is 0.5 + 2 = 2.5
--- Content provided by FirstRanker.com ---
If there are 2 ties with 2 items each and another tie with 3 items, then the
is 0.5 + 0.5 + 2 = 3
--- Content provided by FirstRanker.com ---
correction termProblem 9 : Resolving ties in ranks
The following are the details of ratings scored by two popular insurance
--- Content provided by FirstRanker.com ---
schemes. Determine the rank correlation coefficient between them.
--- Content provided by FirstRanker.com ---
Scheme I80 80 83 84 87 87 89 90
Scheme II
--- Content provided by FirstRanker.com ---
55 56 57 57 57 58 59 60
--- Content provided by FirstRanker.com ---
Solution:From the given values, we have to determine the ranks.
Step 1.
--- Content provided by FirstRanker.com ---
Scheme I in descending order and rank
Arrange the scores for Insurance
--- Content provided by FirstRanker.com ---
them as 1,2,3,...,8.Scheme I Score
--- Content provided by FirstRanker.com ---
90
89
--- Content provided by FirstRanker.com ---
8787
84
--- Content provided by FirstRanker.com ---
83
80
--- Content provided by FirstRanker.com ---
80Rank
1
--- Content provided by FirstRanker.com ---
2
3
--- Content provided by FirstRanker.com ---
45
6
--- Content provided by FirstRanker.com ---
7
8
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
The score 87 appears twice. The corresponding ranks are 3, 4. Their average is(3 + 4) / 2 = 3.5. Assign this rank to the two equal scores in Scheme I.
--- Content provided by FirstRanker.com ---
The score 80 appears twice. The corresponding ranks are
Their average is
--- Content provided by FirstRanker.com ---
7, 8.(7 + 8) / 2 = 7.5. Assign this rank to the two equal scores in Scheme I.
--- Content provided by FirstRanker.com ---
The revised ranks for Insurance Scheme I are as follows:
--- Content provided by FirstRanker.com ---
Scheme I Score
90
--- Content provided by FirstRanker.com ---
89
87 87
--- Content provided by FirstRanker.com ---
8483
80
--- Content provided by FirstRanker.com ---
80
Rank
--- Content provided by FirstRanker.com ---
12
3.5 3.5
--- Content provided by FirstRanker.com ---
5
6
--- Content provided by FirstRanker.com ---
7.57.5
--- Content provided by FirstRanker.com ---
Step 2. Arrange the scores for Insurance Scheme II in descending order and
rank them as 1,2,3,...,8.
--- Content provided by FirstRanker.com ---
Scheme II Score
60
--- Content provided by FirstRanker.com ---
59
58
--- Content provided by FirstRanker.com ---
5757
57
--- Content provided by FirstRanker.com ---
56
55
--- Content provided by FirstRanker.com ---
Rank1 2
3 4 5 6 7 8
--- Content provided by FirstRanker.com ---
The score 57 appears thrice. The corresponding ranks are 4, 5, 6.
--- Content provided by FirstRanker.com ---
Their average is (4 + 5 + 6) / 3 = 15 / 3 = 5. Assign this rank to the three equalscores in Scheme II.
--- Content provided by FirstRanker.com ---
The revised ranks for Insurance Scheme II are as follows:
--- Content provided by FirstRanker.com ---
Scheme II Score60
59
--- Content provided by FirstRanker.com ---
58
57
--- Content provided by FirstRanker.com ---
5757
56
--- Content provided by FirstRanker.com ---
55
Rank
--- Content provided by FirstRanker.com ---
12
3
--- Content provided by FirstRanker.com ---
5
5
--- Content provided by FirstRanker.com ---
57
8
--- Content provided by FirstRanker.com ---
Step 3. Calculation of D2:
--- Content provided by FirstRanker.com ---
Assign the revised ranks to the given pairs of values and calculate D2 as follows:
--- Content provided by FirstRanker.com ---
Scheme I
--- Content provided by FirstRanker.com ---
Scheme IISche e I
m
--- Content provided by FirstRanker.com ---
Scheme II
D = R1- R2
--- Content provided by FirstRanker.com ---
D2Score
Score
--- Content provided by FirstRanker.com ---
Rank: R1
Rank: R2
--- Content provided by FirstRanker.com ---
8055
7.5
--- Content provided by FirstRanker.com ---
8
- 0.5
--- Content provided by FirstRanker.com ---
0.2580
56
--- Content provided by FirstRanker.com ---
7.5
7
--- Content provided by FirstRanker.com ---
0.50.25
83
--- Content provided by FirstRanker.com ---
57
6
--- Content provided by FirstRanker.com ---
51
1
--- Content provided by FirstRanker.com ---
84
57
--- Content provided by FirstRanker.com ---
55
0
--- Content provided by FirstRanker.com ---
0
87
--- Content provided by FirstRanker.com ---
573.5
5
--- Content provided by FirstRanker.com ---
- 1.5
2.25
--- Content provided by FirstRanker.com ---
8758
3.5
--- Content provided by FirstRanker.com ---
3
0.5
--- Content provided by FirstRanker.com ---
0.2589
59
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
00
90
--- Content provided by FirstRanker.com ---
60
1
--- Content provided by FirstRanker.com ---
10
0
--- Content provided by FirstRanker.com ---
Total 4
--- Content provided by FirstRanker.com ---
Step 4. Calculation of :We have N = 8.
Since there are 2 ties with 2 items each and another tie with 3 items, the
--- Content provided by FirstRanker.com ---
correction term is 0.5 + 0.5 + 2 .
The rank correlation coefficient is
--- Content provided by FirstRanker.com ---
= 1 - [{ 6 D2 + (1/2) + (1/2) +2 }/ (N3 ? N)}]= 1 ? { 6 (4.+0.5+0.5+2) / (512 ? 8) } = 1 ? (6 x 7 / 504) = 1 - ( 42/504 )
= 1 - 0.083 = 0.917
--- Content provided by FirstRanker.com ---
Inference:
--- Content provided by FirstRanker.com ---
It is inferred that the two insurance schemes are highly, positively correlated.
--- Content provided by FirstRanker.com ---
REGRESSION
--- Content provided by FirstRanker.com ---
In the pairs of observations, if there is a cause and effect relationship between
the variables X and Y, then the average relationship between these two variables
--- Content provided by FirstRanker.com ---
is called regression, which means "stepping back" or "return to the average".The linear relationship giving the best mean value of a variable corresponding to
the other variable is called a regression line or line of the best fit. The
--- Content provided by FirstRanker.com ---
regression of X on Y is different from the regression of Y on X. Thus, there are
two equations of regression and the two regression lines are given as follows:
--- Content provided by FirstRanker.com ---
Regression ofY on X: Y - Y = b ( X - X )
yx
--- Content provided by FirstRanker.com ---
Regression of X on Y: X - X = b (Y - Y )
--- Content provided by FirstRanker.com ---
xywhere X , Y are the means of X, Y respectively.
Result:
--- Content provided by FirstRanker.com ---
Let x, y denote the standard deviations of x, y respectively. We have the
fo
--- Content provided by FirstRanker.com ---
sult.llowing re
Y
--- Content provided by FirstRanker.com ---
X
b = r
--- Content provided by FirstRanker.com ---
and b = ryx
xy
--- Content provided by FirstRanker.com ---
X
Y
--- Content provided by FirstRanker.com ---
2
r = b b
--- Content provided by FirstRanker.com ---
and so r = b b
yx xy
--- Content provided by FirstRanker.com ---
yx xyResult:
--- Content provided by FirstRanker.com ---
The coefficient of correlation r between X and Y is the square root of the
product of the b values in the two regression equations. We can find r by this
--- Content provided by FirstRanker.com ---
way also.Application
--- Content provided by FirstRanker.com ---
e
The m thod of regression is very much useful for business forecasting.
--- Content provided by FirstRanker.com ---
PRINCIPLE OF LEAST SQUARESe
L t x, y be two variables under consideration. Out of them, let x be an
--- Content provided by FirstRanker.com ---
independent variable and let y be a dependent variable, depending on x. We
--- Content provided by FirstRanker.com ---
desire to build a functional relationship between them. For this purpose, the first
nd
--- Content provided by FirstRanker.com ---
aforemost requirement is that x, y have a high degree of correlation. If the
correlation coefficient between x and y is moderate or less, we shall o
--- Content provided by FirstRanker.com ---
n t go
ahead with the task of fitting a functional relationship between them.
--- Content provided by FirstRanker.com ---
Suppose there is a high degree of corr le ation (positive or negative)
between x and y. Suppose it is required to build a linear relations i
--- Content provided by FirstRanker.com ---
h p between
them i.e., we want a regression of y on x.
--- Content provided by FirstRanker.com ---
Geometrically speaking, if we plot the corresponding values of x and yin a 2-dimensional plane and join such points, we shall obtain a straight line.
However, hardly we can expect all the pairs (x, y) to lie on a straight line. We
--- Content provided by FirstRanker.com ---
can consider several straight lines which are, to some extent, near all the points
(x, y). Consider one line. An observation (x1, y1) may be either above the line of
--- Content provided by FirstRanker.com ---
consideration or below the line. Project this point on the x-axis. It will meet thestraight line at the point (x1, y1e). Here the theoretical value (or the expected
value) of the variable is y1e while the observed value is y1. When there is a
--- Content provided by FirstRanker.com ---
difference between the expected an
ved values, there appears an erro
--- Content provided by FirstRanker.com ---
d obserr.
This error is E1 = y1 ? ^y . This is positive if (x1, y1) is a point above the line and
--- Content provided by FirstRanker.com ---
1
negative if (x1, y1) is a point below the line. For the n pairs of observations, we
--- Content provided by FirstRanker.com ---
have the following n quantities of error:E1 = y1 ? ^y ,
1
--- Content provided by FirstRanker.com ---
E2 = y2 ? ^y ,
2
--- Content provided by FirstRanker.com ---
..
.
--- Content provided by FirstRanker.com ---
En = yn ? ^y .
--- Content provided by FirstRanker.com ---
nSome of these quantities are positive while the remaining ones are negative.
However, the squares of all these quantities are positive.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Y(X1, Y1)
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
e1
--- Content provided by FirstRanker.com ---
e2
--- Content provided by FirstRanker.com ---
(X2, Y2)
--- Content provided by FirstRanker.com ---
O
--- Content provided by FirstRanker.com ---
Xi.e., E2 = (y ? ^ 2
y ) 0, E2 = (y
--- Content provided by FirstRanker.com ---
2
?
--- Content provided by FirstRanker.com ---
2^ 2
1
--- Content provided by FirstRanker.com ---
1
y ) 0, ..., E2 = (y
--- Content provided by FirstRanker.com ---
n?
2
--- Content provided by FirstRanker.com ---
n
^y ) 0.
--- Content provided by FirstRanker.com ---
12
n
--- Content provided by FirstRanker.com ---
Hence the sum of squares of errors (SSE) = E21 + E22 + ... + E2n
= (y1 ? ^y 2
--- Content provided by FirstRanker.com ---
+ (y ? ^y 2+ ... + (y
--- Content provided by FirstRanker.com ---
^y 2
1 )
--- Content provided by FirstRanker.com ---
22 )
n ?
--- Content provided by FirstRanker.com ---
)
0.
--- Content provided by FirstRanker.com ---
nAmong all those straight lines which are somewhat near to the given
observations
--- Content provided by FirstRanker.com ---
(x1, y1), (x2, y2), ..., (xn , yn) , we consider that straight line as the ideal one for
which the SSE is the least. Since the ideal straight line giving regression of y on
--- Content provided by FirstRanker.com ---
x is based on this concept, we call this principle as the principle of leastsquares.
Normal equations
--- Content provided by FirstRanker.com ---
Suppose we have to fit a straight line to the n pairs of observations (x1, y1), (x2,
y2), ...,
--- Content provided by FirstRanker.com ---
(xn , yn). Suppose the equation of straight line finally comes asY = a + b X (1)
--- Content provided by FirstRanker.com ---
where a, b are constants to be determ ned
i
--- Content provided by FirstRanker.com ---
a. M t
m
--- Content provided by FirstRanker.com ---
he atically speaking, when we
require finding the equation of a straight lin
--- Content provided by FirstRanker.com ---
distine, two
ct points on the straight
--- Content provided by FirstRanker.com ---
line are sufficient. However, a different approach is followed here. We want to
--- Content provided by FirstRanker.com ---
include all the observations in our attempt to build a straight line. Then all the n
observed points (x, y) are required to satisfy
--- Content provided by FirstRanker.com ---
e rth
elation (1). Consider the
--- Content provided by FirstRanker.com ---
summation of all such terms. We get
y = (a + b x ) = (a .1 + b x ) = ( a.1) + ( b x ) = a ( 1 ) + b ( x).
--- Content provided by FirstRanker.com ---
i.e. y = an + b ( x) (2)To find two quantities a and b, we require two equations. We have
obtained one equation i.e., (2). We need one m re equation. For this purpose,
--- Content provided by FirstRanker.com ---
o
multiply both sides of (1) by x. We obtain
--- Content provided by FirstRanker.com ---
x y = ax + bx2 .Consider the summation of all such terms. We get
x y = (ax + bx2 ) = ( a x) + ( bx2)
--- Content provided by FirstRanker.com ---
i.e., x y = a ( x ) +
2
--- Content provided by FirstRanker.com ---
b ( x ) .............. (3)Equations (2) and (3) are referred to as the normal equations associated
with the regression of y on x. Solving these two equations, we obtain
--- Content provided by FirstRanker.com ---
2
X Y - X XY
--- Content provided by FirstRanker.com ---
a =n X - ( X)2
--- Content provided by FirstRanker.com ---
2
n X
--- Content provided by FirstRanker.com ---
Y - X Y
--- Content provided by FirstRanker.com ---
and b =
--- Content provided by FirstRanker.com ---
- (
--- Content provided by FirstRanker.com ---
X)2
2
--- Content provided by FirstRanker.com ---
nX
--- Content provided by FirstRanker.com ---
Note: For calculating the coefficient of correlation,
we require X, Y, XY, X2, Y2.
--- Content provided by FirstRanker.com ---
For calculating the regression of y on x, we require X, Y, XY, X2. Thus,tabular colu
e
--- Content provided by FirstRanker.com ---
mn is sam in both the cases with the difference that Y2 is also
required for the coefficient of correlation.
--- Content provided by FirstRanker.com ---
Next, if we consider the regression line of x on y, we get the equationX = a + b
Y. The expressions for the coefficients can be got by inter
--- Content provided by FirstRanker.com ---
n
cha ging th r
--- Content provided by FirstRanker.com ---
e oles ofX and Y in the previous discussion. Thus, we obtain
--- Content provided by FirstRanker.com ---
2
Y X - Y XY
--- Content provided by FirstRanker.com ---
a =
2
--- Content provided by FirstRanker.com ---
n2
Y
--- Content provided by FirstRanker.com ---
- (
--- Content provided by FirstRanker.com ---
Y)n X
--- Content provided by FirstRanker.com ---
Y - X Y
and b =
--- Content provided by FirstRanker.com ---
n Y - (Y)2
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
Problem 10Consider the following data on sales and profit.
X 5 6 7 8 9 10
--- Content provided by FirstRanker.com ---
11
Y 2 4 5 5 3
--- Content provided by FirstRanker.com ---
8
--- Content provided by FirstRanker.com ---
7
--- Content provided by FirstRanker.com ---
Determine the regression of profit on sales.Solution:
--- Content provided by FirstRanker.com ---
We have N = 7. Take X = Sales, =
Y
--- Content provided by FirstRanker.com ---
Profit.Calculate X, Y, XY, X2
o
--- Content provided by FirstRanker.com ---
as f llows:
X
--- Content provided by FirstRanker.com ---
Y
--- Content provided by FirstRanker.com ---
XY
X2
--- Content provided by FirstRanker.com ---
52
10
--- Content provided by FirstRanker.com ---
25
6
--- Content provided by FirstRanker.com ---
424
36
--- Content provided by FirstRanker.com ---
7
5
--- Content provided by FirstRanker.com ---
3549
8
--- Content provided by FirstRanker.com ---
5
40
--- Content provided by FirstRanker.com ---
649
3
--- Content provided by FirstRanker.com ---
27
81
--- Content provided by FirstRanker.com ---
108
80
--- Content provided by FirstRanker.com ---
100
11
--- Content provided by FirstRanker.com ---
777
121
--- Content provided by FirstRanker.com ---
Total: 56
34
--- Content provided by FirstRanker.com ---
293476
--- Content provided by FirstRanker.com ---
a = {( x2) ( y) ? ( x) ( x y)} / {n ( x2) ? ( x) 2}
= (476 x 34 ? 56 x 293) / ( 7 x 476 - 562 ) = (16184 ? 16408 ) / ( 3332 ? 3136 )
--- Content provided by FirstRanker.com ---
= - 224 / 196 = ? 1.1429
--- Content provided by FirstRanker.com ---
b = {n ( x y) ? ( x) ( y)} / {n ( x2) ? ( x) 2}
= (7 x 293 ? 56 x 34)/ 196 = (2051 ? 1904)/ 196 = 147 /196 = 0.75
--- Content provided by FirstRanker.com ---
The
is given by the equation
--- Content provided by FirstRanker.com ---
regression of Y on XY = a + b X
--- Content provided by FirstRanker.com ---
i.e., Y = ? 1.14 + 0.75 X
--- Content provided by FirstRanker.com ---
Problem 11
--- Content provided by FirstRanker.com ---
The
are the details of income
--- Content provided by FirstRanker.com ---
followingand expenditure of 10 households.
Income
--- Content provided by FirstRanker.com ---
40 70 50 60 80 50 90 40
--- Content provided by FirstRanker.com ---
60 60Expenditure 25 60 45 50 45 20 55 30 35 30
Determine the regression of expenditure on income n
--- Content provided by FirstRanker.com ---
a d estimate the expenditure
when the income is 65.
--- Content provided by FirstRanker.com ---
Solution:
e
--- Content provided by FirstRanker.com ---
W have N = 10. Take X = Income, Y = Expenditure
Calculate X, Y, XY, X2 as follows:
--- Content provided by FirstRanker.com ---
X Y XY X2
40 25 1000
--- Content provided by FirstRanker.com ---
1600
70 60 4200
--- Content provided by FirstRanker.com ---
490050 45 2250
2500
--- Content provided by FirstRanker.com ---
60 50 3000
3600
--- Content provided by FirstRanker.com ---
80 45 36006400
50 20 1000
--- Content provided by FirstRanker.com ---
2500
90 55 4950
--- Content provided by FirstRanker.com ---
810040 30 1200
1600
--- Content provided by FirstRanker.com ---
60 35 2100
3600
--- Content provided by FirstRanker.com ---
60 30 1800
--- Content provided by FirstRanker.com ---
3600Total: 600
395
--- Content provided by FirstRanker.com ---
25100
38400
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
a = {( x2) (y) ? ( x) ( x y)} /
2
--- Content provided by FirstRanker.com ---
{n ( x ) ? ( x) 2}
= ( 38400 x 395 - 600
--- Content provided by FirstRanker.com ---
100 ) /x 25
(10 x 38400 - 6002)
--- Content provided by FirstRanker.com ---
= (15168000 ? 15060000) / (384000 ? 360000) = 108000 / 24000 = 4.5
--- Content provided by FirstRanker.com ---
b = {n ( x y) ? ( x) ( y)} / {n ( x2) ? ( x) 2}= ( 10 x 25100 ? 600 x 395) / 24000 = (251000- 237000) / 24000
= 14000 / 24000 = 0.58
--- Content provided by FirstRanker.com ---
The
ion of Y on X is given by the equation
--- Content provided by FirstRanker.com ---
regressY = a + b X
--- Content provided by FirstRanker.com ---
i.e., Y = 4.5 + 0.583 X
To estimate the expenditure w en income is 65
--- Content provided by FirstRanker.com ---
h:
Take X = 65 in the above equation. Then we get
--- Content provided by FirstRanker.com ---
Y = 4.5 + 0.583 x 65 = 4.5 + 37.895 = 42.395 = 42 (approximately).
--- Content provided by FirstRanker.com ---
Problem 12Consider the following data on occupancy rate and profit of a hotel.
Occupancy rate 40
--- Content provided by FirstRanker.com ---
45
70
--- Content provided by FirstRanker.com ---
6070
75
--- Content provided by FirstRanker.com ---
70 80 95
90
--- Content provided by FirstRanker.com ---
Profit50 55 65 70 90 95 105 110 120 125
--- Content provided by FirstRanker.com ---
Determine the regressions of (i) profit on occupancy rate and
(ii) occupancy rate on profit.
--- Content provided by FirstRanker.com ---
Solution:
We have N = 10. Take X = Occupancy rate, Y = Profit.
--- Content provided by FirstRanker.com ---
Note that in Problems 10 and 11, we wanted only one regression line and so we
--- Content provided by FirstRanker.com ---
did not take Y2 . Now we require two regression lines. Therefore,
calculate X, Y, XY, X2, Y2.
--- Content provided by FirstRanker.com ---
X Y
XY
--- Content provided by FirstRanker.com ---
X2
Y2
--- Content provided by FirstRanker.com ---
40 502000
1600
--- Content provided by FirstRanker.com ---
2500
45 55
--- Content provided by FirstRanker.com ---
24752025
3025
--- Content provided by FirstRanker.com ---
70 65
4550
--- Content provided by FirstRanker.com ---
49004225
60 70
--- Content provided by FirstRanker.com ---
4200
3600
--- Content provided by FirstRanker.com ---
490070 90
6300
--- Content provided by FirstRanker.com ---
4900
8100
--- Content provided by FirstRanker.com ---
75 957125
5625
--- Content provided by FirstRanker.com ---
9025
70 105
--- Content provided by FirstRanker.com ---
73504900
11025
--- Content provided by FirstRanker.com ---
80 110
8800
--- Content provided by FirstRanker.com ---
640012100
95 120
--- Content provided by FirstRanker.com ---
11400
9025
--- Content provided by FirstRanker.com ---
1440090 125
11250
--- Content provided by FirstRanker.com ---
8100
15625
--- Content provided by FirstRanker.com ---
Total: 695885
65450
--- Content provided by FirstRanker.com ---
51075
84925
--- Content provided by FirstRanker.com ---
The regression line of Y on X:
Y = a + b X
--- Content provided by FirstRanker.com ---
where a ={( x2) ( y) ? ( x) ( x y)} / {n ( x2) ? ( x) 2}
and b ={n ( x y) ? ( x) ( y)} / {n ( x2) ? ( x) 2
--- Content provided by FirstRanker.com ---
}
We obtain
--- Content provided by FirstRanker.com ---
a = (51075 x 885 ? 695 x 65450) / (10x51075 - 6952)
= (45201375 ? 45487750)/ (510750 ? 483025)
--- Content provided by FirstRanker.com ---
= - 286375 / 27725 = - 10.329b = (10 x 65450 ? 695 x 885) / 27725
= (654500 ? 615075) / 27725 = 39425 / 27725 = 1.422
--- Content provided by FirstRanker.com ---
So, the regression equation is Y = - 1
--- Content provided by FirstRanker.com ---
0.329 + 1.422 X
--- Content provided by FirstRanker.com ---
Next, if we consider the regression line of X on Y,
get the equation X = a + b Y where
--- Content provided by FirstRanker.com ---
we
--- Content provided by FirstRanker.com ---
a = {( y2) ( x) ? ( y) ( x y)} / {n ( y2) ? ( y) 2}and b = {n ( x y) ? ( x) ( y)} / {n ( y2) ? ( y) 2}.
We get
--- Content provided by FirstRanker.com ---
a = (84925 x 695 ? 885 x 65450) / (10 x 84925 ? 8852)
= (59022875 ? 57923250) / ( 849250 ? 783225) = 1099625 / 66025 = 16.655,
--- Content provided by FirstRanker.com ---
b = (10 x 65450 ? 695 x 885) / 66025 = (654500 ? 615075) / 66025= 39425 / 66025 = 0.597
So, the regression equation is X = 16.655 + 0.597 Y
--- Content provided by FirstRanker.com ---
Note: For the data given in this problem, if we use the formula for r, we get
N XY - ( X ) (Y )
--- Content provided by FirstRanker.com ---
r =N X - ( X )2 N Y - (Y )2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
= (10 x 65450 ? 695 x 885) / { (10 x 51075 - 6952 ) (10 x 84925 - 8852 ) }= (654500 ? 615075) / ( 27725 66025 ) = 39425 / 166.508 x 256.95
= 39425 / 42784.23 = 0.9214
--- Content provided by FirstRanker.com ---
However, once we know the two b values, we can find the coefficient of
correlation r between X and Y as the square root of the product of the two b
--- Content provided by FirstRanker.com ---
values.Thus we obtain
r = (1.422 x 0.597) = 0.848934 = 0.9214.
--- Content provided by FirstRanker.com ---
Note that this agrees with the above value of r.
--- Content provided by FirstRanker.com ---
QUESTIONS1.
Explain the aim of `Correlation Analysis'.
--- Content provided by FirstRanker.com ---
2.
Distinguish between positive and negative correlation.
--- Content provided by FirstRanker.com ---
3.
--- Content provided by FirstRanker.com ---
State the formula for simplecoefficient.
correlation
--- Content provided by FirstRanker.com ---
4.
State the properties of the correlation coefficient.
--- Content provided by FirstRanker.com ---
5.nk correlation'?
What is `ra
--- Content provided by FirstRanker.com ---
Explain.
6.
--- Content provided by FirstRanker.com ---
kState the formula for ran correlation coefficient.
7.
--- Content provided by FirstRanker.com ---
Explain how to resolve ties while calculating ranks.
8.
--- Content provided by FirstRanker.com ---
Explain the concept of regression.9.
What is the principle of least squares? Explain.
--- Content provided by FirstRanker.com ---
10.
Explain normal equations in the context of regression analysis.
--- Content provided by FirstRanker.com ---
11.State the formulae for the constant term and coefficient in the regression
equation.
--- Content provided by FirstRanker.com ---
12.
State the relationship between the regression coefficient and correlation
--- Content provided by FirstRanker.com ---
coefficient.13.
Explain the managerial uses of Correlation Analysis and Regression
--- Content provided by FirstRanker.com ---
Analysis.
--- Content provided by FirstRanker.com ---
V
--- Content provided by FirstRanker.com ---
UNIT I2. ANALYSIS OF VARIANCE
Lesson Outline
--- Content provided by FirstRanker.com ---
?
Definition of ANOVA
--- Content provided by FirstRanker.com ---
?Assumptions of ANOVA
?
--- Content provided by FirstRanker.com ---
n of linear models
Classificatio
--- Content provided by FirstRanker.com ---
?ANOVA for one-way classified data
?
--- Content provided by FirstRanker.com ---
ANOVA table for one-way classified data
?
--- Content provided by FirstRanker.com ---
Null and Alternative Hypotheses?
Type I Error
--- Content provided by FirstRanker.com ---
?
Level of significance
--- Content provided by FirstRanker.com ---
?SS, MSS and Variance ratio
?
--- Content provided by FirstRanker.com ---
Calculation of F value
?
--- Content provided by FirstRanker.com ---
Table value of F?
Coding Method
--- Content provided by FirstRanker.com ---
?
Inference fro ANOVA table
--- Content provided by FirstRanker.com ---
m?
Managerial applications of ANOVA
--- Content provided by FirstRanker.com ---
Learning Objectives
After reading this lesson you should be able to
--- Content provided by FirstRanker.com ---
-understand the concept of ANOVA
-
--- Content provided by FirstRanker.com ---
formulate Null and Alternative Hypotheses
-
--- Content provided by FirstRanker.com ---
construct ANOVA table for one-way classified data-
calculate T, N and CF
--- Content provided by FirstRanker.com ---
-
calculate SS, df and MSS
--- Content provided by FirstRanker.com ---
-calculate F value
-
--- Content provided by FirstRanker.com ---
find the table value of F
-
--- Content provided by FirstRanker.com ---
draw inference from ANOVA-
apply coding met
--- Content provided by FirstRanker.com ---
-
understand the managerial
--- Content provided by FirstRanker.com ---
of ANOVAapplications
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
ANALYSIS OF VARIANCE (ANOVA)
Introduction
--- Content provided by FirstRanker.com ---
For managerial decision making, sometimes one has to carry out tests ofsignificance. The analysis of variance is an effective tool for this purpose. The
objective of the analysis of variance is to test the homogeneity of the means of
--- Content provided by FirstRanker.com ---
different samples.
Definition
--- Content provided by FirstRanker.com ---
According to R.A. Fisher, "Analysis of variance is the separation of varianceascribable to one group of causes from the variance ascr
ther groups".
--- Content provided by FirstRanker.com ---
ibable to o
OVA
--- Content provided by FirstRanker.com ---
Assumptions of ANThe technique of ANOVA is mainly used for the analysis and interpretation of
data obtained from experiments. This technique is based on three important
--- Content provided by FirstRanker.com ---
assumptions, namely
1.
--- Content provided by FirstRanker.com ---
The parent population is normal.2.
The error component is distributed normally with zero mean and
--- Content provided by FirstRanker.com ---
constant variance.
3.
--- Content provided by FirstRanker.com ---
The various effects are additive in nature.The technique of ANOVA essentially consists of partitioning the total variation
in an experiment into components of
--- Content provided by FirstRanker.com ---
hese
different sources of variation. T
--- Content provided by FirstRanker.com ---
sources of variations are due to controlled factors and uncontrolled factors.Since the variation in the sample data is characterized by means of many
components of variation, it can be symbolically represented in the mathematical
--- Content provided by FirstRanker.com ---
form called a linear model for the sample data.
Classification of models
--- Content provided by FirstRanker.com ---
Linear models for the sample data may broadly be classified into three types asfollows:
1.
--- Content provided by FirstRanker.com ---
Random effect model
2.
--- Content provided by FirstRanker.com ---
Fixed effect model3.
Mixed effect model
--- Content provided by FirstRanker.com ---
In any variance components model, the error component has always
--- Content provided by FirstRanker.com ---
random effects, since it occurs purely in a random manner. All other
components may be either mi
--- Content provided by FirstRanker.com ---
randomxed or
.
--- Content provided by FirstRanker.com ---
Random effect model
A model in which each of the factors has random effect (including error effect)
--- Content provided by FirstRanker.com ---
is called a random effect model or simply a random model.Fixed effect model
A model in which each of the factors has fixed effects, buy only the error effect
--- Content provided by FirstRanker.com ---
is random is called a fixed effect model or simply a fixed model.
Mixed effect model
--- Content provided by FirstRanker.com ---
A model in which some of the factors have fixed effects and some others haverandom effects is called a mixed effect model or simply a mixed model.
In what follows, we shall restrict ourselves to a fixed effect model.
--- Content provided by FirstRanker.com ---
In a fixed effect model, the main objective is to estimate the effects and
find the measure of variability among each of the factors and finally to find the
--- Content provided by FirstRanker.com ---
variability among the error effects.The ANOVA technique is mainly based on the linear model which
depends on the types of data used in the linear model. There are several types of
--- Content provided by FirstRanker.com ---
data in ANOVA, depending on the number of sources of variation namely,
One-way classified data,
--- Content provided by FirstRanker.com ---
Two-way classified data,...
m-way classified data.
--- Content provided by FirstRanker.com ---
One-way classified data
When the set of observations is distributed over different levels of a single
--- Content provided by FirstRanker.com ---
factor, then it gives one-way classified data.--- Content provided by FirstRanker.com ---
ANOVA for One-way classified dataLet y denote the jth observation corresponding to the ith level of factor A and
i j
--- Content provided by FirstRanker.com ---
Yij the corresponding random variate.
--- Content provided by FirstRanker.com ---
Define the linear model for the sample data obtained from theexperiment by the equation
i =1,2, .
--- Content provided by FirstRanker.com ---
.. , k
y = ? + a + e
--- Content provided by FirstRanker.com ---
ij
i
--- Content provided by FirstRanker.com ---
ij
j = 1, 2,..., n
--- Content provided by FirstRanker.com ---
iwhere ? represents the general mean effect which is fixed and which represents
the general condition of the experimental units, a denotes the fixed effect due
--- Content provided by FirstRanker.com ---
i
to ith level of the factor A (i=1,2,...,k) and hence the variation due to a
--- Content provided by FirstRanker.com ---
i(i=1,2,...,k) is said to be control.
The last component of the model e is the random variable. It is called the error
--- Content provided by FirstRanker.com ---
ij
component and it makes the Yij a random variate. The variation in e is due to
--- Content provided by FirstRanker.com ---
ijall the uncontrolled factors and e is independently, identically and normally
ij
--- Content provided by FirstRanker.com ---
distributed with mean zero and constant variance 2
.
--- Content provided by FirstRanker.com ---
For the realization of the random variate Yij, consider y defined by
ij
--- Content provided by FirstRanker.com ---
i =1,2,...,k
y = ? + a + e
--- Content provided by FirstRanker.com ---
iji
ij
--- Content provided by FirstRanker.com ---
j = 1, 2,..., n
--- Content provided by FirstRanker.com ---
iThe expected value of the general observation y in the experimental units is
ij
--- Content provided by FirstRanker.com ---
given by E( y ) = ? for all i = 1, 2,..., k
ij
--- Content provided by FirstRanker.com ---
iwith y = ? + e , where e is the random error effect due to uncontrolled factors
ij
--- Content provided by FirstRanker.com ---
i
ij
--- Content provided by FirstRanker.com ---
ij(i.e., due to chance only).
Here we may expect ? = ? for all i
--- Content provided by FirstRanker.com ---
= 1, 2,..., k , if there is no variation due to
i
--- Content provided by FirstRanker.com ---
control factors. If it is not the case, we have--- Content provided by FirstRanker.com ---
? ? for all i =1,2,...,ki
i e
--- Content provided by FirstRanker.com ---
. ., ? - ? 0 for all i = 1, 2,..., k
i
--- Content provided by FirstRanker.com ---
Suppose ? - ? a .
i
--- Content provided by FirstRanker.com ---
i
Then we have ? ? + a for all i = 1, 2,..., k
--- Content provided by FirstRanker.com ---
ii
On substitution for ? in the above equation, the linear model reduces to
--- Content provided by FirstRanker.com ---
i
i =1,2,...,k
--- Content provided by FirstRanker.com ---
y = ? + a + e(1)
ij
--- Content provided by FirstRanker.com ---
i
ij
--- Content provided by FirstRanker.com ---
j = 1, 2,..., ni
The objective of ANOVA is to test the null hypothesis
--- Content provided by FirstRanker.com ---
H : ? = ? for all i = 1, 2,..., k or H : a = 0 for all i = 1, 2,..., k . For carrying
o
--- Content provided by FirstRanker.com ---
io
i
--- Content provided by FirstRanker.com ---
out this test, we need to estimate the unknown parameters
? , a for all i =1,2,...,k by the principle of least squares. This can be done by
--- Content provided by FirstRanker.com ---
iminimizing the residual sum of squares defined by
E = e2ij
--- Content provided by FirstRanker.com ---
ij
2
--- Content provided by FirstRanker.com ---
=( y - ? - a ) ,
--- Content provided by FirstRanker.com ---
ij
i
--- Content provided by FirstRanker.com ---
ijusing (1). The normal equations can be obtained by partially differentiating E
with respect to ? and a for all i = 1, 2,..., k and equating the results to zero.
--- Content provided by FirstRanker.com ---
i
We obtain
--- Content provided by FirstRanker.com ---
G = N ? + n a--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(2)
i
--- Content provided by FirstRanker.com ---
i
i
--- Content provided by FirstRanker.com ---
and T?
i = ni
--- Content provided by FirstRanker.com ---
+ ni ai, i = 1,2,...,k (3)
where N = nk. We see that the number of variables (k+1) is more than the
--- Content provided by FirstRanker.com ---
number of independent equations (k). So, by the theorem on a system of linearequations, it follows that unique solution for this system is not possible.
--- Content provided by FirstRanker.com ---
However, by making the assumption that n a = 0 , we can get a
i
--- Content provided by FirstRanker.com ---
ii
unique solution for ? and ai (i = 1,2,...,k). Using this condition in equation (2),
--- Content provided by FirstRanker.com ---
we get
--- Content provided by FirstRanker.com ---
G = N ?
G
--- Content provided by FirstRanker.com ---
.
i .
--- Content provided by FirstRanker.com ---
e ? = N
Therefore the estimate of ? is given by ? G
--- Content provided by FirstRanker.com ---
? =(4)
--- Content provided by FirstRanker.com ---
N
Again from equation (2), we have
--- Content provided by FirstRanker.com ---
Ti = ? + a ini
T
--- Content provided by FirstRanker.com ---
,
i
--- Content provided by FirstRanker.com ---
Hence a =- ?
i
--- Content provided by FirstRanker.com ---
ni
Therefore, the estimate of a is given by
--- Content provided by FirstRanker.com ---
i? Ti
?
--- Content provided by FirstRanker.com ---
a =
- ?
--- Content provided by FirstRanker.com ---
ini
T
--- Content provided by FirstRanker.com ---
G
i.e., ?
--- Content provided by FirstRanker.com ---
ia =
-
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(5)
i
--- Content provided by FirstRanker.com ---
nN
i
--- Content provided by FirstRanker.com ---
Substituting the least square estimates of ?
? and ?
--- Content provided by FirstRanker.com ---
a in the residual sum ofi
squares, we get
--- Content provided by FirstRanker.com ---
?
2
--- Content provided by FirstRanker.com ---
E = (y - ? - a$i )ij
ij
--- Content provided by FirstRanker.com ---
After carrying out some calculations and using the normal equations (2) and (3)
we obtain
--- Content provided by FirstRanker.com ---
22
2
--- Content provided by FirstRanker.com ---
G
T
--- Content provided by FirstRanker.com ---
G2
E = y -
--- Content provided by FirstRanker.com ---
i
-
--- Content provided by FirstRanker.com ---
-(6)
--- Content provided by FirstRanker.com ---
ij
N
--- Content provided by FirstRanker.com ---
nij
i
--- Content provided by FirstRanker.com ---
N
i
--- Content provided by FirstRanker.com ---
The first ter in the RHS ofm
equation (6) is called the corrected total sum of
--- Content provided by FirstRanker.com ---
squares while
2
--- Content provided by FirstRanker.com ---
y is called the uncorrected total sum of squares.ij
ij
--- Content provided by FirstRanker.com ---
For measuring the variation due to treatment (controlled factor), we
--- Content provided by FirstRanker.com ---
consider the null hypothesis that all the treatment effects are equal. i.e.,--- Content provided by FirstRanker.com ---
H : ? = ? = ... = ? = ?o
1
--- Content provided by FirstRanker.com ---
2
k
--- Content provided by FirstRanker.com ---
.i .
e , H : ? = ? for all i = 1, 2,..., k
--- Content provided by FirstRanker.com ---
o
i
--- Content provided by FirstRanker.com ---
.
i .
--- Content provided by FirstRanker.com ---
e , H : ? - ? = 0 for all i = 1, 2,..., k
o
--- Content provided by FirstRanker.com ---
i.
i .
--- Content provided by FirstRanker.com ---
e , H : a = 0
o
--- Content provided by FirstRanker.com ---
iUnder H , the linear model reduces to
o
--- Content provided by FirstRanker.com ---
i =1,2,...,k
y = ? + e
--- Content provided by FirstRanker.com ---
ij
ij
--- Content provided by FirstRanker.com ---
j = 1, 2,..., n
i
--- Content provided by FirstRanker.com ---
Proceeding as before, we get the residual sum of squares for this hypotheticalmodel as
2
--- Content provided by FirstRanker.com ---
G
2
--- Content provided by FirstRanker.com ---
E = y ---- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(7)
1
--- Content provided by FirstRanker.com ---
ij
N
--- Content provided by FirstRanker.com ---
ijActually, E contains the variation due to both treatment and error. Therefore a
1
--- Content provided by FirstRanker.com ---
measure of variation due to treatment can be obtained by " E - E ". Using (6)
1
--- Content provided by FirstRanker.com ---
and (7), we getk
2
--- Content provided by FirstRanker.com ---
2
T
--- Content provided by FirstRanker.com ---
Gi
E - E =
--- Content provided by FirstRanker.com ---
-
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
(8)
1
--- Content provided by FirstRanker.com ---
=n
N
--- Content provided by FirstRanker.com ---
i 1
i
--- Content provided by FirstRanker.com ---
The expression in (8) is usually called the corrected treatment sum of squaresk
2
--- Content provided by FirstRanker.com ---
T
while the term
--- Content provided by FirstRanker.com ---
iis called uncorrected treatment sum of squares. Here it
=
--- Content provided by FirstRanker.com ---
n
i 1
--- Content provided by FirstRanker.com ---
i2
G
--- Content provided by FirstRanker.com ---
may be noted that
is a correction factor (Also called a correction term).
--- Content provided by FirstRanker.com ---
NSince E is based on N-k free observations, it has N - k degrees of freedom (df).
Similarly, since E is based on N -1 free observation, E has N -1 degrees of
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
freedom. So E - E has k -1 degrees of freedom.1
When actually the null hypothesis is true, if we reject it on the basis of
--- Content provided by FirstRanker.com ---
the estimated value in ou
tis
--- Content provided by FirstRanker.com ---
r statical analysis, we will be committing Type ? I
error
--- Content provided by FirstRanker.com ---
o
. The probability f r committing this error is referre
--- Content provided by FirstRanker.com ---
ed to as th level of
--- Content provided by FirstRanker.com ---
significance, denoted by . The testing of the null hypothesis H may be
o
--- Content provided by FirstRanker.com ---
carried out by F test. For given , we have
Trss
--- Content provided by FirstRanker.com ---
TrMSSdF
F =
--- Content provided by FirstRanker.com ---
=
: F
--- Content provided by FirstRanker.com ---
.k 1,
- N -k
--- Content provided by FirstRanker.com ---
EMSS
Ess dF
--- Content provided by FirstRanker.com ---
i.e., It follows F distribution with degrees of freedom k-1 and N-k.All these values are represented in the form of a table called ANOVA table,
furnished below.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
ANOVA Table for one-way classified dataSource of
Degrees of
--- Content provided by FirstRanker.com ---
u
S m of Squares
--- Content provided by FirstRanker.com ---
Mean SquaresVariance ratio
Variation
--- Content provided by FirstRanker.com ---
freedom
(SS)
--- Content provided by FirstRanker.com ---
(MS)F
Between the
--- Content provided by FirstRanker.com ---
E - E = Q
M
--- Content provided by FirstRanker.com ---
level of the1
T
--- Content provided by FirstRanker.com ---
T
Q
--- Content provided by FirstRanker.com ---
F =:
T
--- Content provided by FirstRanker.com ---
k-1
k
--- Content provided by FirstRanker.com ---
22
T
--- Content provided by FirstRanker.com ---
G
T
--- Content provided by FirstRanker.com ---
M =M
--- Content provided by FirstRanker.com ---
i
--- Content provided by FirstRanker.com ---
TE
factor
--- Content provided by FirstRanker.com ---
-
k -1
--- Content provided by FirstRanker.com ---
nN
F
--- Content provided by FirstRanker.com ---
i
i
--- Content provided by FirstRanker.com ---
k 1,- N -k
(Treatment)
--- Content provided by FirstRanker.com ---
Q :
E
--- Content provided by FirstRanker.com ---
Within the level N-kQ
-
--- Content provided by FirstRanker.com ---
E
By subtraction
--- Content provided by FirstRanker.com ---
M =E
--- Content provided by FirstRanker.com ---
of factor (Error)
--- Content provided by FirstRanker.com ---
N - k--- Content provided by FirstRanker.com ---
2
G
--- Content provided by FirstRanker.com ---
TotalN-1
Q = y -
--- Content provided by FirstRanker.com ---
- -
--- Content provided by FirstRanker.com ---
ijN
ij
--- Content provided by FirstRanker.com ---
tio
--- Content provided by FirstRanker.com ---
Variance raThe variance ratio is the ratio of
er variance to the s
--- Content provided by FirstRanker.com ---
the great
maller variance. It is
--- Content provided by FirstRanker.com ---
also called the F-coefficient. We haveF = Greater variance / Smaller variance.
--- Content provided by FirstRanker.com ---
We refer to the table of F values at a desired level of significance . In general,
is taken to be 5 %. The table value is referred to as the theoretical value or the
--- Content provided by FirstRanker.com ---
expected value. The calculated value is referred to as the observed value.
Inference
--- Content provided by FirstRanker.com ---
If the observed value of F is less than the expected value of F (i.e., Fo < Fe) forthe given level of significance , then the null hypothesis H is accepted. In
o
--- Content provided by FirstRanker.com ---
this case, we conclude that there is no significant difference between the
treatment effects.
--- Content provided by FirstRanker.com ---
On the other hand, if the observed value of F is greater than the expected valueof F (i.e., F > F ) for the given level of significance , then the null hypothesis
o
--- Content provided by FirstRanker.com ---
e
H is rejected. In this case, we conclude that all the treatment effects are not
--- Content provided by FirstRanker.com ---
oequal.
Note: If the calculated value of F and the table value of F are equal, we can try
--- Content provided by FirstRanker.com ---
some other value of .
Problem 1
--- Content provided by FirstRanker.com ---
The f llowing are tho
e details of sales effected by three sales persons in three
--- Content provided by FirstRanker.com ---
door-to-door campaigns.
Sales person
--- Content provided by FirstRanker.com ---
Sales in door ? t ? do
oor campaign
--- Content provided by FirstRanker.com ---
A
8
--- Content provided by FirstRanker.com ---
95
10
--- Content provided by FirstRanker.com ---
B
7
--- Content provided by FirstRanker.com ---
66
9
--- Content provided by FirstRanker.com ---
C
6
--- Content provided by FirstRanker.com ---
67
5
--- Content provided by FirstRanker.com ---
Construct an ANOVA table and find out whether there is any significant
difference in the performance of the sales persons.
--- Content provided by FirstRanker.com ---
Solution:Method I (Direct method) :
--- Content provided by FirstRanker.com ---
A =8+9+5+10 = 32
B = 7+6+6+9 = 28
--- Content provided by FirstRanker.com ---
C = 6+6+7+5 = 24
32
--- Content provided by FirstRanker.com ---
Sample mean f ro A : A =
= 8
--- Content provided by FirstRanker.com ---
4
28
--- Content provided by FirstRanker.com ---
Sample mean of r B : B =
= 7
--- Content provided by FirstRanker.com ---
4
24
--- Content provided by FirstRanker.com ---
Sample mean f ro C : C =
= 6
--- Content provided by FirstRanker.com ---
4
Total number of sample item = N
--- Content provided by FirstRanker.com ---
so. of items for A + No. of items o
f r B + No.
--- Content provided by FirstRanker.com ---
of items for C
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
4 + 4 + 4
=
--- Content provided by FirstRanker.com ---
= 1232 + 28 + 24
84
--- Content provided by FirstRanker.com ---
Mean of all the samples X =
=
--- Content provided by FirstRanker.com ---
= 712
12
--- Content provided by FirstRanker.com ---
Sum of squares of deviations for A:
A
--- Content provided by FirstRanker.com ---
A - A = A - 8( - )2
A
--- Content provided by FirstRanker.com ---
A
8
--- Content provided by FirstRanker.com ---
00
9
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
5-3
9
--- Content provided by FirstRanker.com ---
10
2
--- Content provided by FirstRanker.com ---
4--- Content provided by FirstRanker.com ---
14
--- Content provided by FirstRanker.com ---
Sum of squares of devia ionst
for B:
--- Content provided by FirstRanker.com ---
B
- B = B -
--- Content provided by FirstRanker.com ---
(B - B)2B
7
--- Content provided by FirstRanker.com ---
7
0
--- Content provided by FirstRanker.com ---
06
-1
--- Content provided by FirstRanker.com ---
1
6
--- Content provided by FirstRanker.com ---
-11
--- Content provided by FirstRanker.com ---
2
9
--- Content provided by FirstRanker.com ---
4
6
--- Content provided by FirstRanker.com ---
Sum of squares of deviations for C:
C
--- Content provided by FirstRanker.com ---
C - C = C - 6
( - )2
--- Content provided by FirstRanker.com ---
CC
6
--- Content provided by FirstRanker.com ---
0
0
--- Content provided by FirstRanker.com ---
60
0
--- Content provided by FirstRanker.com ---
7
1
--- Content provided by FirstRanker.com ---
15
-1
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
Sum of squares of deviations within2
2
--- Content provided by FirstRanker.com ---
2
varieties = ( A - A) + (B - B) + (C - C )
--- Content provided by FirstRanker.com ---
= 14 + 6 + 2
--- Content provided by FirstRanker.com ---
= 22
--- Content provided by FirstRanker.com ---
Sum of squares of deviations for total variance:Sales person
Sales
--- Content provided by FirstRanker.com ---
Sales - X = Sales ? 7
(Sales - )2
--- Content provided by FirstRanker.com ---
7--- Content provided by FirstRanker.com ---
A8
1
--- Content provided by FirstRanker.com ---
1
A
--- Content provided by FirstRanker.com ---
92
4
--- Content provided by FirstRanker.com ---
A
5
--- Content provided by FirstRanker.com ---
- 24
A
--- Content provided by FirstRanker.com ---
10
3
--- Content provided by FirstRanker.com ---
9B
7
--- Content provided by FirstRanker.com ---
0
0
--- Content provided by FirstRanker.com ---
B6
- 1
--- Content provided by FirstRanker.com ---
1
B
--- Content provided by FirstRanker.com ---
6- 1
1
--- Content provided by FirstRanker.com ---
B
9
--- Content provided by FirstRanker.com ---
24
C
--- Content provided by FirstRanker.com ---
6
- 1
--- Content provided by FirstRanker.com ---
1C
6
--- Content provided by FirstRanker.com ---
- 1
1
--- Content provided by FirstRanker.com ---
C7
0
--- Content provided by FirstRanker.com ---
0
C
--- Content provided by FirstRanker.com ---
52
4
--- Content provided by FirstRanker.com ---
30
--- Content provided by FirstRanker.com ---
ANOVA Table
Source of variati n
--- Content provided by FirstRanker.com ---
o
Degrees of freedom
--- Content provided by FirstRanker.com ---
Sum of squares ofVariance
deviations
--- Content provided by FirstRanker.com ---
8
Between varieties
--- Content provided by FirstRanker.com ---
3 ? 1 = 28
= 4
--- Content provided by FirstRanker.com ---
2
22
--- Content provided by FirstRanker.com ---
Within varieties12 ? 3 = 9
22
--- Content provided by FirstRanker.com ---
= 2.44
--- Content provided by FirstRanker.com ---
9Total
12 ? 1 = 11
--- Content provided by FirstRanker.com ---
30
--- Content provided by FirstRanker.com ---
Calculation of F value:
Greater Variance
--- Content provided by FirstRanker.com ---
F =
= 4.00 = 1.6393
--- Content provided by FirstRanker.com ---
Smaller Variance2.44
Degrees of freedom for greater variance (df = 2
--- Content provided by FirstRanker.com ---
1 )
Degrees of freedom for smaller variance (df = 9
--- Content provided by FirstRanker.com ---
2 )--- Content provided by FirstRanker.com ---
Let us take the level of significance as 5%The table value o
= 4
--- Content provided by FirstRanker.com ---
f F
.26
--- Content provided by FirstRanker.com ---
Inference:
The calcula d
--- Content provided by FirstRanker.com ---
te value of F is less tha the
n
--- Content provided by FirstRanker.com ---
table value of F. Therefore, the nullhypothesis is accepted. It is concluded that there is no significant difference in
the performance of the sales persons, at 5% level of significance.
--- Content provided by FirstRanker.com ---
Method II (Short cut method):
--- Content provided by FirstRanker.com ---
A = 32, B = 28, C = 24.
--- Content provided by FirstRanker.com ---
T= Sum of all the sample items
--- Content provided by FirstRanker.com ---
= A+ B + C= 32 + 28 + 24
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
= 84
N = Total number of te
--- Content provided by FirstRanker.com ---
i ms in all the samples = 4 + 4 + 4 =12--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
T84
Correction Factor =
--- Content provided by FirstRanker.com ---
=
= 588
--- Content provided by FirstRanker.com ---
N12
Calculate the su
--- Content provided by FirstRanker.com ---
m of squares of t e
h observed v lu
--- Content provided by FirstRanker.com ---
a es as follows:Sales Person
X
--- Content provided by FirstRanker.com ---
X2
--- Content provided by FirstRanker.com ---
A
8
--- Content provided by FirstRanker.com ---
64A
9
--- Content provided by FirstRanker.com ---
81
A
--- Content provided by FirstRanker.com ---
525
A
--- Content provided by FirstRanker.com ---
10
100
--- Content provided by FirstRanker.com ---
B7
49
--- Content provided by FirstRanker.com ---
B
6
--- Content provided by FirstRanker.com ---
36B
6
--- Content provided by FirstRanker.com ---
36
B
--- Content provided by FirstRanker.com ---
981
C
--- Content provided by FirstRanker.com ---
6
36
--- Content provided by FirstRanker.com ---
C6
36
--- Content provided by FirstRanker.com ---
C
7
--- Content provided by FirstRanker.com ---
49C
5
--- Content provided by FirstRanker.com ---
25
--- Content provided by FirstRanker.com ---
618
--- Content provided by FirstRanker.com ---
2
Sum of squares of deviations for total variance = X - correction factor
--- Content provided by FirstRanker.com ---
= 618 ? 588 = 30.Sum of squares of deviations for variance between samples
--- Content provided by FirstRanker.com ---
( A)2 (B)2 (C)2
=
--- Content provided by FirstRanker.com ---
++
- CF
--- Content provided by FirstRanker.com ---
N
N
--- Content provided by FirstRanker.com ---
N1
2
--- Content provided by FirstRanker.com ---
3
2
--- Content provided by FirstRanker.com ---
22
32
--- Content provided by FirstRanker.com ---
28
24
--- Content provided by FirstRanker.com ---
=+
+
--- Content provided by FirstRanker.com ---
- 588
4
--- Content provided by FirstRanker.com ---
44
1024
--- Content provided by FirstRanker.com ---
784
576
--- Content provided by FirstRanker.com ---
=+
+
--- Content provided by FirstRanker.com ---
- 588
--- Content provided by FirstRanker.com ---
44
4
--- Content provided by FirstRanker.com ---
= 256 +196 +144 - 588
= 8
ANOVA Table
--- Content provided by FirstRanker.com ---
Source of
Degrees of
--- Content provided by FirstRanker.com ---
Sum of squares ofVariance
--- Content provided by FirstRanker.com ---
variation
Freed
--- Content provided by FirstRanker.com ---
om deviations
--- Content provided by FirstRanker.com ---
Between varieties3-1 = 2
8
--- Content provided by FirstRanker.com ---
8 = 4
2
--- Content provided by FirstRanker.com ---
Within varieties12 ? 3 = 9
22
--- Content provided by FirstRanker.com ---
22 = 2.44
9
--- Content provided by FirstRanker.com ---
Total12 ? 1 = 11
30
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
It is to be noted that the ANOVA tables in the methods I and II are one and thesame. For the further steps of calculation of F value and drawing inference,
refer to method I.
--- Content provided by FirstRanker.com ---
Problem 2
The following are the details of plinth areas of ownership apartment flats offered
--- Content provided by FirstRanker.com ---
y 3b
housing companies A,B,C. Use analysis of variance to determine whe
--- Content provided by FirstRanker.com ---
ther
--- Content provided by FirstRanker.com ---
there is any significant difference in the plinth areas of the apartment flats.Housing Company
Plinth
--- Content provided by FirstRanker.com ---
f
area o apartment flats
--- Content provided by FirstRanker.com ---
A1500
1430
--- Content provided by FirstRanker.com ---
1550
1450
--- Content provided by FirstRanker.com ---
B1450
1550
--- Content provided by FirstRanker.com ---
1600
1480
--- Content provided by FirstRanker.com ---
C1550
1420
--- Content provided by FirstRanker.com ---
1450
1430
--- Content provided by FirstRanker.com ---
Use analysis of variance to determine whether there is any significant difference
in the plinth areas of the apartment's flats.
--- Content provided by FirstRanker.com ---
Note: As the given figures are large, working with them will be difficult.
Therefore, we use the following facts:
--- Content provided by FirstRanker.com ---
i.Variance ratio is independent of the change of origin.
ii.
--- Content provided by FirstRanker.com ---
Variance ratio is independent of the change of scale.
In the problem under consideration, the number
--- Content provided by FirstRanker.com ---
s vary from 1420 to 1600. Sowe follow a method a
c lled the coding method. First, let us subtract 1400 from
--- Content provided by FirstRanker.com ---
each item. We get the following transformed data:
--- Content provided by FirstRanker.com ---
Compa
r
--- Content provided by FirstRanker.com ---
ny T ansformedmea ur
s ement
--- Content provided by FirstRanker.com ---
A
100
--- Content provided by FirstRanker.com ---
30150
50
--- Content provided by FirstRanker.com ---
B
50
--- Content provided by FirstRanker.com ---
150100
80
--- Content provided by FirstRanker.com ---
C
150
--- Content provided by FirstRanker.com ---
2050
30
--- Content provided by FirstRanker.com ---
Next, divide each entry by 10.
--- Content provided by FirstRanker.com ---
The transformed data are given below.Compan
a
--- Content provided by FirstRanker.com ---
y Tr nsf m
or ed
--- Content provided by FirstRanker.com ---
measurementA
10
--- Content provided by FirstRanker.com ---
3
15
--- Content provided by FirstRanker.com ---
5B
5
--- Content provided by FirstRanker.com ---
15
10
--- Content provided by FirstRanker.com ---
8C
15
--- Content provided by FirstRanker.com ---
2
5
--- Content provided by FirstRanker.com ---
3We work with these transformed data. We have
A=10+3+15+5=33
--- Content provided by FirstRanker.com ---
B =5+15+10+8=38
C=15+2+5+3=25
--- Content provided by FirstRanker.com ---
T = A+B +C
= 33 + 38 + 25
--- Content provided by FirstRanker.com ---
= 96N = Total number of items in all the samples = 4 + 4 + 4 = 12
2
--- Content provided by FirstRanker.com ---
T
962
--- Content provided by FirstRanker.com ---
Correction Factor ==
= 768
--- Content provided by FirstRanker.com ---
N
12
--- Content provided by FirstRanker.com ---
Calculate the sum of squares of the bo served values as follows:
Compa y
--- Content provided by FirstRanker.com ---
n X
X2
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
A10
--- Content provided by FirstRanker.com ---
100
A
--- Content provided by FirstRanker.com ---
39
A
--- Content provided by FirstRanker.com ---
15
225
--- Content provided by FirstRanker.com ---
A5
25
--- Content provided by FirstRanker.com ---
B
5
--- Content provided by FirstRanker.com ---
25B
15
--- Content provided by FirstRanker.com ---
225
B
--- Content provided by FirstRanker.com ---
10100
B
--- Content provided by FirstRanker.com ---
8
64
--- Content provided by FirstRanker.com ---
C15
225
--- Content provided by FirstRanker.com ---
C
2
--- Content provided by FirstRanker.com ---
4C
--- Content provided by FirstRanker.com ---
5
25
--- Content provided by FirstRanker.com ---
C3
--- Content provided by FirstRanker.com ---
9
--- Content provided by FirstRanker.com ---
1036
--- Content provided by FirstRanker.com ---
Sum of squares of deviations for total variance =
2
--- Content provided by FirstRanker.com ---
X - correction factor--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
= 1036 ? 768 = 268
--- Content provided by FirstRanker.com ---
u
S m of squares of deviations for variance between samples
--- Content provided by FirstRanker.com ---
( A)2 (B)2 (C)2=
+
--- Content provided by FirstRanker.com ---
+
- CF
--- Content provided by FirstRanker.com ---
NN
N
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
32
2
--- Content provided by FirstRanker.com ---
2
33
--- Content provided by FirstRanker.com ---
3825
=
--- Content provided by FirstRanker.com ---
+
+
--- Content provided by FirstRanker.com ---
- 7684
4
--- Content provided by FirstRanker.com ---
4
1089
--- Content provided by FirstRanker.com ---
1444=
+
--- Content provided by FirstRanker.com ---
+ 625
--- Content provided by FirstRanker.com ---
- 7684
4
--- Content provided by FirstRanker.com ---
4
= 272.25 + 361+156.25 - 768
= 789.5 - 768
--- Content provided by FirstRanker.com ---
= 21.5ANOVA Table
Source of variation
--- Content provided by FirstRanker.com ---
Degrees of Freedom Sum of squar s
e
--- Content provided by FirstRanker.com ---
Variance--- Content provided by FirstRanker.com ---
of deviationsBetween varieties
3-1 = 2
--- Content provided by FirstRanker.com ---
21.5
21.5
--- Content provided by FirstRanker.com ---
= 10.75
2
--- Content provided by FirstRanker.com ---
Within varieties
12 ? 3 = 9
--- Content provided by FirstRanker.com ---
264.524.65 = 27.38
9
--- Content provided by FirstRanker.com ---
Total
12 ? 1 = 11
--- Content provided by FirstRanker.com ---
268Calculation of F value:
--- Content provided by FirstRanker.com ---
Greater Variance
27.38
--- Content provided by FirstRanker.com ---
F ==
= 2.5470
--- Content provided by FirstRanker.com ---
Smaller Variance
10.75
--- Content provided by FirstRanker.com ---
Degrees of freedom for greater variance (df = 91 )
Degrees of freedom for smaller va
--- Content provided by FirstRanker.com ---
riance (df = 2
--- Content provided by FirstRanker.com ---
2 )The table value of F at 5% level of significance = 19.
--- Content provided by FirstRanker.com ---
38
--- Content provided by FirstRanker.com ---
Inference:Since the calculated value of F is less than the t b
a le value of F, the null
--- Content provided by FirstRanker.com ---
hypothesis is accepted and it is concluded that there is no significant difference
in the plinth areas of ownership apartment flats off e
--- Content provided by FirstRanker.com ---
er d by the three companies,at 5% level of significance.
--- Content provided by FirstRanker.com ---
Problem 3
A finance manager has collected the following information on the performance
--- Content provided by FirstRanker.com ---
of three financial schemes.o
S urce of variation
--- Content provided by FirstRanker.com ---
Degrees of Freedom Sum of sq a
u res of deviations
--- Content provided by FirstRanker.com ---
Treatments5 15
Residual 2
--- Content provided by FirstRanker.com ---
25
Total (corrected)
--- Content provided by FirstRanker.com ---
740
--- Content provided by FirstRanker.com ---
Interpret the information obtained by him.
--- Content provided by FirstRanker.com ---
Note: `Treatments' means `Bet een
w
--- Content provided by FirstRanker.com ---
varieties'.`Residual' means `Within varieties' or `Error'.
--- Content provided by FirstRanker.com ---
Solution:
Number of sc e
--- Content provided by FirstRanker.com ---
h mes = 3 (since 3 ? 1 = 2)Total number of sample items = 8 (since 8 ? 1 = 7)
Let us calculate the variance.
--- Content provided by FirstRanker.com ---
15
Variance between varieties =
--- Content provided by FirstRanker.com ---
= 7.52
25
--- Content provided by FirstRanker.com ---
Variance between varieties =
= 5
--- Content provided by FirstRanker.com ---
5Greater Variance
F =
--- Content provided by FirstRanker.com ---
= 7.5 = 1.5
Smaller Variance
--- Content provided by FirstRanker.com ---
5Degrees of freedom for greater variance (df = 2
1 )
--- Content provided by FirstRanker.com ---
Degrees of freedom for smaller variance (df = 5
2 )
--- Content provided by FirstRanker.com ---
The total value of F at 5% level of significance = 5.79Inference:
--- Content provided by FirstRanker.com ---
Since the calculated value of F is less than the table value of F, we accept the
null-hypothesis and conclude that there is no significant difference in the
--- Content provided by FirstRanker.com ---
performance of the three financial schemes.QUESTIONS
--- Content provided by FirstRanker.com ---
1. Define analysis of variance.
2.
--- Content provided by FirstRanker.com ---
State the assumptions in analysis of variance.3.
Explain the classification of linear m
--- Content provided by FirstRanker.com ---
e
od ls for the sample data.
--- Content provided by FirstRanker.com ---
4.Explain ANOVA Table.
5.
--- Content provided by FirstRanker.com ---
Ex lain
p
--- Content provided by FirstRanker.com ---
how inference is drawn from ANOVA Table.6.
--- Content provided by FirstRanker.com ---
Explain the managerial applications of analysis of variance.
--- Content provided by FirstRanker.com ---
UNIT IV
--- Content provided by FirstRanker.com ---
NS OF EXPE3. DESIG
RIMENTS
--- Content provided by FirstRanker.com ---
Lesson Outline
--- Content provided by FirstRanker.com ---
?Definition of design of experiments
?
--- Content provided by FirstRanker.com ---
Key concepts in the design of experiments
?
--- Content provided by FirstRanker.com ---
Steps in the design of experiments?
Replication, Randomization and Blocking
--- Content provided by FirstRanker.com ---
?
Lay out of an experimental design
--- Content provided by FirstRanker.com ---
?Data Allocation Table
?
--- Content provided by FirstRanker.com ---
Completely Randomized Design
?
--- Content provided by FirstRanker.com ---
ANOVA tablD
e for CR
--- Content provided by FirstRanker.com ---
?
Working rule for an example
--- Content provided by FirstRanker.com ---
?Randomized Block Design
?
--- Content provided by FirstRanker.com ---
ANOVA table for RBD
?
--- Content provided by FirstRanker.com ---
Latin Square Design?
ANOVA table for LSD
--- Content provided by FirstRanker.com ---
?
Managerial applications of experimental designs
--- Content provided by FirstRanker.com ---
Learning ObjectivesAfter reading this lesson you should be able to
--- Content provided by FirstRanker.com ---
- understand the definition of design of experiments- understand the key concepts in the design of experiments
- understand the steps in the design of experiments
und
--- Content provided by FirstRanker.com ---
-
erim
--- Content provided by FirstRanker.com ---
erstand the lay out of an expental design
und
--- Content provided by FirstRanker.com ---
-
le
--- Content provided by FirstRanker.com ---
erstand a data allocation tab- construct ANOVA table for CRD
- draw inference from ANOVA table for CRD
- construct ANOVA tabl
--- Content provided by FirstRanker.com ---
BD
e for R
--- Content provided by FirstRanker.com ---
- draw inference fromor RBD
ANOVA table f
--- Content provided by FirstRanker.com ---
- construct ANOVA table for LSD
- draw inference from ANOVA tabl
LS
--- Content provided by FirstRanker.com ---
e for
-
--- Content provided by FirstRanker.com ---
sunderstand the working rules for solving problem
- understand the managerial applications of experimental designs
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
DESIGN OF EXPERIMENTS
--- Content provided by FirstRanker.com ---
. FUNI
DAMENTALS OF DESIGNS
--- Content provided by FirstRanker.com ---
Introduction
The theory of design of experiments was originally developed for
--- Content provided by FirstRanker.com ---
agriculture. For example, to determine whicm
h fertilizer would give
--- Content provided by FirstRanker.com ---
ore yield of
certain crop, from
--- Content provided by FirstRanker.com ---
aamong a set of fertilizers. Nowadays the design of
experiments finds its application in the area of management also. While carrying
--- Content provided by FirstRanker.com ---
ut research for m
o
--- Content provided by FirstRanker.com ---
anagerial decision making, one may go for descriptiveresearch or experimental research. The advantage of experimental research is
at it can
--- Content provided by FirstRanker.com ---
th
be used to establish the cause-effect relationship between the
--- Content provided by FirstRanker.com ---
n.variables under consideratio
Such a relationship is called a causal
--- Content provided by FirstRanker.com ---
relationship.
An experiment may be carried out with a control group or without a
--- Content provided by FirstRanker.com ---
control group, depending on the resources available and the nature of thesubjects involved in the experiment. The researcher has to select different
subjects, put them into several groups and administer treatments to the subjects
--- Content provided by FirstRanker.com ---
. It would be advisa
within each group
--- Content provided by FirstRanker.com ---
ble to include a control group whereverpossible so as to increase the level of validity of the inference drawn from the
experim
--- Content provided by FirstRanker.com ---
ent.
--- Content provided by FirstRanker.com ---
Definition of the design of experiments
perim
--- Content provided by FirstRanker.com ---
The design of ex
ents is the logical construction of the experiment
--- Content provided by FirstRanker.com ---
h a wwit
in
--- Content provided by FirstRanker.com ---
ell-defined level of uncerta ty involved in the inference drawn.
Key concepts in the design of experiments
--- Content provided by FirstRanker.com ---
The design of experiments centers around the following three keyconcepts:
--- Content provided by FirstRanker.com ---
(1) Treatments
--- Content provided by FirstRanker.com ---
(2) Factors
(3) Levels of a treatment factor
--- Content provided by FirstRanker.com ---
Types of experiments
There are two types of experiments, namely absolute experiment and
--- Content provided by FirstRanker.com ---
comparative experiment. In an absolute experiment, one takes into account the
absolute value of a certain characteristic. As distinct from this, a comparative
--- Content provided by FirstRanker.com ---
experiment seeks to compare the effect of two or more objects on somecharacteristic of the population under examination. For example, one may think
of the following situations:
--- Content provided by FirstRanker.com ---
* Comparison of the effect of different fertilizers on a certain crop
* Comparison of the effect of different medicines on a disease
--- Content provided by FirstRanker.com ---
* Comparison of different marketing straom
tegies for the pr
--- Content provided by FirstRanker.com ---
otion of a product
* Comp
--- Content provided by FirstRanker.com ---
achines inarison of different m
the production of a certain product
--- Content provided by FirstRanker.com ---
* Comparison of different met
mobilization
--- Content provided by FirstRanker.com ---
hods of resource--- Content provided by FirstRanker.com ---
Steps in the Design of Experiments
--- Content provided by FirstRanker.com ---
The design of experiments consists of the following steps:1.
Statement of the objectives
--- Content provided by FirstRanker.com ---
2.
Formulation of the statistical hypotheses
--- Content provided by FirstRanker.com ---
3.Choice of the treatments
4.
--- Content provided by FirstRanker.com ---
Choice of the experimental sites
5.
--- Content provided by FirstRanker.com ---
Replication and levels of variation6.
Choice of the experimental blocks, if necessary
--- Content provided by FirstRanker.com ---
7.
haracteristics of the plots undertaken for the experim
--- Content provided by FirstRanker.com ---
Cents
--- Content provided by FirstRanker.com ---
8.
--- Content provided by FirstRanker.com ---
Assignment of treatments to various units
9.
--- Content provided by FirstRanker.com ---
Recording of data10.
Statistical analysis of data
--- Content provided by FirstRanker.com ---
Basic designs
--- Content provided by FirstRanker.com ---
The following are the basic desiin sta
gns
--- Content provided by FirstRanker.com ---
tistical analysis:
--- Content provided by FirstRanker.com ---
1. Completely Randomized Design (CRD)2. Randomized Block Design (RBD)
--- Content provided by FirstRanker.com ---
n (LSD)
--- Content provided by FirstRanker.com ---
3. Latin Square Desigm
--- Content provided by FirstRanker.com ---
Other designs can also be used for drawing inferences fro experiments.
However, they are quite complex and we shall confine ourselves to the above
--- Content provided by FirstRanker.com ---
three designs.Basic principles
--- Content provided by FirstRanker.com ---
The design of experiments is mainly based on the following three basic
principles:
--- Content provided by FirstRanker.com ---
.
1
--- Content provided by FirstRanker.com ---
Replication
2.
--- Content provided by FirstRanker.com ---
Randomization3.
Blocking or Local Control.
--- Content provided by FirstRanker.com ---
Replication means the repetition of each treatment a certain
number of times. This will help in reducing the effect due to a possible extreme
--- Content provided by FirstRanker.com ---
situation (outlier) arising out of a singlem
treat ent. Thus replication will reduce
--- Content provided by FirstRanker.com ---
the experimental error. Homogeneity is possible only within a replication.
--- Content provided by FirstRanker.com ---
Randomization means
cation of the treatm
--- Content provided by FirstRanker.com ---
alloents to different
units in a random way. i.e., all the units will have equal chance of allotment of
--- Content provided by FirstRanker.com ---
treatments. But, w at
h treatment is a tual
--- Content provided by FirstRanker.com ---
cly allotted to a unit will depend on pure
--- Content provided by FirstRanker.com ---
chance only.
--- Content provided by FirstRanker.com ---
The basic design is Completely Randomized Design (CRD). In thisesign, the first two principles namely
d
--- Content provided by FirstRanker.com ---
replication and randomization are used.
There is no necessity of blocking in CRD, because the entire area of experiment
--- Content provided by FirstRanker.com ---
is assumed to be homogeneous. If it is not so,en it becom
th
--- Content provided by FirstRanker.com ---
es necessary to
us experi
--- Content provided by FirstRanker.com ---
subdivide the non-homogeneomental area into homogeneous sub-
groups such that each subgroup has almost the same level of attribute. The
--- Content provided by FirstRanker.com ---
iding the experim
technique of subdiv
--- Content provided by FirstRanker.com ---
ental area into groups is called as blockingor local control and such subgroups are called as Blocks. The RBD and LSD
RD is not a bock design.
--- Content provided by FirstRanker.com ---
are bock designs. However, C
--- Content provided by FirstRanker.com ---
II. Completely Randomized Design (CRD)This design is useful to compare several treatments in an experiment. For
example, suppose there are three training institutes each offering a distinct
--- Content provided by FirstRanker.com ---
training programm
nd
--- Content provided by FirstRanker.com ---
e to sales persons aa manager wants to know which of the
three training programmes would be highly rewarding for his business
--- Content provided by FirstRanker.com ---
organization. One option for him would be the comparison of the means of the
samples taken two at a time. However, comparison of the sample means may not
--- Content provided by FirstRanker.com ---
yield accurate results when more than two samples are involved in theexperiment. Because of this reason, the m nager m
a
--- Content provided by FirstRanker.com ---
ay opt for a completely
ized design. In this design, all
--- Content provided by FirstRanker.com ---
randomare taken for sim
the samples
--- Content provided by FirstRanker.com ---
ultaneous
co side
--- Content provided by FirstRanker.com ---
nration and they are examined by m
a single s
--- Content provided by FirstRanker.com ---
eans of
tatistical test.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
For the application of this design, the first and foremost condition is thatthe experime
rea shoul
--- Content provided by FirstRanker.com ---
ntal a
d be homogeneous in the particular attribute about
--- Content provided by FirstRanker.com ---
which the experiment is carried out. Forlustra
the purpose of il
--- Content provided by FirstRanker.com ---
tion, we consider
an example with 3 treatments denoted by A, B, C. A lay out is a pictorial
--- Content provided by FirstRanker.com ---
representation of assignment of treatments to various experimeT
ntal areas.
--- Content provided by FirstRanker.com ---
he
--- Content provided by FirstRanker.com ---
ple design has the following lay out.exam
Experimental area
--- Content provided by FirstRanker.com ---
B A B
A A C
--- Content provided by FirstRanker.com ---
C B AData on treatments
--- Content provided by FirstRanker.com ---
Suppose there are 3 treatments A, B, C and each treatment is used a
certain number of times as illustrated in the following example:
--- Content provided by FirstRanker.com ---
TREATMENT
E
--- Content provided by FirstRanker.com ---
NO. OF TIMES TH
TREATMENT IS APPLIED
--- Content provided by FirstRanker.com ---
A 4B 3
C 2
--- Content provided by FirstRanker.com ---
Collect the results on the data arising out of the application of these treatments.
--- Content provided by FirstRanker.com ---
Suppe results
ose th
--- Content provided by FirstRanker.com ---
e attri
on th
--- Content provided by FirstRanker.com ---
bute pertaio treatm
ning t
--- Content provided by FirstRanker.com ---
ent A are 38, 36, 35 and
40. Suppose the results pertaining to treatment B are 26, 30 and 28. Suppose the
--- Content provided by FirstRanker.com ---
results pertaining to treent C ar
atm
--- Content provided by FirstRanker.com ---
e 30 and 28. Using these values, a `Data
Allocation Table' is constructed as follows:
--- Content provided by FirstRanker.com ---
Treatmeata Allocation
nt D
--- Content provided by FirstRanker.com ---
A 38 36 35 30
--- Content provided by FirstRanker.com ---
B 26 30 28
--- Content provided by FirstRanker.com ---
C 30 28
--- Content provided by FirstRanker.com ---
The sums of the values for the 3 treatments are denoted by T1, T2 and T3,
respectively. For the above example data, we obtain
--- Content provided by FirstRanker.com ---
T1 = 38 + 36 + 35 + 30 = 139,T2 = 26 + 30 + 28 = 84 and
T3 = 30 + 8
--- Content provided by FirstRanker.com ---
2 = 58.
--- Content provided by FirstRanker.com ---
Statistical Analysis of CRDAs already mentioned, the experimental units in a CRD are taken in a
single group with the condition that the units forming the group must be
--- Content provided by FirstRanker.com ---
ogeneous as far as po
hom
--- Content provided by FirstRanker.com ---
ssible. Suppose there are k treatments in anexperiment. Let the ith treatment be replicated n times. Then the total number
i
--- Content provided by FirstRanker.com ---
k
of experimental units in the design is n + n + ... + n + ... + n = n = N .
--- Content provided by FirstRanker.com ---
12
i
--- Content provided by FirstRanker.com ---
k
i
--- Content provided by FirstRanker.com ---
i=1The treatments are allocated at random to all the units in the experimental area.
This design provides a one-way classified data with different levels of a single
--- Content provided by FirstRanker.com ---
factor called treatments. The linear model for CRD is defined by the relation
i =1,2,...,k
--- Content provided by FirstRanker.com ---
y = ? + a + eij
--- Content provided by FirstRanker.com ---
i
ij
--- Content provided by FirstRanker.com ---
j = 1, 2,..., ni
where y is the jth observation of the ith treatment,
--- Content provided by FirstRanker.com ---
ij
?
--- Content provided by FirstRanker.com ---
is the general mean effect which is fixed,a is the fixed effect due to ith treatment and
i
--- Content provided by FirstRanker.com ---
e is the random error effect which is distributed normally with zero mean and
--- Content provided by FirstRanker.com ---
ij
constant variance.
--- Content provided by FirstRanker.com ---
Let y = G be the Grand total of all the observations.ij
ij
--- Content provided by FirstRanker.com ---
In y , fix i and vary j. Then the sum gives the ith treatment total, denoted by
ij
--- Content provided by FirstRanker.com ---
T . i.e., y = T (i=1,2,...,k).i
ij
--- Content provided by FirstRanker.com ---
i
j
--- Content provided by FirstRanker.com ---
Apply the ANOVA for one-way classified data and compute the totalsum of squares (TSS) and treatment sum of squares (TrSS) as follows:
2
--- Content provided by FirstRanker.com ---
G
2
--- Content provided by FirstRanker.com ---
TSS = y -= Q
ij
--- Content provided by FirstRanker.com ---
N
ij
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
2
T
--- Content provided by FirstRanker.com ---
Gi
TrSS =
--- Content provided by FirstRanker.com ---
-
= QT
--- Content provided by FirstRanker.com ---
nN
i
--- Content provided by FirstRanker.com ---
i
G2/N is called the correction factor or the correction term.
--- Content provided by FirstRanker.com ---
The error sum of squares (ESS) can be obtained by subtraction. All these valuesare represented in the form of an ANOVA Table provided below.
--- Content provided by FirstRanker.com ---
ANOVA Table for CRD
Degrees of
--- Content provided by FirstRanker.com ---
Source ofSum of Squares
Mean Sum of Variance ratio
--- Content provided by FirstRanker.com ---
Freedom
Variation
--- Content provided by FirstRanker.com ---
(SS)Squares (MSS)
F
--- Content provided by FirstRanker.com ---
(df)
M
--- Content provided by FirstRanker.com ---
2T
T
--- Content provided by FirstRanker.com ---
G
Q
--- Content provided by FirstRanker.com ---
F =:
T
--- Content provided by FirstRanker.com ---
Treatments k?
1
--- Content provided by FirstRanker.com ---
iQ =
-
--- Content provided by FirstRanker.com ---
T
M =
--- Content provided by FirstRanker.com ---
M
--- Content provided by FirstRanker.com ---
T
E
--- Content provided by FirstRanker.com ---
NN
T
--- Content provided by FirstRanker.com ---
-
i
--- Content provided by FirstRanker.com ---
k 1i
Fk 1,
--- Content provided by FirstRanker.com ---
- N -k
Q :
--- Content provided by FirstRanker.com ---
EQ
Error N?
--- Content provided by FirstRanker.com ---
k
E
--- Content provided by FirstRanker.com ---
M =-
--- Content provided by FirstRanker.com ---
E
By subtraction
--- Content provided by FirstRanker.com ---
N - k2
G
--- Content provided by FirstRanker.com ---
Total N?
1
--- Content provided by FirstRanker.com ---
2Q = y -
--- Content provided by FirstRanker.com ---
- -
ij
--- Content provided by FirstRanker.com ---
Nij
--- Content provided by FirstRanker.com ---
Application of ANOVA
Objective of ANOVA:
--- Content provided by FirstRanker.com ---
We apply ANOVA to find out whether there is any significant difference
in the performance of the treatments. We formulate the following null
--- Content provided by FirstRanker.com ---
hypothesis:H0: There is no significant difference in the performance of the
treatments.
--- Content provided by FirstRanker.com ---
The null hypothesis has to be tested against the following alternative
hypothesis:
--- Content provided by FirstRanker.com ---
H1: There is a significant difference in the performance of the treatments.We have to decide whether the null hypothesis has to be accepted or rejected at
a desired level of significance ().
--- Content provided by FirstRanker.com ---
Inference
--- Content provided by FirstRanker.com ---
If the observed value of F is less than the expected value of F, i.e., Fo <F
e, then the null-hypothesis H is accepted for a given level of significance (
--- Content provided by FirstRanker.com ---
)
o
--- Content provided by FirstRanker.com ---
and we conclude that the effects due to various treatments do not differsignificantly.
--- Content provided by FirstRanker.com ---
If the observed value of F is greater than the expected value of F,
i.e., F > F , then the null-hypothesis H is rejected for a given level of
--- Content provided by FirstRanker.com ---
oo
significance ( ) and we conclude that the effects due to various treatments
--- Content provided by FirstRanker.com ---
differ significantly.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Working rule for an example:
--- Content provided by FirstRanker.com ---
We have to consider three quantities G, N and the Correction Factor
--- Content provided by FirstRanker.com ---
(denoted by CF) defined as follows:G = Sum of the values for all the treatments,
--- Content provided by FirstRanker.com ---
N = The sum of the number of times each treatment is applied
--- Content provided by FirstRanker.com ---
The correction factor CF = G2 / N.Let us consider an example of CRD. Suppose there are 3 treatments A, B, C.
Suppose the number of times the treatment is applied is n1 in the case of A, n2 for
--- Content provided by FirstRanker.com ---
B and n3 for C. The sums of the values for the 3 treatments are denoted by T1, T2
and T3. With these notations, we have
--- Content provided by FirstRanker.com ---
N = n1 + n2 + n3,G = T1 +T2 +T3,
CF = G2/N = ( T1 +T2 +T3 )2 / (n1 + n2 + n3).
--- Content provided by FirstRanker.com ---
Define the following quantities:
TSS = Sum of the squares of the observed values ? Correction Factor
--- Content provided by FirstRanker.com ---
T2
2
--- Content provided by FirstRanker.com ---
2
r SS = ( T1
--- Content provided by FirstRanker.com ---
/ n1 + T2 / n2 + T3 / n3 ) ? Correction FactorESS = TSS ? Tr SS
--- Content provided by FirstRanker.com ---
Calculation of the Degrees of Freedom (df):
The df for treatments = No. of treatments ? 1.
--- Content provided by FirstRanker.com ---
The df for the total = Total no. of times all the treatments have been applied ? 1= N ? 1 = n1 + n2 + n3 ? 1.
The df for the Error = (Total no. of times all the treatments have been applied -
--- Content provided by FirstRanker.com ---
No. of treatments) ? 2.
We have the following ANOVA table for this example.
--- Content provided by FirstRanker.com ---
ANOVA Table for CRD
--- Content provided by FirstRanker.com ---
Source of
Degrees of
--- Content provided by FirstRanker.com ---
SS MSS
Variance
--- Content provided by FirstRanker.com ---
ratiovariation
freedom
--- Content provided by FirstRanker.com ---
F
Treatment
--- Content provided by FirstRanker.com ---
3? 1 = 2Tr SS
Tr SS / df =
--- Content provided by FirstRanker.com ---
Tr SS / 2
--- Content provided by FirstRanker.com ---
Error8? 2 = 6
ESS
--- Content provided by FirstRanker.com ---
ESS / df =
--- Content provided by FirstRanker.com ---
ESS / 6Total
9? 1 = 8
--- Content provided by FirstRanker.com ---
TSS
--- Content provided by FirstRanker.com ---
After these steps, carry out the Analysis of Variance and draw the inference.
--- Content provided by FirstRanker.com ---
Problem 1
--- Content provided by FirstRanker.com ---
Examine the CRD with the following Data Allocation Table and determinewhether or not the treatments differ significantly.
--- Content provided by FirstRanker.com ---
Treatment Data Allocation
A 28 36 32 34
--- Content provided by FirstRanker.com ---
B 40 38 36
--- Content provided by FirstRanker.com ---
C 32 34
--- Content provided by FirstRanker.com ---
Solution:The treatments in the design are A, B and C.
We have
--- Content provided by FirstRanker.com ---
n1 = The number of times A is applied = 4,
n2 = The number of times B is applied = 3,
--- Content provided by FirstRanker.com ---
n3 = The number of times C is applied = 2.--- Content provided by FirstRanker.com ---
N = n1 + n2 + n3 = 4 + 3 + 2 = 9.The sums of the values for the 3 treatments are denoted by T1, T2 and T3,
--- Content provided by FirstRanker.com ---
respectively.
--- Content provided by FirstRanker.com ---
For the given data on experimental values, we obtainT1 = 28 + 36 + 32 + 34 = 130,
T2 = 40 + 38 + 36 = 114 and
--- Content provided by FirstRanker.com ---
T3 = 32 + 34 = 66.
G = T1 + T2 + T3 = 130 + 114 + 66 = 310.
--- Content provided by FirstRanker.com ---
The correction factor = G2/N = 3102/9 = 10677.8
y2 ij = 282 + 362 + 322 + 342 + 402 + 382 + 362 + 322 + 342
--- Content provided by FirstRanker.com ---
= 784 + 1296 + 1024 + 1156 + 1600 + 1444 + 1296 + 1024 + 1156
= 10780
--- Content provided by FirstRanker.com ---
(T2i /n i ) = 1302 / 4 + 1142 / 3 + 662 / 2= 16900 / 4 + 12996 / 3 + 4356 / 2 = 4225 + 4332 + 2178 = 10735
--- Content provided by FirstRanker.com ---
The total sum of squares (TSS) and treatment sum of squares (TrSS) are
calculated as follows:
--- Content provided by FirstRanker.com ---
TSS = y2 ij ? CF = 10780 ? 10677.8 = 102.2TrSS = T2i /n i ? CF = 10735 ? 10677.8 = 57.2
ESS = TSS ? TrSS
--- Content provided by FirstRanker.com ---
We apply ANOVA to find out whether there is any significant difference in the
--- Content provided by FirstRanker.com ---
performance of the treatments. We formulate the following null hypothesis:H0: There is no significant difference in the performance of the
treatments.
--- Content provided by FirstRanker.com ---
The null hypothesis has to be tested against the following alternative
--- Content provided by FirstRanker.com ---
hypothesis:
H1: There is a significant difference in the performance of the treatments.
--- Content provided by FirstRanker.com ---
We have to decide whether the null hypothesis has to be accepted or rejected ata desired level of significance ().
ANOVA Table for CRD
--- Content provided by FirstRanker.com ---
Source of
Degrees of
--- Content provided by FirstRanker.com ---
SSMSS = SS/DF
Variance ratio
--- Content provided by FirstRanker.com ---
variation
freedom
--- Content provided by FirstRanker.com ---
FTreatment
3? 1 = 2
--- Content provided by FirstRanker.com ---
57.2
57.2 / 2 = 28.6
--- Content provided by FirstRanker.com ---
28.6 / 7.5 = 3.81Error
8? 2 = 6
--- Content provided by FirstRanker.com ---
45.0
45 / 6 = 7.5
--- Content provided by FirstRanker.com ---
Total
9? 1 = 8
--- Content provided by FirstRanker.com ---
102.2
--- Content provided by FirstRanker.com ---
In the table, first enter the values of SS for `Total' and `Treatment'. From Total,
--- Content provided by FirstRanker.com ---
subtract Treatment to obtain SS for `Error'.
i.e., ESS = TSS ? TrSS = 102.2 ? 57.2 = 45.0
--- Content provided by FirstRanker.com ---
Calculation of F value: F = Greater variance / Smaller variance = 28.6 / 7.5
= 3.81
--- Content provided by FirstRanker.com ---
Degrees of freedom for greater variance (df1) = 2
Degrees of freedom for smaller variance (df2) = 6
--- Content provided by FirstRanker.com ---
Table value of F at 5% level of significance = 5.14Inference:
--- Content provided by FirstRanker.com ---
Since the calculated value of F is less than the table value of F, the null
hypothesis is accepted and it is concluded that there is no significant difference
--- Content provided by FirstRanker.com ---
in the treatments A, B and C, at 5% level of significance.III. Randomized Block Design (RBD)
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
In CRD, note that the site is not split into blocks. An improvement of
CRD can be obtained by providing the blocking (local control) measure in the
--- Content provided by FirstRanker.com ---
experimental design. One such design is Randomized Block Design (RBD). In ablock design, the site is split into different blocks such that each block is
homogeneous in itself, with respect to the particular attribute under experiment.
--- Content provided by FirstRanker.com ---
The result from a RBD will be better than that from a CRD. While we use one-
way ANOVA in CRD, we use two-way ANOVA in RBD.
--- Content provided by FirstRanker.com ---
Example of the lay out of RBD:Experimental area
--- Content provided by FirstRanker.com ---
Treatment Block 1
Block 2
--- Content provided by FirstRanker.com ---
Block 3A 19 16 17
B 16 17 20
--- Content provided by FirstRanker.com ---
C 23 24 22
--- Content provided by FirstRanker.com ---
This is an example of a RBD with 3 treatments and 3 blocks.--- Content provided by FirstRanker.com ---
Statistical Analysis of RBD
--- Content provided by FirstRanker.com ---
Suppose there are k treatments each replicated r times. Then the totalnumber of experimental units is rk. These units are rearranged into r groups
(Blocks) of size k. The local control measure is adopted in this design in order to
--- Content provided by FirstRanker.com ---
make the units of each group to be homogeneous. The group units in these
blocks are known as plots or cells. The k treatments are allocated at random in
--- Content provided by FirstRanker.com ---
the k plots of each of the blocks selected randomly one by one. This type of--- Content provided by FirstRanker.com ---
homogeneous grouping of experimental units and random allocation oftreatments to randomly selected blocks are two main features of RBD.
The technique of ANOVA for two-way classified data is applicable to an
--- Content provided by FirstRanker.com ---
experiment with RBD lay out. The data collected from the experiment is
classified according to the levels of two factors namely treatments and blocks.
--- Content provided by FirstRanker.com ---
The linear model for RBD is defined by the relationi =1, 2,..., k
y = ? + a + b + e
--- Content provided by FirstRanker.com ---
ij
--- Content provided by FirstRanker.com ---
ij
ij
--- Content provided by FirstRanker.com ---
j =1, 2,..., r
where y is the observation corresponding to ith treatment and jth block,
--- Content provided by FirstRanker.com ---
ij?
is the general mean effect which is fixed,
--- Content provided by FirstRanker.com ---
a is the fixed effect due to ith treatment,
i
--- Content provided by FirstRanker.com ---
b is the fixed effect due to jth block andj
e is the random error effect which is distributed normally with zero mean and
--- Content provided by FirstRanker.com ---
ij
constant variance.
--- Content provided by FirstRanker.com ---
Applying the method of ANOVA for two-way classified data, the sum ofsquares due to treatments, blocks and error can be obtained.
Let y = G be the Grand total of all the rk observations.
--- Content provided by FirstRanker.com ---
ij
ij
--- Content provided by FirstRanker.com ---
In y , fix i and vary j. Then the sum gives the ith treatment total, denoted byij
T . i.e., y = T (i=1,2,...,k).
--- Content provided by FirstRanker.com ---
i
ij
--- Content provided by FirstRanker.com ---
ij
In y , fix j and vary i. Then the sum gives the jth block total, denoted by
--- Content provided by FirstRanker.com ---
ij
Bj . i.e., y = Bj (j=1,2,...,r).
--- Content provided by FirstRanker.com ---
ijj
2
--- Content provided by FirstRanker.com ---
G
e
--- Content provided by FirstRanker.com ---
W takeas the correction factor. The number of treatments is k and the
rk
--- Content provided by FirstRanker.com ---
numb
s
--- Content provided by FirstRanker.com ---
er of blocks is r. Various sum of squares are computed as follows.--- Content provided by FirstRanker.com ---
2G
2
--- Content provided by FirstRanker.com ---
TSS = y -
= Q
--- Content provided by FirstRanker.com ---
ijrk
ij
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
TG
i
--- Content provided by FirstRanker.com ---
TrSS =
-
--- Content provided by FirstRanker.com ---
= Q ,T
r
--- Content provided by FirstRanker.com ---
rk
i
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
Bj
G
--- Content provided by FirstRanker.com ---
BSS =-
= Q ,
--- Content provided by FirstRanker.com ---
B
k
--- Content provided by FirstRanker.com ---
rkj
ESS = Q - Q - Q = Q
--- Content provided by FirstRanker.com ---
T
B
--- Content provided by FirstRanker.com ---
EAll these values are represented in the form of an ANOVA table provided
--- Content provided by FirstRanker.com ---
below.
ANOVA Table for RBD
--- Content provided by FirstRanker.com ---
VarianceSource of
Degrees of Sum of Squares Mean Sum of
--- Content provided by FirstRanker.com ---
ratio
Variation
--- Content provided by FirstRanker.com ---
Freedom(SS)
Squares (MSS)
--- Content provided by FirstRanker.com ---
F
MT
--- Content provided by FirstRanker.com ---
22
F =
--- Content provided by FirstRanker.com ---
:
T
--- Content provided by FirstRanker.com ---
GQ
T
--- Content provided by FirstRanker.com ---
i
Q =
--- Content provided by FirstRanker.com ---
-T
--- Content provided by FirstRanker.com ---
M =
--- Content provided by FirstRanker.com ---
MTreatments
k ? 1
--- Content provided by FirstRanker.com ---
E
--- Content provided by FirstRanker.com ---
Tr
rk
--- Content provided by FirstRanker.com ---
T
k -1
--- Content provided by FirstRanker.com ---
iFk 1,
- (k 1
--- Content provided by FirstRanker.com ---
- )(r 1
- )
--- Content provided by FirstRanker.com ---
22
B
--- Content provided by FirstRanker.com ---
G
M
--- Content provided by FirstRanker.com ---
Blocksr ? 1
j
--- Content provided by FirstRanker.com ---
Q =
-
--- Content provided by FirstRanker.com ---
B
F =
--- Content provided by FirstRanker.com ---
:
B
--- Content provided by FirstRanker.com ---
QB
k
--- Content provided by FirstRanker.com ---
rk
B
--- Content provided by FirstRanker.com ---
=M
j
--- Content provided by FirstRanker.com ---
M
--- Content provided by FirstRanker.com ---
B
E
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
r -1F
--- Content provided by FirstRanker.com ---
r 1,
- (k 1
--- Content provided by FirstRanker.com ---
- )(r 1- )
(k ? 1)(r ?
--- Content provided by FirstRanker.com ---
Q :
E
--- Content provided by FirstRanker.com ---
QError
E
--- Content provided by FirstRanker.com ---
M =
--- Content provided by FirstRanker.com ---
E1)
--- Content provided by FirstRanker.com ---
-
-
--- Content provided by FirstRanker.com ---
By subtraction(k 1)(r 1)
2
--- Content provided by FirstRanker.com ---
G
Total
--- Content provided by FirstRanker.com ---
(rk ? 1)2
Q = y -
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
ij
rk
--- Content provided by FirstRanker.com ---
ij
--- Content provided by FirstRanker.com ---
We have to find out whether there is any significant difference in theperformance of the treatments. Also we can determine whether there is any
--- Content provided by FirstRanker.com ---
significant difference in the performance of different blocks. We formulate the
following two null hypotheses:
--- Content provided by FirstRanker.com ---
Null hypothesis-1
H01: There is no significant difference in the performance of the treatments.
--- Content provided by FirstRanker.com ---
Null hypothesis-2
H02: There is no significant difference in the performance of the blocks.
--- Content provided by FirstRanker.com ---
Each null hypothesis has to be tested against the alternative hypothesis. Even
--- Content provided by FirstRanker.com ---
though there are two null hypotheses, the important one is the null hypothesison the treatments. We have to decide whether to accept or reject the null
hypothesis on the treatments at a desired level of significance ().
--- Content provided by FirstRanker.com ---
Inference
If the observed value of F is less than the expected value of F, i.e., Fo <
--- Content provided by FirstRanker.com ---
Fe, then the null-hypothesis H is accepted for a given level of significance (
)
--- Content provided by FirstRanker.com ---
o
and we conclude that the effects due to various treatments do not differ
--- Content provided by FirstRanker.com ---
significantly.If the observed value of F is greater than the expected value of F, i.e.,
--- Content provided by FirstRanker.com ---
F > F then the null-hypothesis H is rejected for a given level of significance
o
--- Content provided by FirstRanker.com ---
o( ) and we conclude that the effects due to various treatments differ
significantly.
--- Content provided by FirstRanker.com ---
Similarly, the blocks' effects may also be tested, if necessary.
Working rule for an example:
--- Content provided by FirstRanker.com ---
Consider the following example:
Treatment Block 1
--- Content provided by FirstRanker.com ---
Block 2
Block 3
--- Content provided by FirstRanker.com ---
Block 4A
72 68 70 56
--- Content provided by FirstRanker.com ---
B
--- Content provided by FirstRanker.com ---
55 60 62 55
C
--- Content provided by FirstRanker.com ---
65 70 70 60In this case, we have
--- Content provided by FirstRanker.com ---
T1 = 72 + 68 + 70 + 56 = 266,
T2 = 55 + 60 + 62 + 55 = 232,
--- Content provided by FirstRanker.com ---
T3 = 65 + 70 + 70 + 60 = 265,T1 + T2 + T3 = 266 + 232 + 265 = 763.
B1 = 72 + 55 + 65 = 192,
--- Content provided by FirstRanker.com ---
B2 = 68 + 60 + 70 = 198,
B3 = 70 + 62 + 70 = 202,
--- Content provided by FirstRanker.com ---
B4 = 56 + 55 + 60 = 171,B1 + B2 + B3 + B4 = 192 + 198 + 202 + 171 = 763.
For easy reference, let us take the number of treatments as t and the number of
--- Content provided by FirstRanker.com ---
blocks as b. Then we have t = 3 and b = 4.
Calculate Tr SS and BSS as follows:
--- Content provided by FirstRanker.com ---
T2
2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
r SS = ( T1/ b + T2 / b +T3 / b + T4 / b ) ? Correction Factor
BSS = ( B 2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
21
/ t + B2 / t + B3 / + B3 / t ) ? Correction Factor
--- Content provided by FirstRanker.com ---
After these steps, carry out the Analysis of Variance and draw the inference.
--- Content provided by FirstRanker.com ---
Problem 2Analyse the following RBD and determine whether or not the treatments differ
significantly.
--- Content provided by FirstRanker.com ---
Experimental area
Treatment Block 1
--- Content provided by FirstRanker.com ---
Block 2Block 3
A 9 5 7
--- Content provided by FirstRanker.com ---
B 6 8 5
--- Content provided by FirstRanker.com ---
C 4 5 8
--- Content provided by FirstRanker.com ---
Solution:The treatments in the design are A, B and C. There are 3 blocks namely, Block
1, Block 2 and Block 3.
--- Content provided by FirstRanker.com ---
We have
n1 = the number of times A is applied = 3,
--- Content provided by FirstRanker.com ---
n2 = the number of times B is applied = 3,n3 = the number of times C is applied = 3.
N = n1 + n2 + n3 = 3 + 3 + 3 = 9.
--- Content provided by FirstRanker.com ---
The sums of the values for the 3 treatments are denoted by T1, T2 and T3,
respectively.
--- Content provided by FirstRanker.com ---
For the given data on experimental values, we obtainT1 = 9 + 5 + 7 = 21,
T2 = 6 + 8 + 5 = 19,
--- Content provided by FirstRanker.com ---
T3 = 4 + 5 + 8 = 17,
T1 + T2 + T3 = 21 + 19 + 17 = 57.
--- Content provided by FirstRanker.com ---
B1 = 9 + 6 + 4 = 19,B2 = 5 + 8 + 5 = 18,
B3 = 7 + 5 + 8 = 20,
--- Content provided by FirstRanker.com ---
B1 +B2 +B3 = 19 + 18 + 20 = 57.
G = T1 + T2 + T3 = 57.
--- Content provided by FirstRanker.com ---
The correction factor = G2/N = 572 / 9 = 3249 / 9 = 361y2 ij = 92 + 52 + 72 + 62 + 82 + 52 + 42 + 52 + 82
= 81 + 25 + 49 + 36 + 64 + 25 + 16 + 25 + 64 = 385
--- Content provided by FirstRanker.com ---
No. of blocks = b = 3
No. of treatments = t = 3
--- Content provided by FirstRanker.com ---
( T2i / b ) = 212 / 3 + 192 / 3 + 172 / 3--- Content provided by FirstRanker.com ---
= 441 / 3 + 361 / 3 + 289 / 3 = 147 + 120.3 + 96.3 = 363.6( B2j / t ) = 192 / 3 + 182 / 3 + 202 / 3
= 361 / 3 + 324 / 3 + 400 / 3 = 120.3 + 108 + 13.3 = 361.6
--- Content provided by FirstRanker.com ---
The total sum of squares (TSS), treatment sum of squares (TrSS) and block sum
of squares (BSS) are calculated as follows:
--- Content provided by FirstRanker.com ---
TSS = y2 ij ? CF = 385 ? 361 = 24TrSS = (T2i /b) ? CF = 363.6 ? 361 = 2.6
--- Content provided by FirstRanker.com ---
BSS = (B2j /t) ? CF = 361.6 ? 361 = 0.6
--- Content provided by FirstRanker.com ---
ESS = TSS ? TrSS ? BSS = 24 ? 2.6 ? 0.6 = 24 ? 3.2 = 20.8We apply ANOVA to find out whether there is any significant difference
--- Content provided by FirstRanker.com ---
in the performance of the treatments. We formulate the following null
hypothesis:
--- Content provided by FirstRanker.com ---
H0: There is no significant difference in the performance of thetreatments.
The null hypothesis has to be tested against the following alternative
--- Content provided by FirstRanker.com ---
hypothesis:
H1: There is a significant difference in the performance of the treatments.
--- Content provided by FirstRanker.com ---
We have to decide whether the null hypothesis has to be accepted or rejected ata desired level of significance ().
ANOVA Table for RBD
--- Content provided by FirstRanker.com ---
Source of
Degrees of
--- Content provided by FirstRanker.com ---
SSMSS = SS/DF
Variance ratio
--- Content provided by FirstRanker.com ---
variation
freedom
--- Content provided by FirstRanker.com ---
FTreatment
3? 1 = 2
--- Content provided by FirstRanker.com ---
2.6
2.6 / 2 = 1.3
--- Content provided by FirstRanker.com ---
5.2 / 1.3 = 4.0Block
3? 1 = 2
--- Content provided by FirstRanker.com ---
0.6
0.6 / 2 = 0.3
--- Content provided by FirstRanker.com ---
5.2 / 0.3 = 17.3--- Content provided by FirstRanker.com ---
Error8? 4 = 4
20.8
--- Content provided by FirstRanker.com ---
20.8 / 4 = 5.2
--- Content provided by FirstRanker.com ---
Total9? 1 = 8
24.0
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
In the table, first enter the values of SS for `Total', `Treatment' and `Block'.
From Total, subtract (Treatment + Block) to obtain SS for `Error'.
--- Content provided by FirstRanker.com ---
i.e., ESS = 24.0 - 3.2 = 20.8
Calculation of F value: We consider `Treatment'.
--- Content provided by FirstRanker.com ---
F = Greater variance / Smaller variance = 5.2 / 1.3 = 4Degrees of freedom for greater variance (df1) = 4
Degrees of freedom for smaller variance (df2) = 2
--- Content provided by FirstRanker.com ---
Table value of F at 5% level of significance = 19.25
--- Content provided by FirstRanker.com ---
Inference:Since the calculated value of F for the treatments is less than the table
value of F, the null hypothesis is accepted and it is concluded that there is no
--- Content provided by FirstRanker.com ---
significant difference in the treatments A, B and C at 5% level of significance.
Note: If required, by using the same table, we can also test whether there is any
--- Content provided by FirstRanker.com ---
significant difference in the blocks, at 5% level of significance.--- Content provided by FirstRanker.com ---
IV. Latin Square Design (LSD)
--- Content provided by FirstRanker.com ---
It was pointed out earlier that RBD is an improvement of CRD, sinceRBD provides an error control measure for the elimination of block variation. In
RBD, the source of variation is eliminated in only one direction, namely block
--- Content provided by FirstRanker.com ---
wise. This idea can be further generalized to improve RBD by eliminating more
sources of variation. One such design with a provision for elimination of two
--- Content provided by FirstRanker.com ---
sources of variation is `Latin Square Design'. The result from LSD will be better
--- Content provided by FirstRanker.com ---
than that from a RBD.Suppose there are n treatments each replicated n times. Then the total
--- Content provided by FirstRanker.com ---
number of experimental units is
2
--- Content provided by FirstRanker.com ---
n ? n = n . Let p ? q denote the factors whosevariations are to be eliminated from the experimental error. Then both the
factors P and Q should be related to the variable under study. In that case, these
--- Content provided by FirstRanker.com ---
two factors are control factors of variation.
--- Content provided by FirstRanker.com ---
Therefore, the total number of level combinations of the two factors
--- Content provided by FirstRanker.com ---
is
2
--- Content provided by FirstRanker.com ---
n ? n = n . Now the experimental units are so chosen that each unit containsb
different level com inations of these two factors. Further the 2
--- Content provided by FirstRanker.com ---
n experimental
units are arranged in the form of an n? n array so that there are n rows and n
--- Content provided by FirstRanker.com ---
columns of the 2n units. Then each unit belongs to different row-column
combination. i.e., the two factors P and Q become the rows and columns of the
--- Content provided by FirstRanker.com ---
design. Though it is not necessary that the two factors P and Q should always be
called as rows and columns, it has become a convention to define LSD by means
--- Content provided by FirstRanker.com ---
of two factors, namely rows and columns.--- Content provided by FirstRanker.com ---
After the experimental units are obtained, the n treatments are
--- Content provided by FirstRanker.com ---
allocated to the 2n units such that each treatment occurs once and only once in
each row and each column. This ensures that each treatment is replicated n
--- Content provided by FirstRanker.com ---
times. If a two-way table is formed with the levels of the factor P (rows) and the
levels of the factor Q (columns), then the n treatments should be allocated to the
--- Content provided by FirstRanker.com ---
2n units such that each treatment occurs once and only once in each level of the
factor P and each level of the factor Q. Such an arrangement is called a Latin
--- Content provided by FirstRanker.com ---
Square Design of order n? n .
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Example of lay out of LSDExample 1:
--- Content provided by FirstRanker.com ---
Experimental area
A B C
--- Content provided by FirstRanker.com ---
B C AC A B
--- Content provided by FirstRanker.com ---
In this design, the first row consists of the experiments A, B, C, in this
order. The second row is got by a cyclic permutation of the first row elements.
--- Content provided by FirstRanker.com ---
The third row is got by a cyclic permutation of the second row elements.Example 2:
Experimental area
--- Content provided by FirstRanker.com ---
A B C
C A B
--- Content provided by FirstRanker.com ---
B C AIn this design, the first row consists of the experiments A, B, C in this
--- Content provided by FirstRanker.com ---
order. The third row is got by a cyclic permutation of the first row elements. The
second row is got by a cyclic permutation of the third row elements.
--- Content provided by FirstRanker.com ---
Example 3:
--- Content provided by FirstRanker.com ---
Experimental area
A B C D
--- Content provided by FirstRanker.com ---
B C D AC D A B
--- Content provided by FirstRanker.com ---
D A B C
--- Content provided by FirstRanker.com ---
In this design, the first row consists of the experiments A, B, C, D in this
order. The second row is got by a cyclic permutation of the first row elements.
--- Content provided by FirstRanker.com ---
The third row is got by a cyclic permutation of the second row elements. Thefourth row is got by a cyclic permutation of the third row elements.
--- Content provided by FirstRanker.com ---
Example 4:
--- Content provided by FirstRanker.com ---
Suppose there are 5 treatments denoted by A, B, C, D, E. Then thefollowing arrangement of the treatments is a Latin Square Design of order 5? 5 .
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Factor Q (Column)Column
Q
--- Content provided by FirstRanker.com ---
Q
Q
--- Content provided by FirstRanker.com ---
Q2
3
--- Content provided by FirstRanker.com ---
4
5
--- Content provided by FirstRanker.com ---
QRow .
1
--- Content provided by FirstRanker.com ---
P
1
--- Content provided by FirstRanker.com ---
A B C D EP
2
--- Content provided by FirstRanker.com ---
B C D E A
P
--- Content provided by FirstRanker.com ---
3C D E A B
Factor P
--- Content provided by FirstRanker.com ---
P
4
--- Content provided by FirstRanker.com ---
D E A B CP
5
--- Content provided by FirstRanker.com ---
E A B C D
--- Content provided by FirstRanker.com ---
Note that every treatment appears in each row and column exactly once.In the lay out of LSD, apart from indicating the treatment, the
--- Content provided by FirstRanker.com ---
experimental value also has to be mentioned in each cell.
--- Content provided by FirstRanker.com ---
Statistical Analysis of LSD
--- Content provided by FirstRanker.com ---
In LSD, we have to consider three factors namely rows, columns and
--- Content provided by FirstRanker.com ---
treatments. Therefore, the data collected from this design must be analyzed as a
three-way classified data. For this purpose, actually there must be 3
--- Content provided by FirstRanker.com ---
nobservations, since there are three factors each with n-levels. However, because
of the particular allocation of the treatment to each cell, there is only one
--- Content provided by FirstRanker.com ---
observation per cell, instead of n-observations per cell, according to a three-way
classified data. Consequently, there is no interaction between any of the factors
--- Content provided by FirstRanker.com ---
namely rows, columns and treatments. Hence the appropriate linear model forLSD is defined by the relation
--- Content provided by FirstRanker.com ---
y
= ? + r + c + t + e (i, j,k =1,2,...,n
--- Content provided by FirstRanker.com ---
ijki
j
--- Content provided by FirstRanker.com ---
k
ijk
--- Content provided by FirstRanker.com ---
)where y is the general observation corresponding to ith row, jth column and kth
ijk
--- Content provided by FirstRanker.com ---
treatment,
?
--- Content provided by FirstRanker.com ---
is the general mean effect which is fixed,r is the fixed effect due to ith row,
i
--- Content provided by FirstRanker.com ---
c is the fixed effect due to jth column,
j
--- Content provided by FirstRanker.com ---
t is the fixed effect due to kth treatment andk
e is the random error effect which is distributed normally with zero mean and
--- Content provided by FirstRanker.com ---
ijk
constant variance.
--- Content provided by FirstRanker.com ---
Application of ANOVA:
--- Content provided by FirstRanker.com ---
The analysis here is similar to the analysis of two-way classified data.
--- Content provided by FirstRanker.com ---
First of all, the data is arranged in a row-column table. Let y denoteij
the observation corresponding to ith row and jth column in the table.
--- Content provided by FirstRanker.com ---
In y , fix i and vary j. Then the sum gives the ith row total, denoted by Ri .
--- Content provided by FirstRanker.com ---
ij
i.e., y = R (i=1,2,...,n)
--- Content provided by FirstRanker.com ---
.
ij
--- Content provided by FirstRanker.com ---
i
j
--- Content provided by FirstRanker.com ---
In y , fix j and vary i. Then the sum gives the jth column total, denoted byij
Cj . i.e., y = Cj (j=1,2,...n).
--- Content provided by FirstRanker.com ---
ij
j
--- Content provided by FirstRanker.com ---
Let--- Content provided by FirstRanker.com ---
T = kth treatment total (k=1,2,...,n).
k
--- Content provided by FirstRanker.com ---
We have R = C = T = G which is the Grand total of all thei
j
--- Content provided by FirstRanker.com ---
k
i
--- Content provided by FirstRanker.com ---
jk
2
--- Content provided by FirstRanker.com ---
G
2
--- Content provided by FirstRanker.com ---
n observations. The correction factor CF is defined by CF =where
N
--- Content provided by FirstRanker.com ---
2
N = n is the total number of observations. We have
--- Content provided by FirstRanker.com ---
y = G .
ij
--- Content provided by FirstRanker.com ---
ij
Various sums of squares are computed through the CF as follows:
--- Content provided by FirstRanker.com ---
2G
2
--- Content provided by FirstRanker.com ---
2
TSS = y -
--- Content provided by FirstRanker.com ---
which has (n -1) dFij
2
--- Content provided by FirstRanker.com ---
n
ij
--- Content provided by FirstRanker.com ---
22
R
--- Content provided by FirstRanker.com ---
G
i
--- Content provided by FirstRanker.com ---
RSS =-
which has (n -1) dF
--- Content provided by FirstRanker.com ---
2
n
--- Content provided by FirstRanker.com ---
ni
2
--- Content provided by FirstRanker.com ---
2
C j
--- Content provided by FirstRanker.com ---
GCSS =
-
--- Content provided by FirstRanker.com ---
which has (n -1) dF
2
--- Content provided by FirstRanker.com ---
nn
j
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
TG
k
--- Content provided by FirstRanker.com ---
TSS =
-
--- Content provided by FirstRanker.com ---
which has (n -1) dF2
n
--- Content provided by FirstRanker.com ---
n
k
--- Content provided by FirstRanker.com ---
ESS = TSS - RSS - CSS - TrSS--- Content provided by FirstRanker.com ---
which has (n-1)(n-2) dF.
--- Content provided by FirstRanker.com ---
All these values are represented in the form of an ANOVA Table below.
--- Content provided by FirstRanker.com ---
ANOVA Table for n? n Latin Square Design
--- Content provided by FirstRanker.com ---
Degrees
Variance
--- Content provided by FirstRanker.com ---
Source ofSum of Squares
Mean Sum of
--- Content provided by FirstRanker.com ---
of
ratio
--- Content provided by FirstRanker.com ---
Variation(SS)
Squares (MSS)
--- Content provided by FirstRanker.com ---
Freedom
F
--- Content provided by FirstRanker.com ---
M R2
2
--- Content provided by FirstRanker.com ---
F =
:
--- Content provided by FirstRanker.com ---
RG
Q
--- Content provided by FirstRanker.com ---
R
Rows (n-1)
--- Content provided by FirstRanker.com ---
iQ =
-
--- Content provided by FirstRanker.com ---
R
M =
--- Content provided by FirstRanker.com ---
M
--- Content provided by FirstRanker.com ---
E
--- Content provided by FirstRanker.com ---
R2
n
--- Content provided by FirstRanker.com ---
n
R
--- Content provided by FirstRanker.com ---
n -1i
Fn 1,
--- Content provided by FirstRanker.com ---
- (n 1
- )(n-2)
--- Content provided by FirstRanker.com ---
MColumns
(n-1)
--- Content provided by FirstRanker.com ---
C
2
--- Content provided by FirstRanker.com ---
2F
C
--- Content provided by FirstRanker.com ---
=
:
--- Content provided by FirstRanker.com ---
GQ
C
--- Content provided by FirstRanker.com ---
j
Q =
--- Content provided by FirstRanker.com ---
-c
M =
--- Content provided by FirstRanker.com ---
M E
--- Content provided by FirstRanker.com ---
C
2
--- Content provided by FirstRanker.com ---
c
--- Content provided by FirstRanker.com ---
n
n
--- Content provided by FirstRanker.com ---
n -1
j
--- Content provided by FirstRanker.com ---
Fn 1,- (n 1
- )(n-2)
--- Content provided by FirstRanker.com ---
M
Treatments
--- Content provided by FirstRanker.com ---
(n-1)T
2
--- Content provided by FirstRanker.com ---
2
F =
--- Content provided by FirstRanker.com ---
:T
G
--- Content provided by FirstRanker.com ---
Q
T
--- Content provided by FirstRanker.com ---
kQ =
-
--- Content provided by FirstRanker.com ---
T
M =
--- Content provided by FirstRanker.com ---
M E
--- Content provided by FirstRanker.com ---
T
2
--- Content provided by FirstRanker.com ---
T--- Content provided by FirstRanker.com ---
n
n
--- Content provided by FirstRanker.com ---
n -1k
Fn 1,
--- Content provided by FirstRanker.com ---
- (n 1
- )(n-2)
--- Content provided by FirstRanker.com ---
Q :E
Q
--- Content provided by FirstRanker.com ---
Error (n-1)
(n-2)
--- Content provided by FirstRanker.com ---
EM =
--- Content provided by FirstRanker.com ---
E
-
--- Content provided by FirstRanker.com ---
-By subtraction
--- Content provided by FirstRanker.com ---
(n 1)(n 2)
2
--- Content provided by FirstRanker.com ---
GTotal
2
--- Content provided by FirstRanker.com ---
(n - )
2
--- Content provided by FirstRanker.com ---
1Q = y -
ij
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2n
ij
--- Content provided by FirstRanker.com ---
The following hypotheses are formed:
--- Content provided by FirstRanker.com ---
Null hypothesis-1H01: There is no significant difference in the performance of the
treatments.
--- Content provided by FirstRanker.com ---
Null hypothesis-2
--- Content provided by FirstRanker.com ---
H02: There is no significant difference in the performance of the rows.--- Content provided by FirstRanker.com ---
Null hypothesis-3
H03: There is no significant difference in the performance of the
--- Content provided by FirstRanker.com ---
columns.
--- Content provided by FirstRanker.com ---
Each null hypothesis has to be tested against the alternative hypothesis.Even though there are three null hypotheses, the important one is the null
hypothesis on the treatments. We have to decide whether to accept or reject the
--- Content provided by FirstRanker.com ---
null hypothesis on the treatments at a desired level of significance ().
--- Content provided by FirstRanker.com ---
InferenceIf the observed value of F is less than the expected value of F, i.e., Fo
--- Content provided by FirstRanker.com ---
< Fe, for a given level of significance , then the null hypothesis of equal
treatment effect is accepted. Otherwise, it is rejected.
--- Content provided by FirstRanker.com ---
Problem 3
Examine the following experimental values on the output due to four
--- Content provided by FirstRanker.com ---
different training methods A, B, C and D for sales persons and find out whether
there is any significant difference in the training methods.
--- Content provided by FirstRanker.com ---
AB
C
--- Content provided by FirstRanker.com ---
D
28
--- Content provided by FirstRanker.com ---
2032
28
--- Content provided by FirstRanker.com ---
B
C
--- Content provided by FirstRanker.com ---
DA
36
--- Content provided by FirstRanker.com ---
30
28
--- Content provided by FirstRanker.com ---
20C
D
--- Content provided by FirstRanker.com ---
A
B
--- Content provided by FirstRanker.com ---
2530
22
--- Content provided by FirstRanker.com ---
35
D
--- Content provided by FirstRanker.com ---
AB
C
--- Content provided by FirstRanker.com ---
30
26
--- Content provided by FirstRanker.com ---
3628
--- Content provided by FirstRanker.com ---
Solution:
--- Content provided by FirstRanker.com ---
In this design, there are 4 treatments A, B, C and D. In the lay out of the
design, each treatment appears exactly once in each row as well as each column.
--- Content provided by FirstRanker.com ---
Therefore this design is LSD. The name of the treatment and the observed valueunder that treatment are specified together in each cell.
R1 = first row elements = 28 + 20 + 32 + 28 = 108
--- Content provided by FirstRanker.com ---
R2 = second row elements = 36 + 30 + 28 + 20 = 114
R3 = third row elements = 25 + 30 + 22 + 35 = 112
--- Content provided by FirstRanker.com ---
R4 = fourth row elements = 30 + 26 + 36 + 28 = 120C1 = first column elements = 28 + 36 + 25 + 30 = 119
C2 = second column elements = 20 + 30 + 30 + 26 = 106
--- Content provided by FirstRanker.com ---
C3 = third column elements = 32 + 28 + 22 + 36 = 118
C4 = fourth column elements = 28 + 20 + 35 + 28 = 111
--- Content provided by FirstRanker.com ---
From the given table, rewrite the experimental values for each treatment
separately as follows:
--- Content provided by FirstRanker.com ---
Treatment
A
--- Content provided by FirstRanker.com ---
BC
D
--- Content provided by FirstRanker.com ---
28
20
--- Content provided by FirstRanker.com ---
3228
20
--- Content provided by FirstRanker.com ---
36
30
--- Content provided by FirstRanker.com ---
2822
35
--- Content provided by FirstRanker.com ---
25
30
--- Content provided by FirstRanker.com ---
2636
28
--- Content provided by FirstRanker.com ---
30
--- Content provided by FirstRanker.com ---
T1 = A = 28 + 20 + 22 + 26 = 96T2 = B = 20 + 36 + 35 + 36 = 127
T3 = C = 32 + 30 + 25 + 28 = 115
--- Content provided by FirstRanker.com ---
T4 = D =
0
--- Content provided by FirstRanker.com ---
28 + 28 + 31
+ 30 = 1 6
--- Content provided by FirstRanker.com ---
G = T1
--- Content provided by FirstRanker.com ---
2
+ T + T3 + T3
--- Content provided by FirstRanker.com ---
+= 96
1
--- Content provided by FirstRanker.com ---
127 + 1 5 + 116 = 454
n = No. of treatments = 4
--- Content provided by FirstRanker.com ---
N = n2 = 16Correction Factor = G2/N = 4542 / 16 = 206116 / 16 = 12882.25
--- Content provided by FirstRanker.com ---
The total sum of squares (TSS), Row sum of squares (RSS), Column sum of
--- Content provided by FirstRanker.com ---
squares (CSS) and Treatment sum of squares (TrSS) are calculated as follows:TSS = y2 ij ? Correction Factor
RSS = ( R 2
--- Content provided by FirstRanker.com ---
i
/ n ) ? Correction Factor
--- Content provided by FirstRanker.com ---
CSS = (C 2j
/ nj ) ? Correction Factor
--- Content provided by FirstRanker.com ---
TrSS = (T2k /n) ? Correction Factor
y2 ij =282 +202+322+282+362+302 +282 +202 +252 +302 +222 +352 +302 +262
--- Content provided by FirstRanker.com ---
+362+282=784+400+1024+784+1296+900+784+400+625+900+484+1225+900+676+12
--- Content provided by FirstRanker.com ---
96+784
=13262
--- Content provided by FirstRanker.com ---
TSS = y2 ij ? CF = 13262 ? 12882.25 = 379.75RSS = R 2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
21
/ 4 + R2 / 4 + R3 / 4 + R4 / 4 ? CF
--- Content provided by FirstRanker.com ---
= 108 2
2
--- Content provided by FirstRanker.com ---
22
--- Content provided by FirstRanker.com ---
/ 4 + 114 / 4 + 112 / 4 + 120 / 4 ? 12882.25
= 11664 / 4 + 12996 / 4 + 12544 / 4 + 14400 / 4 ? 12882.25
--- Content provided by FirstRanker.com ---
= 2916 + 3249 + 3136 + 3600 ? 12882.25 = 12901 ? 12882.25 = 18.75--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
CSS = C 2
--- Content provided by FirstRanker.com ---
22
2
--- Content provided by FirstRanker.com ---
1
/ 4 + C2 / 4 + C3 / 4 + C4 / 4 ? CF
--- Content provided by FirstRanker.com ---
= 119 22
2
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
/ 4 + 106 / 4 + 118 / 4 + 111 / 4 ? 12882.25= 14161 / 4 + 11236 / 4 + 13924 / 4 + 12321/ 4 ? 12882.25
= 3540.25 + 2809 + 3481 + 3080.25 ? 12882.25 = 12910.5 ?12882.25
--- Content provided by FirstRanker.com ---
= 28.25
--- Content provided by FirstRanker.com ---
T2
2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
rSS = T1/ 4 + T2 / 4 + T3 / 4 + T4 / 4 ? CF
= 96 2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
/ 4 + 127 / 4 + 115 / 4 + 1162 / 4 ? 12882.25
= 9216 / 4 + 16129 / 4 + 13225 / 4 +13456/ 4 ? 12882.25
--- Content provided by FirstRanker.com ---
= 2304 + 4032.25+ 3306.25 + 3364 ? 12882.25 = 13006.5 ? 12882.25
= 124.25
--- Content provided by FirstRanker.com ---
ESS = Error sum of squares = TSS ? RSS ? CSS ? TrSS= 379.75 ? (18.75 + 28.25 + 124.25 ) = 379.75 ?171.25 = 208.50
--- Content provided by FirstRanker.com ---
We apply ANOVA to find out whether there is any significant difference in the
performance of the treatments. We formulate the following null hypothesis:
--- Content provided by FirstRanker.com ---
H0: There is no significant difference in the training methods.The null hypothesis has to be tested against the following alternative
hypothesis:
--- Content provided by FirstRanker.com ---
H1: There is a significant difference in the training methods.
--- Content provided by FirstRanker.com ---
We have to decide whether the null hypothesis has to be accepted or rejected ata desired level of significance ().
We have the following ANOVA Table.
--- Content provided by FirstRanker.com ---
ANOVA Table for LSD
--- Content provided by FirstRanker.com ---
Source ofDegrees of
Sum of Mean Sum of
--- Content provided by FirstRanker.com ---
Variance ratio
Variation
--- Content provided by FirstRanker.com ---
FreedomSquares Squares (MSS)
F
--- Content provided by FirstRanker.com ---
(SS)
--- Content provided by FirstRanker.com ---
Row
4 ? 1 = 3
--- Content provided by FirstRanker.com ---
18.7518.75 / 3 = 6.25 34.75 / 6.25 = 5.56
Column
--- Content provided by FirstRanker.com ---
4 ? 1 = 3
28.25
--- Content provided by FirstRanker.com ---
28.25 / 3 = 9.42 34.75 / 9.42 = 3.69Treatment
4 ? 1 = 3
--- Content provided by FirstRanker.com ---
124.25 124.25 / 3 = 41.42 41.42 / 34.75 = 1.19
Error
--- Content provided by FirstRanker.com ---
3 x 2 = 6208.50
208.50 / 6 = 34.75
--- Content provided by FirstRanker.com ---
Total
16 ? 1 = 15
--- Content provided by FirstRanker.com ---
379.75--- Content provided by FirstRanker.com ---
Calculation of F value: We consider `Treatment'.
--- Content provided by FirstRanker.com ---
F = Greater variance / Smaller variance = 41.42 / 34.75 = 1.19Degrees of freedom for greater variance (df1) = 3
Degrees of freedom for smaller variance (df2) = 6
--- Content provided by FirstRanker.com ---
Table value of F at 5% level of significance = 4.76
--- Content provided by FirstRanker.com ---
Inference:Since the calculated value of F for the treatments is less than the table
value of F, the null hypothesis is accepted and it is concluded that there is no
--- Content provided by FirstRanker.com ---
significant difference in the training methods A, B, C and D, at 5% level of
significance.
--- Content provided by FirstRanker.com ---
Problem 4Examine the following production values got from four different
machines A, B, C and D and determine whether there is any significant
--- Content provided by FirstRanker.com ---
difference in the machines.
A
--- Content provided by FirstRanker.com ---
DC
B
--- Content provided by FirstRanker.com ---
131
129
--- Content provided by FirstRanker.com ---
126126
C
--- Content provided by FirstRanker.com ---
B
A
--- Content provided by FirstRanker.com ---
D125
125
--- Content provided by FirstRanker.com ---
127
124
--- Content provided by FirstRanker.com ---
D C B A--- Content provided by FirstRanker.com ---
125 120 123 126B
A
--- Content provided by FirstRanker.com ---
D
C
--- Content provided by FirstRanker.com ---
123126
127
--- Content provided by FirstRanker.com ---
121
--- Content provided by FirstRanker.com ---
Solution :In this design, there are 4 treatments A, B, C and D. In the lay out of the
design, each treatment appears exactly once in each row as well as each column.
--- Content provided by FirstRanker.com ---
Therefore this design is LSD.
Since the entries in the design are large, we will follow the coding method.
--- Content provided by FirstRanker.com ---
Subtract 120 from each entry. We get the following LSD.A
D
--- Content provided by FirstRanker.com ---
C
B
--- Content provided by FirstRanker.com ---
119
6
--- Content provided by FirstRanker.com ---
6
C
--- Content provided by FirstRanker.com ---
BA
D
--- Content provided by FirstRanker.com ---
5
5
--- Content provided by FirstRanker.com ---
74
D
--- Content provided by FirstRanker.com ---
C
B
--- Content provided by FirstRanker.com ---
A5
0
--- Content provided by FirstRanker.com ---
3
6
--- Content provided by FirstRanker.com ---
BA
D
--- Content provided by FirstRanker.com ---
C
3
--- Content provided by FirstRanker.com ---
67
1
--- Content provided by FirstRanker.com ---
R1 = first row elements = 11 + 9 + 6 + 6 = 32
--- Content provided by FirstRanker.com ---
R2 = second row elements = 5 + 5 + 7 + 4 = 21R3 = third row elements = 5 + 0 + 3 + 6 = 14
R4 = fourth row elements = 3 + 6 + 7 + 1 = 17
--- Content provided by FirstRanker.com ---
C1 = first column elements = 11 + 5 + 5 + 3 = 24
C2 = second column elements = 9+ 5 + 0 + 6 = 20
--- Content provided by FirstRanker.com ---
C3 = third column elements = 6 + 7 + 3 + 7 = 23C4 = fourth column elements = 6 + 4 + 6 + 1 = 17
--- Content provided by FirstRanker.com ---
From the given table, rewrite the experimental values for each treatment
--- Content provided by FirstRanker.com ---
separately as follows:
--- Content provided by FirstRanker.com ---
TreatmentA
B
--- Content provided by FirstRanker.com ---
C
D
--- Content provided by FirstRanker.com ---
116
6
--- Content provided by FirstRanker.com ---
9
7
--- Content provided by FirstRanker.com ---
55
4
--- Content provided by FirstRanker.com ---
6
3
--- Content provided by FirstRanker.com ---
05
6
--- Content provided by FirstRanker.com ---
3
1
--- Content provided by FirstRanker.com ---
7T1 = A = 11 +7 + 6 + 6 = 30
--- Content provided by FirstRanker.com ---
T2 = B = 6 +5 + 3 + 3 = 17
T3 = C = 6 + 5 + 0 + 1 = 12
--- Content provided by FirstRanker.com ---
T4 = D = 9 + 4 + 5 + 7 = 25G = T1 + T2 + T3 + T3 = 30 + 17 + 12 + 25 = 84
n = No. of treatments = 4
--- Content provided by FirstRanker.com ---
N = n2 = 16
--- Content provided by FirstRanker.com ---
Correction Factor = G2/N = 842 / 16 = 7056 / 16 = 441y2 ij =112 +92+62+62+52+52 +72 +42 +52 +02 +32 +62 +32 +62 +72+12
=121+81+36+36+25+25+49+16+25+0+9+36+9+36+49+1 = 554
--- Content provided by FirstRanker.com ---
The total sum of squares (TSS), Row sum of squares (RSS), Column sum of
--- Content provided by FirstRanker.com ---
squares (CSS) and Treatment sum of squares (TrSS) are calculated as follows:TSS = y2 ij ? CF = 554 ? 441 = 113
--- Content provided by FirstRanker.com ---
RSS = R 2
2
--- Content provided by FirstRanker.com ---
22
1
--- Content provided by FirstRanker.com ---
/ 4 + R2 / 4 + R3 / 4 + R4 / 4 ? CF
--- Content provided by FirstRanker.com ---
= 32 2
2
--- Content provided by FirstRanker.com ---
22
--- Content provided by FirstRanker.com ---
/ 4 + 21 / 4 + 14 / 4 + 17 / 4 ? 441
= 1024 / 4 + 441 / 4 + 196 / 4 + 289 / 4 ? 441
--- Content provided by FirstRanker.com ---
= 256 + 110.25 + 49 + 72.25 ? 441 = 487.5 ? 441 = 46.5CSS = C 2
2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
1/ 4 + C2 / 4 + C3 / 4 + C4 / 4 ? CF
= 24 2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
2/ 4 + 20 / 4 + 23 / 4 + 17 / 4 ? 441
--- Content provided by FirstRanker.com ---
= 576/ 4 + 400 / 4 + 529 / 4 + 289 / 4 ? 441
= 144 + 100 + 132.25 + 72.25 ? 441 = 448.5 ? 441 = 7.5
--- Content provided by FirstRanker.com ---
T2
2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
rSS = T1/ 4 + T2 / 4 + T3 / 4 + T4 / 4 ? CF
= 30 2
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
/ 4 + 17 / 4 + 12 / 4 + 252 / 4 ? 441
= 900 / 4 + 289 / 4 + 144 / 4 + 625 / 4 ? 441
--- Content provided by FirstRanker.com ---
= 225 + 72.25+ 36 + 156.25 ? 441 = 489.5 ? 441 = 48.5
ESS = TSS ? RSS ? CSS ? TrSS
--- Content provided by FirstRanker.com ---
= 113 ? (46.5 + 7.5 + 48.5 ) = 113 ?102.5 = 10.5We formulate the following null hypothesis:
--- Content provided by FirstRanker.com ---
H0: There is no significant difference in the performance of the
machines.
--- Content provided by FirstRanker.com ---
The null hypothesis has to be tested against the following alternativehypothesis:
H1: There is a significant difference in the performance of the machines.
--- Content provided by FirstRanker.com ---
We have to decide whether the null hypothesis has to be accepted or rejected at
a desired level of significance ().
--- Content provided by FirstRanker.com ---
We have the following ANOVA Table.ANOVA Table for LSD
Sum of
--- Content provided by FirstRanker.com ---
Variance ratio
Source of
--- Content provided by FirstRanker.com ---
Degrees ofMean Sum of
Squares
--- Content provided by FirstRanker.com ---
F
Variation
--- Content provided by FirstRanker.com ---
FreedomSquares (MSS)
(SS)
--- Content provided by FirstRanker.com ---
Row
4 ? 1 = 3
--- Content provided by FirstRanker.com ---
46.546.5 / 3 = 15.50 15.50 / 1.75 =
--- Content provided by FirstRanker.com ---
8.857
Column
--- Content provided by FirstRanker.com ---
4 ? 1 = 3
7.5
--- Content provided by FirstRanker.com ---
7.5 / 3 = 2.502.50 / 1.75 =
1.429
--- Content provided by FirstRanker.com ---
Treatment
4 ? 1 = 3
--- Content provided by FirstRanker.com ---
48.548.5 / 3 = 16.17
16.17 / 1.75 =
--- Content provided by FirstRanker.com ---
9.240
Error
--- Content provided by FirstRanker.com ---
3 x 2 = 610.5
10.5 / 6 = 1.75
--- Content provided by FirstRanker.com ---
Total
--- Content provided by FirstRanker.com ---
16 ? 1 = 15113.0
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Calculation of F value: We consider `Treatment'.F = Greater variance / Smaller variance = 16.17 / 1.75 = 9.240
Degrees of freedom for greater variance (df1) = 3
--- Content provided by FirstRanker.com ---
Degrees of freedom for smaller variance (df2) = 6
Table value of F at 5% level of significance = 4.76
--- Content provided by FirstRanker.com ---
Inference:
Since the calculated value of F for the treatments is greater than the table
--- Content provided by FirstRanker.com ---
value of F, the null hypothesis is rejected and the alternative hypothesis is
accepted. It is concluded that there is a significant difference in the performance
--- Content provided by FirstRanker.com ---
of the machines A, B, C and D at 5% level of significance.Problem 5
--- Content provided by FirstRanker.com ---
The financial manager of a company obtained the following details on
the LSD concerning the resources mobilized through 4 different schemes.
--- Content provided by FirstRanker.com ---
Source ofDegrees of
SS
--- Content provided by FirstRanker.com ---
Variation
Freedom
--- Content provided by FirstRanker.com ---
Row3
270
--- Content provided by FirstRanker.com ---
Column
3
--- Content provided by FirstRanker.com ---
150--- Content provided by FirstRanker.com ---
Treatment3
1380
--- Content provided by FirstRanker.com ---
Error
6
--- Content provided by FirstRanker.com ---
156Total
15
--- Content provided by FirstRanker.com ---
1956
--- Content provided by FirstRanker.com ---
Examine the data and find out whether there is any significant difference in theschemes.
--- Content provided by FirstRanker.com ---
Solution :
ANOVA Table for LSD
--- Content provided by FirstRanker.com ---
Sum ofVariance ratio
Source of
--- Content provided by FirstRanker.com ---
Degrees of
Mean Sum of
--- Content provided by FirstRanker.com ---
SquaresF
Variation
--- Content provided by FirstRanker.com ---
Freedom
Squares (MSS)
--- Content provided by FirstRanker.com ---
(SS)Row
3
--- Content provided by FirstRanker.com ---
270
270 / 3 = 90
--- Content provided by FirstRanker.com ---
90 / 26 = 3.462Column
3
--- Content provided by FirstRanker.com ---
150
150 / 3 = 50
--- Content provided by FirstRanker.com ---
50 / 26 = 1.923Treatment
3
--- Content provided by FirstRanker.com ---
1380
1380 / 3 = 460
--- Content provided by FirstRanker.com ---
460 / 26 = 17.692Error
6
--- Content provided by FirstRanker.com ---
156
156 / 6 = 26
--- Content provided by FirstRanker.com ---
Total
15
--- Content provided by FirstRanker.com ---
1956
--- Content provided by FirstRanker.com ---
Null hypothesis:
--- Content provided by FirstRanker.com ---
H0: There is no significant difference in the performance of the schemes.
Alternative hypothesis:
--- Content provided by FirstRanker.com ---
H1: There is a significant difference in the performance of the schemes.Calculation of F value: We consider `Treatment'.
--- Content provided by FirstRanker.com ---
F = Greater variance / Smaller variance = 460 / 26 = 17.692
Degrees of freedom for greater variance (df1) = 3
--- Content provided by FirstRanker.com ---
Degrees of freedom for smaller variance (df2) = 6Table value of F at 5% level of significance = 4.76
--- Content provided by FirstRanker.com ---
Inference:
--- Content provided by FirstRanker.com ---
Since the calculated value of F for the treatments is greater than the table
value of F, the null hypothesis is rejected and the alternative hypothesis is
--- Content provided by FirstRanker.com ---
accepted. It is concluded that there is a significant difference in the financialschemes A, B, C and D, at 5% level of significance.
--- Content provided by FirstRanker.com ---
QUESTIONS
--- Content provided by FirstRanker.com ---
1.What is an experimental design? Explain.
2.
--- Content provided by FirstRanker.com ---
Explain the key concepts in experimental design.
3.
--- Content provided by FirstRanker.com ---
Explain the steps in experimental design.4.
Explain the terms Replication, Randomization and Local Control.
--- Content provided by FirstRanker.com ---
5.
What is meant by the lay out of an experimental design? Explain with an
--- Content provided by FirstRanker.com ---
example.6.
What is a data allocation table? Give an example.
--- Content provided by FirstRanker.com ---
7.
Describe a Completely Randomized Design.
--- Content provided by FirstRanker.com ---
8.Describe a Randomized Block Design.
9.
--- Content provided by FirstRanker.com ---
Describe a Latin Square Design.
10.
--- Content provided by FirstRanker.com ---
Explain the construction of a lay out of a Latin Square Design.11.
Explain the managerial application of an experimental design.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT IV
--- Content provided by FirstRanker.com ---
4. PARTIAL AND MULTIPLE CORRELATIONLesson Outline
?
--- Content provided by FirstRanker.com ---
The concept of partial correlation
?
--- Content provided by FirstRanker.com ---
The concept of multiple correlationLearning Objectives
After reading this lesson you should be able to
--- Content provided by FirstRanker.com ---
-
determine partial correlation coefficient
--- Content provided by FirstRanker.com ---
-determine multiple correlation coefficient
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
I. PARTIAL CORRELATION
--- Content provided by FirstRanker.com ---
Simple correlation is a measure of the relationship between a dependentvariable and another independent variable. For example, if the performance of a
sales person depends only on the training that he has received, then the
--- Content provided by FirstRanker.com ---
relationship between the training and the sales performance is measured by the
simple correlation coefficient r. However, a dependent variable may depend on
--- Content provided by FirstRanker.com ---
several variables. For example, the yarn produced in a factory may depend onthe efficiency of the machine, the quality of cotton, the efficiency of workers,
etc. It becomes necessary to have a measure of relationship in such complex
--- Content provided by FirstRanker.com ---
situations. Partial correlation is used for this purpose. The technique of partial
correlation proves useful when one has to develop a model with 3 to 5 variables.
--- Content provided by FirstRanker.com ---
Suppose Y is a dependent variable, depending on n other variables X1,
X2, ..., Xn.. Partial correlation is a measure of the relationship between Y and
--- Content provided by FirstRanker.com ---
any one of the variables X1, X2,...,Xn, as if the other variables have been
eliminated from the situation.
--- Content provided by FirstRanker.com ---
The partial correlation coefficient is defined in terms of simplecorrelation coefficients as follows:
Let r12. 3 denote the correlation of X1 and X2 by eliminating the effect of X3.
--- Content provided by FirstRanker.com ---
Let r12 be the simple correlation coefficient between X1 and X2.
Let r13 be the simple correlation coefficient between X1 and X3.
--- Content provided by FirstRanker.com ---
Let r23 be the simple correlation coefficient between X2 and X3.Then we have
--- Content provided by FirstRanker.com ---
r12 - r13 r23
r12.3 =
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
(1- r 13) (1- r 23)
--- Content provided by FirstRanker.com ---
r13 - r12 r32
--- Content provided by FirstRanker.com ---
Similarly, r13.2 =2
--- Content provided by FirstRanker.com ---
2
(1- r 12) (1- r 32)
--- Content provided by FirstRanker.com ---
r23 - r 21 r13
and r32.1 =
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
2(1- r 21) (1- r 13)
--- Content provided by FirstRanker.com ---
Problem 1
Given that r12 = 0.6, r13 = 0.58, r23 = 0.70 determine the partial correlation
--- Content provided by FirstRanker.com ---
coefficient r12.3Solution:
--- Content provided by FirstRanker.com ---
We have
0.6 0
--- Content provided by FirstRanker.com ---
- .58 0x .70
=
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
2(1 (
- 0.58) ) (1 (
--- Content provided by FirstRanker.com ---
- 0.70) )
0.6 0
--- Content provided by FirstRanker.com ---
- .406=
--- Content provided by FirstRanker.com ---
(1 0
- .3364)(1 0
--- Content provided by FirstRanker.com ---
- .49)0.194
=
--- Content provided by FirstRanker.com ---
0.6636x 0.51
--- Content provided by FirstRanker.com ---
0.194
=
--- Content provided by FirstRanker.com ---
0.194
=
--- Content provided by FirstRanker.com ---
=
0.8146 0
--- Content provided by FirstRanker.com ---
x .7141 0.5817 0.3335
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Problem 2
--- Content provided by FirstRanker.com ---
If r12 = 0.75, r13 = 0.80, r23 = 0.70, find the partial correlation coefficient r13.2
--- Content provided by FirstRanker.com ---
Solution:We have
--- Content provided by FirstRanker.com ---
r13 - r12 r32
--- Content provided by FirstRanker.com ---
r13.2 =2
--- Content provided by FirstRanker.com ---
2
(1 - r 12) (1 - r 32)
--- Content provided by FirstRanker.com ---
0.8 - 0.75X 0.70=
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
(1- (0.75) ) (1- (0.70) )0.8 - 0.525
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
(1- 0.5625) (1- 0.49)
0.275
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
(0.4375) (0.51)0.275
0.275
--- Content provided by FirstRanker.com ---
=
=
--- Content provided by FirstRanker.com ---
= 0.58230.6614 X 0.7141
0.4723
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
II. MULTIPLE CORRELATIONWhen the value of a variable is influenced by another variable, the
--- Content provided by FirstRanker.com ---
relationship between them is a simple correlation. In a real life situation, a
variable may be influenced by many other variables. For example, the sales
--- Content provided by FirstRanker.com ---
achieved for a product may depend on the income of the consumers, the price,the quality of the product, sales promotion techniques, the channels of
distribution, etc. In this case, we have to consider the joint influence of several
--- Content provided by FirstRanker.com ---
independent variables on the dependent variable. Multiple correlations arise in
--- Content provided by FirstRanker.com ---
this context.
Suppose Y is a dependent variable, which is influenced by n other
--- Content provided by FirstRanker.com ---
variables X1, X2, ...,Xn. The multiple correlation is a measure of the relationshipbetween Y and X1, X2,..., Xn considered together.
The multiple correlation coefficients are denoted by the letter R. The
--- Content provided by FirstRanker.com ---
dependent variable is denoted by X1. The independent variables are denoted by
X2, X3, X4,..., etc.
--- Content provided by FirstRanker.com ---
Meaning of Notations:
R1.23 denotes the multiple correlation of the dependent variable X1 with two
--- Content provided by FirstRanker.com ---
independent variables X2 and X3 . It is a measure of the relationship that X1 has
with X2 and X3 .
--- Content provided by FirstRanker.com ---
R2.13 is the multiple correlation of the dependent variable X2 with two
independent variables X1 and X3.
--- Content provided by FirstRanker.com ---
R3.12 is the multiple correlation of the dependent variable X3 with two
independent variables X1 and X2.
--- Content provided by FirstRanker.com ---
R1.234 is the multiple correlation of the dependent variable X1 with threeindependent variables X2 , X3 and X4.
Coefficient of Multiple Linear Correlations
--- Content provided by FirstRanker.com ---
The coefficient of multiple linear correlation is given in terms of the partial
--- Content provided by FirstRanker.com ---
correlation coefficients as follows:2
2
--- Content provided by FirstRanker.com ---
r 12 + r 13 - 2 12
r 13
--- Content provided by FirstRanker.com ---
r 23r
1.
--- Content provided by FirstRanker.com ---
R 23 =
--- Content provided by FirstRanker.com ---
21 - r 23
2
--- Content provided by FirstRanker.com ---
2
r 2 1 + r 23 - 2 21
--- Content provided by FirstRanker.com ---
r 23r 13
r
--- Content provided by FirstRanker.com ---
R2.13 =
--- Content provided by FirstRanker.com ---
21 - r 13
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
r
--- Content provided by FirstRanker.com ---
31 + r32 - 2 3
--- Content provided by FirstRanker.com ---
r 1 32
r
--- Content provided by FirstRanker.com ---
12r
R3.12 =
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
1 - r 12Properties of the coefficient of multiple linear correlations:
--- Content provided by FirstRanker.com ---
1.
--- Content provided by FirstRanker.com ---
The coefficient of multiple linear correlations R is a non-negativequantity. It varies between 0 and 1.
--- Content provided by FirstRanker.com ---
2.
R1.23 = R1.32
--- Content provided by FirstRanker.com ---
R2.13 = R2.31R3.12 = R3.21, etc.
--- Content provided by FirstRanker.com ---
3.
R1.23 |r12|,
--- Content provided by FirstRanker.com ---
R1.32 |r13|, etc.--- Content provided by FirstRanker.com ---
Problem 3
--- Content provided by FirstRanker.com ---
If the simple correlation coefficients have the values r12 = 0.6, r13 = 0.65,r23 = 0.8, find the multiple correlation coefficient R1.23
--- Content provided by FirstRanker.com ---
Solution:
2
--- Content provided by FirstRanker.com ---
2r 12 + r 13 - 2 12
r 13
--- Content provided by FirstRanker.com ---
r 23
r
--- Content provided by FirstRanker.com ---
We have 1.R 23 =
--- Content provided by FirstRanker.com ---
2
1 - r 23
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
(0.6) + (0.65) - 2x0.6x0.65x0.8
=
--- Content provided by FirstRanker.com ---
2
1 - (0.8)
--- Content provided by FirstRanker.com ---
0.36+ 0.4225- 0.624
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
1 - 0.64
0.7825- 0.624
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
0.360.1585
=
--- Content provided by FirstRanker.com ---
0.36
--- Content provided by FirstRanker.com ---
= 0.4403 = 0.6636--- Content provided by FirstRanker.com ---
Problem 4
--- Content provided by FirstRanker.com ---
Given that r21 = 0.7, r23 = 0.85 and r13 = 0.75, determine R2.13
--- Content provided by FirstRanker.com ---
Solution:
2
--- Content provided by FirstRanker.com ---
2
r 2 1 + r 23 - 2 21
--- Content provided by FirstRanker.com ---
r 23r 13
r
--- Content provided by FirstRanker.com ---
We have R2.13 =
--- Content provided by FirstRanker.com ---
21 - r 13
2
--- Content provided by FirstRanker.com ---
2
(0.7) + (0.85) - 2 x0.7x0.85x0.75
--- Content provided by FirstRanker.com ---
=2
--- Content provided by FirstRanker.com ---
1 - (0.75)
0.49+ 0.7225- 0.8925
--- Content provided by FirstRanker.com ---
=1 - 0.5625
--- Content provided by FirstRanker.com ---
1.2125- 0.8925
=
--- Content provided by FirstRanker.com ---
0.4375
0.32
--- Content provided by FirstRanker.com ---
=
= 0.7314 =0.8552
--- Content provided by FirstRanker.com ---
0.4375QUESTIONS
--- Content provided by FirstRanker.com ---
1.
Explain partial correlation.
--- Content provided by FirstRanker.com ---
2.Explain multiple correlations.
3.
--- Content provided by FirstRanker.com ---
State the properties of the coefficient of multiple linear correlations.
--- Content provided by FirstRanker.com ---
UNIT IV
--- Content provided by FirstRanker.com ---
5. DISCRIMINATE ANALYSISLesson Outline
?
--- Content provided by FirstRanker.com ---
An overview of Matrix Theory
?
--- Content provided by FirstRanker.com ---
The objective of Discriminate Analysis?
The concept of Discriminant Function
--- Content provided by FirstRanker.com ---
?
Determination of Discriminant Function
--- Content provided by FirstRanker.com ---
?Pooled covariance matrix
Learning Objectives
--- Content provided by FirstRanker.com ---
After reading this lesson you should be able to
-
--- Content provided by FirstRanker.com ---
understand the basic concepts in Matrix Theory- understand the objective of Discriminate Analysis
-
--- Content provided by FirstRanker.com ---
understand Discriminant Function
-
--- Content provided by FirstRanker.com ---
calculate the Discriminant Function--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
PART ? I: AN OVERVIEW OF MATRIX THEORYFirst, let us have an overview of matrix theory required for discriminate
analysis.
--- Content provided by FirstRanker.com ---
A matrix is a rectangular or square array of numbers. The matrix
--- Content provided by FirstRanker.com ---
a
a
--- Content provided by FirstRanker.com ---
a
11
--- Content provided by FirstRanker.com ---
121n
a
--- Content provided by FirstRanker.com ---
a
a
--- Content provided by FirstRanker.com ---
2122
2n
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
aa
a
--- Content provided by FirstRanker.com ---
1
m
--- Content provided by FirstRanker.com ---
m2mn
--- Content provided by FirstRanker.com ---
is a rectangular matrix with m rows and n columns. We say that it is a matrix of
type m? n . A matrix with n rows and n columns is called a square matrix. We
--- Content provided by FirstRanker.com ---
say that it is a matrix of type n? n .A matrix with just one row is called a row matrix or a row vector.
Eg:
--- Content provided by FirstRanker.com ---
(a a
a
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
n )
A matrix with just one column is called a column matrix or a column vector.
--- Content provided by FirstRanker.com ---
b
1
--- Content provided by FirstRanker.com ---
b
--- Content provided by FirstRanker.com ---
Eg:2
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
bm
--- Content provided by FirstRanker.com ---
A matrix in which all the entries are zero is called a zero matrix.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Addition of two matrices is accomplished by the addition of the numbers
in the corresponding places in the two matrices. Thus we have
--- Content provided by FirstRanker.com ---
a
a b
--- Content provided by FirstRanker.com ---
ba + b
a + b
--- Content provided by FirstRanker.com ---
11
12
--- Content provided by FirstRanker.com ---
1112
11
--- Content provided by FirstRanker.com ---
11
12
--- Content provided by FirstRanker.com ---
12+
=
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
a
a
--- Content provided by FirstRanker.com ---
b
b
--- Content provided by FirstRanker.com ---
a + ba + b
21
--- Content provided by FirstRanker.com ---
22
21
--- Content provided by FirstRanker.com ---
2221
21
--- Content provided by FirstRanker.com ---
22
22
--- Content provided by FirstRanker.com ---
Multiplication of a matrix by a scalar is accomplished by multiplying each
element in the matrix by that scalar. Thus we have
--- Content provided by FirstRanker.com ---
a
a
--- Content provided by FirstRanker.com ---
kaka
11
--- Content provided by FirstRanker.com ---
12
11
--- Content provided by FirstRanker.com ---
12k
=
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
aa
ka
--- Content provided by FirstRanker.com ---
ka
21
--- Content provided by FirstRanker.com ---
2221
22
--- Content provided by FirstRanker.com ---
k (a
a
--- Content provided by FirstRanker.com ---
a= ka ka
ka
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
n )( 1
2
--- Content provided by FirstRanker.com ---
n )
b kb
--- Content provided by FirstRanker.com ---
11
--- Content provided by FirstRanker.com ---
b
kb
--- Content provided by FirstRanker.com ---
22
k
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
b
kb
--- Content provided by FirstRanker.com ---
mm
--- Content provided by FirstRanker.com ---
When a matrix A of type m? n and a matrix B of type n ? p are multiplied, we
obtain a matrix C of type m ? p . To get the element in the ith row, jth column of
--- Content provided by FirstRanker.com ---
C, consider the elements of the ith row in A and the elements in the jth column ofB, multiply the corresponding elements and take the sum. Thus, we have
--- Content provided by FirstRanker.com ---
a
a b
--- Content provided by FirstRanker.com ---
ba b + a b
a b + a b
--- Content provided by FirstRanker.com ---
11
12
--- Content provided by FirstRanker.com ---
1112
11 11
--- Content provided by FirstRanker.com ---
12 21
11 12
--- Content provided by FirstRanker.com ---
12 22=
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
aa
b
--- Content provided by FirstRanker.com ---
b
a b + a b
--- Content provided by FirstRanker.com ---
a b + a b21
22 21
--- Content provided by FirstRanker.com ---
22
21 11
--- Content provided by FirstRanker.com ---
22 2121 12
22 22
--- Content provided by FirstRanker.com ---
1 0
--- Content provided by FirstRanker.com ---
The matrix I =is called the identity matrix of order 2. Similarly we can
0 1
--- Content provided by FirstRanker.com ---
consider identity ma
s
--- Content provided by FirstRanker.com ---
trice of higher order. The identity matrix has the followingproperty: If the ma
A and I are of type
--- Content provided by FirstRanker.com ---
trices
n? n , then A I = I A = A.
--- Content provided by FirstRanker.com ---
a b
--- Content provided by FirstRanker.com ---
Consider a square matrix of order 2. Denote it by A =. The
c d
--- Content provided by FirstRanker.com ---
a
b
--- Content provided by FirstRanker.com ---
determinant of A = det A == ad ? bc. If it is zero, we say that A is a
c
--- Content provided by FirstRanker.com ---
d
singular matrix. If it is not zero, we say that A is a non-singular matrix. When
--- Content provided by FirstRanker.com ---
ad - bc 0 , A has a multiplicative inverse, denoted by1
A- with the property
--- Content provided by FirstRanker.com ---
that
1
--- Content provided by FirstRanker.com ---
-1
AA
--- Content provided by FirstRanker.com ---
A-
=
--- Content provided by FirstRanker.com ---
A = I .We have
-
--- Content provided by FirstRanker.com ---
-
1
--- Content provided by FirstRanker.com ---
db
1
--- Content provided by FirstRanker.com ---
A =
--- Content provided by FirstRanker.com ---
det A -ca
Note that
--- Content provided by FirstRanker.com ---
1
a b d
--- Content provided by FirstRanker.com ---
b- 1 0
=
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
ad - bc c d -c
a 0 1
--- Content provided by FirstRanker.com ---
A symmetric matrix is the one in which the first row and first column are
--- Content provided by FirstRanker.com ---
identical; the second row and second column are identical; and so on.Example:
a h g
--- Content provided by FirstRanker.com ---
a b
and
--- Content provided by FirstRanker.com ---
hb
f
--- Content provided by FirstRanker.com ---
b d
g f
--- Content provided by FirstRanker.com ---
care similar matrices.
--- Content provided by FirstRanker.com ---
PART ? II: DISCRIMINATE ANALYSIS
The objective of discriminate analysis
--- Content provided by FirstRanker.com ---
The objective of discriminate analysis (also known as discriminantanalysis) is to separate a population (or samples from the population) into two
distinct groups or two distinct conditionalities. After such a separation is made,
--- Content provided by FirstRanker.com ---
we should be able to discriminate one group against the other. In other words, if
--- Content provided by FirstRanker.com ---
some sample data is given, it should be possible for us to say with certainty
whether that sample data has come from the first group or the second group. For
--- Content provided by FirstRanker.com ---
this purpose, a function called `Discriminant function' is constructed. It is alinear function and it is used to describe the differences between two groups.
--- Content provided by FirstRanker.com ---
It is to be noted that the concept of discriminant function is applicable
--- Content provided by FirstRanker.com ---
when there are more than 2 distinct groups also. However, we restrict ourselvesto a situation of two distinct groups only. The discriminant function is the linear
combination of the observations from the two groups which minimizes the
--- Content provided by FirstRanker.com ---
distance between the mean vectors of the two groups after some transformation
of the vectors. Suppose we consider 2 variables both taking values under two
--- Content provided by FirstRanker.com ---
different conditions denoted by condition I and condition II. Suppose there arem samples for each variable under condition I and n samples for each variable
under condition II.
--- Content provided by FirstRanker.com ---
Let the values of the samples be as follows:
--- Content provided by FirstRanker.com ---
Condition I
Condition II
--- Content provided by FirstRanker.com ---
Variable 1
Variable 2
--- Content provided by FirstRanker.com ---
Variable 1Variable 2
p
--- Content provided by FirstRanker.com ---
q
1
--- Content provided by FirstRanker.com ---
11
1
--- Content provided by FirstRanker.com ---
p
q
--- Content provided by FirstRanker.com ---
22
2
--- Content provided by FirstRanker.com ---
2
M
--- Content provided by FirstRanker.com ---
MM
M
--- Content provided by FirstRanker.com ---
p
q
--- Content provided by FirstRanker.com ---
mm
n
--- Content provided by FirstRanker.com ---
n
--- Content provided by FirstRanker.com ---
Determine the means of the samples for the two variables under the twoconditions.
--- Content provided by FirstRanker.com ---
Let
p be the mean of the values of variable 1 under condition I.
--- Content provided by FirstRanker.com ---
Letq be the mean of the values of variable 2 under condition I.
--- Content provided by FirstRanker.com ---
Let
be the mean of the values of variable 1 under condition II.
--- Content provided by FirstRanker.com ---
Let be the mean of the values of variable 2 under condition II.
Let y , y denote the column vectors whose entries are the mean values
--- Content provided by FirstRanker.com ---
12
under conditions I, II respectively.
--- Content provided by FirstRanker.com ---
p
--- Content provided by FirstRanker.com ---
i.e.,y =
,
--- Content provided by FirstRanker.com ---
y =
--- Content provided by FirstRanker.com ---
12
--- Content provided by FirstRanker.com ---
q
--- Content provided by FirstRanker.com ---
( p - )
Calculate the column vector y - y =
--- Content provided by FirstRanker.com ---
. The pooled covariance matrix
1
--- Content provided by FirstRanker.com ---
2(q - )
S is obtained as follows:
--- Content provided by FirstRanker.com ---
m(
p -p +
--- Content provided by FirstRanker.com ---
-
p -p q -q + - -
--- Content provided by FirstRanker.com ---
i) n ( j )
m
--- Content provided by FirstRanker.com ---
n
2
--- Content provided by FirstRanker.com ---
2( i )( i )
( j )( j )
--- Content provided by FirstRanker.com ---
1
i 1
--- Content provided by FirstRanker.com ---
=j 1
=
--- Content provided by FirstRanker.com ---
i 1
=
--- Content provided by FirstRanker.com ---
j 1=
S
--- Content provided by FirstRanker.com ---
= m+n-2 m
(
p -p)(q -q) n
--- Content provided by FirstRanker.com ---
+ (
--- Content provided by FirstRanker.com ---
- )( -)m (
q -q) n
--- Content provided by FirstRanker.com ---
+ -
i
--- Content provided by FirstRanker.com ---
ij
j
--- Content provided by FirstRanker.com ---
i
( j )2
--- Content provided by FirstRanker.com ---
2i 1
--- Content provided by FirstRanker.com ---
=j 1
=
--- Content provided by FirstRanker.com ---
i 1
=
--- Content provided by FirstRanker.com ---
j 1=
--- Content provided by FirstRanker.com ---
a b
1
--- Content provided by FirstRanker.com ---
db
-
--- Content provided by FirstRanker.com ---
Note that the inverse of the matrix
is
--- Content provided by FirstRanker.com ---
, providedc d
ad - bc -c
--- Content provided by FirstRanker.com ---
a
ad - bc 0 .
--- Content provided by FirstRanker.com ---
Calculate the inverse of the matrix S. Denote it by1
S - . Find the matrix product
--- Content provided by FirstRanker.com ---
-1
S ( y - y ) . The result is a column vector order 2. Denote it by and the
--- Content provided by FirstRanker.com ---
12
--- Content provided by FirstRanker.com ---
entries by and ? . Then = ?
--- Content provided by FirstRanker.com ---
Fisher's discriminant function Z is obtained asZ = y + ? y .
1
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
Application:--- Content provided by FirstRanker.com ---
Given an observation of the attributes, we can use the discriminant functionto decide whether it arose from condition I or condition II.
Problem
--- Content provided by FirstRanker.com ---
A tourism manager adopts two different strategies. Under each strategy, the
number of tourists and the profits earned (in thousands of rupees) are as
--- Content provided by FirstRanker.com ---
recorded below.Strategy I
No. of tourists
--- Content provided by FirstRanker.com ---
Profit earned
30
--- Content provided by FirstRanker.com ---
6032
64
--- Content provided by FirstRanker.com ---
30
65
--- Content provided by FirstRanker.com ---
3861
40
--- Content provided by FirstRanker.com ---
65
--- Content provided by FirstRanker.com ---
Strategy IINo. of tourists
Profit earned
--- Content provided by FirstRanker.com ---
38
55
--- Content provided by FirstRanker.com ---
4061
37
--- Content provided by FirstRanker.com ---
57
36
--- Content provided by FirstRanker.com ---
5546
58
--- Content provided by FirstRanker.com ---
41
61
--- Content provided by FirstRanker.com ---
4259
--- Content provided by FirstRanker.com ---
Construct Fisher's discriminant function and examine whether the strategies
provide an effective tool of discrimination of the tourist operations.
--- Content provided by FirstRanker.com ---
Solution:
--- Content provided by FirstRanker.com ---
The given values are plotted in a graph. One point belonging to Strategy
I seems to be an outlier as it is closer to the points of Strategy II. The other
--- Content provided by FirstRanker.com ---
points seem to fall in two clusters. We shall examine this phenomenon by means
of Fisher's discriminant function.
--- Content provided by FirstRanker.com ---
We have38 55
1
--- Content provided by FirstRanker.com ---
1
--- Content provided by FirstRanker.com ---
p30
q 60
--- Content provided by FirstRanker.com ---
40
61
--- Content provided by FirstRanker.com ---
1--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
12
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
p
32
--- Content provided by FirstRanker.com ---
q
64
--- Content provided by FirstRanker.com ---
37 572
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
33
--- Content provided by FirstRanker.com ---
p = 30 , q = 65 ,
= 36 , = 55
--- Content provided by FirstRanker.com ---
3--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
34
--- Content provided by FirstRanker.com ---
4
--- Content provided by FirstRanker.com ---
p
38
--- Content provided by FirstRanker.com ---
q
61
--- Content provided by FirstRanker.com ---
46 584
--- Content provided by FirstRanker.com ---
4
5
--- Content provided by FirstRanker.com ---
5p
--- Content provided by FirstRanker.com ---
40
--- Content provided by FirstRanker.com ---
q 6541
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
615
--- Content provided by FirstRanker.com ---
5
6
--- Content provided by FirstRanker.com ---
6--- Content provided by FirstRanker.com ---
42
--- Content provided by FirstRanker.com ---
59
--- Content provided by FirstRanker.com ---
7
--- Content provided by FirstRanker.com ---
7The means of the above 4 columns are obtained as
170
--- Content provided by FirstRanker.com ---
315
280
--- Content provided by FirstRanker.com ---
406p =
= 34, q =
--- Content provided by FirstRanker.com ---
= 63, =
= 40, =
--- Content provided by FirstRanker.com ---
= 585
5
--- Content provided by FirstRanker.com ---
7
7
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
y
= column vector containing the mean values under strategy I
--- Content provided by FirstRanker.com ---
1
p 34
--- Content provided by FirstRanker.com ---
=
=
--- Content provided by FirstRanker.com ---
q 63
--- Content provided by FirstRanker.com ---
y= column vector containing the mean values under strategy II
2
--- Content provided by FirstRanker.com ---
40
--- Content provided by FirstRanker.com ---
==
--- Content provided by FirstRanker.com ---
58
--- Content provided by FirstRanker.com ---
Therefore we get34 40 6
-
--- Content provided by FirstRanker.com ---
y - y =
-
--- Content provided by FirstRanker.com ---
=1
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
63 58 5--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Calculation of p - p , q - q etc.,
i
--- Content provided by FirstRanker.com ---
ip - p
q - q
--- Content provided by FirstRanker.com ---
i
i
--- Content provided by FirstRanker.com ---
P q( p - p)2 ( p - p )(q - q )
i
--- Content provided by FirstRanker.com ---
i
(q - q
--- Content provided by FirstRanker.com ---
i)2
i
--- Content provided by FirstRanker.com ---
= p - 34 = q - 63
30
--- Content provided by FirstRanker.com ---
60- 4
- 3
--- Content provided by FirstRanker.com ---
16
12
--- Content provided by FirstRanker.com ---
932
64
--- Content provided by FirstRanker.com ---
- 2
1
--- Content provided by FirstRanker.com ---
4- 2
1
--- Content provided by FirstRanker.com ---
30
65
--- Content provided by FirstRanker.com ---
- 42
16
--- Content provided by FirstRanker.com ---
- 8
4
--- Content provided by FirstRanker.com ---
3861
4
--- Content provided by FirstRanker.com ---
- 2
16
--- Content provided by FirstRanker.com ---
- 84
40
--- Content provided by FirstRanker.com ---
65
6
--- Content provided by FirstRanker.com ---
236
12
--- Content provided by FirstRanker.com ---
4
--- Content provided by FirstRanker.com ---
886
22
--- Content provided by FirstRanker.com ---
Calculation of - , - , etc.,
--- Content provided by FirstRanker.com ---
jj
-
--- Content provided by FirstRanker.com ---
-
j
--- Content provided by FirstRanker.com ---
j--- Content provided by FirstRanker.com ---
( - )2 ( - )( - )
j
--- Content provided by FirstRanker.com ---
j( -
j
--- Content provided by FirstRanker.com ---
)2
j
--- Content provided by FirstRanker.com ---
= - 40 = - 5838
55
--- Content provided by FirstRanker.com ---
- 2
- 3
--- Content provided by FirstRanker.com ---
46
9
--- Content provided by FirstRanker.com ---
40
61
--- Content provided by FirstRanker.com ---
03
0
--- Content provided by FirstRanker.com ---
0
9
--- Content provided by FirstRanker.com ---
3757
- 3
--- Content provided by FirstRanker.com ---
- 1
9
--- Content provided by FirstRanker.com ---
31
36
--- Content provided by FirstRanker.com ---
55
- 4
--- Content provided by FirstRanker.com ---
- 316
12
--- Content provided by FirstRanker.com ---
9
46
--- Content provided by FirstRanker.com ---
586
0
--- Content provided by FirstRanker.com ---
36
0
--- Content provided by FirstRanker.com ---
041
61
--- Content provided by FirstRanker.com ---
1
3
--- Content provided by FirstRanker.com ---
13
9
--- Content provided by FirstRanker.com ---
42
59
--- Content provided by FirstRanker.com ---
21
4
--- Content provided by FirstRanker.com ---
2
1
--- Content provided by FirstRanker.com ---
70
26
--- Content provided by FirstRanker.com ---
38
--- Content provided by FirstRanker.com ---
5
--- Content provided by FirstRanker.com ---
( p - p + - = 88 + 70 = 158i
)
--- Content provided by FirstRanker.com ---
7 ( j )2
2
--- Content provided by FirstRanker.com ---
i 1=
j 1
--- Content provided by FirstRanker.com ---
=
5
--- Content provided by FirstRanker.com ---
( p - p q -q + - - = 6 + 26 = 32i
)( i
--- Content provided by FirstRanker.com ---
) 7 ( j
)( j )
--- Content provided by FirstRanker.com ---
i 1=
j 1
--- Content provided by FirstRanker.com ---
=
5
--- Content provided by FirstRanker.com ---
(q -q + - = 22 + 38 = 60i
)
--- Content provided by FirstRanker.com ---
7 ( j )2
2
--- Content provided by FirstRanker.com ---
i 1=
j 1
--- Content provided by FirstRanker.com ---
=
m + n ? 2 = 5 + 7 ? 2 = 10.
--- Content provided by FirstRanker.com ---
The pooled covariance matrix
1 158
--- Content provided by FirstRanker.com ---
32
15.8
--- Content provided by FirstRanker.com ---
3.2S =
=
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
10 32 60 3.26
det S = 94.8 ? 10.24 = 84.56
--- Content provided by FirstRanker.com ---
1
6
--- Content provided by FirstRanker.com ---
3- .2 0.071
0.038
--- Content provided by FirstRanker.com ---
-
1
--- Content provided by FirstRanker.com ---
S - ==
--- Content provided by FirstRanker.com ---
84.56 3.2
--- Content provided by FirstRanker.com ---
-15.8 0
- .038 0.187
--- Content provided by FirstRanker.com ---
1
--- Content provided by FirstRanker.com ---
== S-
--- Content provided by FirstRanker.com ---
( y - y
1
--- Content provided by FirstRanker.com ---
2 )?
--- Content provided by FirstRanker.com ---
0.071 -0.038 -6 0.616
-
--- Content provided by FirstRanker.com ---
==
--- Content provided by FirstRanker.com ---
0.038
-
--- Content provided by FirstRanker.com ---
0.187 51
.163
--- Content provided by FirstRanker.com ---
Fisher's discriminant function is obtained as
--- Content provided by FirstRanker.com ---
Z = y + ? y1
2
--- Content provided by FirstRanker.com ---
= 0.616
--- Content provided by FirstRanker.com ---
-y +1.161y
1
--- Content provided by FirstRanker.com ---
2
where y denotes the number of tourists and y is the profit earned
--- Content provided by FirstRanker.com ---
12
--- Content provided by FirstRanker.com ---
Inference
--- Content provided by FirstRanker.com ---
We evaluate the discriminant function for the data given in the problem.
--- Content provided by FirstRanker.com ---
Strategy I
--- Content provided by FirstRanker.com ---
No. of tourists
Profit earned
--- Content provided by FirstRanker.com ---
Z(y1)
(y2)
--- Content provided by FirstRanker.com ---
30
60
--- Content provided by FirstRanker.com ---
51.332
64
--- Content provided by FirstRanker.com ---
54.72
30
--- Content provided by FirstRanker.com ---
6557.12
38
--- Content provided by FirstRanker.com ---
61
47.54
--- Content provided by FirstRanker.com ---
4065
50.96
--- Content provided by FirstRanker.com ---
Strategy II
No. of tourists
--- Content provided by FirstRanker.com ---
Profit earnedZ
(y1)
--- Content provided by FirstRanker.com ---
(y2)
38
--- Content provided by FirstRanker.com ---
5540.56
40
--- Content provided by FirstRanker.com ---
61
46.30
--- Content provided by FirstRanker.com ---
3757
43.50
--- Content provided by FirstRanker.com ---
36
55
--- Content provided by FirstRanker.com ---
41.7946
58
--- Content provided by FirstRanker.com ---
39.12
41
--- Content provided by FirstRanker.com ---
6145.69
42
--- Content provided by FirstRanker.com ---
59
42.75
--- Content provided by FirstRanker.com ---
By referring to the projected values of the discriminant function, it is seen that
the discrimination function is able to separate the two strategies.
--- Content provided by FirstRanker.com ---
QUESTIONS
--- Content provided by FirstRanker.com ---
1.
Explain the objective of discriminate analysis.
--- Content provided by FirstRanker.com ---
2.
Briefly describe how discriminate analysis is carried out.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT IV6. CLUSTER ANALYSIS
Lesson Outline
--- Content provided by FirstRanker.com ---
?
The objective of cluster analysis
--- Content provided by FirstRanker.com ---
?Cluster analysis for qualitative data
?
--- Content provided by FirstRanker.com ---
Resemblance matrix
?
--- Content provided by FirstRanker.com ---
Simple matching coefficient?
Pessimistic, moderate, optimistic estimates of similarity
--- Content provided by FirstRanker.com ---
?
Object-attribute incidence matrix
--- Content provided by FirstRanker.com ---
?Matching coefficient matrix
?
--- Content provided by FirstRanker.com ---
Cluster analysis for quantitative data
?
--- Content provided by FirstRanker.com ---
Hierarchical cluster analysis?
Euclidean distance matrix
--- Content provided by FirstRanker.com ---
?
Dendogram
--- Content provided by FirstRanker.com ---
Learning ObjectivesAfter reading this lesson you should be able to
- understand the objective of cluster analysis
--- Content provided by FirstRanker.com ---
- perform cluster analysis for qualitative data
- perform cluster analysis for quantitative data
--- Content provided by FirstRanker.com ---
- understand resemblance matrix- determine simple matching coefficient
- understand the properties of simple matching coefficient
--- Content provided by FirstRanker.com ---
- determine pessimistic, moderate, optimistic estimates of similarity
- understand object-attribute incidence matrix
--- Content provided by FirstRanker.com ---
- understand matching coefficient matrix- find out Euclidean distance matrix
- construct Dendogram
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
THE OBJECTIVE OF CLUSTER ANALYSIS
--- Content provided by FirstRanker.com ---
A cluster means a group of objects which remain together as far as acertain characteristic is concerned. When several objects are examined
systematically, the cluster analysis seeks to put similar objects in the same
--- Content provided by FirstRanker.com ---
cluster and dissimilar objects in different clusters so that each object will be
allotted to one and only one cluster. Thus, it is a method for estimation of
--- Content provided by FirstRanker.com ---
similarities among multivariate data. Similarity or dissimilarity is concernedwith a certain attribute like magnitude, direction, shape, distance, colour, smell,
taste, performance, etc.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Thus, it is to be seen that objects with similar description are pooled together toform a single cluster and objects with dissimilar properties will contribute to
distinct clusters. For this purpose, given a set of objects, one has to determine
--- Content provided by FirstRanker.com ---
which objects in that set are similar and which objects are dissimilar.
--- Content provided by FirstRanker.com ---
Method of Cluster AnalysisCluster analysis is a complex task. However, we can have a broad outline of
this analysis. One has to carry out the following steps:
--- Content provided by FirstRanker.com ---
1.
Identify the objects that are required to be put in different clusters.
--- Content provided by FirstRanker.com ---
2.
Prepare a list of attributes possessed by the objects under consideration.
--- Content provided by FirstRanker.com ---
If they are too many, identify the important ones with the help of experts.
--- Content provided by FirstRanker.com ---
3.Identify the common attributes possessed by two or more objects.
--- Content provided by FirstRanker.com ---
4.
--- Content provided by FirstRanker.com ---
Find out the attributes which are present in one object and absent in other
objects.
--- Content provided by FirstRanker.com ---
5.
Evolve a measure of similarity or dissimilarity. In other words, evolve a
--- Content provided by FirstRanker.com ---
measure of "togetherness" or "standing apart".
--- Content provided by FirstRanker.com ---
6.Apply a standard algorithm to separate the objects into different clusters.
--- Content provided by FirstRanker.com ---
Applications of Cluster Analysis
--- Content provided by FirstRanker.com ---
The concept of cluster analysis has applications in a variety of areas. A fewexamples are listed below:
--- Content provided by FirstRanker.com ---
1. A marketing manager can use it to find out which brands of products are
perceived to be similar by the consumers.
--- Content provided by FirstRanker.com ---
2. A doctor can apply this method to find out which diseases follow the same
pattern of occurrence.
--- Content provided by FirstRanker.com ---
3. An agriculturist may use it to determine which parts of his land are similar as
--- Content provided by FirstRanker.com ---
regards the cultivating crop.4. Once a set of objects have been put in different clusters, the top level
--- Content provided by FirstRanker.com ---
management can take a policy decision as to which cluster has to be paid more
attention and which cluster needs less attention, etc. Thus it will help the
--- Content provided by FirstRanker.com ---
management in the decision on market segmentation.In short, cluster analysis finds applications in so many contexts.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
I. Method of Cluster Analysis for Qualitative Data
We consider a case of binary attributes. They have two states, namely
--- Content provided by FirstRanker.com ---
present or absent. Suppose we have to evolve a measure of resemblancebetween two objects P and Q. Suppose we take into consideration certain pre-
determined attributes. If a certain attribute is present in an object, we will
--- Content provided by FirstRanker.com ---
indicate it by 1 and if that attribute is absent we indicate it by 0. Count the
number of attributes which are present in both the objects, which are absent in
--- Content provided by FirstRanker.com ---
both the objects and which are present in one object but not in the other. Weuse the following notations.
a
--- Content provided by FirstRanker.com ---
=
Number of attributes present in both P and Q,
--- Content provided by FirstRanker.com ---
b=
Number of attributes present in P but not in Q,
--- Content provided by FirstRanker.com ---
c
=
--- Content provided by FirstRanker.com ---
Number of attributes present in Q but not in P,d
=
--- Content provided by FirstRanker.com ---
Number of attributes absent in both P and Q.
Among these quantities, a and d are counts for matched pairs of attributes while
--- Content provided by FirstRanker.com ---
b and c are counts for unmatched pairs of attributes.Resemblance matrix of two objects
--- Content provided by FirstRanker.com ---
The resemblance matrix of two objects P and Q consists of the values a, b, c, d
as its entries. It is shown below.
--- Content provided by FirstRanker.com ---
Q
1
--- Content provided by FirstRanker.com ---
0
--- Content provided by FirstRanker.com ---
P 1a
b
--- Content provided by FirstRanker.com ---
0
--- Content provided by FirstRanker.com ---
cd
--- Content provided by FirstRanker.com ---
Simple matching coefficient
--- Content provided by FirstRanker.com ---
We consider a similarity coefficient called simple matching coefficient
C(P,Q), defined as the ratio of the matched pairs of attributes to the total number
--- Content provided by FirstRanker.com ---
a + dof attributes. i.e., C ( P,Q) =
--- Content provided by FirstRanker.com ---
a + b + c + d
Properties of simple matching coefficient
--- Content provided by FirstRanker.com ---
1.The denominator in C(P,Q) shows that the simple matching coefficient
gives equal weight for the unmatched pairs of attributes as well as the matched
--- Content provided by FirstRanker.com ---
pairs.
--- Content provided by FirstRanker.com ---
2.The minimum value of C(P,Q) is 0.
--- Content provided by FirstRanker.com ---
3.
The maximum value of C(P,Q) is 1.
--- Content provided by FirstRanker.com ---
4.
A value of C(P,Q) = 1 indicates perfect similarity between the objects P
--- Content provided by FirstRanker.com ---
and Q. This occurs when there are no unmatched pairs of attributes. i.e., b = c =
0.
--- Content provided by FirstRanker.com ---
5.
A value of C(P,Q) = 0 indicates maximum dissimilarity between the
--- Content provided by FirstRanker.com ---
objects P and Q. This occurs when there are no matched pairs of attributes. i.e., a
= d = 0.
--- Content provided by FirstRanker.com ---
6.
C(P,Q) = C(Q,P).
--- Content provided by FirstRanker.com ---
7.
--- Content provided by FirstRanker.com ---
Using C(P,Q), we can estimate the percentage of similarity between Pand Q.
--- Content provided by FirstRanker.com ---
8. C(P,P) = 1 since b = c = 0.
--- Content provided by FirstRanker.com ---
Illustrative Problem 1
--- Content provided by FirstRanker.com ---
A tourist is interested in evaluating two tourist spots P, Q with regard to
their similarity and dissimilarity. He considers 10 attributes of the tourist spots
--- Content provided by FirstRanker.com ---
and collects the following data matrix:Attribute
--- Content provided by FirstRanker.com ---
Tourist Spot 1
Tourist Spot 2
--- Content provided by FirstRanker.com ---
11
1
--- Content provided by FirstRanker.com ---
2
0
--- Content provided by FirstRanker.com ---
03
1
--- Content provided by FirstRanker.com ---
1
4
--- Content provided by FirstRanker.com ---
00
5
--- Content provided by FirstRanker.com ---
0
1
--- Content provided by FirstRanker.com ---
61
1
--- Content provided by FirstRanker.com ---
7
1
--- Content provided by FirstRanker.com ---
18
1
--- Content provided by FirstRanker.com ---
1
9
--- Content provided by FirstRanker.com ---
10
10
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
Determine whether the two tourist spots are similar or not.
Solution:
--- Content provided by FirstRanker.com ---
We obtain the following resemblance matrix.
--- Content provided by FirstRanker.com ---
Q1
--- Content provided by FirstRanker.com ---
0
--- Content provided by FirstRanker.com ---
P1
--- Content provided by FirstRanker.com ---
a = 6
b = 1
--- Content provided by FirstRanker.com ---
0c = 1
d = 2
--- Content provided by FirstRanker.com ---
We obtain the similarity coefficient as
--- Content provided by FirstRanker.com ---
(
--- Content provided by FirstRanker.com ---
a + dC P,Q) = a +b +c + d
6 + 2
--- Content provided by FirstRanker.com ---
=
--- Content provided by FirstRanker.com ---
6 +1+1+ 28
=
--- Content provided by FirstRanker.com ---
= 0.8
10
--- Content provided by FirstRanker.com ---
Inference
It is estimated that there is 80% similarity between the two tourist spots P and Q.
--- Content provided by FirstRanker.com ---
Matching coefficient with correction term
The correction term in the matching coefficient can be defined in several
--- Content provided by FirstRanker.com ---
ways. We consider two specific approaches.(a) Rogers and Tanimoto coefficient of matching
By giving double weight for unmatched pairs of attributes, the matching
--- Content provided by FirstRanker.com ---
coefficient with correction term is defined as
(
--- Content provided by FirstRanker.com ---
a + dC P,Q) =
.
--- Content provided by FirstRanker.com ---
a + d + 2(b + c)
Perfect similarity between P and Q occurs when b = c = 0. In this case, C(P,Q)
--- Content provided by FirstRanker.com ---
= 1.Maximum dissimilarity between P and Q occurs when a = d = 0. In this case,
C(P,Q) = 0.
--- Content provided by FirstRanker.com ---
(b) Sokal and Sneath coefficient of matching
--- Content provided by FirstRanker.com ---
By giving double weight for matched pairs of attributes, the matchingcoefficient with correction term is defined as
2(a + d )
--- Content provided by FirstRanker.com ---
C(P, Q) =
.
--- Content provided by FirstRanker.com ---
2(a + d ) + b + cPerfect similarity between P and Q occurs when b = c = 0. In this case, C(P,Q) =
1.
--- Content provided by FirstRanker.com ---
Maximum dissimilarity between P and Q occurs when a = d = 0. In this case,
--- Content provided by FirstRanker.com ---
C(P,Q) = 0.
--- Content provided by FirstRanker.com ---
ExampleIf we adopt Rogers and Tanimoto principle in the above problem, we get
--- Content provided by FirstRanker.com ---
6 + 2
8
--- Content provided by FirstRanker.com ---
C(P, Q) ==
= 0.67 .
--- Content provided by FirstRanker.com ---
6 + 2 + 2(1+1)
12
--- Content provided by FirstRanker.com ---
So the estimate of similarity between P and Q is 67%
If we adopt Sokal and Sneath principle in the above example, we get
--- Content provided by FirstRanker.com ---
2(6 + 2)
--- Content provided by FirstRanker.com ---
16C(P, Q) =
=
--- Content provided by FirstRanker.com ---
= 0.89.
2(6 + 2) +1+1
--- Content provided by FirstRanker.com ---
18Thus, the similarity between P and Q is estimated as 89%
--- Content provided by FirstRanker.com ---
Comparison of the three coefficients of similarity:
One can verify the following relation:
--- Content provided by FirstRanker.com ---
a + da + d
+
--- Content provided by FirstRanker.com ---
2(a d )
--- Content provided by FirstRanker.com ---
.a + d + 2(b + c)
a + b + c + d
--- Content provided by FirstRanker.com ---
2(a + d ) + b + c
--- Content provided by FirstRanker.com ---
i.e., Rogers-Tanimoto Coefficient Simple matching Coefficient Sokal-Sneath Coefficient.
It is observed that Rogers and Tanimoto principle provides a pessimistic
--- Content provided by FirstRanker.com ---
estimate of similarity. On the other hand, Sokal and Sneath principle gives an
optimistic estimate of similarity. The simple matching coefficient (without any
--- Content provided by FirstRanker.com ---
correction term) gives a moderate estimate of similarity.Clustering through object-attribute incidence matrix
--- Content provided by FirstRanker.com ---
Consider a set of objects. Enumerate the attributes of the objects. Not all
--- Content provided by FirstRanker.com ---
the attributes will be present in all the objects. The object-attribute incidencematrix consists of the entries 0 and 1. If a certain attribute is present in an
object, the corresponding place in the matrix is marked by 1; otherwise it is
--- Content provided by FirstRanker.com ---
marked by 0. This matrix is useful in separating the objects into clusters.
--- Content provided by FirstRanker.com ---
Illustrative Problem 2An expert of fashion designs identifies six fashions and five important
attributes of fashions. He obtains the following object-attribute incidence
--- Content provided by FirstRanker.com ---
matrix.
Object
--- Content provided by FirstRanker.com ---
12 3 4 5
6
--- Content provided by FirstRanker.com ---
1 0 0 0 0 1
--- Content provided by FirstRanker.com ---
Attribute 10 0 0 1 1 0
--- Content provided by FirstRanker.com ---
2
--- Content provided by FirstRanker.com ---
0 1 0 0 1 0--- Content provided by FirstRanker.com ---
3
--- Content provided by FirstRanker.com ---
4
0 1 0 1 0 0
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
51 0 1 0 0 1
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Separate the objects into two clusters.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Solution:--- Content provided by FirstRanker.com ---
Method I: By examination of the entries in the object-attribute incidence
matrix
--- Content provided by FirstRanker.com ---
Denote the 6 objects by O , O , O , O , O , O and the 5 attributes
1
--- Content provided by FirstRanker.com ---
23
4
--- Content provided by FirstRanker.com ---
5
6
--- Content provided by FirstRanker.com ---
by A , A , A , A , A .1
2
--- Content provided by FirstRanker.com ---
3
4
--- Content provided by FirstRanker.com ---
5Consider the object O . Attributes A and A are present in object O and the
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
51
other 3 attributes are absent in it. Compare other objects with object O and
--- Content provided by FirstRanker.com ---
1
find which object possesses similar attributes. For this, consider the colum s of
--- Content provided by FirstRanker.com ---
nthe matrix. It is noticed that columns 1 and 6 in the matrix are identical. i.e.,
Attributes A and A are present in both the objects O and O . All the other
--- Content provided by FirstRanker.com ---
1
5
--- Content provided by FirstRanker.com ---
16
attributes are absent in both the objects. So the objects O and O can be put in
--- Content provided by FirstRanker.com ---
1
6
--- Content provided by FirstRanker.com ---
a cluster. Denote this cluster by{O ,O .1
6}
--- Content provided by FirstRanker.com ---
The remaining objects are O ,O ,O , O . Consider the columns 2,3,4,5 in
--- Content provided by FirstRanker.com ---
23
4
--- Content provided by FirstRanker.com ---
5
the matrix. No other column is id
--- Content provided by FirstRanker.com ---
mn 2. The objectentical to colu
O possesses
--- Content provided by FirstRanker.com ---
2
the attributes A and A . Identify other objects which possess at least one of
--- Content provided by FirstRanker.com ---
34
these attributes. Objects O possess attribute A . So put the objects O and O
--- Content provided by FirstRanker.com ---
4
4
--- Content provided by FirstRanker.com ---
24
in a cluster. Denote this cluster by{O ,O .
--- Content provided by FirstRanker.com ---
2
4}
--- Content provided by FirstRanker.com ---
The remaining objects are O and O . The object O possesses only the
3
--- Content provided by FirstRanker.com ---
5
3
--- Content provided by FirstRanker.com ---
attribute A and the same is possessed by objects O and O . So the object O5
1
--- Content provided by FirstRanker.com ---
6
3
--- Content provided by FirstRanker.com ---
is closer to the cluster {O ,O rather than the cluster{O ,O . So enlarge the2
4}
--- Content provided by FirstRanker.com ---
1
6}
--- Content provided by FirstRanker.com ---
cluster {O ,O by including the object O . Thus we get the cluster{O ,O ,O .1
6
--- Content provided by FirstRanker.com ---
3}
1
--- Content provided by FirstRanker.com ---
6}3
--- Content provided by FirstRanker.com ---
The remaining object is O . It possesses attributes A and A . These
5
--- Content provided by FirstRanker.com ---
23
attributes are absent in the objects O , O , O . Attribute A in present in object
--- Content provided by FirstRanker.com ---
1
6
--- Content provided by FirstRanker.com ---
33
--- Content provided by FirstRanker.com ---
O and attribute A is present in object O . So enlarge the cluster {O ,O by
2
--- Content provided by FirstRanker.com ---
4}
2
--- Content provided by FirstRanker.com ---
24
including the object O . In this way we get the cluster{O ,O ,O .
--- Content provided by FirstRanker.com ---
2
4
--- Content provided by FirstRanker.com ---
5}5
--- Content provided by FirstRanker.com ---
Result: Thus we obtain the following two clusters.
Cluster I: {O ,O ,O and
--- Content provided by FirstRanker.com ---
13
6}
--- Content provided by FirstRanker.com ---
Cluster II: {O ,O ,O .
2
--- Content provided by FirstRanker.com ---
45}
The attributes present in cluster I are absent in cluster II and vice verse.
--- Content provided by FirstRanker.com ---
Method II: Application of simple matching coefficient
--- Content provided by FirstRanker.com ---
Calculate the matching coefficients of pairs of distinct objects. Since there are 6objects, we have (6 x 5) / 2 = 15 such pairs. Tabulate the results as follows:
--- Content provided by FirstRanker.com ---
Counts of matched and unmatched pairs of attributes
--- Content provided by FirstRanker.com ---
Ordered pairsa
b
--- Content provided by FirstRanker.com ---
c
D
--- Content provided by FirstRanker.com ---
Simple matching coefficientof objects
= (a+b)/(a+b+c+d)
--- Content provided by FirstRanker.com ---
O ,
O
--- Content provided by FirstRanker.com ---
02
2
--- Content provided by FirstRanker.com ---
1
0.2
--- Content provided by FirstRanker.com ---
12
O , O
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
03
0.8
--- Content provided by FirstRanker.com ---
1
3
--- Content provided by FirstRanker.com ---
O , O0
2
--- Content provided by FirstRanker.com ---
2
1
--- Content provided by FirstRanker.com ---
0.21
4
--- Content provided by FirstRanker.com ---
O , O
0
--- Content provided by FirstRanker.com ---
22
1
--- Content provided by FirstRanker.com ---
0.2
1
--- Content provided by FirstRanker.com ---
5O , O
2
--- Content provided by FirstRanker.com ---
0
0
--- Content provided by FirstRanker.com ---
31.0
1
--- Content provided by FirstRanker.com ---
6
O ,
--- Content provided by FirstRanker.com ---
O0
2
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
0.42
3
--- Content provided by FirstRanker.com ---
O , O
1
--- Content provided by FirstRanker.com ---
11
2
--- Content provided by FirstRanker.com ---
0.6
2
--- Content provided by FirstRanker.com ---
4O ,
O
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
12
0.6
--- Content provided by FirstRanker.com ---
2
5
--- Content provided by FirstRanker.com ---
O ,
--- Content provided by FirstRanker.com ---
O0
1
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
0.42
6
--- Content provided by FirstRanker.com ---
O , O
0
--- Content provided by FirstRanker.com ---
12
2
--- Content provided by FirstRanker.com ---
0.4
3
--- Content provided by FirstRanker.com ---
4O , O
0
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
20.4
3
--- Content provided by FirstRanker.com ---
5
O , O
--- Content provided by FirstRanker.com ---
10
1
--- Content provided by FirstRanker.com ---
3
0.8
--- Content provided by FirstRanker.com ---
36
O , O
--- Content provided by FirstRanker.com ---
1
1
--- Content provided by FirstRanker.com ---
12
0.6
--- Content provided by FirstRanker.com ---
4
5
--- Content provided by FirstRanker.com ---
O , O0
2
--- Content provided by FirstRanker.com ---
2
1
--- Content provided by FirstRanker.com ---
0.24
6
--- Content provided by FirstRanker.com ---
O , O
0
--- Content provided by FirstRanker.com ---
22
1
--- Content provided by FirstRanker.com ---
0.2
5
--- Content provided by FirstRanker.com ---
6We form the matching coefficient matrix for the objects under
--- Content provided by FirstRanker.com ---
consideration by entering the simple matching coefficients against the pairs of
objects. It is a symmetric matrix since C(P,Q) = C(Q,P). In the present problem,
--- Content provided by FirstRanker.com ---
we get the following matrix.Object
1
--- Content provided by FirstRanker.com ---
2 3 4 5
6
--- Content provided by FirstRanker.com ---
1 0.2 0.8 0.2 0.2 1
Object 1
--- Content provided by FirstRanker.com ---
0.2 1 0.4 0.6 0.6 0.4
--- Content provided by FirstRanker.com ---
2
0.8 0.4 1 0.4 0.4 0.8
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
30.2 0.6 0.4 1 0.6 0.2
--- Content provided by FirstRanker.com ---
4
--- Content provided by FirstRanker.com ---
0.2 0.6 0.4 0.6 1 0.2--- Content provided by FirstRanker.com ---
5
1 0.4 0.8 0.2 0.2 1
--- Content provided by FirstRanker.com ---
6--- Content provided by FirstRanker.com ---
Consider the matching coefficients of pairs of distinct objects. Here there are 15
such pairs. The maximum among them is 1 = C( O , O ). Thus O and O have
--- Content provided by FirstRanker.com ---
16
1
--- Content provided by FirstRanker.com ---
6
the maximum similarity. Therefore, they can be put in a cluster. The next
--- Content provided by FirstRanker.com ---
maximum matching coefficient is 0.8 possessed by the pairs ( O , O ) and
--- Content provided by FirstRanker.com ---
13
( O , O ). Therefore the objects O , O , O can be clubbed together. The next
--- Content provided by FirstRanker.com ---
3
6
--- Content provided by FirstRanker.com ---
13
6
--- Content provided by FirstRanker.com ---
maximum matching coefficient is 0.6 possessed by the pairs
( O , O ), ( O , O ) and ( O , O ). So the objects O , O , O can be considered
--- Content provided by FirstRanker.com ---
24
2
--- Content provided by FirstRanker.com ---
5
4
--- Content provided by FirstRanker.com ---
52
4
--- Content provided by FirstRanker.com ---
5
together. Since we have exhausted all the objects, the process is now complete.
--- Content provided by FirstRanker.com ---
Result: Thus we have arrived at Cluster I: {O ,O ,O and Cluster II:1
3
--- Content provided by FirstRanker.com ---
6}
{O ,O ,O .
--- Content provided by FirstRanker.com ---
24
5}
--- Content provided by FirstRanker.com ---
II. Method of Cluster Analysis for Quantitative Data Hierarchical Cluster
--- Content provided by FirstRanker.com ---
AnalysisThe aim of the hierarchical cluster analysis is to put the given objects
--- Content provided by FirstRanker.com ---
into various clusters and to arrange the clusters in a hierarchical order. A cluster
will consist of similar objects. Dissimilar objects will be put into different
--- Content provided by FirstRanker.com ---
clusters. The clusters so formed will be arranged such that two clusters whichcontain somewhat similar objects will be grouped together. Two clusters which
contain extremely dissimilar objects will stand apart in the hierarchical order.
--- Content provided by FirstRanker.com ---
Steps in hierarchical cluster analysis
--- Content provided by FirstRanker.com ---
The hierarchical cluster analysis comprises of the following steps.
1.
--- Content provided by FirstRanker.com ---
Collect the necessary data in a matrix form. The columns in the matrix
denote the objects taken for examination and the rows denote the attributes that
--- Content provided by FirstRanker.com ---
describe the objects. This matrix is called the data matrix.2.
--- Content provided by FirstRanker.com ---
Standardize the data matrix.
--- Content provided by FirstRanker.com ---
3.
--- Content provided by FirstRanker.com ---
Use the data matrix or the standardized data matrix to determine thevalues of "resemblance coefficient". It is measure of similarities among pairs of
objects.
--- Content provided by FirstRanker.com ---
4.
--- Content provided by FirstRanker.com ---
By means of the values of the resemblance coefficient, construct adiagram called a dendogram. It is a tree-like structure. A tree will exhibit the
different clusters into which the given set of objects is decomposed. The tree
--- Content provided by FirstRanker.com ---
will indicate the hierarchy of similarities among different pairs of objects. This
is the reason for calling the method as hierarchical cluster analysis.
--- Content provided by FirstRanker.com ---
Illustrative problem 3A marketing manager wishes to examine the sales performance of 4 sales
persons P,Q,R,S in his division by means of cluster analysis. Records indicating
--- Content provided by FirstRanker.com ---
their performance in the past 6 months are collected in the following table.
Unit: Rs. In lakhs
--- Content provided by FirstRanker.com ---
Sales PerformanceMonth
P Q R S
--- Content provided by FirstRanker.com ---
January
20
--- Content provided by FirstRanker.com ---
2225
23
--- Content provided by FirstRanker.com ---
February
22
--- Content provided by FirstRanker.com ---
2327
24
--- Content provided by FirstRanker.com ---
March
24
--- Content provided by FirstRanker.com ---
2428
25
--- Content provided by FirstRanker.com ---
April
19
--- Content provided by FirstRanker.com ---
2122
20
--- Content provided by FirstRanker.com ---
May
20
--- Content provided by FirstRanker.com ---
2224
21
--- Content provided by FirstRanker.com ---
June
21
--- Content provided by FirstRanker.com ---
2325
24
--- Content provided by FirstRanker.com ---
Help the manager in arranging the sales persons in a hierarchical order
--- Content provided by FirstRanker.com ---
according to their sales performance.Solution:
First we construct a Euclidean distance matrix. This matrix is formed
--- Content provided by FirstRanker.com ---
by entering the Euclidean distances against the pairs of objects. In our context,
--- Content provided by FirstRanker.com ---
Euclidean distance does not refer to any geographical distance. It is a relative
measure of the performance of two sales persons over the given period of time.
--- Content provided by FirstRanker.com ---
It will indicate which two sales persons are similar in their performance andwhich two sales persons are extremely different in their performance.
--- Content provided by FirstRanker.com ---
Assume that there are n data values for each sales person. Denote the
sales data of two persons by vectors P and Q as follows:
--- Content provided by FirstRanker.com ---
P = ( X , X ,..., X1
2
--- Content provided by FirstRanker.com ---
n )
Q = (Y ,Y ,...,Y
--- Content provided by FirstRanker.com ---
12
n )
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Then the Euclidean distance between them is denoted by d(P,Q) and isdefined by the following relation:
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
2d(P,Q) =
( X -Y + X -Y +...+ X -Y
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
11 )
( 2 2)
--- Content provided by FirstRanker.com ---
( n n )
--- Content provided by FirstRanker.com ---
(1)Note that d(P,P) = 0 and d(Q,P) = d(P,Q). In the problem under consideration, n
= 6. For the 4 sales persons P,Q,R,S, we have to calculate the 6 quantities
--- Content provided by FirstRanker.com ---
d(P,Q), d(P,R), d(P,S), d(Q,R), d(Q,S), d(R,S). We have
P = (20, 22, 24,19, 20, 2 )
--- Content provided by FirstRanker.com ---
1Q = (22, 23, 24, 21, 22, 23)
R = (25, 27, 28, 22, 24, 25)
--- Content provided by FirstRanker.com ---
S = (23, 24, 25, 20, 21, 24)
Using formula (1), calculate the Euclidean distances. We obtain
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2d (P,Q) = (20 - 22)2 + (22 - 23)2 + (24 - 24)2 + (19 - 2 )2
1 + (20 - 22)2 + (21- 23)
--- Content provided by FirstRanker.com ---
= ( 2
- )2 + (- )2
--- Content provided by FirstRanker.com ---
1 + (0)2 + ( 2- )2 + (-2)2 + (-2)2
= 4 +1+ 0 + 4 + 4 + 4
--- Content provided by FirstRanker.com ---
= 17= 4.1
--- Content provided by FirstRanker.com ---
correct to 1 place of decimals. Next we get2
d (P, R) = (20 - 25)2 + (22 - 27)2 + (24 - 28)2 + (19 - 22)2 + (20 - 24)2 + (21- 25)
--- Content provided by FirstRanker.com ---
= ( 5
- )2 + ( 5
--- Content provided by FirstRanker.com ---
- )2 + ( 4- )2 + ( 3
- )2 + (-4)2 + (-4)2
--- Content provided by FirstRanker.com ---
= 25 + 25 +16 + 9 +16 +16
= 107
=10.3
--- Content provided by FirstRanker.com ---
2
d P S = (
--- Content provided by FirstRanker.com ---
- )2 + ( - )2 + ( - )2 + ( - )2 + ( - )2
( , )
--- Content provided by FirstRanker.com ---
20 2322 24
24 25
--- Content provided by FirstRanker.com ---
19 20
20 21 + (21- 24)
--- Content provided by FirstRanker.com ---
= 9 + 4 +1+1+1+ 9= 25
= 5
--- Content provided by FirstRanker.com ---
2
d Q R = (
--- Content provided by FirstRanker.com ---
- )2 + ( - )2 + ( - )2 + ( - )2 + ( - )2( , )
22 25
--- Content provided by FirstRanker.com ---
23 27
24 28
--- Content provided by FirstRanker.com ---
21 2222 24
+ (23- 25)
--- Content provided by FirstRanker.com ---
= 9 +16 +16 +1+ 4 + 4
= 50
= 7.1
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
d Q S = (- )2 + ( - )2 + ( - )2 + ( - )2 + ( - )2 + ( - )2
( , )
--- Content provided by FirstRanker.com ---
22 23
23 24
--- Content provided by FirstRanker.com ---
24 2521 20
22 21
--- Content provided by FirstRanker.com ---
23 24
= 1+1+1+1+1+1
= 6
--- Content provided by FirstRanker.com ---
= 2.42
--- Content provided by FirstRanker.com ---
d R S = (
- )2 + ( - )2 + ( - )2 + ( - )2 + ( - )2
--- Content provided by FirstRanker.com ---
( , )25 23
27 24
--- Content provided by FirstRanker.com ---
28 25
22 20
--- Content provided by FirstRanker.com ---
24 21 + (25 - 24)= 4 + 9 + 9 + 4 + 9 +1
= 36
= 6
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
The following Euclidean distance matrix is got for the sales personsP,Q,R and S.
P
--- Content provided by FirstRanker.com ---
Q
R
--- Content provided by FirstRanker.com ---
SP -
4.1 10.3
--- Content provided by FirstRanker.com ---
5
Q 4.1
--- Content provided by FirstRanker.com ---
-7.1
2.4
--- Content provided by FirstRanker.com ---
R 10.3
--- Content provided by FirstRanker.com ---
7.1-
6
--- Content provided by FirstRanker.com ---
S 5
2.4
--- Content provided by FirstRanker.com ---
6-
--- Content provided by FirstRanker.com ---
Determination of Dendogram:
--- Content provided by FirstRanker.com ---
We adopt a procedure called single linkage clustering method (SLINK).This is based on the concept of nearest neighbours.
--- Content provided by FirstRanker.com ---
Consider the distance between different persons. They are d(P,Q),
d(P,R), d(P,S), d(Q,R), d(Q,S), d(R,S). i.e., 4.1, 10.3, 5, 7.1, 2.4, 6
--- Content provided by FirstRanker.com ---
The minimum among them is 2.4 = d(Q,S). Thus Q and S are the nearestneighbours. Therefore, Q and S are selected to form a cluster at the first level,
denoted by {Q,S}. Next, we have to add another object to the list {Q,S}. The
--- Content provided by FirstRanker.com ---
remaining elements are P and R. We have to decide whether P should be added
--- Content provided by FirstRanker.com ---
to the list {Q,S} or R should be added. So we have to determine which amongP, R is nearer to the set {Q,S}. We consider the quantities
d ((Q, S ), P) = Minimum d (Q, P), d (S, P)
--- Content provided by FirstRanker.com ---
= Minimum [4.1,5] = 4.1
--- Content provided by FirstRanker.com ---
d ((Q, S ), R) = Minimum d (Q, R), d (S, R)= Minimum [7.1,6] = 6
--- Content provided by FirstRanker.com ---
Among these two quantities, we find Minimum [d((Q,S),P), d((Q,S),R)] =
Minimum [4.1,6] = 4.1 = d((Q,S),P).
--- Content provided by FirstRanker.com ---
Therefore, P is nearer to the cluster {Q,S} rather than R. Consequently Pis attached with the set {Q,S} and so we obtain the cluster {{Q,S}, P}. This is
the cluster at the second level. If there are other objects remaining, we have to
--- Content provided by FirstRanker.com ---
repeat the above procedure. In the present case, there is only one object
remaining i.e., R. We add R to the cluster ((Q,S),P) to form the cluster at the
--- Content provided by FirstRanker.com ---
third level. We note thatd ((Q, S ), P), R = Minimum d (Q, R), d (S, R), d (P, R)
--- Content provided by FirstRanker.com ---
= Minimum [7.1,6,10. ]
3 = 6
--- Content provided by FirstRanker.com ---
Using these values, we obtain the following diagram:--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Dendogram
--- Content provided by FirstRanker.com ---
Inference
--- Content provided by FirstRanker.com ---
It is seen that sales persons Q, S are similar in their performance over the
given period of time. The next sales person somewhat similar to them is P. The
--- Content provided by FirstRanker.com ---
sales person R stands apart.--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
QUESTIONS1.
--- Content provided by FirstRanker.com ---
Explain the objective of cluster analysis.
--- Content provided by FirstRanker.com ---
2.Briefly describe how cluster analysis is carried out.
3.
--- Content provided by FirstRanker.com ---
State the properties of simple matching coefficient.
4.
--- Content provided by FirstRanker.com ---
Describe the methods of obtaining pessimistic, moderate and optimisticestimates of the similarity between two objects.
5.
--- Content provided by FirstRanker.com ---
Explain object-attribute incidence matrix.
6.
--- Content provided by FirstRanker.com ---
Explain matching coefficient matrix.7.
What are the steps in hierarchical cluster analysis?
--- Content provided by FirstRanker.com ---
8.
What is Euclidean distance matrix? Explain.
--- Content provided by FirstRanker.com ---
9.What is a dendogram? Explain.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT IV
--- Content provided by FirstRanker.com ---
7. FACTOR ANALYSIS AND CONJOINT ANALYSISLesson Outline
--- Content provided by FirstRanker.com ---
?
Factor Analysis
--- Content provided by FirstRanker.com ---
?Conjoint Analysis
?
--- Content provided by FirstRanker.com ---
Steps in Development of Conjoint Analysis
?
--- Content provided by FirstRanker.com ---
Applications of Conjoint Analysis?
Advantages and disadvantages of Conjoint Analysis
--- Content provided by FirstRanker.com ---
?
Illustrative problems
--- Content provided by FirstRanker.com ---
?Multi-factor evaluation approach in Conjoint Analysis
?
--- Content provided by FirstRanker.com ---
Two-factor evaluation approach in Conjoint Analysis
Learning Objectives
--- Content provided by FirstRanker.com ---
After reading this lesson you should be able to- understand the concept of Factor Analysis
- understand the managerial applications of Factor Analysis
--- Content provided by FirstRanker.com ---
- understand the concept of Conjoint Analysis
- apply rating scale technique in Conjoint Analysis
--- Content provided by FirstRanker.com ---
- apply ranking method in Conjoint Analysis- apply mini-max scaling method in Conjoint Analysis
- understand Multi-factor evaluation approach
--- Content provided by FirstRanker.com ---
- understand Two-factor evaluation approach
- understand the managerial applications of Conjoint Analysis
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
PART I - FACTOR ANALYSIS
--- Content provided by FirstRanker.com ---
In a real life situation, several variables are operating. Some variables
--- Content provided by FirstRanker.com ---
may be highly correlated among themselves. For example, if manager of arestaurant has to analyse six attributes of a new product. He undertakes a sample
survey and finds out the responses of potential consumers. He obtains the
--- Content provided by FirstRanker.com ---
following attribute correlation matrix.
Attribute
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
34
5
--- Content provided by FirstRanker.com ---
6
Attribute
--- Content provided by FirstRanker.com ---
1 1.00 0.05 0.10 0.95 0.20 0.022 0.05 1.00 0.15 0.10 0.60 0.85
3 0.10 0.15 1.00 0.50 0.55 0.10
--- Content provided by FirstRanker.com ---
4 0.95 0.10 0.50 1.00 0.12 0.08
5 0.20 0.60 0.55 0.12 1.00 0.80
--- Content provided by FirstRanker.com ---
6 0.02 0.85 0.10 0.08 0.80 1.00Attribute Correlation Matrix
--- Content provided by FirstRanker.com ---
We try to group the attributes by their correlations. The high correlation
--- Content provided by FirstRanker.com ---
values are observed for the following attributes:Attributes 1, 4 with a very high correlation coefficient of 0.95.
--- Content provided by FirstRanker.com ---
Attributes 2, 4 with a high correlation coefficient of 0.85.
--- Content provided by FirstRanker.com ---
Attributes 3, 4 with a high correlation coefficient of 0.85.--- Content provided by FirstRanker.com ---
As a result, it is seen that not all the attributes are independent. The attributes
--- Content provided by FirstRanker.com ---
1 and 4 have mutual influence on each other while the attributes 2, 5 and 6 have
mutual influence among themselves. As far as attribute 3 is concerned, it has
--- Content provided by FirstRanker.com ---
little correlation with the attributes 1, 2 and 6. Even with the other attributes 4and 5, its correlation is not high. However, we can say that attribute 3 is
somewhat closer to the variables 4 and 5 rather than the attributes 1, 2 and 6.
--- Content provided by FirstRanker.com ---
Thus, from the given list of 6 attributes, it is possible to find out 2 or 3 common
factors as follows:
--- Content provided by FirstRanker.com ---
I.
1) The common features of the attributes 1,3,4 will give a factor
--- Content provided by FirstRanker.com ---
2) The common features of the attributes 2, 5, 6 will give a factor
--- Content provided by FirstRanker.com ---
orII.
--- Content provided by FirstRanker.com ---
1) The common features of the attributes 1,4 will give a factor
2) The common features of the attributes 2,5,6 will give a factor
--- Content provided by FirstRanker.com ---
3) The attribute 3 can be considered to be an independent factorThe factor analysis is a multivariate method. It is a statistical technique
--- Content provided by FirstRanker.com ---
to identify the underlying factors among a large number of interdependent
variables. It seeks to extract common factor variances from a given set of
--- Content provided by FirstRanker.com ---
observations. It splits a number of attributes or variables into a smaller group ofuncorrelated factors. It determines which variables belong together. This method
is suitable for the cases with a number of variables having a high degree of
--- Content provided by FirstRanker.com ---
correlation.
--- Content provided by FirstRanker.com ---
In the above example, we would like to filter down the attributes 1, 4
--- Content provided by FirstRanker.com ---
into a single attribute. Also we would like to do the same for the attributes 2, 5,6. If a set of attributes (variables) A1, A2, ..., Ak filter down to an attribute Ai
(1 i k), we say that these attributes are loaded on the factor Ai or saturated
--- Content provided by FirstRanker.com ---
with the factor Ai. Sometimes, more than one factor also may be identified.
--- Content provided by FirstRanker.com ---
Basic concepts in factor analysis
--- Content provided by FirstRanker.com ---
The following are the key concepts on which factor analysis is based.
--- Content provided by FirstRanker.com ---
Factor: A factor plays a fundamental role among a set of attributes or variables.These variables can be filtered down to the factor. A factor represents the
combined effect of a set of attributes. Either there may be one such factor or
--- Content provided by FirstRanker.com ---
several such factors in a real life problem based on the complexity of the
situation and the number of variables operating.
--- Content provided by FirstRanker.com ---
Factor loading: A factor loading is a value that explains how closely the
variables are related to the factor. It is the correlation between the factor and the
--- Content provided by FirstRanker.com ---
variable. While interpreting a factor, the absolute value of the factor is taken into
account.
--- Content provided by FirstRanker.com ---
Communality: It is a measure of how much each variable is accounted for bythe underlying factors together. It is the sum of the squares of the loadings of the
variable on the common factors. If A,B,C,... are the factors, then the
--- Content provided by FirstRanker.com ---
communality of a variable is computed using the relation
h2 = ( The factor loading of the variable with respect to factor A)2 +
--- Content provided by FirstRanker.com ---
( The factor loading of the variable with respect to factor B)2 +( The factor loading of the variable with respect to factor C)2 + .....
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Eigen value: The sum of the squared values of factor loadings pertaining to a
factor is called an Eigen value. It is a measure of the relative importance of each
--- Content provided by FirstRanker.com ---
factor under consideration.Total Sum of Squares (TSS)
--- Content provided by FirstRanker.com ---
It is the sum of the Eigen values of all the factors.
--- Content provided by FirstRanker.com ---
Application of Factor Analysis:
--- Content provided by FirstRanker.com ---
1. Model building for new product development:
--- Content provided by FirstRanker.com ---
As pointed out earlier, a real life situation is highly complex and it
consists of several variables. A model for the real life situation can be built by
--- Content provided by FirstRanker.com ---
incorporating as many features of the situation as possible. But then, with a
multitude of features, it is very difficult to build such a highly idealistic model.
--- Content provided by FirstRanker.com ---
A practical way is to identify the important variables and incorporate them in themodel. Factor analysis seeks to identify those variables which are highly
correlated among themselves and find a common factor which can be taken as a
--- Content provided by FirstRanker.com ---
representative of those variables. Based on the factor loading, some of variables
can be merged together to give a common factor and then a model can be built
--- Content provided by FirstRanker.com ---
by incorporating such factors. Identification of the most common features of aproduct preferred by the consumers will be helpful in the development of new
products.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2. Model building for consumers:
--- Content provided by FirstRanker.com ---
Another application of factor analysis is to carry out a similar exercisefor the respondents instead of the variables themselves. Using the factor loading,
the respondents in a research survey can be sorted out into various groups in
--- Content provided by FirstRanker.com ---
such a way that the respondents in a group have more or less homogeneous
opinions on the topics of the survey. Thus a model can be constructed on the
--- Content provided by FirstRanker.com ---
groups of consumers. The results emanating from such an exercise will guidethe management in evolving appropriate strategies towards market
segmentation.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
PART II - CONJOINT ANALYSIS
Introduction
--- Content provided by FirstRanker.com ---
Everything in the world is undergoing a change. There is a proverb
--- Content provided by FirstRanker.com ---
saying that "the old order changes, yielding place to new". Due to rapidadvancement in science and technology, there is fast communication across the
world. Consequently, the whole world has shrunk into something like a village
--- Content provided by FirstRanker.com ---
and thus now-a-days one speaks of the "global village". Under the present set-
up, one can purchase any product of his choice from whatever part of the world
--- Content provided by FirstRanker.com ---
it may be available. Because of this reason, what was a seller's market a fewyears back has transformed into a buyer's market now.
--- Content provided by FirstRanker.com ---
In a seller's market of yesterday, the manufacturer or the seller could
pass on a product according to his own perceptions and prescriptions. In the
--- Content provided by FirstRanker.com ---
buyer's market of today, a buyer decides what he should purchase, what shouldbe the quality of the product, how much to purchase, where to purchase, when to
--- Content provided by FirstRanker.com ---
purchase, at what cost to purchase, from whom to purchase, etc. A manager is
perplexed at the way a consumer takes a decision on the purchase of a product.
--- Content provided by FirstRanker.com ---
In this background, conjoint analysis is an effective tool to understand a buyer's
preferences for a good or service.
--- Content provided by FirstRanker.com ---
Meaning of Conjoint AnalysisA product or service has several attributes. By an attribute, we mean a
characteristic, a property, a feature, a quality, a specification or an aspect. A
--- Content provided by FirstRanker.com ---
buyer's decision to purchase a good or service is based on not just one attribute
but a combination of several attributes. i.e., he is concerned with a join of
--- Content provided by FirstRanker.com ---
attributes.Therefore, finding out the consumer's preferences for individual
--- Content provided by FirstRanker.com ---
attributes of a product or service may not yield accurate results for a marketing
research problem. In view of this fact, conjoint analysis seeks to find out the
--- Content provided by FirstRanker.com ---
consumer's preferences for a `join of attributes', i.e., a combination of severalattributes.
--- Content provided by FirstRanker.com ---
Let us consider an example. Suppose a consumer desires to purchase a
wrist watch. He would take into consideration several attributes of a wrist
--- Content provided by FirstRanker.com ---
watch, namely the configuration details such as mechanism, size, dial,appearance, colour and other particulars such as strap, price, durability,
warranty, after-sales service, etc. If a consumer is asked what the important
--- Content provided by FirstRanker.com ---
aspect among the above list is, he would reply that all attributes are important
for him and so a manager cannot arrive at a decision on the design of a wrist
--- Content provided by FirstRanker.com ---
watch. Conjoint analysis assumes that the buyer will base his decision not onjust the individual attributes of the product but rather he would consider various
combinations of the attributes, such as
--- Content provided by FirstRanker.com ---
`mechanism, colour, price, after-sales service',
--- Content provided by FirstRanker.com ---
or `dial, colour, durability, warranty',--- Content provided by FirstRanker.com ---
or `dial, appearance, price, durability', etc.
--- Content provided by FirstRanker.com ---
This analysis would enable a manager in his decision making process in the
identification of some of the preferred combinations of the features of the
--- Content provided by FirstRanker.com ---
product.The rank correlation method seeks to assess the consumer's preferences
for individual attributes. In contrast, the conjoint analysis seeks to assess the
--- Content provided by FirstRanker.com ---
consumer's preferences for combinations (or groups) of attributes of a product
or a service. This method is also called an `unfolding technique' because
--- Content provided by FirstRanker.com ---
preferences on groups of attributes unfold from the rankings expressed by theconsumers. Another name for this method is `multi-attribute compositional
model' because it deals with combinations of attributes.
--- Content provided by FirstRanker.com ---
Steps in the Development of Conjoint Analysis
--- Content provided by FirstRanker.com ---
The development of conjoint analysis comprises of the following steps:1.
Collect a list of the attributes (features) of a product or a service.
--- Content provided by FirstRanker.com ---
2.
For each attribute, fix a certain number of points or marks. The more the
--- Content provided by FirstRanker.com ---
number of points for an attribute, the more serious the consumers' concern onthat attribute.
3.
--- Content provided by FirstRanker.com ---
Select a list of combinations of various attributes.
4.
--- Content provided by FirstRanker.com ---
Decide a mode of presentation of the attributes to the respondents of thestudy i.e., whether it should be in written form, or oral form, or a pictorial
representation etc.
--- Content provided by FirstRanker.com ---
5.
Inform the combinations of the attributes to the prospective customers.
--- Content provided by FirstRanker.com ---
6.Request the respondents to rank the combinations, or to rate them on a
suitable scale, or to choose between two different combinations at a time.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
7.
Decide a procedure to aggregate the responses from the consumers. Any
--- Content provided by FirstRanker.com ---
one of the following procedures may be adopted:(i). Go by the individual responses of the consumers.
--- Content provided by FirstRanker.com ---
(ii). Put all the responses together and construct a single utility
function.
--- Content provided by FirstRanker.com ---
(iii). Split the responses into a certain number of segments such thatwithin each segment, the preferences would be similar.
--- Content provided by FirstRanker.com ---
8.
Choose the appropriate technique to analyze the data collected from the
--- Content provided by FirstRanker.com ---
respondents.9.
--- Content provided by FirstRanker.com ---
Identify the most preferred combination of attributes.
--- Content provided by FirstRanker.com ---
10.Incorporate the result in designing a new product, construction of an
advertisement copy, etc.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Applications of Conjoint Analysis1.
An idea of consumer's preferences for combinations of attributes will be
--- Content provided by FirstRanker.com ---
useful in designing new products or modification of an existing product.
--- Content provided by FirstRanker.com ---
2.A forecast of the profits to be earned by a product or a service.
--- Content provided by FirstRanker.com ---
3.
A forecast of the market share for the company's product.
--- Content provided by FirstRanker.com ---
4.
A forecast of the shift in brand loyalty of the consumers.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
5.
A forecast of differences in responses of various segments of the
--- Content provided by FirstRanker.com ---
product.6.
Formulation of marketing strategies for the promotion of the product.
--- Content provided by FirstRanker.com ---
7.
Evaluation of the impact of alternative advertising strategies.
--- Content provided by FirstRanker.com ---
8.A forecast of the consumers' reaction to pricing policies.
9.
--- Content provided by FirstRanker.com ---
A forecast of the consumers' reaction on the channels of distribution.
10.
--- Content provided by FirstRanker.com ---
Evolving an appropriate marketing mix.11.
Even though the technique of conjoint analysis was developed for the
--- Content provided by FirstRanker.com ---
formulation of corporate strategy, this method can be used to have a
comprehensive knowledge of a wide range of areas such as family decision
--- Content provided by FirstRanker.com ---
making process, pharmaceuticals, tourism development, public transport system,etc.
Advantages of Conjoint Analysis
--- Content provided by FirstRanker.com ---
1.
The analysis can be carried out on physical variables.
--- Content provided by FirstRanker.com ---
2.
Preferences by different individuals can be measured and pooled
--- Content provided by FirstRanker.com ---
together to arrive at a decision.
--- Content provided by FirstRanker.com ---
Disadvantages of Conjoint Analysis1.
When more and more attributes of a product are included in the study,
--- Content provided by FirstRanker.com ---
the number of combinations of attributes also increases, rendering the study
highly difficult. Consequently, only a few selected attributes can be included in
--- Content provided by FirstRanker.com ---
the study.2.
Gathering of information from the respondents will be a tough job.
--- Content provided by FirstRanker.com ---
3.
Whenever novel combinations of attributes are included, the respondents
--- Content provided by FirstRanker.com ---
will have difficulty in capturing such combinations.4.
The psychological measurements of the respondents may not be
--- Content provided by FirstRanker.com ---
accurate.
--- Content provided by FirstRanker.com ---
In spite of the above stated disadvantages, conjoint analysis offers more
--- Content provided by FirstRanker.com ---
scope to the researchers in identifying the consumers' preferences for groups ofattributes.
Illustrative Problem 1 : Application of Rating Scale Technique
--- Content provided by FirstRanker.com ---
A wrist watch manufacturer desires to find out the combinations of attributes
that a consumer would be interested in. After considering several attributes, the
--- Content provided by FirstRanker.com ---
manufacturer identifies the following combinations of attributes for carrying outmarketing research.
Combination ? I
--- Content provided by FirstRanker.com ---
Mechanism, colour, price, after-scales service
Combination ? II
--- Content provided by FirstRanker.com ---
Dial, colour, durability, warrantyCombination ? III
--- Content provided by FirstRanker.com ---
Dial, appearance, price, durability
Combination ? IV
--- Content provided by FirstRanker.com ---
Mechanism, dial, price, warranty12 respondents are asked to rate the 4 combinations on the following 3-point
--- Content provided by FirstRanker.com ---
rating scale.
Scale ? 1
--- Content provided by FirstRanker.com ---
:Less important
Scale ? 2
--- Content provided by FirstRanker.com ---
:
Somewhat important
--- Content provided by FirstRanker.com ---
Scale ? 3:
Very important
--- Content provided by FirstRanker.com ---
Their responses are given in the following table:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Rating of CombinationRespondent Combination I Combination
Combination
--- Content provided by FirstRanker.com ---
Combination
No.
--- Content provided by FirstRanker.com ---
IIIII
IV
--- Content provided by FirstRanker.com ---
1
Less
--- Content provided by FirstRanker.com ---
SomewhatVery
Somewhat
--- Content provided by FirstRanker.com ---
important
important
--- Content provided by FirstRanker.com ---
importantimportant
2
--- Content provided by FirstRanker.com ---
Somewhat
Very
--- Content provided by FirstRanker.com ---
LessSomewhat
important
--- Content provided by FirstRanker.com ---
important
important
--- Content provided by FirstRanker.com ---
important3
Somewhat
--- Content provided by FirstRanker.com ---
Less
Somewhat
--- Content provided by FirstRanker.com ---
Very importantimportant
important
--- Content provided by FirstRanker.com ---
important
4
--- Content provided by FirstRanker.com ---
LessLess
Very
--- Content provided by FirstRanker.com ---
Somewhat
important
--- Content provided by FirstRanker.com ---
importantimportant
important
--- Content provided by FirstRanker.com ---
5
Somewhat
--- Content provided by FirstRanker.com ---
VeryVery
Less important
--- Content provided by FirstRanker.com ---
important
important
--- Content provided by FirstRanker.com ---
important6
Somewhat
--- Content provided by FirstRanker.com ---
Very
Somewhat
--- Content provided by FirstRanker.com ---
Less importantimportant
important
--- Content provided by FirstRanker.com ---
important
7
--- Content provided by FirstRanker.com ---
SomewhatLess
Very
--- Content provided by FirstRanker.com ---
Less important
important
--- Content provided by FirstRanker.com ---
importantimportant
8
--- Content provided by FirstRanker.com ---
Very
Somewhat
--- Content provided by FirstRanker.com ---
LessSomewhat
important
--- Content provided by FirstRanker.com ---
important
important
--- Content provided by FirstRanker.com ---
important9
Very
--- Content provided by FirstRanker.com ---
Less
Somewhat
--- Content provided by FirstRanker.com ---
Somewhatimportant
important
--- Content provided by FirstRanker.com ---
important
important
--- Content provided by FirstRanker.com ---
10 SomewhatVery
Less
--- Content provided by FirstRanker.com ---
Somewhat
important
--- Content provided by FirstRanker.com ---
importantimportant
important
--- Content provided by FirstRanker.com ---
11 Very Somewhat
Very
--- Content provided by FirstRanker.com ---
Somewhatimportant
important
--- Content provided by FirstRanker.com ---
important
important
--- Content provided by FirstRanker.com ---
12 Very LessVery
Somewhat
--- Content provided by FirstRanker.com ---
important
important
--- Content provided by FirstRanker.com ---
importantimportant
--- Content provided by FirstRanker.com ---
Determine the most important and the least important combinations of the
--- Content provided by FirstRanker.com ---
attributes.
Solution:
--- Content provided by FirstRanker.com ---
Let us assign scores to the scales as follows:
Sl. No.
--- Content provided by FirstRanker.com ---
Scale
Score
--- Content provided by FirstRanker.com ---
1Less important
1
--- Content provided by FirstRanker.com ---
2
Somewhat important
--- Content provided by FirstRanker.com ---
33
Very important
--- Content provided by FirstRanker.com ---
5
--- Content provided by FirstRanker.com ---
The scores for the four combinations are calculated as follows:
Score for
--- Content provided by FirstRanker.com ---
No. of
Combination Response
--- Content provided by FirstRanker.com ---
Total ScoreResponse
Respondents
--- Content provided by FirstRanker.com ---
Less important
1
--- Content provided by FirstRanker.com ---
21 X 2 = 2
Somewhat
--- Content provided by FirstRanker.com ---
I
3
--- Content provided by FirstRanker.com ---
63 X 6 = 18
important
--- Content provided by FirstRanker.com ---
5
4
--- Content provided by FirstRanker.com ---
5 X 4 = 20Very important
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1240
Less important
--- Content provided by FirstRanker.com ---
1
5
--- Content provided by FirstRanker.com ---
1 X 5 = 5Somewhat
II
--- Content provided by FirstRanker.com ---
3
3
--- Content provided by FirstRanker.com ---
3 X 3 = 9important
5
--- Content provided by FirstRanker.com ---
4
5 X 4 = 20
--- Content provided by FirstRanker.com ---
Very important--- Content provided by FirstRanker.com ---
12
--- Content provided by FirstRanker.com ---
34Less important
1
--- Content provided by FirstRanker.com ---
3
1 X 3 = 3
--- Content provided by FirstRanker.com ---
SomewhatIII
3
--- Content provided by FirstRanker.com ---
3
3 X 3 = 9
--- Content provided by FirstRanker.com ---
important5
6
--- Content provided by FirstRanker.com ---
5 X 6 = 30
Very important
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
12
42
--- Content provided by FirstRanker.com ---
Less important
--- Content provided by FirstRanker.com ---
13
1 X 3 = 3
--- Content provided by FirstRanker.com ---
Somewhat
IV
--- Content provided by FirstRanker.com ---
38
3 X 8 = 24
--- Content provided by FirstRanker.com ---
important
5
--- Content provided by FirstRanker.com ---
15 X 1 = 5
Very important
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
12
32
--- Content provided by FirstRanker.com ---
Let us tabulate the scores earned by the four combinations as follows:
--- Content provided by FirstRanker.com ---
Combination Totalscores
I
--- Content provided by FirstRanker.com ---
40
II
--- Content provided by FirstRanker.com ---
34III
42
--- Content provided by FirstRanker.com ---
IV
32
--- Content provided by FirstRanker.com ---
Inference:
It is concluded that the consumers consider combination III as the most
--- Content provided by FirstRanker.com ---
important and combination IV as the least important.
Note: For illustrating the concepts involved, we have taken up 12 respondents in
--- Content provided by FirstRanker.com ---
the above problem. In actual research work, we should take a large number ofrespondents, say 200 or 100. In any case, the number of respondents shall not
be less than 30.
--- Content provided by FirstRanker.com ---
Illustrative Problem 2:
Application of Ranking Method
--- Content provided by FirstRanker.com ---
A marketing manager selects four combinations of features of a productfor study. The following are the ranks awarded by 10 respondents. Rank one
means the most important and rank 4 means the least important.
--- Content provided by FirstRanker.com ---
Respondent
Rank Awarded
--- Content provided by FirstRanker.com ---
No.Combination
--- Content provided by FirstRanker.com ---
I
Combination Combination Combination
--- Content provided by FirstRanker.com ---
II III IV
--- Content provided by FirstRanker.com ---
12
1
--- Content provided by FirstRanker.com ---
3
4
--- Content provided by FirstRanker.com ---
21
4
--- Content provided by FirstRanker.com ---
2
3
--- Content provided by FirstRanker.com ---
31
2
--- Content provided by FirstRanker.com ---
3
4
--- Content provided by FirstRanker.com ---
43
2
--- Content provided by FirstRanker.com ---
4
1
--- Content provided by FirstRanker.com ---
54
1
--- Content provided by FirstRanker.com ---
2
3
--- Content provided by FirstRanker.com ---
61
2
--- Content provided by FirstRanker.com ---
3
4
--- Content provided by FirstRanker.com ---
74
3
--- Content provided by FirstRanker.com ---
2
1
--- Content provided by FirstRanker.com ---
83
1
--- Content provided by FirstRanker.com ---
2
4
--- Content provided by FirstRanker.com ---
93
1
--- Content provided by FirstRanker.com ---
4
2
--- Content provided by FirstRanker.com ---
104
1
--- Content provided by FirstRanker.com ---
2
3
--- Content provided by FirstRanker.com ---
Determine the most important and the least important combinations of the
features of the product.
--- Content provided by FirstRanker.com ---
Solution:
--- Content provided by FirstRanker.com ---
Let us assign scores to the ranks as follows:
Rank Score
--- Content provided by FirstRanker.com ---
1
10
--- Content provided by FirstRanker.com ---
28
3
--- Content provided by FirstRanker.com ---
6
4
--- Content provided by FirstRanker.com ---
4--- Content provided by FirstRanker.com ---
The scores for the 4 combinations are calculated as follows:
Score for
--- Content provided by FirstRanker.com ---
No. of
Combination Rank
--- Content provided by FirstRanker.com ---
Total Scorerank
Respondents
--- Content provided by FirstRanker.com ---
1
10
--- Content provided by FirstRanker.com ---
310 X 3 = 30
2
--- Content provided by FirstRanker.com ---
8
1
--- Content provided by FirstRanker.com ---
8 X 1= 8I
3
--- Content provided by FirstRanker.com ---
6
3
--- Content provided by FirstRanker.com ---
6 X 3 = 184
4
--- Content provided by FirstRanker.com ---
3
4 X 3 = 12
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
10
68
--- Content provided by FirstRanker.com ---
110
5
--- Content provided by FirstRanker.com ---
10 X 5 = 50
2
--- Content provided by FirstRanker.com ---
83
8 X 3 = 24
--- Content provided by FirstRanker.com ---
II
3
--- Content provided by FirstRanker.com ---
61
6 X 1 = 6
--- Content provided by FirstRanker.com ---
4
4
--- Content provided by FirstRanker.com ---
14 X 1 = 4
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1084
1
--- Content provided by FirstRanker.com ---
10
Nil
--- Content provided by FirstRanker.com ---
--2
8
--- Content provided by FirstRanker.com ---
5
8 X 5 = 40
--- Content provided by FirstRanker.com ---
III3
6
--- Content provided by FirstRanker.com ---
3
6 X 3 = 18
--- Content provided by FirstRanker.com ---
44
2
--- Content provided by FirstRanker.com ---
4 X 2 = 8
--- Content provided by FirstRanker.com ---
10
--- Content provided by FirstRanker.com ---
66
1
--- Content provided by FirstRanker.com ---
102
10 X 2 = 20
--- Content provided by FirstRanker.com ---
2
8
--- Content provided by FirstRanker.com ---
18 X 1 = 8
IV
--- Content provided by FirstRanker.com ---
3
6
--- Content provided by FirstRanker.com ---
36 X 3 = 18
4
--- Content provided by FirstRanker.com ---
4
4
--- Content provided by FirstRanker.com ---
4 X 4 = 16--- Content provided by FirstRanker.com ---
10
--- Content provided by FirstRanker.com ---
62The final scores for the 4 combinations are as follows:
--- Content provided by FirstRanker.com ---
Combination Score
I
--- Content provided by FirstRanker.com ---
68II
84
--- Content provided by FirstRanker.com ---
III
--- Content provided by FirstRanker.com ---
66
IV
--- Content provided by FirstRanker.com ---
62Inference:
--- Content provided by FirstRanker.com ---
It is seen that combination II is the most preferred one by the consumers
and combination IV is the least preferred one.
--- Content provided by FirstRanker.com ---
Illustrative Problem 3:
--- Content provided by FirstRanker.com ---
Application of Mini-Max Scaling Method
An insurance manager chooses 5 combinations of attributes of a social
--- Content provided by FirstRanker.com ---
security plan for analysis. He requests 10 respondents to indicate theirperceptions on the importance of the combinations by awarding the minimum
score and the maximum score for each combination in the range of 0 to 100.
--- Content provided by FirstRanker.com ---
The details of the responses are given below. Help the manager in the
identification of the most important and the least important combinations of the
--- Content provided by FirstRanker.com ---
attributes of the social security plan.Combination Combination Combination Combination Combination
--- Content provided by FirstRanker.com ---
Respondent
I
--- Content provided by FirstRanker.com ---
IIIII
IV
--- Content provided by FirstRanker.com ---
V
Number
--- Content provided by FirstRanker.com ---
Min Max Min Max Min Max Min Max Min Max1
30
--- Content provided by FirstRanker.com ---
60
45
--- Content provided by FirstRanker.com ---
8550
70
--- Content provided by FirstRanker.com ---
40
75
--- Content provided by FirstRanker.com ---
5080
2
--- Content provided by FirstRanker.com ---
35
65
--- Content provided by FirstRanker.com ---
5080
50
--- Content provided by FirstRanker.com ---
80
35
--- Content provided by FirstRanker.com ---
7540
75
--- Content provided by FirstRanker.com ---
3
40
--- Content provided by FirstRanker.com ---
7035
80
--- Content provided by FirstRanker.com ---
60
80
--- Content provided by FirstRanker.com ---
4070
50
--- Content provided by FirstRanker.com ---
80
4
--- Content provided by FirstRanker.com ---
4080
40
--- Content provided by FirstRanker.com ---
80
60
--- Content provided by FirstRanker.com ---
8550
75
--- Content provided by FirstRanker.com ---
60
80
--- Content provided by FirstRanker.com ---
530
75
--- Content provided by FirstRanker.com ---
50
80
--- Content provided by FirstRanker.com ---
6075
60
--- Content provided by FirstRanker.com ---
75
60
--- Content provided by FirstRanker.com ---
856
35
--- Content provided by FirstRanker.com ---
70
35
--- Content provided by FirstRanker.com ---
8550
80
--- Content provided by FirstRanker.com ---
40
80
--- Content provided by FirstRanker.com ---
4080
7
--- Content provided by FirstRanker.com ---
40
80
--- Content provided by FirstRanker.com ---
4075
45
--- Content provided by FirstRanker.com ---
75
50
--- Content provided by FirstRanker.com ---
7040
80
--- Content provided by FirstRanker.com ---
8
--- Content provided by FirstRanker.com ---
30
80
--- Content provided by FirstRanker.com ---
4075
50
--- Content provided by FirstRanker.com ---
80
50
--- Content provided by FirstRanker.com ---
7060
80
--- Content provided by FirstRanker.com ---
9
45
--- Content provided by FirstRanker.com ---
7545
75
--- Content provided by FirstRanker.com ---
50
80
--- Content provided by FirstRanker.com ---
5080
50
--- Content provided by FirstRanker.com ---
80
10
--- Content provided by FirstRanker.com ---
5575
40
--- Content provided by FirstRanker.com ---
85
35
--- Content provided by FirstRanker.com ---
7545
80
--- Content provided by FirstRanker.com ---
40
80
--- Content provided by FirstRanker.com ---
Solution:
For each combination, consider the minimum score and the maximum
--- Content provided by FirstRanker.com ---
score separately and calculate the average in each case.
Combination Combination Combination Combination Combination
--- Content provided by FirstRanker.com ---
I
II
--- Content provided by FirstRanker.com ---
III
IV
--- Content provided by FirstRanker.com ---
VMin Max Min Max Min Max Min Max Min Max
Total
--- Content provided by FirstRanker.com ---
380 730 420
800
--- Content provided by FirstRanker.com ---
510 780 460 750 490 800Average 38
73
--- Content provided by FirstRanker.com ---
42
80
--- Content provided by FirstRanker.com ---
5178
46
--- Content provided by FirstRanker.com ---
75
49
--- Content provided by FirstRanker.com ---
80Consider the mean values obtained for the minimum and maximum of each
--- Content provided by FirstRanker.com ---
combination and calculate the range for each combination as
--- Content provided by FirstRanker.com ---
Range = Maximum value ? Minimum valueThe measure of importance for each combination is calculated as follows:
--- Content provided by FirstRanker.com ---
Measure of Importance for a combination of attributes
--- Content provided by FirstRanker.com ---
Range for that combination=
?100
--- Content provided by FirstRanker.com ---
Sum of the ranges for all the combinations
--- Content provided by FirstRanker.com ---
Tabulate the results as follows:Measure of
Combination
--- Content provided by FirstRanker.com ---
Max. Value
Min. Value
--- Content provided by FirstRanker.com ---
RangeImportance
--- Content provided by FirstRanker.com ---
I
73
--- Content provided by FirstRanker.com ---
38
35
--- Content provided by FirstRanker.com ---
21.875II
80
--- Content provided by FirstRanker.com ---
42
38
--- Content provided by FirstRanker.com ---
23.750III
78
--- Content provided by FirstRanker.com ---
51
27
--- Content provided by FirstRanker.com ---
16.875IV
75
--- Content provided by FirstRanker.com ---
46
29
--- Content provided by FirstRanker.com ---
18.125V
80
--- Content provided by FirstRanker.com ---
49
31
--- Content provided by FirstRanker.com ---
19.375Sum of the ranges
160
--- Content provided by FirstRanker.com ---
100.000
--- Content provided by FirstRanker.com ---
Inference:It is concluded that combination II is the most important one and
combination III is the least important one.
--- Content provided by FirstRanker.com ---
APPROACHES FOR CONJOINT ANALYSIS
--- Content provided by FirstRanker.com ---
The following two approaches are available for conjoint analysis:i.
--- Content provided by FirstRanker.com ---
Multi-factor evaluation approach
ii.
--- Content provided by FirstRanker.com ---
Two-factor evaluation approachMULTI-FACTOR EVALUATION APPROACH IN CONJOINT
--- Content provided by FirstRanker.com ---
ANALYSIS
Suppose a researcher has to analyze n factors. It is possible that each factor can
--- Content provided by FirstRanker.com ---
assume a value in different levels.Product Profile
A product profile is a description of all the factors under consideration, with any
--- Content provided by FirstRanker.com ---
one level for each factor.
Suppose, for example, there are 3 factors with the levels given below.
--- Content provided by FirstRanker.com ---
Factor1 : 3
levels
--- Content provided by FirstRanker.com ---
Factor
2 : 2
--- Content provided by FirstRanker.com ---
levelsFactor
3 : 4
--- Content provided by FirstRanker.com ---
levels
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Then we have 3? 2? 4 = 24 product profiles. For each respondent in theresearch survey, we have to provide 24 data sheets such that each data sheet
contains a distinct profile. In each profile, the respondent is requested to
--- Content provided by FirstRanker.com ---
indicate his preference for that profile in a rating scale of 0 to 10. A rating of 10
indicates that the respondent's preference for that profile is the highest and a
--- Content provided by FirstRanker.com ---
rating of 0 means that he is not all interested in the product with that profile.Example: Consider the product `Refrigerator' with the following factors and
levels:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Factor 1:
capacity of 180 liters; 200 liters; 230 liters
--- Content provided by FirstRanker.com ---
Factor
2 : number
--- Content provided by FirstRanker.com ---
of doors: either 1 or 2Factor
3 : Price
--- Content provided by FirstRanker.com ---
:
Rs. 9000; Rs. 10,000; Rs. 12,000
--- Content provided by FirstRanker.com ---
Sample profile of the product
--- Content provided by FirstRanker.com ---
Profile Number :
Capacity
--- Content provided by FirstRanker.com ---
:200 liters
--- Content provided by FirstRanker.com ---
Number of Doors :
1
--- Content provided by FirstRanker.com ---
Price
--- Content provided by FirstRanker.com ---
:
Rs. 10,000
--- Content provided by FirstRanker.com ---
Rating of Respondent:
(in the scale of 0 to 10)
--- Content provided by FirstRanker.com ---
Steps in Multi-factor Evaluation Approach:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1.
--- Content provided by FirstRanker.com ---
Identify the factors or features of a product to be analyzed. If they are toomany, select the important ones by discussion with experts.
2.
--- Content provided by FirstRanker.com ---
Find out the levels for each factor selected in Step 1.
--- Content provided by FirstRanker.com ---
3.
--- Content provided by FirstRanker.com ---
Design all possible product profiles. If there are n factors with levels L1,L2,...Ln respectively, then the total number of profiles = L1L2...Ln.
4.
--- Content provided by FirstRanker.com ---
Select the scaling technique to be adopted for multi-factor evaluation
approach (rating scale or ranking method).
--- Content provided by FirstRanker.com ---
5.Select the list of respondents using the standard sampling technique.
6.
--- Content provided by FirstRanker.com ---
Request each respondent to give his rating scale for all the profiles of the
product. Another way of collecting the responses is to request each respondent
--- Content provided by FirstRanker.com ---
to award ranks to all the profiles: i.e., rank 1 for the best profile, rank 2 for thenext best profile etc.
7.
--- Content provided by FirstRanker.com ---
For each factor profile, collect all the responses from all the participating
respondents in the survey work.
--- Content provided by FirstRanker.com ---
With the rating scale awarded by the respondents, find out the score secured byeach profile.
8.
--- Content provided by FirstRanker.com ---
Tabulate the results in Step 8. Select the profile with the highest score.
This is the most preferred profile.
--- Content provided by FirstRanker.com ---
9.Implement the most preferred profile in the design of a new product.
--- Content provided by FirstRanker.com ---
TWO-FACTOR EVALUATION APPROACH IN CONJOINT ANALYSIS
When several factors with different levels for each factor have to be
--- Content provided by FirstRanker.com ---
analyzed, the respondents will have difficulty in evaluating all the profiles in themulti-factor evaluation approach. Because of this reason, two-factor evaluation
approach is widely used in conjoint analysis.
--- Content provided by FirstRanker.com ---
Suppose there are several factors to be analyzed with different levels of
--- Content provided by FirstRanker.com ---
values for each factor, then we consider any two factors at a time with theirlevels of values. For each such case, we have a data sheet called a two-factor
table. If there are n factors, then the number of such data sheets
--- Content provided by FirstRanker.com ---
n n(n -1)
is
--- Content provided by FirstRanker.com ---
=.
--- Content provided by FirstRanker.com ---
2
2
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Let us consider the example of `Refrigerator' described in the multi-factor approach. For the two factors (i) capacity and (ii) price, we have the
following data sheet.
--- Content provided by FirstRanker.com ---
Data Sheet (Two Factor Table) No:
Factor: Price of refrigerator
--- Content provided by FirstRanker.com ---
Factor: CapacityPrice
of Refrigerator
--- Content provided by FirstRanker.com ---
Rs. 9,000
Rs. 10,000
--- Content provided by FirstRanker.com ---
Rs. 12,000180 liters
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
200 liters--- Content provided by FirstRanker.com ---
230 liters
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
In this case, the data sheet is a matrix of 3 rows and 3 columns.
--- Content provided by FirstRanker.com ---
Therefore, there are 3? 3 = 9 places in the matrix. The respondent has to awardranks from 1 to 9 in the cells of the matrix. A rank of 1 means the respondent
has the maximum preference for that entry and a rank of 9 means he has the
--- Content provided by FirstRanker.com ---
least preference for that entry. Compared to multi-factor evaluation approach,
the respondents will find it easy to respond to two-factor evaluation approach
--- Content provided by FirstRanker.com ---
since only two factors are considered at a time.Steps in two-factor evaluation approach:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Identify the factors or features of a product to be analyzed.
--- Content provided by FirstRanker.com ---
1.
Find out the levels for each factor selected in Step 1.
--- Content provided by FirstRanker.com ---
2.Consider all possible pairs of factors. If there are n factors, then the
n n(n -1)
--- Content provided by FirstRanker.com ---
number of pairs is
=
--- Content provided by FirstRanker.com ---
. For each pair of factors, prepare a two-factor
2
--- Content provided by FirstRanker.com ---
2
table, indicating all the levels for the two factors. If L1 and L2 are the respective
--- Content provided by FirstRanker.com ---
levels for the two factors, then the number of cells in the corresponding table is
--- Content provided by FirstRanker.com ---
L1L2.3.
--- Content provided by FirstRanker.com ---
Select the list of respondents using the standard sampling technique.
4.
--- Content provided by FirstRanker.com ---
Request each respondent to award ranks for the cells in each two-factortable. i.e., rank 1 for the best cell, rank 2 for the next best cell, etc.
5.
--- Content provided by FirstRanker.com ---
For each two-factor table, collect all the responses from all the
participating respondents in the survey work.
--- Content provided by FirstRanker.com ---
6.With the ranks awarded by the respondents, find out the score secured by
each cell in each two-factor table.
--- Content provided by FirstRanker.com ---
7.
Tabulate the results in Step 7. Select the cell with the highest score.
--- Content provided by FirstRanker.com ---
Identify the two factors and their corresponding levels.8.
Implement the most preferred combination of the factors and their levels
--- Content provided by FirstRanker.com ---
in the design of a new product.
Application:
--- Content provided by FirstRanker.com ---
The two factor approach is useful when a manager goes for marketsegmentation to promote his product. The approach will enable the top level
management to evolve a policy decision as to which segment of the market has
--- Content provided by FirstRanker.com ---
to be concentrated more in order to maximize the profit from the product under
consideration.
--- Content provided by FirstRanker.com ---
QUESTIONS
--- Content provided by FirstRanker.com ---
1.
Explain the purpose of `Factor Analysis'.
--- Content provided by FirstRanker.com ---
2.What is the objective of `Conjoint Analysis'? Explain.
3.
--- Content provided by FirstRanker.com ---
State the steps in the development of conjoint analysis.
4.
--- Content provided by FirstRanker.com ---
State the applications of conjoint analysis.5.
Enumerate the advantages and disadvantages of conjoint analysis.
--- Content provided by FirstRanker.com ---
6.
--- Content provided by FirstRanker.com ---
What is a `product profile'? Explain.
7.
--- Content provided by FirstRanker.com ---
What are the steps in multi-factor evaluation approach in conjointanalysis?
8.
--- Content provided by FirstRanker.com ---
What is a `two-factor table'? Explain.
9.
--- Content provided by FirstRanker.com ---
Explain two-factor evaluation approach in conjoint analysis.REFERENCES
Green, P.E. and Srinivasan, V., Conjoint Analysis in Consumer Research: Issues
--- Content provided by FirstRanker.com ---
and Outlook, Journal of Consumer Research, 5, 1978, 103 ? 123.
Green, P.E., Carrol, J. and Goldberg, A General Approach to Product Design
--- Content provided by FirstRanker.com ---
Optimization via Conjoint Analysis, Journal of Marketing, 43, 1981, 17 ? 35.Johnson, R.A. and Wichern, D.W., Applied Multivariate Statistical Analysis,
Pearson Education, Delhi, 2005.
--- Content provided by FirstRanker.com ---
Kanji, G.K., 100 Statistical Tests, Sage Publications, New Delhi, 1994.
Kothari, C.R., Quantitative Techniques, Vikas Publishing House Private Ltd.,
--- Content provided by FirstRanker.com ---
New Delhi, 1997.Marrison, D.F., Multivariate Statistical Methods, McGraw Hill, New York,
1986.
--- Content provided by FirstRanker.com ---
Panneerselvam, R., Research Methodology, Prentice Hall of India, New Delhi,
2004.
--- Content provided by FirstRanker.com ---
Rencher, A.V., Methods of Multivariate Analysis, Wiley Inter-science, SecondEdition, New Jersey, 2002.
Romesburg, H.C., Cluster Analysis for Researchers, Lifetime Learning
--- Content provided by FirstRanker.com ---
Publications, Belmont, California, 1984.
--- Content provided by FirstRanker.com ---
Statistical Table-1: F-values at 1% level of significance
--- Content provided by FirstRanker.com ---
df1: degrees of freedom for greater variance
--- Content provided by FirstRanker.com ---
df2: degrees of freedom for smaller variance
--- Content provided by FirstRanker.com ---
df2/df11 2 3 4 5 6 7 8 9 10
1
--- Content provided by FirstRanker.com ---
4052.1 4999.5 5403.3 5624.5 5763.6 5858.9 5928.3 5981.0 6022.4 6055.8
2
--- Content provided by FirstRanker.com ---
98.5 99.0 99.1 99.2 99.2 99.3 99.3 99.3 99.3 99.33
34.1 30.8 29.4 28.7 28.2 27.9 27.6 27.4 27.3 27.2
--- Content provided by FirstRanker.com ---
4
21.1 18.0 16.6 15.9 15.5 15.2 14.9 14.7 14.6 14.5
--- Content provided by FirstRanker.com ---
516.2 13.2 12.0 11.3 10.9 10.6 10.4 10.2 10.1 10.0
6
--- Content provided by FirstRanker.com ---
13.7
10.9 9.7 9.1 8.7 8.4 8.2 8.1 7.9 7.8
--- Content provided by FirstRanker.com ---
712.2 9.5 8.4 7.8 7.4 7.1 6.9 6.8 6.7 6.6
8
--- Content provided by FirstRanker.com ---
11.2 8.6 7.5 7.0 6.6 6.3 6.1 6.0 5.9 5.8
9
--- Content provided by FirstRanker.com ---
10.5 8.0 6.9 6.4 6.0 5.8 5.6 5.4 5.3 5.210
10.0 7.5 6.5 5.9 5.6 5.3 5.2 5.0 4.9 4.8
--- Content provided by FirstRanker.com ---
11
9.6 7.2 6.2 5.6 5.3 5.0 4.8 4.7 4.6 4.5
--- Content provided by FirstRanker.com ---
129.3 6.9 5.9 5.4 5.0 4.8 4.6 4.4 4.3 4.2
13
--- Content provided by FirstRanker.com ---
9.0 6.7 5.7 5.2 4.8 4.6 4.4 4.3 4.1 4.1
14
--- Content provided by FirstRanker.com ---
8.8 6.5 5.5 5.0 4.6 4.4 4.2 4.1 4.0 3.915
8.6 6.3 5.4 4.8 4.5 4.3 4.1 4.0 3.8 3.8
--- Content provided by FirstRanker.com ---
16
8.5 6.2 5.2 4.7 4.4 4.2 4.0 3.8 3.7 3.6
--- Content provided by FirstRanker.com ---
178.4 6.1 5.1 4.6 4.3 4.1 3.9 3.7 3.6 3.5
18
--- Content provided by FirstRanker.com ---
8.2 6.0 5.0 4.5 4.2 4.0 3.8 3.7 3.5 3.5
19
--- Content provided by FirstRanker.com ---
8.1 5.9 5.0 4.5 4.1 3.9 3.7 3.6 3.5 3.420
8.0 5.8 4.9 4.4 4.1 3.8 3.6 3.5 3.4 3.3
--- Content provided by FirstRanker.com ---
21
8.0 5.7 4.8 4.3 4.0 3.8 3.6 3.5 3.3 3.3
--- Content provided by FirstRanker.com ---
227.9 5.7 4.8 4.3 3.9 3.7 3.5 3.4 3.3 3.2
23
--- Content provided by FirstRanker.com ---
7.8 5.6 4.7 4.2 3.9 3.7 3.5 3.4 3.2 3.2
--- Content provided by FirstRanker.com ---
24
7.8 5.6 4.7 4.2 3.8 3.6 3.4 3.3 3.2 3.1
--- Content provided by FirstRanker.com ---
257.7 5.5 4.6 4.1 3.8 3.6 3.4 3.3 3.2 3.1
26
--- Content provided by FirstRanker.com ---
7.7 5.5 4.6 4.1 3.8 3.5 3.4 3.2 3.1 3.0
27
--- Content provided by FirstRanker.com ---
7.6 5.4 4.6 4.1 3.7 3.5 3.3 3.2 3.1 3.028
7.6 5.4 4.5 4.0 3.7 3.5 3.3 3.2 3.1 3.0
--- Content provided by FirstRanker.com ---
29
7.5 5.4 4.5 4.0 3.7 3.4 3.3 3.1 3.0 3.0
--- Content provided by FirstRanker.com ---
307.5 5.3 4.5 4.0 3.6 3.4 3.3 3.1 3.0 2.9
--- Content provided by FirstRanker.com ---
Statistical Table-2: F-values at 2.5% level of significance
--- Content provided by FirstRanker.com ---
df1: degrees of freedom for greater variancedf2: degrees of freedom for smaller variance
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
df2/df11
2
--- Content provided by FirstRanker.com ---
3
4
--- Content provided by FirstRanker.com ---
56
7
--- Content provided by FirstRanker.com ---
8
9
--- Content provided by FirstRanker.com ---
101
647.7 799.5 864.1 899.5 921.8 937.1 948.2 956.6
--- Content provided by FirstRanker.com ---
963.2 968.6
--- Content provided by FirstRanker.com ---
238.5 39.0 39.1 39.2 39.2 39.3 39.3 39.3 39.3 39.3
3
--- Content provided by FirstRanker.com ---
17.4 16.0 15.4 15.1 14.8 14.7 14.6 14.5 14.4 14.4
4
--- Content provided by FirstRanker.com ---
12.210.6 9.9 9.6 9.3 9.1 9.0 8.9 8.9 8.8
5
--- Content provided by FirstRanker.com ---
10.0 8.4 7.7 7.3 7.1 6.9 6.8 6.7 6.6 6.6
6
--- Content provided by FirstRanker.com ---
8.8 7.2 6.5 6.2 5.9 5.8 5.6 5.5 5.5 5.47
8.0 6.5 5.8 5.5 5.2 5.1 4.9 4.8 4.8 4.7
--- Content provided by FirstRanker.com ---
8
7.5 6.0 5.4 5.0 4.8 4.6 4.5 4.4 4.3 4.2
--- Content provided by FirstRanker.com ---
97.2 5.7 5.0 4.7 4.4 4.3 4.1 4.1 4.0 3.9
10
--- Content provided by FirstRanker.com ---
6.9 5.4 4.8 4.4 4.2 4.0 3.9 3.8 3.7 3.7
11
--- Content provided by FirstRanker.com ---
6.7 5.2 4.6 4.2 4.0 3.8 3.7 3.6 3.5 3.512
6.5 5.0 4.4 4.1 3.8 3.7 3.6 3.5 3.4 3.3
--- Content provided by FirstRanker.com ---
13
6.4 4.9 4.3 3.9 3.7 3.6 3.4 3.3 3.3 3.2
--- Content provided by FirstRanker.com ---
146.2 4.8 4.2 3.8 3.6 3.5 3.3 3.2 3.2 3.1
15
--- Content provided by FirstRanker.com ---
6.1 4.7 4.1 3.8 3.5 3.4 3.2 3.1 3.1 3.0
--- Content provided by FirstRanker.com ---
16
6.1 4.6 4.0 3.7 3.5 3.3 3.2 3.1 3.0 2.9
--- Content provided by FirstRanker.com ---
176.0 4.6 4.0 3.6 3.4 3.2 3.1 3.0 2.9 2.9
18
--- Content provided by FirstRanker.com ---
5.9 4.5 3.9 3.6 3.3 3.2 3.0 3.0 2.9 2.8
19
--- Content provided by FirstRanker.com ---
5.9 4.5 3.9 3.5 3.3 3.1 3.0 2.9 2.8 2.820
5.8 4.4 3.8 3.5 3.2 3.1 3.0 2.9 2.8 2.7
--- Content provided by FirstRanker.com ---
21
5.8 4.4 3.8 3.4 3.2 3.0 2.9 2.8 2.7 2.7
--- Content provided by FirstRanker.com ---
22
5.7 4.3 3.7 3.4 3.2 3.0 2.9 2.8 2.7
--- Content provided by FirstRanker.com ---
2.7
--- Content provided by FirstRanker.com ---
235.7 4.3 3.7 3.4 3.1 3.0 2.9 2.8 2.7
2.6
--- Content provided by FirstRanker.com ---
24
--- Content provided by FirstRanker.com ---
5.7 4.3 3.7 3.3 3.1 2.9 2.8 2.7 2.72.6
--- Content provided by FirstRanker.com ---
25
5.6 4.2 3.6 3.3 3.1 2.9 2.8 2.7 2.6
--- Content provided by FirstRanker.com ---
2.626
--- Content provided by FirstRanker.com ---
5.6 4.2 3.6 3.3 3.1 2.9 2.8 2.7 2.6
2.5
--- Content provided by FirstRanker.com ---
27
5.6 4.2 3.6 3.3 3.0 2.9 2.8 2.7 2.6
--- Content provided by FirstRanker.com ---
2.5
28
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.6
--- Content provided by FirstRanker.com ---
5.64.2
3.6
--- Content provided by FirstRanker.com ---
3.2
3.0
--- Content provided by FirstRanker.com ---
2.92.7
2.6
--- Content provided by FirstRanker.com ---
2.5
--- Content provided by FirstRanker.com ---
295.5 4.2 3.6 3.2 3.0 2.8 2.7 2.6 2.5
2.5
--- Content provided by FirstRanker.com ---
30
--- Content provided by FirstRanker.com ---
5.5 4.1 3.5 3.2 3.0 2.8 2.7 2.6 2.52.5
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Statistical Table-3: F-values at 5% level of significancedf1: degrees of freedom for greater variance
--- Content provided by FirstRanker.com ---
df2: degrees of freedom for smaller variance
--- Content provided by FirstRanker.com ---
df2/df1
--- Content provided by FirstRanker.com ---
1
2
--- Content provided by FirstRanker.com ---
3 45
6
--- Content provided by FirstRanker.com ---
7
8
--- Content provided by FirstRanker.com ---
910
1
--- Content provided by FirstRanker.com ---
161.4 199.5 215.7 224.5 230.1 233.9 236.7 238.8 240.5 241.8
2
--- Content provided by FirstRanker.com ---
18.5 19.0 19.1 19.2 19.2 19.3 19.319.3
19.3
--- Content provided by FirstRanker.com ---
19.3
3
--- Content provided by FirstRanker.com ---
10.1 9.5 9.2 9.1 9.0 8.98.8 8.8 8.8 8.7
4
--- Content provided by FirstRanker.com ---
7.7 6.9 6.5 6.3 6.2 6.1
6.0 6.0 5.9 5.9
--- Content provided by FirstRanker.com ---
56.6 5.7 5.4 5.1 5.0 4.9
4.8 4.8 4.7 4.7
--- Content provided by FirstRanker.com ---
6
5.9 5.1 4.7 4.5 4.3 4.2
--- Content provided by FirstRanker.com ---
4.2 4.1 4.0 4.07
5.5 4.7 4.3 4.1 3.9 3.8
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
3.7 3.7 3.6 3.6
8
--- Content provided by FirstRanker.com ---
5.3 4.4 4.0
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
3.8 3.6 3.5
--- Content provided by FirstRanker.com ---
3.5 3.4 3.3 3.39
5.1 4.2 3.8 3.6 3.4 3.3
--- Content provided by FirstRanker.com ---
3.2 3.2 3.1 3.1
10
--- Content provided by FirstRanker.com ---
4.9 4.1 3.7 3.4 3.3 3.23.1 3.0 3.0 2.9
11
--- Content provided by FirstRanker.com ---
4.8 3.9 3.5 3.3 3.2 3.0
3.0 2.9 2.8 2.8
--- Content provided by FirstRanker.com ---
124.7 3.8 3.4 3.2 3.1 2.9
2.9 2.8 2.7 2.7
--- Content provided by FirstRanker.com ---
13
4.6 3.8 3.4 3.1 3.0 2.9
--- Content provided by FirstRanker.com ---
2.8 2.7 2.7 2.614
4.6 3.7 3.3 3.1 2.9 2.8
--- Content provided by FirstRanker.com ---
2.7 2.6 2.6 2.6
15
--- Content provided by FirstRanker.com ---
4.5 3.6 3.2 3.0 2.9 2.72.7 2.6 2.5 2.5
16
--- Content provided by FirstRanker.com ---
4.4 3.6 3.2 3.0 2.8 2.7
2.6 2.5 2.5 2.4
--- Content provided by FirstRanker.com ---
174.4 3.5 3.1 2.9 2.8 2.6
2.6 2.5 2.4 2.4
--- Content provided by FirstRanker.com ---
18
4.4 3.5 3.1 2.9 2.7 2.6
--- Content provided by FirstRanker.com ---
2.5 2.5 2.4 2.419
4.3 3.5 3.1 2.8 2.7 2.6
--- Content provided by FirstRanker.com ---
2.5 2.4 2.4 2.3
20
--- Content provided by FirstRanker.com ---
4.3 3.4 3.0 2.8 2.7 2.52.5 2.4 2.3 2.3
21
--- Content provided by FirstRanker.com ---
4.3
3.4
--- Content provided by FirstRanker.com ---
3.02.8
2.6
--- Content provided by FirstRanker.com ---
2.5 2.4
2.4
--- Content provided by FirstRanker.com ---
2.32.3
22
--- Content provided by FirstRanker.com ---
4.3 3.4 3.0 2.8 2.6 2.5
2.4
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.4 2.3 2.323
4.2 3.4 3.0 2.7 2.6 2.5
--- Content provided by FirstRanker.com ---
2.4
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.3 2.3 2.2
--- Content provided by FirstRanker.com ---
244.2 3.4 3.0 2.7 2.6 2.5
2.4
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.3 2.3 2.2
25
--- Content provided by FirstRanker.com ---
4.2 3.3 2.9 2.7 2.6 2.42.4
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.3 2.2 2.2
--- Content provided by FirstRanker.com ---
26
4.2 3.3 2.9 2.7 2.5 2.4
--- Content provided by FirstRanker.com ---
2.3--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.3 2.2 2.2
27
--- Content provided by FirstRanker.com ---
4.2 3.3 2.9 2.7 2.5 2.4
2.3
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.3 2.2 2.228
4.1 3.3 2.9 2.7 2.5 2.4
--- Content provided by FirstRanker.com ---
2.3
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.2 2.2 2.1
--- Content provided by FirstRanker.com ---
294.1 3.3 2.9 2.7 2.5 2.4
2.3
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.2 2.2 2.1
30
--- Content provided by FirstRanker.com ---
4.1 3.3 2.9 2.6 2.5 2.42.3
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
2.2 2.2 2.1
--- Content provided by FirstRanker.com ---
Statistical Table-4: F-values at 10% level of significance
--- Content provided by FirstRanker.com ---
df1: degrees of freedom for greater variance
--- Content provided by FirstRanker.com ---
df2: degrees of freedom for smaller variance
--- Content provided by FirstRanker.com ---
df2/df1 1 2 3 4
5
--- Content provided by FirstRanker.com ---
67
8
--- Content provided by FirstRanker.com ---
9 10
1
--- Content provided by FirstRanker.com ---
39.849.5
53.5 55.8 57.2 58.2 58.9 59.4 59.8 60.1
--- Content provided by FirstRanker.com ---
2
8.5
--- Content provided by FirstRanker.com ---
9.09.1 9.2 9.2 9.3 9.3 9.3 9.3 9.3
3
--- Content provided by FirstRanker.com ---
5.5
5.4
--- Content provided by FirstRanker.com ---
5.3 5.3 5.3 5.2 5.2 5.2 5.2 5.24
4.5
--- Content provided by FirstRanker.com ---
4.3
4.1 4.1 4.0 4.0 3.9 3.9 3.9 3.9
--- Content provided by FirstRanker.com ---
54.0
3.7
--- Content provided by FirstRanker.com ---
3.6 3.5 3.4 3.4 3.3 3.3 3.3 3.2
6
--- Content provided by FirstRanker.com ---
3.73.4
3.2 3.1 3.1 3.0 3.0 2.9 2.9 2.9
--- Content provided by FirstRanker.com ---
7
3.5
--- Content provided by FirstRanker.com ---
3.23.0 2.9 2.8 2.8 2.7 2.7 2.7 2.7
8
--- Content provided by FirstRanker.com ---
3.4
3.1
--- Content provided by FirstRanker.com ---
2.9 2.8 2.7 2.6 2.6 2.5 2.5 2.59
3.3
--- Content provided by FirstRanker.com ---
3.0
2.8 2.6 2.6 2.5 2.5 2.4 2.4 2.4
--- Content provided by FirstRanker.com ---
103.2
2.9
--- Content provided by FirstRanker.com ---
2.7 2.6 2.5 2.4 2.4 2.3 2.3 2.3
11
--- Content provided by FirstRanker.com ---
3.22.8
2.6 2.5 2.4 2.3 2.3 2.3 2.2 2.2
--- Content provided by FirstRanker.com ---
12
3.1
--- Content provided by FirstRanker.com ---
2.82.6 2.4 2.3 2.3 2.2 2.2 2.2 2.1
13
--- Content provided by FirstRanker.com ---
3.1
2.7
--- Content provided by FirstRanker.com ---
2.5 2.4 2.3 2.2 2.2 2.1 2.1 2.114
3.1
--- Content provided by FirstRanker.com ---
2.7
2.5 2.3 2.3 2.2 2.1 2.1 2.1 2.0
--- Content provided by FirstRanker.com ---
153.0
2.6
--- Content provided by FirstRanker.com ---
2.4 2.3 2.2 2.2 2.1 2.1 2.0 2.0
16
--- Content provided by FirstRanker.com ---
3.02.6
2.4 2.3 2.2 2.1 2.1 2.0 2.0 2.0
--- Content provided by FirstRanker.com ---
17
3.0
--- Content provided by FirstRanker.com ---
2.62.4 2.3 2.2 2.1 2.1 2.0 2.0 2.0
18
--- Content provided by FirstRanker.com ---
3.0
2.6
--- Content provided by FirstRanker.com ---
2.4 2.2 2.1 2.1 2.0 2.0 2.0 1.919
2.9
--- Content provided by FirstRanker.com ---
2.6
2.3 2.2 2.1 2.1 2.0 2.0 1.9 1.9
--- Content provided by FirstRanker.com ---
202.9
2.5
--- Content provided by FirstRanker.com ---
2.3 2.2 2.1 2.0 2.0 1.9 1.9 1.9
21
--- Content provided by FirstRanker.com ---
2.92.5
2.3 2.2 2.1 2.0 2.0 1.9 1.9 1.9
--- Content provided by FirstRanker.com ---
22
2.9
--- Content provided by FirstRanker.com ---
2.52.3 2.2 2.1 2.0 2.0 1.9 1.9 1.9
23
--- Content provided by FirstRanker.com ---
2.9
2.5
--- Content provided by FirstRanker.com ---
2.3 2.2 2.1 2.0 1.9 1.9 1.9 1.824
2.9
--- Content provided by FirstRanker.com ---
2.5
2.3 2.1 2.1 2.0 1.9 1.9 1.9 1.8
--- Content provided by FirstRanker.com ---
252.9
2.5
--- Content provided by FirstRanker.com ---
2.3 2.1 2.0 2.0 1.9 1.9 1.8 1.8
26
--- Content provided by FirstRanker.com ---
2.92.5
2.3 2.1 2.0 2.0 1.9 1.9 1.8 1.8
--- Content provided by FirstRanker.com ---
27
2.9
--- Content provided by FirstRanker.com ---
2.52.2 2.1 2.0 2.0 1.9 1.9 1.8 1.8
28
--- Content provided by FirstRanker.com ---
2.8
2.5
--- Content provided by FirstRanker.com ---
2.2 2.1 2.0 1.9 1.9 1.9 1.8 1.829
2.8
--- Content provided by FirstRanker.com ---
2.4
2.2 2.1 2.0 1.9 1.9 1.8 1.8 1.8
--- Content provided by FirstRanker.com ---
302.8
2.4
--- Content provided by FirstRanker.com ---
2.2 2.1 2.0 1.9 1.9 1.8 1.8 1.8
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT V
1. STRUCTURE AND COMPONENTS OF RESEARCH REPORTS
--- Content provided by FirstRanker.com ---
Lesson Objectives:What is a Report?
--- Content provided by FirstRanker.com ---
Characteristics of a good report
--- Content provided by FirstRanker.com ---
Framework of a Report
--- Content provided by FirstRanker.com ---
Practical Reports Vs Academic Reports
--- Content provided by FirstRanker.com ---
Parts of a Research ReportA note on Literature Review
--- Content provided by FirstRanker.com ---
Learning Objectives:
--- Content provided by FirstRanker.com ---
After reading this lesson, you should be able to:Understand the meaning of a research report
--- Content provided by FirstRanker.com ---
Analyze the components of a good report
--- Content provided by FirstRanker.com ---
Structure of a report
--- Content provided by FirstRanker.com ---
Characteristic differences in Research Reporting
--- Content provided by FirstRanker.com ---
WHAT IS A REPORT?
--- Content provided by FirstRanker.com ---
A report is a written document on a particular topic, which conveysinformation and ideas and may also make recommendations. Reports often form
the basis of crucial decision making. Inaccurate, incomplete and poorly written
--- Content provided by FirstRanker.com ---
reports fail to achieve their purpose and reflect on the decision, which will
ultimately be made. This will also be the case if the report is excessively long,
--- Content provided by FirstRanker.com ---
jargonistic and/ or structureless. A good report can be written by keeping thefollowing features in mind:
1.
--- Content provided by FirstRanker.com ---
All points in the report should be clear to the intended reader.
2.
--- Content provided by FirstRanker.com ---
The report should be concise with information kept to a necessaryminimum and arranged logically under various headings and sub-headings.
3.
--- Content provided by FirstRanker.com ---
All information should be correct and supported by evidence.
4.
--- Content provided by FirstRanker.com ---
All relevant material should be included in a complete report.Purpose of Research Report
--- Content provided by FirstRanker.com ---
1.
Why am I writing this report? Do I want to inform/ explain/
--- Content provided by FirstRanker.com ---
persuade, or indeed all of these.2.
Who is going to read this report? Managers/ academicians/
--- Content provided by FirstRanker.com ---
researchers! What do they already know? What do they need to know? Do any
of them have certain attitudes or prejudices?
--- Content provided by FirstRanker.com ---
3.What resources do we have? Do I have access to a computer? Do I
have enough time? Can any of my colleagues help?
--- Content provided by FirstRanker.com ---
4.
Think about the content of your report ? what am I going to put in it?
--- Content provided by FirstRanker.com ---
What are my main themes? How much should be the text, and how much shouldbe the illustrations?
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Framework of a Report
The various frameworks can be used depending on the content of the
--- Content provided by FirstRanker.com ---
report, but generally the same rules apply. Introduction, method, results anddiscussion with references or bibliography at the end, and an abstract at the
beginning could form the framework.
--- Content provided by FirstRanker.com ---
STRUCTURE OF A REPORT
Structure your writing around the IMR&D framework and you will
--- Content provided by FirstRanker.com ---
ensure a beginning, middle and end to your report.I
Introduction Why did I do this research?
--- Content provided by FirstRanker.com ---
(beginning)
M
--- Content provided by FirstRanker.com ---
MethodWhat did I do and how did I go about (middle)
doing it?
--- Content provided by FirstRanker.com ---
R
Results
--- Content provided by FirstRanker.com ---
What did I find?(middle)
AND
--- Content provided by FirstRanker.com ---
D
Discussion
--- Content provided by FirstRanker.com ---
What does it all mean?(end)
--- Content provided by FirstRanker.com ---
What do I put in the beginning part?
TITLE PAGE
--- Content provided by FirstRanker.com ---
Title of project, Sub?title (whereappropriate), Date, Author, Organization,
Logo
--- Content provided by FirstRanker.com ---
BACKGROUND
History(if any) behind project
--- Content provided by FirstRanker.com ---
ACKNOWLEDGEMENTAuthor thanks people and organization who
helped during the project
--- Content provided by FirstRanker.com ---
SUMMARY(sometimes called A condensed version of a report ? outlines
abstract of the synopsis)
--- Content provided by FirstRanker.com ---
salient points, emphasizes main conclusionsand (where appropriate) the main
recommendations. N.B this is often
--- Content provided by FirstRanker.com ---
difficult to write and it is suggested that you
write it last.
--- Content provided by FirstRanker.com ---
LIST OF CONTENTS
--- Content provided by FirstRanker.com ---
An at- a ? glance list that tells the readerwhat is in the report and what page
number(s) to find it on.
--- Content provided by FirstRanker.com ---
LIST OF TABLES
As above, specifically for tables.
--- Content provided by FirstRanker.com ---
LIST OF APPENDICESAs above, specifically for appendices.
INTRODUCTION
--- Content provided by FirstRanker.com ---
Author sets the scene and states his/ her
intentions.
--- Content provided by FirstRanker.com ---
AIMS AND OBJECTIVESAIMS ? general aims of the audit/ project,
broad statement of intent. OBJECTIVES ?
--- Content provided by FirstRanker.com ---
specific things expected to do/ deliver(e.g.
expected outcomes)
--- Content provided by FirstRanker.com ---
What do I put in the middle part?
METHOD
--- Content provided by FirstRanker.com ---
Work steps; what was done ? how, by
whom, when?
--- Content provided by FirstRanker.com ---
RESULT/FINDINGSHonest presentation of the findings,
whether these were as expected or not.
--- Content provided by FirstRanker.com ---
give the facts, including any
inconsistencies or difficulties
--- Content provided by FirstRanker.com ---
encounteredWhat do I put in the end part?
DISCUSSION
--- Content provided by FirstRanker.com ---
Explanation of the results.( you might like to
keep the SWOT analysis in mind and think about
--- Content provided by FirstRanker.com ---
your project's strengths, weakness, opportunitiesand threats, as you write)
CONCLUSIONS
--- Content provided by FirstRanker.com ---
The author links the results/ findings with the
points made in the introduction and strives to
--- Content provided by FirstRanker.com ---
reach clear, simply stated and unbiased--- Content provided by FirstRanker.com ---
conclusions. Make sure they are fully supportedby evidence and arguments of the main body of
your audit/project.
--- Content provided by FirstRanker.com ---
RECOMMENDATIONS
The author states what specific actions should be
--- Content provided by FirstRanker.com ---
taken, by whom and why. They must always belinked to the future and should always be
realistic. Don't make them unless asked to.
--- Content provided by FirstRanker.com ---
REFERENCES
A section of a report, which provides full details
--- Content provided by FirstRanker.com ---
of publications mentioned in the text, or fromwhich extracts have been quoted.
APPENDIX
--- Content provided by FirstRanker.com ---
The purpose of an appendix is to supplement the
information contained in the main body of the
--- Content provided by FirstRanker.com ---
report.PRACTICAL REPORTS VS. ACADEMIC REPORTS
--- Content provided by FirstRanker.com ---
Practical Reports:
In the practical world of business or government, a report conveys an
--- Content provided by FirstRanker.com ---
information and (sometimes) recommendations from a researcher who hasinvestigated a topic in detail. A report like this will usually be requested
by people who need the information for a specific purpose and their
--- Content provided by FirstRanker.com ---
request may be written in terms of reference or the brief. whatever the
report, it is important to look at the instruction for what is wanted. A
--- Content provided by FirstRanker.com ---
report like this differs from an essay in that it is designed to provideinformation which will be acted on, rather than to be read by people
interested in the ideas for their own sake. Because of this, it has a different
--- Content provided by FirstRanker.com ---
structure and layout.
--- Content provided by FirstRanker.com ---
Academic Reports:--- Content provided by FirstRanker.com ---
A report written for an academic course can be thought of as asimulation. We can imagine that someone wants the report for a practical
purpose, although we are really writing the report as an academic exercise for
--- Content provided by FirstRanker.com ---
assessment. Theoretical ideas will be more to the front in an academic report
than in a practical one. Sometimes a report seems to serve academic and
--- Content provided by FirstRanker.com ---
practical purposes. Students on placement with organizations often have toproduce a report for the organization and for assessment on the course.
Although the background work for both will be related, in practice, the report
--- Content provided by FirstRanker.com ---
the student produces for academic assessment will be different from the report
produced for the organization, because the needs of each are different.
--- Content provided by FirstRanker.com ---
RESEARCH REPORT: PRELIMINARIES
It is not sensible to leave all your writing until the end. There is always
--- Content provided by FirstRanker.com ---
the possibility that it will take much longer than you anticipate and you will not
have enough time. There could also be pressure upon available word processors
--- Content provided by FirstRanker.com ---
as other students try to complete their own reports. It is wise to begin writing upsome aspects of your research as you go along. Remember that you do not have
to write your report in the order than it will be read. Often it is easiest to start
--- Content provided by FirstRanker.com ---
with the method section. Leave the introduction and the abstract to last. The
use of a word processor makes it very straightforward to modify and rearrange
--- Content provided by FirstRanker.com ---
what you have written as your research progresses and your ideas change. Thevery process of writing will help your ideas to develop. Last but by no means
least, ask someone to proofread your work.
--- Content provided by FirstRanker.com ---
STRUCTURE OF A RESEARCH REPORT
--- Content provided by FirstRanker.com ---
A research report has a different structure and layout in comparison to aproject report. A research report is for reference and is often quite a long
document. It has to be clearly structured for the readers to quickly find the
--- Content provided by FirstRanker.com ---
information wanted. It needs to be planned carefully to make sure that the
--- Content provided by FirstRanker.com ---
information given in the report is put under correct headings.
--- Content provided by FirstRanker.com ---
PARTS OF RESEARCH REPORTCover sheet: This should contain some or all of the following:
Full title of the report
--- Content provided by FirstRanker.com ---
Name of the researcher
Name of the unit of which the project is a part
--- Content provided by FirstRanker.com ---
Name of the institutionDate/Year.
--- Content provided by FirstRanker.com ---
Title page: Full title of the report.
Your name
--- Content provided by FirstRanker.com ---
Acknowledgement: A thank you to the people who helped you.Contents
List of the Tables
--- Content provided by FirstRanker.com ---
Headings and sub-headings used in the report should be given with their
--- Content provided by FirstRanker.com ---
page numbers. Each chapter should begin on a new page. Use a consistentsystem in dividing the report into parts. The simplest may be to use chapters for
each major part and subdivide these into sections and sub-sections. 1, 2, 3 etc.
--- Content provided by FirstRanker.com ---
can be used as the numbers for each chapter. The sections of chapter 3 (for
example) would be 3.1, 3.2, 3.3, and so on. For further sub-division of a sub-
--- Content provided by FirstRanker.com ---
section you may use 3.2.1, 3.2.2, and so on.Abstract or Summary or Executive Summary or Introduction:
This presents an overview of the whole report. It should let the reader see
--- Content provided by FirstRanker.com ---
in advance, what is in the report. This includes what you set out to do, how
review of literature is focused and narrowed in your research, the relation of the
--- Content provided by FirstRanker.com ---
methodology you chose to your objectives, a summary of your findings and
--- Content provided by FirstRanker.com ---
analysis of the findings--- Content provided by FirstRanker.com ---
BODY
Aims and Purpose or Aims and Objectives:
--- Content provided by FirstRanker.com ---
Why did you do this work? What was the problem you wereinvestigating? If you are not including review of literature, mention the specific
research/es which is/are relevant to your work.
--- Content provided by FirstRanker.com ---
Review of Literature
This should help to put your research into a background context and to
--- Content provided by FirstRanker.com ---
explain its importance. Include only the books and articles which relate directlyto your topic. You need to be analytical and critical, and not just describe the
works that you have read.
--- Content provided by FirstRanker.com ---
Methodology
Methodology deals with the methods and principles used in an activity,
--- Content provided by FirstRanker.com ---
in this case research. In the methodology chapter, explain the method/s you usedfor the research and why you thought they were the appropriate ones. You may,
for example, be depending mostly upon secondary data or you may have
--- Content provided by FirstRanker.com ---
collected your own data. You should explain the method of data collection,
materials used, subjects interviewed, or places you visited. Give a detailed
--- Content provided by FirstRanker.com ---
account of how and when you carried out your research and explain why youused the particular method/s, rather than other methods. Included in this chapter
should be an examination of ethical issues, if any.
--- Content provided by FirstRanker.com ---
Results or Findings
What did you find out? Give a clear presentation of your results. Show
--- Content provided by FirstRanker.com ---
the essential data and calculations here. You may use tables, graphs and figures.--- Content provided by FirstRanker.com ---
Analysis and Discussion
Interpret your results. What do you make out of them? How do they
--- Content provided by FirstRanker.com ---
compare with those of others who have done research in this area? The accuracy
of your measurements/results should be discussed and deficiencies, if any, in the
--- Content provided by FirstRanker.com ---
research design should be mentioned.Conclusions
What do you conclude? Summarize briefly the main conclusions which
--- Content provided by FirstRanker.com ---
you discussed under "Results." Were you able to answer some or all of the
questions which you raised in your aims and objectives? Do not be tempted to
--- Content provided by FirstRanker.com ---
draw conclusions which are not backed up by your evidence. Note thedeviation/s from expected results and any failure to achieve all that you had
hoped.
--- Content provided by FirstRanker.com ---
Recommendations
Make your recommendations, if required. The suggestions for action and
--- Content provided by FirstRanker.com ---
further research should be given.Appendix
You may not need an appendix, or you may need several. If you have
--- Content provided by FirstRanker.com ---
used questionnaires, it is usual to include a blank copy in the appendix. You
could include data or calculations, not given in the body, that are necessary, or
--- Content provided by FirstRanker.com ---
useful, to get the full benefit from your report. There may be maps, drawings,photographs or plans that you want to include. If you have used special
equipment, you may include information about it.
--- Content provided by FirstRanker.com ---
The plural of an appendix is appendices. If an appendix or appendices
are needed, design them thoughtfully in a way that your readers find it/them
--- Content provided by FirstRanker.com ---
convenient to use.References
--- Content provided by FirstRanker.com ---
List all the sources which you referred in the body of the report. You
may use the pattern prescribed by American Psychological Association, or any
--- Content provided by FirstRanker.com ---
other standard pattern recognized internationally.
REVIEW OF LITERATURE
--- Content provided by FirstRanker.com ---
In the case of small projects, this may not be in the form of a critical reviewof the literature, but this is often asked for and is a standard part of larger
projects. Sometimes students are asked to write Review of Literature on a topic
--- Content provided by FirstRanker.com ---
as a piece of work in its own right. In its simplest form, the review of literature
is a list of relevant books and other sources, each followed by a description and
--- Content provided by FirstRanker.com ---
comment on its relevance.The literature review should demonstrate that you have read and analysed
--- Content provided by FirstRanker.com ---
the literature relevant to your topic. From your readings, you may get ideas
about methods of data collection and analysis. If the review is part of a project,
--- Content provided by FirstRanker.com ---
you will be required to relate your readings to the issues in the project, and whiledescribing the readings, you should apply them to your topic. A review should
include only relevant studies. The review should provide the reader with a
--- Content provided by FirstRanker.com ---
picture of the state of knowledge in the subject.
--- Content provided by FirstRanker.com ---
Your literature search should establish what previous researches have beencarried out in the subject area. Broadly speaking, there are three kinds of sources
that you should consult:
--- Content provided by FirstRanker.com ---
1. Introductory material;
2. Journal articles and
--- Content provided by FirstRanker.com ---
3. Books.To get an idea about the background of your topic, you may consult one or
--- Content provided by FirstRanker.com ---
more textbooks at the appropriate time. It is a good practice to review in
cumulative stages - that is, do not think you can do it all at one go. Keep a
--- Content provided by FirstRanker.com ---
careful record of what you have searched, how you have gone about it, and the--- Content provided by FirstRanker.com ---
exact citations and page numbers of your readings. Write notes as you go along.Record suitable notes on everything you read and note the methods of
investigations. Make sure that you keep a full reference, complete with page
--- Content provided by FirstRanker.com ---
numbers. You will have to find your own balance between taking notes that are
too long and detailed, and ones too brief to be of any use. It is best to write your
--- Content provided by FirstRanker.com ---
notes in complete sentences and paragraphs, because research has shown thatyou are more likely to understand your notes later if they are written in a way
that other people would understand. Keep your notes from different sources
--- Content provided by FirstRanker.com ---
and/or about different points on separate index cards or on separate sheets of
paper. You will do mainly basic reading while you are trying to decide on your
--- Content provided by FirstRanker.com ---
topic. You may scan and make notes on the abstracts or summaries of work inthe area. Then do a more thorough job of reading later on, when you are more
confident of what you are doing. If your project spans several months, it would
--- Content provided by FirstRanker.com ---
be advisable towards the end to check whether there are any new and recent
references.
--- Content provided by FirstRanker.com ---
REFERENCES
There are many methods of referencing your work; some of the most
--- Content provided by FirstRanker.com ---
common ones are the Numbered Style, American Psychological Association
Style and the Harvard Method, with many other variations. Just use the one
--- Content provided by FirstRanker.com ---
you are most familiar and comfortable with. Details of all the works referredby you should be given in the reference section.
THE PRESENTATION OF REPORT
--- Content provided by FirstRanker.com ---
Well-produced, appropriate illustrations enhance the presentability of
a report. With today's computer packages, almost anything is possible.
--- Content provided by FirstRanker.com ---
However, histograms, bar charts and pie charts are still the three 'staples'.Readers like illustrated information, because it is easier to absorb and it's more
memorable. Illustrations are useful only when they are easier to understand than
--- Content provided by FirstRanker.com ---
words or figures and they must be relevant to the text. Use the algorithm
--- Content provided by FirstRanker.com ---
included to help you decide whether or not to use an illustration. They should
never be included for their own sake, and don't overdo it; too many
--- Content provided by FirstRanker.com ---
illustrations distract the attention of readers.--- Content provided by FirstRanker.com ---
UNIT V2. TYPES OF REPORTS: CHARACTERISTICS OF GOOD RESEARCH
--- Content provided by FirstRanker.com ---
REPORT
--- Content provided by FirstRanker.com ---
Lesson Outline:Different types of Reports
--- Content provided by FirstRanker.com ---
Technical Reports
--- Content provided by FirstRanker.com ---
General Reports
--- Content provided by FirstRanker.com ---
Reporting Styles
--- Content provided by FirstRanker.com ---
Characteristics of a Good ReportLearning Objectives:
--- Content provided by FirstRanker.com ---
After reading this lesson, you should be able to:
o Understand different types of reports
--- Content provided by FirstRanker.com ---
o Technical Reports and their contentso General Reports
o Different types of Writing styles
--- Content provided by FirstRanker.com ---
o Essential characteristics of a Good Report
--- Content provided by FirstRanker.com ---
Reports vary in length and type. Students' study reports are often calledTerm papers, project reports, theses, dissertations depending on the nature of the
report. Reports of researchers are in the form of monographs, research papers,
--- Content provided by FirstRanker.com ---
research thesis, etc. In business organizations a wide variety of reports are
under use: project reports, annual reports of financial statements, report of
--- Content provided by FirstRanker.com ---
consulting groups, project proposals etc. News items in daily papers are alsoone form of report writing. In this lesson, let us identify different forms of
reports and their major components.
--- Content provided by FirstRanker.com ---
Types of Reports
--- Content provided by FirstRanker.com ---
Reports may be categorized broadly as Technical Reports and General
--- Content provided by FirstRanker.com ---
Reports based on the nature of methods, terms of reference and the extent of in-depth enquiry made etc. On the basis of usage pattern, the reports may also be
classified as Information oriented reports, decision oriented reports and research
--- Content provided by FirstRanker.com ---
based reports. Further, reports may also differ based on the communication
situation. For example, the reports may be in the form of Memo, which is
--- Content provided by FirstRanker.com ---
appropriate for informal situations or for short periods. On the other hand, theprojects that extend over a period of time, often call for project reports. Thus,
there is no standard format of reports. The most important thing that helps in
--- Content provided by FirstRanker.com ---
classifying the reports is the outline of its purpose and answers for the following
questions:
--- Content provided by FirstRanker.com ---
What did you do?
--- Content provided by FirstRanker.com ---
Why did you choose the particular research method that you used?
--- Content provided by FirstRanker.com ---
What did you learn and what are the implications of what you learned?
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
If you are writing a recommendation report, what action are yourecommending in response to what you learned?
--- Content provided by FirstRanker.com ---
Two types of report formats are described below:
--- Content provided by FirstRanker.com ---
A Technical ReportA Technical report mainly focuses on methods employed, assumptions
--- Content provided by FirstRanker.com ---
made while conducting a study, detailed presentation of findings and drawing
--- Content provided by FirstRanker.com ---
inferences and comparisons with earlier findings based on the type of data
drawn from the empirical work.
--- Content provided by FirstRanker.com ---
An outline of a Technical Report mostly consists of the following:Title and Nature of the Study:
Brief title and the nature of work sometimes followed by subtitle indicate more
--- Content provided by FirstRanker.com ---
appropriately either the method or tools used. Description of objectives of the
study, research design, operational terms, working hypothesis, type of analysis
--- Content provided by FirstRanker.com ---
and data required should be present.Abstract
of
--- Content provided by FirstRanker.com ---
Findings:
--- Content provided by FirstRanker.com ---
A brief review of the main findings just can be made either in a paragraph or inone/two pages.
Review of current status:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
A quick review of past observations and contradictions reported, applicationsobserved and reported are reviewed based on the in-house resources or based on
published observations.
--- Content provided by FirstRanker.com ---
Sampling and Methods employed
--- Content provided by FirstRanker.com ---
Specific methods used in the study and their limitations. In the case of
experimental methods, the nature of subjects and control conditions are to be
--- Content provided by FirstRanker.com ---
specified. In the case of sample studies, details of the sample design i.e., sample
size, sample selection etc are given.
--- Content provided by FirstRanker.com ---
Data sources and experiment conductedSources of data, their characteristics and limitations should be specified. In the
--- Content provided by FirstRanker.com ---
case of primary survey, the manner in which data has been collected should be
described.
--- Content provided by FirstRanker.com ---
Analysis of data and tools used.--- Content provided by FirstRanker.com ---
The analysis of data and presentation of findings of the study with supporting
data in the form of tables and charts are to be narrated. This constitutes the
--- Content provided by FirstRanker.com ---
major component of the research report.--- Content provided by FirstRanker.com ---
Summary of findings--- Content provided by FirstRanker.com ---
A detailed summary of findings of the study and major observations should be
stated. Decision inputs if any, policy implications from the observations should
--- Content provided by FirstRanker.com ---
be specified.References
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
A brief list of studies conducted on similar lines, either preceding the presentstudy or conducted under different experimental conditions is listed.
Technical appendices
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
These appendices include the design of experiments or questionnaires used inconducting the study, mathematical derivations, elaboration on particular
techniques of analysis etc.
--- Content provided by FirstRanker.com ---
General Reports
General reports often relate popular policy issues mostly related to social
--- Content provided by FirstRanker.com ---
issues. These reports are generally simple, less technical, good use of tables andcharts. Most often they reflect the journalistic style. Example for this type of
report is the "Best B-Schools Survey in Business Magazines". The outline of
--- Content provided by FirstRanker.com ---
these reports is as follows:
1.
--- Content provided by FirstRanker.com ---
Major Findings and their implications2.
Recommendations for Action
--- Content provided by FirstRanker.com ---
3.
Objectives of the Study
--- Content provided by FirstRanker.com ---
4.Method employed for collecting data
5.
--- Content provided by FirstRanker.com ---
Results
--- Content provided by FirstRanker.com ---
Writing StylesThere are atleast 3 distinct report writing styles that can be applied by
--- Content provided by FirstRanker.com ---
students of Business Studies. They are called:
i.
--- Content provided by FirstRanker.com ---
Conservative--- Content provided by FirstRanker.com ---
ii.Key points
iii.
--- Content provided by FirstRanker.com ---
Holistic
--- Content provided by FirstRanker.com ---
i.Conservative Style
Essentially, the conservative approach takes the best structural elements from
--- Content provided by FirstRanker.com ---
essay writing and integrates these with appropriate report writing tools. Thus,
headings are used to deliberate upon different sections of the answer. In
--- Content provided by FirstRanker.com ---
addition, the space is well utilized by ensuring that each paragraph is distinct(perhaps separated from other paragraphs by leaving two blank lines in
between).
--- Content provided by FirstRanker.com ---
ii.
--- Content provided by FirstRanker.com ---
Key Point StyleThis style utilizes all of the report writing tools and is thus more overtly `report-
looking'. Use of headings, underlining, margins, diagrams and tables are
--- Content provided by FirstRanker.com ---
common. Occasionally reporting might even use indentation and dot points. The
important thing to remember is that the tools should be applied in a way that
--- Content provided by FirstRanker.com ---
adds to the report. The question must be addressed and the tools applied shouldassist in doing that. An advantage of this style is the enormous amount of
information that can be delivered relatively quickly.
--- Content provided by FirstRanker.com ---
iii.
--- Content provided by FirstRanker.com ---
Holistic StyleThe most complex and unusual of the styles, holistic report writing aims to
answer the question from a thematic and integrative perspective. This style of
--- Content provided by FirstRanker.com ---
report writing requires the researcher to have a strong understanding of the
course and is able to see which outcomes are being targeted by the question.
--- Content provided by FirstRanker.com ---
Essentials of a Good Report:
Good research report should satisfy some of the following basic characteristics:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
STYLE
Reports should be easy to read and understand. The style of the writer
--- Content provided by FirstRanker.com ---
should ensure that sentences are succinct and the language used is simple, to thepoint and avoiding excessive jargon.
LAYOUT
--- Content provided by FirstRanker.com ---
A good layout enables the reader to follow the report's intentions,
and aids the communication process. Sections and paragraphs should be given
--- Content provided by FirstRanker.com ---
headings and sub-headings. You may also consider a system of numbering orlettering to identify the relative importance of paragraphs and sub-paragraphs.
Bullet points are an option for highlighting important points in your report.
--- Content provided by FirstRanker.com ---
ACCURACY
Make sure everything you write is factually accurate. If you would
--- Content provided by FirstRanker.com ---
mislead or misinform, you will be doing a disservice not only to yourself butalso to the readers, and your credibility will be destroyed. Remember to refer to
any information you have used to support your work.
--- Content provided by FirstRanker.com ---
CLARITY
--- Content provided by FirstRanker.com ---
Take a break from writing. When you would come back to it, you'll
have the degree of objectivity that you need. Use simple language to express
--- Content provided by FirstRanker.com ---
your point of view.
READABILITY
--- Content provided by FirstRanker.com ---
Experts agree that the factors, which affect readability the most, are:>
Attractive appearance
--- Content provided by FirstRanker.com ---
>
Non-technical subject matter
--- Content provided by FirstRanker.com ---
>Clear and direct style
>
--- Content provided by FirstRanker.com ---
Short sentences
--- Content provided by FirstRanker.com ---
>
Short and familiar words
--- Content provided by FirstRanker.com ---
REVISION
When first draft of the report is completed, it should be put to one side
--- Content provided by FirstRanker.com ---
atleast for 24 hours. The report should then be read as if with eyes of the intended
reader. It should be checked for spelling and grammatical errors. Remember the
--- Content provided by FirstRanker.com ---
spell and grammar check on your computer. Use it!REINFORCEMENT
--- Content provided by FirstRanker.com ---
Reinforcement usually gets the message across. This old adage is well
known and is used to good effect in all sorts of circumstances e.g., presentations
--- Content provided by FirstRanker.com ---
- not just report writing.>
TELL THEM WHAT YOU ARE GOING TO SAY: in the introduction and
--- Content provided by FirstRanker.com ---
summary you set the scene for what follows in your report.
>
--- Content provided by FirstRanker.com ---
THEN SAY IT : you spell things out in results/findings>
THEN TELL THEM WHAT YOU SAID: you remind your readers through the
--- Content provided by FirstRanker.com ---
discussion what it was all about.
--- Content provided by FirstRanker.com ---
FEEDBACK MEETINGIt is useful to circulate copies of your report prior to the feedback
meeting. Meaningful discussion can then take place during the feedback meeting
--- Content provided by FirstRanker.com ---
with recommendations for change more likely to be agreed upon which can then
be included in your conclusion. The following questions should be asked at this
--- Content provided by FirstRanker.com ---
stage to check whether the Report served the purpose:>
Does the report have impact?
--- Content provided by FirstRanker.com ---
>
Do the summary /abstract do justice to the report?
--- Content provided by FirstRanker.com ---
>Does the introduction encourage the reader to read more?
--- Content provided by FirstRanker.com ---
>
--- Content provided by FirstRanker.com ---
Is the content consistent with the purpose of the report?
>
--- Content provided by FirstRanker.com ---
Have the objectives been met?>
Is the structure logical and clear?
--- Content provided by FirstRanker.com ---
>
Have the conclusions been clearly stated?
--- Content provided by FirstRanker.com ---
>Are the recommendations based on the conclusions and expressed
clearly and logically?
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
UNIT V
--- Content provided by FirstRanker.com ---
3. FORMAT AND PRESENTATION OF A REPORT
--- Content provided by FirstRanker.com ---
Lesson Outline:
--- Content provided by FirstRanker.com ---
Importance of Presentation of a ReportCommon Elements of a Format
--- Content provided by FirstRanker.com ---
Title Page
--- Content provided by FirstRanker.com ---
Introductory Pages
--- Content provided by FirstRanker.com ---
Body of the Text
--- Content provided by FirstRanker.com ---
ReferencesAppendix
--- Content provided by FirstRanker.com ---
Dos and Don'ts
--- Content provided by FirstRanker.com ---
Presentation of Reports
--- Content provided by FirstRanker.com ---
Learning Objectives:
--- Content provided by FirstRanker.com ---
After reading this Lesson, you should be able to:--- Content provided by FirstRanker.com ---
Understand the importance of Format of a Report
--- Content provided by FirstRanker.com ---
Contents of a Title PageWhat should be in Introductory pages
--- Content provided by FirstRanker.com ---
Contents of a Body Text
--- Content provided by FirstRanker.com ---
How to report other studies
--- Content provided by FirstRanker.com ---
Contents of an Appendix
--- Content provided by FirstRanker.com ---
Dos and Don'ts of a Report--- Content provided by FirstRanker.com ---
Any report serves its purpose, if it is finally presented before the
stakeholders of the work. In the case of an MBA student, Project Work
--- Content provided by FirstRanker.com ---
undertaken in an industrial enterprise and the findings of the study would be
more relevant, if they are presented before the internal managers of the
--- Content provided by FirstRanker.com ---
company. In the case of reports prepared out of consultancy projects, apresentation would help the users to interact with the research team and get
clarification on any issue of their interest. Business Reports or Feasibility
--- Content provided by FirstRanker.com ---
Reports do need a summary presentation, if they have to serve the intended
purpose. Finally, the Research Reports of the scholars would help in achieving
--- Content provided by FirstRanker.com ---
the intended academic purpose, if they are made public in academicsymposiums, seminars or in Public Viva Voce examinations. Thus, the
presentation of a report goes along with preparation of a good report. Further,
--- Content provided by FirstRanker.com ---
the use of graphs, charts, citations and pictures draw the attention of readers and
audience of any type. In this lesson, it is intended to provide a general outline
--- Content provided by FirstRanker.com ---
related to the presentation of any type of report. See Exhibit IExhibit I
Common Elements of a Report
--- Content provided by FirstRanker.com ---
A report may contain some or all of the following, please refer to your
--- Content provided by FirstRanker.com ---
departmental guidelines.MEMORANDUM OR COVERING LETTER
--- Content provided by FirstRanker.com ---
Memorandum Or Covering Letter is a brief note stating the purpose or giving an
explanation that is used when the report is sent to someone within the same
--- Content provided by FirstRanker.com ---
organization.--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
TITLE PAGE
It is addressed to the receiver of a report while giving an explanation for
--- Content provided by FirstRanker.com ---
it, and is used when the report is for someone who does not belong to the sameorganization as the writer. It contains a descriptive heading or name. It may also
contain author's name, position, company's name and so on.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
EXECUTIVE SUMMARY
--- Content provided by FirstRanker.com ---
Executive Summary summarizes the main contents and is usually of about300-350 words.
TABLE OF CONTENTS
--- Content provided by FirstRanker.com ---
Table of Contents consists of a list of the main sections, indicating the page on
which each section begins.
--- Content provided by FirstRanker.com ---
INTRODUCTIONInforms the reader of what the report is about--aim and purpose, significant
issues, any relevant background information.
--- Content provided by FirstRanker.com ---
REVIEW OF LITERATURE
Presents critical analysis of the available research to build a base for the present
--- Content provided by FirstRanker.com ---
study.METHODOLOGY
Gives details about nature of the study, research design, sample, and tools used
--- Content provided by FirstRanker.com ---
for data collection and analysis.
RESULTS
--- Content provided by FirstRanker.com ---
Presents findings of the study.DISCUSSION
Describes the reasoning and research in detail.
--- Content provided by FirstRanker.com ---
CONCLUSION/S
Summarizes the main points made in the written work in the light of objectives.
--- Content provided by FirstRanker.com ---
It often includes an overall answer to the problem/s addressed; or an overallstatement synthesizing the strands of information dealt with.
--- Content provided by FirstRanker.com ---
RECOMMENDATION/S OR IMPLICATIONS
Gives suggestions related to the issue(s) or problem(s) dealt with. It may
--- Content provided by FirstRanker.com ---
highlight the applications of the findings under Implications Section.
--- Content provided by FirstRanker.com ---
REFERENCESAn alphabetical list of all sources referred in the report.
APPENDICES
--- Content provided by FirstRanker.com ---
Extra information of further details placed after the main body of the text.
FORMATS OF REPORTS
--- Content provided by FirstRanker.com ---
Before attempting to look into Presentation dimensions of a Report, a quick lookinto standard format associated with a Research Report is examined hereunder.
The format generally includes the steps one should follow while writing and
--- Content provided by FirstRanker.com ---
finalizing their research report.
Different Parts of a Report
--- Content provided by FirstRanker.com ---
Generally different parts of a report include:1.
Cover Page / Title Page
--- Content provided by FirstRanker.com ---
2.
Introductory Pages ( Foreword, Preface, Acknowledgement, Table of
--- Content provided by FirstRanker.com ---
Contents, List of Tables, List of Illustrations or Figures, Key words /Abbreviations used etc.)
3.
--- Content provided by FirstRanker.com ---
Contents of the Report (which generally includes a Macro setting,
Research Problem, Methodology used, Objectives of the study, Review of
--- Content provided by FirstRanker.com ---
studies, Tools Used for Data Collection and Analysis, Empirical results inone/two sections, Summary of Observations etc.)
4.
--- Content provided by FirstRanker.com ---
References (including Appendices, Glossary of terms used, Source data,
Derivations of Formulas for Models used in the analysis etc.)
--- Content provided by FirstRanker.com ---
Title Page:
The Cover page or Title Page of a Research Report should contain the following
--- Content provided by FirstRanker.com ---
information:
--- Content provided by FirstRanker.com ---
1.
Title of the Project / Subject
--- Content provided by FirstRanker.com ---
2.Who has conducted the study
3.
--- Content provided by FirstRanker.com ---
For What purpose
4.
--- Content provided by FirstRanker.com ---
Organization5.
Period of submission
--- Content provided by FirstRanker.com ---
A Model:
--- Content provided by FirstRanker.com ---
An example of a Summer Project Report conducted by an MBA studentgenerally follows the following Title Page:
A STUDY ON THE USE OF COMPUTER TECHNOLOGY IN BANKING
--- Content provided by FirstRanker.com ---
OPERATIONS IN XXX BANK LTD., PONDICHERRY
--- Content provided by FirstRanker.com ---
A SUMMER PROJECT REPORTPREPARED BY
Ms. MADAVI LATHA
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Submitted at--- Content provided by FirstRanker.com ---
SCHOOL OF MANAGEMENTPONDICHERRY UNIVERSITY
PONDICHERRY ? 605 014
--- Content provided by FirstRanker.com ---
2006
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Introductory Pages:
--- Content provided by FirstRanker.com ---
Introductory pages generally do not constitute the Write up of the Research
work done. These introductory pages basically form the Index of the work
--- Content provided by FirstRanker.com ---
done. These pages are usually numbered in Roman numerical (eg, I, ii, iii etc).The introductory pages include the following components
--- Content provided by FirstRanker.com ---
Foreword
--- Content provided by FirstRanker.com ---
PrefaceAcknowledgements
--- Content provided by FirstRanker.com ---
Table of Contents
--- Content provided by FirstRanker.com ---
List of Tables
--- Content provided by FirstRanker.com ---
List of Figures / Charts
--- Content provided by FirstRanker.com ---
Foreword is usually one page write up or a citation about the work byany eminent / popular personality or a specialist in the given field of study.
Generally, the write up includes a brief background on the contemporary issues
--- Content provided by FirstRanker.com ---
and suitability of the present subject and its timeliness, major highlights of the
--- Content provided by FirstRanker.com ---
present work, brief background of the author etc. The writer of the Foreword
generally gives the Foreword on his letter head
--- Content provided by FirstRanker.com ---
Preface is again one/two pages write up by the author of the book /
report stating circumstances under which the present work is taken up,
--- Content provided by FirstRanker.com ---
importance of the work, major dimensions examined and intended audience for
the given work. The author gives his signature and address at the bottom of the
--- Content provided by FirstRanker.com ---
page along with date and year of the workAcknowledgements is a short section, mostly a paragraph. It mostly
consists of sentences giving thanks to all those associated and encouraged to
--- Content provided by FirstRanker.com ---
carry out the present work. Generally, author takes time to acknowledge the
liberal funding by any funding agency to carry out the work, and agencies which
--- Content provided by FirstRanker.com ---
had given permission to use their resources. At the end, the author thankseverybody and gives his signature.
Table of Contents refers to the index of all pages of the said Research
--- Content provided by FirstRanker.com ---
Report. These contents provide the information about the chapters, sub-
sections, annexure for each chapter, if any, etc. Further, the page numbers of the
--- Content provided by FirstRanker.com ---
content of the report greatly helps any one to refer to those pages for necessarydetails. Most authors use different forms while listing the sub contents. These
include alphabet classification and decimal classification. Examples for both of
--- Content provided by FirstRanker.com ---
them are given below:
Example of content sheet (alphabet classification)
--- Content provided by FirstRanker.com ---
An example of Content Sheet with decimal classification
--- Content provided by FirstRanker.com ---
CONTENTS
Foreword
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
i
Preface
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
iii
--- Content provided by FirstRanker.com ---
Acknowledgement
--- Content provided by FirstRanker.com ---
vChapter I (Title of the Chapter) INTRODUCTION
--- Content provided by FirstRanker.com ---
1.Macro
Economic
--- Content provided by FirstRanker.com ---
Background
--- Content provided by FirstRanker.com ---
12. Performance of a specific industry sector
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
63. Different studies conducted so far
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
94
Nature
--- Content provided by FirstRanker.com ---
and
Scope
--- Content provided by FirstRanker.com ---
174.1.
Objectives
--- Content provided by FirstRanker.com ---
of
the
--- Content provided by FirstRanker.com ---
study18
4.2.
--- Content provided by FirstRanker.com ---
Methodology
adopted
--- Content provided by FirstRanker.com ---
194.2.
a.
--- Content provided by FirstRanker.com ---
Sampling
Procedure
--- Content provided by FirstRanker.com ---
adopted20
4.2.b.
--- Content provided by FirstRanker.com ---
Year
of
--- Content provided by FirstRanker.com ---
thestudy 20
--- Content provided by FirstRanker.com ---
Chapter II (Title of the Chapter): Empirical Results I--- Content provided by FirstRanker.com ---
22
--- Content provided by FirstRanker.com ---
1. Test results of H1
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
22
--- Content provided by FirstRanker.com ---
2. Test Results of H2
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
27
--- Content provided by FirstRanker.com ---
3 Test Results of H3
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
323.1.
Sub
--- Content provided by FirstRanker.com ---
Hypothesis
of
--- Content provided by FirstRanker.com ---
H333
3.2.
--- Content provided by FirstRanker.com ---
Sub
Hypothesis
--- Content provided by FirstRanker.com ---
ofH2
37
--- Content provided by FirstRanker.com ---
Chapter III
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
45Chapter IV
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
85
Chapter V (Summary & Conclusions)
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
120
--- Content provided by FirstRanker.com ---
Appendices
132
--- Content provided by FirstRanker.com ---
References/Bibliography
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
135
--- Content provided by FirstRanker.com ---
Glossary
140
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
List of Tables and Charts:
--- Content provided by FirstRanker.com ---
Details of Charts and Tables given in the research Report are numberedand presented on separate pages and the lists of such tables and charts are given
on a separate page. Tables are generally numbered either in Arabic numerals or
--- Content provided by FirstRanker.com ---
in decimal form. In the case of decimal form, it is possible to indicate the
chapter to which the said table belongs. For example, Table 2.1 refers to Table
--- Content provided by FirstRanker.com ---
1 in Chapter 2.Executive Summary:
--- Content provided by FirstRanker.com ---
Most Business Reports or Project works conducted on a specific issue
carry one or two pages of Executive Summary. This summary precedes the
--- Content provided by FirstRanker.com ---
Chapters of the Regular Research Report. This summary generally contains abrief description of problem under enquiry, methods used and the findings. A
line about the possible alternatives for decision making would be the last line of
--- Content provided by FirstRanker.com ---
the Executive Summary.
--- Content provided by FirstRanker.com ---
BODY OF THE REPORT:The body of the Report is the most important part of the report. This
body of report may be segmented into a handful of Units or Chapters arranged in
--- Content provided by FirstRanker.com ---
a sequential order. Research Report often present the Methodology, Objectives
of the study, Data tools, etc in the first or second chapters along with a brief
--- Content provided by FirstRanker.com ---
background of the study, review of relevant studies. The major findings of thestudy are incorporated into two or three chapters based on the major or minor
hypothesis tested or based on the sequence of objectives of the study. Further,
--- Content provided by FirstRanker.com ---
the chapter plan may also be based likely on different dimensions of the problem
under enquiry.
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
Each Chapter may be divided into sections. While the first section may narratethe descriptive characteristics of the problem under enquiry, the second and
subsequent sections may focus on empirical results based on deeper insights of
--- Content provided by FirstRanker.com ---
the problem of study. Each chapter based on Research Studies mostly contain
Major Headings, Sub headings, quotations drawn from observations made by
--- Content provided by FirstRanker.com ---
earlier writers, footnotes and exhibits.Use of References:
There are two types of reference formatting. The first is the 'in-text' reference
--- Content provided by FirstRanker.com ---
format, where previous researchers and authors are cited during the building of
arguments in the Introduction and Discussion sections. The second type of
--- Content provided by FirstRanker.com ---
format is that adopted for the Reference section for writing footnotes orBibliography.
Citations in the text
--- Content provided by FirstRanker.com ---
The names and dates of researchers go in the text as they are mentioned e.g.,
"This idea has been explored in the work of Smith (1992)." It is generally
--- Content provided by FirstRanker.com ---
unacceptable to refer to authors and previous researchers etc.Examples of Citing References (Single author)
Duranti (1995) has argued or It has been argued that (Duranti, 1995)
--- Content provided by FirstRanker.com ---
In the case of more authors,
Moore, Maguire, and Smyth (1992) proposed or It has been proposed that
--- Content provided by FirstRanker.com ---
(Moore, Macquire, & Smyth, 1992)
For subsequent citations in the same report: Moore et al.(1992) also proposed... or It
--- Content provided by FirstRanker.com ---
has also been proposed that. . . . (Moore et al., 1992)
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
The reference section:
--- Content provided by FirstRanker.com ---
The report ends with reference section, which comes immediately after the
Recommendations and begins on a new page. It is titled as 'References' in upper
--- Content provided by FirstRanker.com ---
and lower case letters centered across the page.Published Journal Articles
Beckerian, D.A. (1993). In search of the typical eyewitness. American
--- Content provided by FirstRanker.com ---
Psychologist, 48, 574-576.
Gubbay, S.S., Ellis, W., Walton, J.N., and Court, S.D.M. (1965). Clumsy
--- Content provided by FirstRanker.com ---
children: A study of apraxic and agnosic defects in 21 children. Brain, 88, 295-312.
Authored Books
--- Content provided by FirstRanker.com ---
Cone, J.D., and Foster, S.L. (1993). Dissertations and theses from start to finish:
Psychology and related fields. Washington, DC: American Psychological
--- Content provided by FirstRanker.com ---
Association.Cone, J.D., and Foster, S.L. (1993). Dissertations and theses from start to finish:
Psychology and related fields (2nd ed.). Washington, DC: American
--- Content provided by FirstRanker.com ---
Psychological Association.
--- Content provided by FirstRanker.com ---
APPENDICES:The purpose of the appendices is to supplement the main body of your text and
provide additional information that may be of interest to the reader.
--- Content provided by FirstRanker.com ---
There is no major heading for the Appendices. You simply need to
include each one, starting on a new page, numbered, using capital letters, and
--- Content provided by FirstRanker.com ---
headed with a centered brief descriptive title. For example:Appendix A: List of stimulus words presented to the participants
Dos and Don'ts of Report Writing
--- Content provided by FirstRanker.com ---
1. Choose a font size that is not too small or too large; 11 or 12 is a good font
size to use.
--- Content provided by FirstRanker.com ---
2. Acknowledgment need not be a separate page, except in the final report. In fact,--- Content provided by FirstRanker.com ---
you could just drop it altogether for the first- and second-stage reports. Yourguide already knows how much you appreciate his/her support. Express your
gratitude by working harder instead of writing a flowery acknowledgment.
--- Content provided by FirstRanker.com ---
3. Make sure your paragraphs have some indentation and that it is not too large.
Refer to some text books or journal papers if you are not sure.
--- Content provided by FirstRanker.com ---
4. If figures, equations, or trends are taken from some reference, the reference mustbe cited right there, even if you have cited it earlier.
5. The correct way of referring to a figure is Fig. 4 or Fig. 1.2 (note that there is a
--- Content provided by FirstRanker.com ---
space after Fig.). The same applies to Section, Equation, etc. (e.g., Sec. 2, Eq.
3.1).
--- Content provided by FirstRanker.com ---
6. Cite a reference as, for example, "The threshold voltage is a strong function ofthe implant dose [1]." Note that there must be a space before the bracket.
7. Follow some standard format while writing references. For example, you could
--- Content provided by FirstRanker.com ---
look up any IEEE transactions issue and check out the format for journal papers,
books, conference papers, etc.
--- Content provided by FirstRanker.com ---
8. Do not type references (for that matter, any titles or captions) entirely in capitalletters. The only capital letters required are (i) the first letter of a name, (ii)
acronyms, (iii) the first letter of the title of an article (iv) the first letter of a
--- Content provided by FirstRanker.com ---
sentence.
9. The order of references is very important. In the list of your references, the first
--- Content provided by FirstRanker.com ---
reference must be the one which is cited before any other reference, and so on.Also, every reference in the list must be cited at least once (this also applies to
figures). In handling references and figure numbers, Latex turns out to be far
--- Content provided by FirstRanker.com ---
better than Word.
12. Many commercial packages allow "screen dump" of figures. While this is useful
--- Content provided by FirstRanker.com ---
in preparing reports, it is often very wasteful (in terms of toner or ink) since thebackground is black. Please see if you can invert the image or use a plotting
program with the raw data such that the background is white.
--- Content provided by FirstRanker.com ---
13. The following tips may be useful: (a) For Windows, open the file in
--- Content provided by FirstRanker.com ---
Paint and select Image/Invert Colors. (b) For Linux, open the file in Image
Magick (this can be done by typing display) and then selecting Enhance/Negate.
--- Content provided by FirstRanker.com ---
14. As far as possible, place each figure close to the part of the text where it isreferred to.
15. A list of figures is not required except for the final project report. It generally
--- Content provided by FirstRanker.com ---
does not do more than wasting paper.
16. The figures, when viewed together with the caption, must be, as far as possible,
--- Content provided by FirstRanker.com ---
self-explanatory. There are times when one must say, "see text for details".However, this is an exception and not a rule.
17. The purpose of a figure caption is simply to state what is being presented in the
--- Content provided by FirstRanker.com ---
figure. It is not the right place for making comments or comparisons; that should
appear only in the text.
--- Content provided by FirstRanker.com ---
18. If you are showing comparison of two (or more) quantities, use the samenotation through out the report. For example, suppose you want to compare
measured data with analytical model in four different figures, in each figure,
--- Content provided by FirstRanker.com ---
make sure that the measured data is represented by the same line type or symbol.
The same should be followed for the analytical model. This makes it easier for
--- Content provided by FirstRanker.com ---
the reader to focus on the important aspects of the report rather than getting lostin lines and symbols.
19. If you must resize a plot or a figure, make sure that you do it simultaneously in
--- Content provided by FirstRanker.com ---
both x and y directions. Otherwise, circles in the original figure will appear as
ellipses, letters will appear too fat or too narrow, and other similar calamities
--- Content provided by FirstRanker.com ---
will occur.20. In the beginning of any chapter, you need to add a brief introduction and then
start sections. The same is true about sections and subsections. If you have
--- Content provided by FirstRanker.com ---
sections that are too small, it only means that there is not enough material to
make a separate section. In that case, do not make a separate section. Include the
--- Content provided by FirstRanker.com ---
same material in the main section or elsewhere.
--- Content provided by FirstRanker.com ---
Remember, a short report is perfectly acceptable if you have put in the effort andcovered all important aspects of your work. Adding unnecessary sections and
subsections will create the impression that you are only covering up the lack of
--- Content provided by FirstRanker.com ---
effort.
22. Do not make one-line paragraphs.
--- Content provided by FirstRanker.com ---
23. Always add a space after a full stop, comma, colon, etc. Also, leave a spacebefore opening a bracket. If the sentence ends with a closing bracket, add the
full stop (or comma or semicolon, etc) after the bracket.
--- Content provided by FirstRanker.com ---
24. Do not add a space before a full stop, comma, colon, etc.
25. Using a hyphen can be tricky. If two (or more) words form a single adjective, a
--- Content provided by FirstRanker.com ---
hyphen is required; otherwise, it should not be used. For example, (a) A short-channel device shows a finite output conductance. (b) This is a good example of
mixed-signal simulation. (c)Several devices with short channels were studied.
--- Content provided by FirstRanker.com ---
26. If you are using Latex, do not use the quotation marks to open. If you do that,
you get "this". Use the single opening quotes (twice) to get "this".
--- Content provided by FirstRanker.com ---
27. Do not use very informal language. Instead of "This theory should be taken witha pinch of salt," you might say, "This theory is not convincing," or "It needs
more work to show that this theory applies in all cases."
--- Content provided by FirstRanker.com ---
28. Do not use "&"; write "and" instead. Do not write "There're" for "There are" etc.
29. If you are describing several items of the same type (e.g., short-channel effects
--- Content provided by FirstRanker.com ---
in a MOS transistor), use the "list" option; it enhances the clarity of your report.30. Do not use "bullets" in your report. They are acceptable in a presentation, but
not in a formal report. You may use numerals or letters instead.
--- Content provided by FirstRanker.com ---
31. Whenever in doubt, look up a text book or a journal paper to verify whether
your grammar and punctuation are correct.
--- Content provided by FirstRanker.com ---
32. Do a spell check before you print out your document. It always helps.33. Always write the report so that the reader can easily make out what your
--- Content provided by FirstRanker.com ---
contribution is. Do not leave the reader guessing in this respect.
34. Above all, be clear. Your report must have a flow, i.e., the reader must be able to
--- Content provided by FirstRanker.com ---
appreciate continuity in the report. After the first reading, the reader should be
able to understand (a) the overall theme and (b) what is new (if it is a project
--- Content provided by FirstRanker.com ---
report).35. Plagiarism is a very serious offense. You simply cannot copy material from an
existing report or paper and put it verbatim in your report. The idea of writing a
--- Content provided by FirstRanker.com ---
report is to convey in your words what you have understood from the literature.
The above list may seem a little intimidating. However, if you make a
--- Content provided by FirstRanker.com ---
sincere effort, most of the points are easy to remember and practice. Asupplementary exercise that will help you immensely is that of looking for all
major and minor details when you read an article from a newspaper or a
--- Content provided by FirstRanker.com ---
magazine, such as grammar, punctuation, organization of the material, etc.
--- Content provided by FirstRanker.com ---
PRESENTATION OF A REPORTIn this section, we will look into the issues associated with presentation of a
Research Report by the Researcher or principal investigator. While preparing
--- Content provided by FirstRanker.com ---
for the presentation of a report, the researchers should focus on the following
issues:
--- Content provided by FirstRanker.com ---
--- Content provided by FirstRanker.com ---
1.
--- Content provided by FirstRanker.com ---
What is the purpose of the report and issues on which the Presentationhas to focus?
2.
--- Content provided by FirstRanker.com ---
Who are the stakeholders and what are their areas of interest?
3.
--- Content provided by FirstRanker.com ---
The mode and media of presentation.4.
Extent of Coverage and depth to address at.
--- Content provided by FirstRanker.com ---
5.
--- Content provided by FirstRanker.com ---
Time, Place and cost associated with presentation.
6.
--- Content provided by FirstRanker.com ---
Audio ? Visual aids intended to be used.--- Content provided by FirstRanker.com ---
Document Outline
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??
?
?
?
--- Content provided by FirstRanker.com ---
??