Loading [MathJax]/jax/output/CommonHTML/jax.js
Skip to main content

Correlation: B.Com. II Notes

 Correlation

In previous blog we discussed about the measure central tendency and Dispersion to use to study the variable. the correlation is a statistical concept that allows us to measure and understand the relationship between two or more variables. it provided a valuable information about that variables. e.g. price and demand of commodity, income and expenditure of family, height and weight of group of persons. their we use the relation   between this two variables. in above examples we see the one variable increases other variable is also changes in same or opposite direction. 

definition: Correlation is statistical tool which study the relationship between two or more variables. for analysis of correlation various method and techniques are used. 

example: i. Demand and supply of product                                                                                                                       ii. price and demand                                                                                                                                           iii. Income and expenditure

Use Of Correlation:

the correlation analysis is widely used in economic, business and other fields. 

i. To Predict: if we know the relation between two variables, we can estimate the value of them when value of other is known. e.g. we know the correlation between height and weight then we calculate the weight for known value of height.

ii. To control: The correlation also enables us to control our activity. e.g. we know the correlation between fertilizer and crop, then we control the yield and life of the crop, (use of more fertilizer is hazard to crop. 

iii. To Plan: the knowledge of correlation help to planning. e.g. if we know the relation between the rainfall and yield of crop, then we know the rainfall we calculate the yield of crop, depend on the yield of crop we  plan for import and export.

Types of Correlation:

i. Positive and Negative Correlation

ii. Linear and non-liner Correlation.

iii, simple, Multiple and  Partial Correlation.

Positive correlation: If both the variables changes in same direction i.e. if one variable increases other variable also increases or if the one variable decreases then other variable also decreases, the correlation is said to be positive correlation. e.g. i. height and weight mean height is increases weight also increases.

Negative Correlation: if the both variables are changes in opposite direction i.e. if one variable increases other variable decreases or if the one variable decreases then other variable also increases. e.g. price and demand mean the price increases demand is decreases.

the difference between the positive and negative correlation depends on the direction of change of two variables.

when change in one variable is not affected on other variable it is no correlation between two variables.

Linear Correlation:  this type of correlation is based on the nature of the graph of two variables. if the graph is straight line the correlation is Linear correlation.

if the graph is not a straight line but is curve is called Non-Linear Correlation.

Simple Correlation: we study only two variable say price and demand it is simple correlation.

Multiple Correlation: we study more than two variables is called multiple correlation.

Partial Correlation: we study the more than two variables but correlation is studied between two variables only, and the effect of the other variable is assumed as constant.

Methods of studying Correlation:

I. Graphical method : Scatter Diagram

II. Mathematical Method:  

i. Karl Pearson Coefficient of correlation 'r'

                                                   ii. Spearman's Rank Correlation Coefficient 'R'

i. Scatter diagram: scatter diagram is a graph showing correlation between two variable. the N pair of values (x1,y1), (x2,y2) .....(xn,yn) of two variables x and y are plotted on the graph or XY plane we get the points or dot's on graph and they are generally scattered so it is called scatter diagram or dot diagram.

the scatter point show the direction and degree of correlation between x and y or any two variable the direction of correlation is denoted as + or -sign and degree by r. 

from the  scattered diagram we interpret the degree and direction of correlation. 

Interpretation of scatter diagram :

i. if the graph shows all the point lie on a rising straight line towards the right, the correlation is perfect positive and r = +1 e.g. correlation between age and height of children , their is strong correlation between  age and height , as children grow older they get taller and this relationship is quite linear in their growth phases.  we see the graph 





ii. if the graph shows all the point lie in falling straight line towards the right, the correlation s perfect negative and r=-1.e.g. the correlation between price and demand if price increases demand decreases.


iii. if all the points lies in narrow strip which rise towards right.  Then the correlation is high degree positive correlation, i.e. 08<r>1.





iv. if all the points lie in narrow strip which rise towards left, then the correlation is high –ve correlation it is -1<r>-0.8





v. if all point rise on broad line or strip rising towards the right the correlation is low degree positive correlation.





vi. if all point rise on broad line or strip rising towards the left the correlation is low degree positive correlation.



vii. if all point are scattered without any pattern, then there is no correlation, i.e. r=0.



Merit and demerit:

Merit:

i.                    It is very easy to draw.

ii.                  It is easy to understand.

iii.                It gives quick idea about the correlation between variables.

Demerits:

i.                    It is not used for mathematical calculation.

ii.                  It no measure the degree of correlation.

iii.                It is not suitable when values are large.

Example 1. To draw a scatter diagram & indicate whether correlation is positive or negative.

Price:

17

18

19

21

25

26

30

Supply:

30

37

36

42

50

51

54

 

To draw scattered diagram we plot the price on x –axis and supply on y-axis. Then we get a scatted diagram.



From this diagram we see the the point are rising hence it is high positive correlation.

Karl Pearson's Coefficient of correlation:

    Karl Pearson's gives the mathematical method to measure the correlation between two variables. based on following assumptions. 

i. there is linear relationship between two variables.

ii. there is cause and effect relationship between two variables.

definition: Karl Pearson's Coefficient of correlation between two variables X and Y, denoted as corr(X,Y) or "r" and it is given as      

r=Cov(X,Y)σXσY

Where:

  • Cov(X,Y) is the covariance of X and Y, calculated as: Cov(X,Y)=1nni=1(xiˉx)(yiˉy)
  • σX is the standard deviation of X.
  • σY is the standard deviation of Y.

The correlation coefficient formula can also be written as:

r=xyx2y2

Where:

  • x=(xˉx)
  • y=(yˉy)

Properties of r:

  • 1. r always lies between -1 to +1.
  • 2. r is unaffected by a change of origin and scale.
  • 3. r is equal to the square root of the product of two regression coefficients, i.e., r=bxybyx.
  • 4. r is free from unit.

Interpretation of r:

  • If r=+1, there is a perfect positive correlation between two variables.
  • If r=1, there is a perfect negative correlation between two variables.
  • If r=0, there is no correlation between two variables.
  • If 0.8<r1, there is a high degree of positive correlation between two variables.
  • If 1r0.8, there is a high degree of negative correlation between two variables.
  • If 0.4r1, there is a low degree of positive correlation between two variables.
  • If 0.4r1, there is a low degree of negative correlation between two variables.
  • The type of correlation depends on the sign of r: if r has a negative sign, it means negative correlation, and if r has a positive sign, it indicates positive correlation.

Merits and Demerits:

Merits:

  • r gives the numerical value of correlation.
  • It is also useful for estimation.

Demerits:

  • The value of r is not affected by extreme values, so it may not provide proper results in some cases.
  • Sometimes, it is difficult to calculate.

Comments

Popular posts from this blog

MCQ'S based on Basic Statistics (For B. Com. II Business Statistics)

    (MCQ Based on Probability, Index Number, Time Series   and Statistical Quality Control Sem - IV)                                                            1.The control chart were developed by ……         A) Karl Pearson B) R.A. fisher C) W.A. Shewhart D) B. Benjamin   2.the mean = 4 and variance = 2 for binomial r.v. x then value of n is….. A) 7 B) 10 C) 8 D)9   3.the mean = 3 and variance = 2 for binomial r.v. x then value of n is….. A) 7 B) 10 C) 8 D)9 4. If sampl...

Measures of Central Tendency :Mean, Median and Mode

Changing Color Blog Name  Measures of Central Tendency  I. Introduction. II. Requirements of good measures. III. Mean Definition. IV . Properties  V. Merits and Demerits. VI. Examples VII.  Weighted Arithmetic Mean VIII. Median IX. Quartiles I. Introduction Everybody is familiar with the word Average. and everybody are used the word average in daily life as, average marks, average of bike, average speed etc. In real life the average is used to represent the whole data, or it is a single figure is represent the whole data. the average value is lies around the centre of the data. consider the example if we are interested to measure the height of the all student and remember the heights of all student, in that case there are 2700 students then it is not possible to remember the all 2700 students height so we find out the one value that represent the height of the all 2700 students in college. therefore the single value represent ...

Business Statistics Notes ( Meaning, Scope, Limitations of statistics and sampling Methods)

  Business Statistics Paper I Notes. Welcome to our comprehensive collection of notes for the Business Statistics!  my aim is to provided you  with the knowledge you need as you begin your journey to comprehend the essential ideas of this subject. Statistics is a science of collecting, Presenting, analyzing, interpreting data to make informed business decisions. It forms the backbone of modern-day business practices, guiding organizations in optimizing processes, identifying trends, and predicting outcomes. I will explore several important topics through these notes, such as: 1. Introduction to Statistics. :  meaning definition and scope of  Statistics. 2. Data collection methods. 3. Sampling techniques. 4. Measures of  central tendency : Mean, Median, Mode. 5. Measures of Dispersion : Relative and Absolute Measures of dispersion,  Range, Q.D., Standard deviation, Variance. coefficient of variation.  6.Analysis of bivariate data: Correlation, Regr...

Classification, Tabulation, Frequency Distribution, Diagrams & Graphical Presentation.

Business Statistics I    Classification, Tabulation, Frequency Distribution ,  Diagrams & Graphical Presentation. In this section we study the following point : i. Classification and it types. ii. Tabulation. iii. Frequency and Frequency Distribution. iv. Some important concepts. v. Diagrams & Graphical Presentation   I. Classification and it's types:        Classification:- The process of arranging data into different classes or groups according to their common  characteristics is called classification. e.g. we dividing students into age, gender and religion. It is a classification of students into age, gender and religion.  Or  Classification is a method used to categorize data into different groups based on the values of specific variable.  The purpose of classification is to condenses the data, simplifies complexities, it useful to comparison and helps to analysis. The following are some criteria to classi...

Measures of Dispersion : Range , Quartile Deviation, Standard Deviation and Variance.

Measures of Dispersion :  I.  Introduction. II. Requirements of good measures. III. Uses of Measures of Dispersion. IV.  Methods Of Studying Dispersion:     i.  Absolute Measures of Dispersions :             i. Range (R)          ii. Quartile Deviation (Q.D.)          iii. Mean Deviation (M.D.)         iv. Standard Deviation (S. D.)         v. Variance    ii.   Relative Measures of Dispersions :              i. Coefficient of Range          ii. Coefficient of Quartile Deviation (Q.D.)          iii. Coefficient of Mean Deviation (M.D.)         iv. Coefficient of Standard Deviation (S. D.)         v. Coefficien...

Basic Concepts of Probability and Binomial Distribution , Poisson Distribution.

 Probability:  Basic concepts of Probability:  Probability is a way to measure hoe likely something is to happen. Probability is number between 0 and 1, where probability is 0 means is not happen at all and probability is 1 means it will be definitely happen, e.g. if we tossed coin there is a 50% chance to get head and 50% chance to get tail, it can be represented in probability as 0.5 for each outcome to get head and tail. Probability is used to help us taking decision and predicting the likelihood of the event in many areas, that are science, finance and Statistics.  Now we learn the some basic concepts that used in Probability:  i) Random Experiment OR Trail: A Random Experiment is an process that get one or more possible outcomes. examples of random experiment include tossing a coin, rolling a die, drawing  a card from pack of card etc. using this we specify the possible outcomes known as sample pace.  ii)Outcome: An outcome is a result of experi...

Statistical Inference I ( Theory of estimation : Efficiency)

🔖Statistical Inference I ( Theory of estimation : Efficiency)  In this article we see the  terms:  I. Efficiency. II. Mean Square Error. III. Consistency. 📚 Efficiency:  We know that  two unbiased estimator of parameter gives rise to infinitely many unbiased estimators of parameter. there if one of parameter have two estimators then the problem is to choose one of the best estimator among the class of unbiased estimators. in that case we need to some other criteria to to find out best estimator. therefore, that situation  we check the variability of that estimator, the measure of variability of estimator T around it mean is Var(T). hence If T is an Unbiased estimator of parameter then it's variance gives good precision. the variance is smaller then it give's greater precision. 📑 i. Efficient estimator: An estimator T is said to be an Efficient Estimator of 𝚹, if T is unbiased estimator of    𝛉. and it's variance is less than any other estima...

The Power of Statistics: A Gateway to Exciting Opportunities

  My Blog The Power of Statistics: A Gateway to Exciting Opportunities     Hey there, future statistician! Ever wondered how Netflix seems to know exactly what shows you'll love, how sports teams break down player performance, or how businesses figure out their pricing strategies? The answer is statistics—a fascinating field that helps us make sense of data in our everyday lives. Let's dive into why choosing statistics for your B.Sc. Part First can lead you to some exciting opportunities.     Why Statistics Matters in Everyday Life     From predicting election outcomes and analyzing social media trends to understanding consumer behavior and optimizing public transport routes, statistics are crucial. It's the backbone of modern decision-making, helping us sift through complex data to uncover meaningful insights that drive innovation and progress.   The Role of Statistics in Future Opportunities ...

Statistical Inference I ( Theory of Estimation) : Unbiased it's properties and examples

 📚Statistical Inference I Notes The theory of  estimation invented by Prof. R. A. Fisher in a series of fundamental papers in around 1930. Statistical inference is a process of drawing conclusions about a population based on the information gathered from a sample. It involves using statistical techniques to analyse data, estimate parameters, test hypotheses, and quantify uncertainty. In essence, it allows us to make inferences about a larger group (i.e. population) based on the characteristics observed in a smaller subset (i.e. sample) of that group. Notation of parameter: Let x be a random variable having distribution function F or f is a population distribution. the constant of  distribution function of F is known as Parameter. In general the parameter is denoted as any Greek Letters as θ.   now we see the some basic terms :  i. Population : in a statistics, The group of individual under study is called Population. the population is may be a group of obj...