Skip to main content

Non-Parametric Test 1.Sign test

Types of Non-Parametric Test :

the non-parametric test is Broadly divided into three categories as:

1. One-sample Test 

2.Two-sample test

3. K-sample test

In this blog we discuss the one-sample tests, 

there are four types of one-sample non-parametric test as follows: 

1. Run Test.

2.Sign Test

3. Wilcoxon Sing-Rank test

4. Kolmogorov-smirnov test.

first we discuss the sign test.

1.Sign Test

The sign test id one of the very simplest non-parametric test. the test is based on signs (i.e. plus and minus signs) hence is called sign test. the non-parametric test are alternative to the parametric one, therefore the sign test is alternative to parametric T-test, is used to test the median of population instead of mean of population. (i.e. Parametric T-test is used to testing mean of population and sign test is used to testing the median of the population they are alternative to each other when the data follows assumption of T-test we use parametric t-test, but i some situation the data dose not following the assumption of parametric test then we use non-parametric Sign test.) also it used if data is in ordinal scale.

assumptions of sign test.:

the sign test id used when the follows following assumptions.

1. the sample selected form population with  un-known median.

2. the selected variable for study is continuous.

3. the selected variable for study ids at least in ordinal scale.

Procedure of test as follows.:

let X1 ,X2, X3, ..........Xn. be a random sample of size n arranged in order of  occurrence. form the population with unknown median. now we wish to test the hypothesis that the specified value m0 (i.e. the hypothetical median is m ) of population median. 

the null and alternative hypothesis is as 

H0m =m0              for two tailed test

V/S H1m ¹m          

the test consist following steps as

Step I: the test based on the signs then we firstly converting the all observations into the sign (means observation converted into plus and minus signs). for converting we subtract the median minto each observation. (i.e.  X1 - m0 ) that is we obtaining the difference for all observations from the median and observing there signs.(i.e. if difference is negative we get negative sign and difference is positive we get positive sign) in that one of the observation is equal to the median (m0 ) then we remove this observation or not consider in analysis part. then the size of sample is reduced and it is denoted as n.

Step II: now we get the Plus and minus signs then we count the number of plus sign and minus signs and it is denoted as S+ for total number of plus signs and S-  for total number of minus signs.

Step III.:  Now we consider the null hypothesis is true then on the basis of postulated value of median  we expect that the value of variable is greater than median mean we get plus sign then the number of plus sign it consider as success and   number of minus sign is consider as failure approximately equal. then the distribution of sign is binomial distribution with parameter (n, p=0.5). for simplicity we consider smaller number of sign. that means the number of plus sign is less than number of minus sign then plus sign is success and minus is failure. similarly if minus signs are less than number of plus sign then minus sign is success and number of plus sign is failure.

Step IV

i. Small Sample Test : (i.e. n  is less than  or equal to 20 ).

if the number of observations are is less than or equal to 20 (i.e. n =<20 ).  is called small sample test.

for Decision about hypothesis we we the p-value and it is determined as 

P-value = P(S <  s)  and where s is equal to 

S = min ( S+,S-)

if the number of observations are is less than or equal to 20 (i.e. n  =<20 ).  is called small sample test.

if the p-value is less than or equal to a%  level of significance then we reject the null hypothesis at a% level of significance otherwise accept the null hypothesis.


Large sample test:(i.e. n  greater than 20 )

if the number of observation is greater than 20 we use large sample test  (i.e. n  >20 ).

  for large sample test we use normal approximation for binomial distribution.

as E(S) = n*1/2 = n/2

and S.D.( S) = n*1/4= n/4

the normal approximation is z test gives as

z = (S - E(S))/S.D.(S) 

Z =  (S - (n/2))/(n/4) 

then we comparing the calculated and tabulated value of z. at  a%  level of significance.

if calculated z is less than or equal to  tabulated (critical value ) then we accept null hypothesis other wise reject the null hypothesis.

Sign Test Example:

The sign test id one of the very simplest non-parametric test. the test is based on signs (i.e. plus and minus signs) hence is called sign test. the non-parametric test are alternative to the parametric one, therefore the sign test is alternative to parametric T-test, is used to test the median of population instead of mean of population. (i.e. Parametric T-test is used to testing mean of population and sign test is used to testing the median of the population they are alternative to each other when the data follows assumption of T-test we use parametric t-test, but i some situation the data dose not following the assumption of parametric test then we use non-parametric Sign test.) also it used if data is in ordinal scale.

if these assumptions follow data set then we apply  sign test to analyse the data.:

the sign test id used when the follows following assumptions.

1. the sample selected form population with  un-known median.

2. the selected variable for study is continuous.

3. the selected variable for study ids at least in ordinal scale.

let X1 ,X2, X3, ..........Xn. be a random sample of size n arranged in order of  occurrence. form the population with unknown median. now we wish to test the hypothesis that the specified value m0 (i.e. the hypothetical median is m ) of population median. 

the null and alternative hypothesis is as 

H0m =m0              for two tailed test

V/S H1m ¹m0             

Example 1. Radom sample of 10 are given by 150, 155, 157, 178, 148, 159, 145, 147, 152, 154.

use sign test to test the average of population is 150. at 5% level of significance.

Answer: Here the distribution of population is not given, then parametric test is not used for that because assumption of normality is for parametric test is not fulfil. then we use non-parametric test.

for testing the average is 150 we use sign test for median  ( in case we consider median instead of mean) now hypothesis is

H0μ = 150

V/S H1μ  150

Firstly we converting the observation into sign because the sign test based on signs if one of the observation is equal to median we remove that observation, observation less than median we take  (-) minus sign and observation greater  than median we take (+) plus sign as follow.

    +       +       +                     +         -          -         +            +.

first observation 150 is equal to median then it remove and 155 is greater than 150 taking (+) plus sign and  145 is less than 150 taking (-) minus sign.

now we counting the number of signs

S+ =Number of positive signs = 6

S- = Number of negative signs = 3

n = total number of plus and minus signs = 9

then for  test statistic is consider S = min(S+, S-) = S = min(5, 3) = 3

here the  n =9 is less than 20 then we use small sample test.

the test statistics is calculated as P-Value = P(S ≤ S) 

where S Has binomial distribution.

Note that   we consider the null hypothesis is true then on the basis of postulated value of median  we expect that the value of variable is greater than median mean we get plus sign then the number of plus sign it consider as success and   number of minus sign is consider as failure approximately equal. then the distribution of sign is binomial distribution with parameter (n, p=0.5). for simplicity we consider smaller number of sign. that means the number of plus sign is less than number of minus sign then plus sign is success and minus is failure. similarly if minus signs are less than number of plus sign then minus sign is success and number of plus sign is failure.

P-Value = P(S ≤ 3) =0.2539


The P-Value is compared with α% level of significance.

here  P-Value = 0.2539 is greater  then 5% level of significance = 0.05 hence we Accept null hypothesis, that means the given sample drawn from population having mean (Median) is 150.






Comments

Post a Comment

Popular posts from this blog

MCQ'S based on Basic Statistics (For B. Com. II Business Statistics)

    (MCQ Based on Probability, Index Number, Time Series   and Statistical Quality Control Sem - IV)                                                            1.The control chart were developed by ……         A) Karl Pearson B) R.A. fisher C) W.A. Shewhart D) B. Benjamin   2.the mean = 4 and variance = 2 for binomial r.v. x then value of n is….. A) 7 B) 10 C) 8 D)9   3.the mean = 3 and variance = 2 for binomial r.v. x then value of n is….. A) 7 B) 10 C) 8 D)9 4. If sampl...

Measures of Central Tendency :Mean, Median and Mode

Changing Color Blog Name  Measures of Central Tendency  I. Introduction. II. Requirements of good measures. III. Mean Definition. IV . Properties  V. Merits and Demerits. VI. Examples VII.  Weighted Arithmetic Mean VIII. Median IX. Quartiles I. Introduction Everybody is familiar with the word Average. and everybody are used the word average in daily life as, average marks, average of bike, average speed etc. In real life the average is used to represent the whole data, or it is a single figure is represent the whole data. the average value is lies around the centre of the data. consider the example if we are interested to measure the height of the all student and remember the heights of all student, in that case there are 2700 students then it is not possible to remember the all 2700 students height so we find out the one value that represent the height of the all 2700 students in college. therefore the single value represent ...

Business Statistics Notes ( Meaning, Scope, Limitations of statistics and sampling Methods)

  Business Statistics Paper I Notes. Welcome to our comprehensive collection of notes for the Business Statistics!  my aim is to provided you  with the knowledge you need as you begin your journey to comprehend the essential ideas of this subject. Statistics is a science of collecting, Presenting, analyzing, interpreting data to make informed business decisions. It forms the backbone of modern-day business practices, guiding organizations in optimizing processes, identifying trends, and predicting outcomes. I will explore several important topics through these notes, such as: 1. Introduction to Statistics. :  meaning definition and scope of  Statistics. 2. Data collection methods. 3. Sampling techniques. 4. Measures of  central tendency : Mean, Median, Mode. 5. Measures of Dispersion : Relative and Absolute Measures of dispersion,  Range, Q.D., Standard deviation, Variance. coefficient of variation.  6.Analysis of bivariate data: Correlation, Regr...

Classification, Tabulation, Frequency Distribution, Diagrams & Graphical Presentation.

Business Statistics I    Classification, Tabulation, Frequency Distribution ,  Diagrams & Graphical Presentation. In this section we study the following point : i. Classification and it types. ii. Tabulation. iii. Frequency and Frequency Distribution. iv. Some important concepts. v. Diagrams & Graphical Presentation   I. Classification and it's types:        Classification:- The process of arranging data into different classes or groups according to their common  characteristics is called classification. e.g. we dividing students into age, gender and religion. It is a classification of students into age, gender and religion.  Or  Classification is a method used to categorize data into different groups based on the values of specific variable.  The purpose of classification is to condenses the data, simplifies complexities, it useful to comparison and helps to analysis. The following are some criteria to classi...

Measures of Dispersion : Range , Quartile Deviation, Standard Deviation and Variance.

Measures of Dispersion :  I.  Introduction. II. Requirements of good measures. III. Uses of Measures of Dispersion. IV.  Methods Of Studying Dispersion:     i.  Absolute Measures of Dispersions :             i. Range (R)          ii. Quartile Deviation (Q.D.)          iii. Mean Deviation (M.D.)         iv. Standard Deviation (S. D.)         v. Variance    ii.   Relative Measures of Dispersions :              i. Coefficient of Range          ii. Coefficient of Quartile Deviation (Q.D.)          iii. Coefficient of Mean Deviation (M.D.)         iv. Coefficient of Standard Deviation (S. D.)         v. Coefficien...

Basic Concepts of Probability and Binomial Distribution , Poisson Distribution.

 Probability:  Basic concepts of Probability:  Probability is a way to measure hoe likely something is to happen. Probability is number between 0 and 1, where probability is 0 means is not happen at all and probability is 1 means it will be definitely happen, e.g. if we tossed coin there is a 50% chance to get head and 50% chance to get tail, it can be represented in probability as 0.5 for each outcome to get head and tail. Probability is used to help us taking decision and predicting the likelihood of the event in many areas, that are science, finance and Statistics.  Now we learn the some basic concepts that used in Probability:  i) Random Experiment OR Trail: A Random Experiment is an process that get one or more possible outcomes. examples of random experiment include tossing a coin, rolling a die, drawing  a card from pack of card etc. using this we specify the possible outcomes known as sample pace.  ii)Outcome: An outcome is a result of experi...

Statistical Inference I ( Theory of estimation : Efficiency)

🔖Statistical Inference I ( Theory of estimation : Efficiency)  In this article we see the  terms:  I. Efficiency. II. Mean Square Error. III. Consistency. 📚 Efficiency:  We know that  two unbiased estimator of parameter gives rise to infinitely many unbiased estimators of parameter. there if one of parameter have two estimators then the problem is to choose one of the best estimator among the class of unbiased estimators. in that case we need to some other criteria to to find out best estimator. therefore, that situation  we check the variability of that estimator, the measure of variability of estimator T around it mean is Var(T). hence If T is an Unbiased estimator of parameter then it's variance gives good precision. the variance is smaller then it give's greater precision. 📑 i. Efficient estimator: An estimator T is said to be an Efficient Estimator of 𝚹, if T is unbiased estimator of    𝛉. and it's variance is less than any other estima...

The Power of Statistics: A Gateway to Exciting Opportunities

  My Blog The Power of Statistics: A Gateway to Exciting Opportunities     Hey there, future statistician! Ever wondered how Netflix seems to know exactly what shows you'll love, how sports teams break down player performance, or how businesses figure out their pricing strategies? The answer is statistics—a fascinating field that helps us make sense of data in our everyday lives. Let's dive into why choosing statistics for your B.Sc. Part First can lead you to some exciting opportunities.     Why Statistics Matters in Everyday Life     From predicting election outcomes and analyzing social media trends to understanding consumer behavior and optimizing public transport routes, statistics are crucial. It's the backbone of modern decision-making, helping us sift through complex data to uncover meaningful insights that drive innovation and progress.   The Role of Statistics in Future Opportunities ...

Statistical Inference I ( Theory of Estimation) : Unbiased it's properties and examples

 📚Statistical Inference I Notes The theory of  estimation invented by Prof. R. A. Fisher in a series of fundamental papers in around 1930. Statistical inference is a process of drawing conclusions about a population based on the information gathered from a sample. It involves using statistical techniques to analyse data, estimate parameters, test hypotheses, and quantify uncertainty. In essence, it allows us to make inferences about a larger group (i.e. population) based on the characteristics observed in a smaller subset (i.e. sample) of that group. Notation of parameter: Let x be a random variable having distribution function F or f is a population distribution. the constant of  distribution function of F is known as Parameter. In general the parameter is denoted as any Greek Letters as θ.   now we see the some basic terms :  i. Population : in a statistics, The group of individual under study is called Population. the population is may be a group of obj...