Skip to main content

Run test

 ONE-SAMPLE TESTS

RUN TEST

One of the fundamental assumptions of the parametric test is that the observed data are random and test statistic and the subsequent analysis are based on this assumption. It is always better to check whether this assumption is true or not.

A very simple tool for checking this assumption is run test. This section is devoted to throw light on the run test. Before discussing the run test first we have to explain what we mean by a “run”.

A run in observations, is defined as a sequence of letters or symbols of one kind, immediately preceded and succeeded by letters of other kind or no letters. For example a sequence of two letters H and T as given below:

HHTHTTTHTHHHTTT                                                                                                            

In this sequence, we start with first letter H and go up to other kind of letter, that is, T. In this way, we get first run of two H’s. Then we start with this T and go up to other kind of letter, that is, H. Then we get a run of one T and so on and finally a run of three T’s. In all, we see that there are eight runs. And it is denoted as r=8. it is shown as below.

HH T H TTT H  T HHH TTT                                                                                                  1   2   3   4     5   6  7         8

Under the run test we Judged randomness of observations by using number of runs in the observed sequence. Too few runs indicates that there is some clustering or trend and too large runs indicates that there is some kind of repeated or cycles according to some patterns.

for example, the following sequence of H's and T's is obtained when tossing a coin 10 times.

HHHHHHTTTT                                                                                                                        1             2 

in the above sequence we see there is only 2 run's of 6 heads and 4 Tails, hence from this we say the similar item tend to cluster together, therefore such sequence of observation is not considered as random. now the anther sequence of 10 tosses is as following:

H T H T H T H T H T                                                                                                        1  2  3 4 5   6  7 8 9 10 .

this numbers indicates the runs for all observation  in this sequence there are 10runs of 5 runs of one head each  and 5 runs of one tail each. this sequence is cloud not be considered as random because there are too many runs they indicates the pattern.

T HHH TT H T HH T HHH TT H T H T                                                                          1    2      3   4  5  6    7   8       9 10 111213 

Here neither the number of runs too small nor too large, this type of sequence may be considered as random. 

Assumptions: 

Run test make the following assumptions:

(i) Observed data should be such that we can categorise the observations into two mutually exclusive types.

 (ii) The variable under study is continuous.                    

Procedure for RUN test

Let X1,X2,...,Xn be a set of no observations arranged in the order in which they occur. Generally, we are interested to test whether a population or a sample or a sequence is random or not. So here we consider only two-tailed case. Thus, we can take the null and alternative hypotheses as

H0 :  The observations are random

H1 :  The observations are not random [two-tailed test]

test consist following steps:

Step 1: First of all, we check the form of the given dada that the given data are in symbolical form such as sequence of H and T, A and B, etc. or in the numeric form. If the data in symbolical form then it is ok, but if data in numeric form then first we convert numeric data in symbolical form. For this, we calculate median of the given observations by using either of the following formula given below             

Median = size of [(n+1) /2 ] th observation.           

provided observations should be either in ascending or descending order of magnitude.

After that, we replace the observations which are above the median by  a symbol ‘A’ (say) and the observation which are below the median by a symbol ‘B’ (say) without altering the observed order. The observations which are equal to median are discarded form the analysis and let reduced size of the sample denoted by n.

 

Step 2: Counts number of times first symbol (A) occurs and denote it by n1

 

Step 3: Counts number of times second symbol (B) occurs and denote it by n2 where, n = n1+n2

 

Step 4: For testing the null hypothesis, the test statistic is the total number of runs so in this step we count total number of runs in the sequence of symbols and denote it by R.

 

Step 5: Obtain critical values of test statistic corresponding n1,n2  at α % level of significance under the condition that null hypothesis is true.  From the Table  of critical value for run test is used to obtain respectively lower (RL) and upper (RU) critical values of the number of runs for a given combination of n1 and n2 at 5% level of significance.

Note1: Generally, critical values for run test are available at 5% level of

significance so we test our hypotheses for 5% level of significance.

Step 6: Decision Rule:

To take the decision about null hypothesis, the test statistic is compared with the critical (tabulated) values.

 

If the observed number of runs(R) is either less than or equal to the lower critical value (RL) or greater than or equal to the upper critical value (RU), that is, if R < RL or R > RU then we reject the null hypothesis at 5% level of significance.

 

If R lies between  Rl and Ru , that is,  Rl < R> Ru, then we Accept

 null hypothesis at 5% level of significance     

Comments

Post a Comment

Popular posts from this blog

Statistical Inference II Notes

Likelihood Ratio Test 

B. Com. -I Statistics Practical No. 1 Classification, tabulation and frequency distribution –I: Qualitative data.

  Shree GaneshA B. Com. Part – I: Semester – I OE–I    Semester – I (BASIC STATISTICS PRACTICAL-I) Practical: 60 Hrs. Marks: 50 (Credits: 02) Course Outcomes: After completion of this practical course, the student will be able to: i) apply sampling techniques in real life. ii) perform classification and tabulation of primary data. iii) represent the data by means of simple diagrams and graphs. iv) summarize data by computing measures of central tendency.   LIST OF PRACTICALS: 1. Classification, tabulation and frequency distribution –I: Qualitative data. 2. Classification, tabulation and frequency distribution –II : Quantitative data. 3. Diagrammatic representation of data by using Pie Diagram and Bar Diagrams. 4. Graphical representation of data by using Histogram, Frequency Polygon, Frequency Curve and     Locating Modal Value. 5. Graphical representation of data by using Ogive Curves and Locating Quartile Values....

Statistical Inference: Basic Terms and Definitions.

  📚📖 Statistical Inference: Basic Terms. The theory of estimation is of paramount importance in statistics for several reasons. Firstly, it allows researchers to make informed inferences about population characteristics based on limited sample data. Since it is often impractical or impossible to measure an entire population, estimation provides a framework to generalize findings from a sample to the larger population. By employing various estimation methods, statisticians can estimate population parameters such as means, proportions, and variances, providing valuable insights into the population's characteristics. Second, the theory of estimating aids in quantifying the estimates' inherent uncertainty. Measures like standard errors, confidence intervals, and p-values are included with estimators to provide  an idea of how accurate and reliable the estimates are. The range of possible values for the population characteristics and the degree of confidence attached to those est...

B. Com. I Practical No. 4 :Graphical representation of data by using Histogram, Frequency Polygon, Frequency Curve and Locating Modal Value.

Practical No. 4 Graphical representation of data by using Histogram, Frequency Polygon, Frequency Curve and Locating Modal Value   Graphical Representation: The representation of numerical data into graphs is called graphical representation of data. following are the graphs to represent a data i.                     Histogram ii.                 Frequency Polygon    iii.                Frequency Curve iv.        Locating Modal Value i.     Histogram: Histogram is one of the simplest methods to representing the grouped (continuous) frequency distribution. And histogram is defined as A pictorial representation of grouped (or continuous) frequency distribution to drawing a...

Index Number

 Index Number      Introduction  We seen in measures of central tendency the data can be reduced to a single figure by calculating an average and two series can be compared by their averages. But the data are homogeneous then the average is meaningful. (Data is homogeneous means data in same type). If the two series of the price of commodity for two years. It is clear that we cannot compare the cost of living for two years by using simple average of the price of the commodities. For that type of problem we need type of average is called Index number. Index number firstly defined or developed to study the effect of price change on the cost of living. But now days the theory of index number is extended to the field of wholesale price, industrial production, agricultural production etc. Index number is like barometers to measure the change in change in economics activities.   An index may be defined as a " specialized  average designed to measure the...

Statistical Inference I ( Theory of Estimation) : Unbiased it's properties and examples

 📚Statistical Inference I Notes The theory of  estimation invented by Prof. R. A. Fisher in a series of fundamental papers in around 1930. Statistical inference is a process of drawing conclusions about a population based on the information gathered from a sample. It involves using statistical techniques to analyse data, estimate parameters, test hypotheses, and quantify uncertainty. In essence, it allows us to make inferences about a larger group (i.e. population) based on the characteristics observed in a smaller subset (i.e. sample) of that group. Notation of parameter: Let x be a random variable having distribution function F or f is a population distribution. the constant of  distribution function of F is known as Parameter. In general the parameter is denoted as any Greek Letters as θ.   now we see the some basic terms :  i. Population : in a statistics, The group of individual under study is called Population. the population is may be a group of obj...

Statistical Inference Practical: Point Estimation by Method of Moment

 

Median test

 Non- Parametric test Median test Median test is also a Non-Parametric test and it is alternative to Parametric T test. The median test is used when we are interested to check the two independent sample have same median or not. It is useful when data is discrete or continuous and if data is in small size.  Assumptions:  I) the variable under study is ordinal scale II) the variable is random and Independent. The stepwise procedure for computation of median test for two independent sample : Step I :- firstly we define the hypothesis Null Hypothesis is the two independent sample have same median.  Against Alternative Hypothesis is the two independent sample have different median.  Step II :- In this step we combine two sample data. And calculating the median of combined data. Step III :- after that for testing hypothesis we constructing the (2x2) contingency table. For that table we divide the sample into two parts as number of observation above and below to the ...

Basic Concepts of Probability and Binomial Distribution , Poisson Distribution.

 Probability:  Basic concepts of Probability:  Probability is a way to measure hoe likely something is to happen. Probability is number between 0 and 1, where probability is 0 means is not happen at all and probability is 1 means it will be definitely happen, e.g. if we tossed coin there is a 50% chance to get head and 50% chance to get tail, it can be represented in probability as 0.5 for each outcome to get head and tail. Probability is used to help us taking decision and predicting the likelihood of the event in many areas, that are science, finance and Statistics.  Now we learn the some basic concepts that used in Probability:  i) Random Experiment OR Trail: A Random Experiment is an process that get one or more possible outcomes. examples of random experiment include tossing a coin, rolling a die, drawing  a card from pack of card etc. using this we specify the possible outcomes known as sample pace.  ii)Outcome: An outcome is a result of experi...