Skip to main content

Run test

 ONE-SAMPLE TESTS

RUN TEST

One of the fundamental assumptions of the parametric test is that the observed data are random and test statistic and the subsequent analysis are based on this assumption. It is always better to check whether this assumption is true or not.

A very simple tool for checking this assumption is run test. This section is devoted to throw light on the run test. Before discussing the run test first we have to explain what we mean by a “run”.

A run in observations, is defined as a sequence of letters or symbols of one kind, immediately preceded and succeeded by letters of other kind or no letters. For example a sequence of two letters H and T as given below:

HHTHTTTHTHHHTTT                                                                                                            

In this sequence, we start with first letter H and go up to other kind of letter, that is, T. In this way, we get first run of two H’s. Then we start with this T and go up to other kind of letter, that is, H. Then we get a run of one T and so on and finally a run of three T’s. In all, we see that there are eight runs. And it is denoted as r=8. it is shown as below.

HH T H TTT H  T HHH TTT                                                                                                  1   2   3   4     5   6  7         8

Under the run test we Judged randomness of observations by using number of runs in the observed sequence. Too few runs indicates that there is some clustering or trend and too large runs indicates that there is some kind of repeated or cycles according to some patterns.

for example, the following sequence of H's and T's is obtained when tossing a coin 10 times.

HHHHHHTTTT                                                                                                                        1             2 

in the above sequence we see there is only 2 run's of 6 heads and 4 Tails, hence from this we say the similar item tend to cluster together, therefore such sequence of observation is not considered as random. now the anther sequence of 10 tosses is as following:

H T H T H T H T H T                                                                                                        1  2  3 4 5   6  7 8 9 10 .

this numbers indicates the runs for all observation  in this sequence there are 10runs of 5 runs of one head each  and 5 runs of one tail each. this sequence is cloud not be considered as random because there are too many runs they indicates the pattern.

T HHH TT H T HH T HHH TT H T H T                                                                          1    2      3   4  5  6    7   8       9 10 111213 

Here neither the number of runs too small nor too large, this type of sequence may be considered as random. 

Assumptions: 

Run test make the following assumptions:

(i) Observed data should be such that we can categorise the observations into two mutually exclusive types.

 (ii) The variable under study is continuous.                    

Procedure for RUN test

Let X1,X2,...,Xn be a set of no observations arranged in the order in which they occur. Generally, we are interested to test whether a population or a sample or a sequence is random or not. So here we consider only two-tailed case. Thus, we can take the null and alternative hypotheses as

H0 :  The observations are random

H1 :  The observations are not random [two-tailed test]

test consist following steps:

Step 1: First of all, we check the form of the given dada that the given data are in symbolical form such as sequence of H and T, A and B, etc. or in the numeric form. If the data in symbolical form then it is ok, but if data in numeric form then first we convert numeric data in symbolical form. For this, we calculate median of the given observations by using either of the following formula given below             

Median = size of [(n+1) /2 ] th observation.           

provided observations should be either in ascending or descending order of magnitude.

After that, we replace the observations which are above the median by  a symbol ‘A’ (say) and the observation which are below the median by a symbol ‘B’ (say) without altering the observed order. The observations which are equal to median are discarded form the analysis and let reduced size of the sample denoted by n.

 

Step 2: Counts number of times first symbol (A) occurs and denote it by n1

 

Step 3: Counts number of times second symbol (B) occurs and denote it by n2 where, n = n1+n2

 

Step 4: For testing the null hypothesis, the test statistic is the total number of runs so in this step we count total number of runs in the sequence of symbols and denote it by R.

 

Step 5: Obtain critical values of test statistic corresponding n1,n2  at α % level of significance under the condition that null hypothesis is true.  From the Table  of critical value for run test is used to obtain respectively lower (RL) and upper (RU) critical values of the number of runs for a given combination of n1 and n2 at 5% level of significance.

Note1: Generally, critical values for run test are available at 5% level of

significance so we test our hypotheses for 5% level of significance.

Step 6: Decision Rule:

To take the decision about null hypothesis, the test statistic is compared with the critical (tabulated) values.

 

If the observed number of runs(R) is either less than or equal to the lower critical value (RL) or greater than or equal to the upper critical value (RU), that is, if R < RL or R > RU then we reject the null hypothesis at 5% level of significance.

 

If R lies between  Rl and Ru , that is,  Rl < R> Ru, then we Accept

 null hypothesis at 5% level of significance     

Comments

Post a Comment

Popular posts from this blog

Basic Concepts of Probability and Binomial Distribution , Poisson Distribution.

 Probability:  Basic concepts of Probability:  Probability is a way to measure hoe likely something is to happen. Probability is number between 0 and 1, where probability is 0 means is not happen at all and probability is 1 means it will be definitely happen, e.g. if we tossed coin there is a 50% chance to get head and 50% chance to get tail, it can be represented in probability as 0.5 for each outcome to get head and tail. Probability is used to help us taking decision and predicting the likelihood of the event in many areas, that are science, finance and Statistics.  Now we learn the some basic concepts that used in Probability:  i) Random Experiment OR Trail: A Random Experiment is an process that get one or more possible outcomes. examples of random experiment include tossing a coin, rolling a die, drawing  a card from pack of card etc. using this we specify the possible outcomes known as sample pace.  ii)Outcome: An outcome is a result of experi...

Statistical Inference: Basic Terms and Definitions.

  📚📖 Statistical Inference: Basic Terms. The theory of estimation is of paramount importance in statistics for several reasons. Firstly, it allows researchers to make informed inferences about population characteristics based on limited sample data. Since it is often impractical or impossible to measure an entire population, estimation provides a framework to generalize findings from a sample to the larger population. By employing various estimation methods, statisticians can estimate population parameters such as means, proportions, and variances, providing valuable insights into the population's characteristics. Second, the theory of estimating aids in quantifying the estimates' inherent uncertainty. Measures like standard errors, confidence intervals, and p-values are included with estimators to provide  an idea of how accurate and reliable the estimates are. The range of possible values for the population characteristics and the degree of confidence attached to those est...

MCQ'S based on Basic Statistics (For B. Com. II Business Statistics)

    (MCQ Based on Probability, Index Number, Time Series   and Statistical Quality Control Sem - IV)                                                            1.The control chart were developed by ……         A) Karl Pearson B) R.A. fisher C) W.A. Shewhart D) B. Benjamin   2.the mean = 4 and variance = 2 for binomial r.v. x then value of n is….. A) 7 B) 10 C) 8 D)9   3.the mean = 3 and variance = 2 for binomial r.v. x then value of n is….. A) 7 B) 10 C) 8 D)9 4. If sampl...

B. Com. -I Statistics Practical No. 1 Classification, tabulation and frequency distribution –I: Qualitative data.

  Shree GaneshA B. Com. Part – I: Semester – I OE–I    Semester – I (BASIC STATISTICS PRACTICAL-I) Practical: 60 Hrs. Marks: 50 (Credits: 02) Course Outcomes: After completion of this practical course, the student will be able to: i) apply sampling techniques in real life. ii) perform classification and tabulation of primary data. iii) represent the data by means of simple diagrams and graphs. iv) summarize data by computing measures of central tendency.   LIST OF PRACTICALS: 1. Classification, tabulation and frequency distribution –I: Qualitative data. 2. Classification, tabulation and frequency distribution –II : Quantitative data. 3. Diagrammatic representation of data by using Pie Diagram and Bar Diagrams. 4. Graphical representation of data by using Histogram, Frequency Polygon, Frequency Curve and     Locating Modal Value. 5. Graphical representation of data by using Ogive Curves and Locating Quartile Values....

Index Number

 Index Number      Introduction  We seen in measures of central tendency the data can be reduced to a single figure by calculating an average and two series can be compared by their averages. But the data are homogeneous then the average is meaningful. (Data is homogeneous means data in same type). If the two series of the price of commodity for two years. It is clear that we cannot compare the cost of living for two years by using simple average of the price of the commodities. For that type of problem we need type of average is called Index number. Index number firstly defined or developed to study the effect of price change on the cost of living. But now days the theory of index number is extended to the field of wholesale price, industrial production, agricultural production etc. Index number is like barometers to measure the change in change in economics activities.   An index may be defined as a " specialized  average designed to measure the...

Statistical Inference Practical: Point Estimation by Method of Moment

 

B. Com. I Practical No. 3 :Diagrammatic representation of data by using Pie Diagram and Bar Diagrams.

Practical No. 3 :Diagrammatic representation of data by using Pie Diagram and Bar Diagrams. Diagrammatic Presentation. We have observed the classification and tabulation method. We use this method to take a lot of information and make it fit into a small table. The reason we do this is to make the information more organized and easier to understand. Tabulation helps us arrange data neatly so that it's not messy and confusing. tabulation is a way to make big files of information look neat and tidy in a table.  but better and beautiful way to represent data using diagrams and graphs. the diagram and graph have some advantages because that used to visualise the data. that helps to understand and give information easily to any common man or any one, following are the some  advantages of diagram and graph.  I. Advantages i. Data Representation: Diagrams and graphs are excellent for presenting data visually, making trends, comparisons, and statistical information easier to...

Time Series

 Time series  Introduction:-         We see the many variables are changes over period of time that are population (I.e. population are changes over time means population increase day by day), monthly demand of commodity, food production, agriculture production increases and that can be observed over period of times known as time series. Time series is defined as a set of observation arranged according to time is called time series. Or a time Series is a set of statistical observation arnging chronological order. ( Chronological order means it is arrangements of variable according to time) and it gives information about variable.  Also we draw the graph of time series to see the behaviour of variable over time. It can be used of forecasting. The analysis of time series is helpful to economist, business men, also for scientist etc. Because it used to forecasting the future, observing the past behaviour of that variable or items. Also planning for future...

Method of Moment & Maximum Likelihood Estimator: Method, Properties and Examples.

 Statistical Inference I: Method Of Moment:   One of the oldest method of finding estimator is Method of Moment, it was discovered by Karl Pearson in 1884.  Method of Moment Estimator Let X1, X2, ........Xn be a random sample from a population with probability density function (pdf) f(x, θ) or probability mass function (pmf) p(x) with parameters θ1, θ2,……..θk. If μ r ' (r-th raw moment about the origin) then μ r ' = ∫ -∞ ∞ x r f(x,θ) dx for r=1,2,3,….k .........Equation i In general, μ 1 ' , μ 2 ' ,…..μ k ' will be functions of parameters θ 1 , θ 2 ,……..θ k . Let X 1 , X 2 ,……X n be the random sample of size n from the population. The method of moments consists of solving "k" equations (in Equation i) for θ 1 , θ 2 ,……..θ k to obtain estimators for the parameters by equating μ 1 ' , μ 2 ' ,…..μ k ' with the corresponding sample moments m 1 ' , m 2 ' ,…..m k ' . Where m r ' = sample m...