SlideShare a Scribd company logo
1 of 19
Download to read offline
VTU Edusat Programme – 16
Subject : Engineering Mathematics
Sub Code: 10MAT41
UNIT – 8: Sampling Theory
Dr. K.S.Basavarajappa
Professor & Head
Department of Mathematics
Bapuji Institute of Engineering and of Technology
Davangere-577004
Email: ksbraju@hotmail.com
Statistical Inference:
It is necessary to draw some valid and reasonable conclusions
concerning a large mass of individuals or things. Every individual
or the entire group is known as population. Small part of this
population is known as a sample. The process of drawing some
valid and reasonable conclusion about the entire population is
Statistical Inference.
Random sampling:
A large collection of individuals or attributed or numerical
data can be understood as population or universe.
A finite subset of the universe is called a sample. The
number of individuals in a sample is called a Sample Size (n).
Sampling distribution:
For every sample size (n) we can compute quantities like
mean, median, standard deviation etc., obviously these will not be
the same.
Suppose we group these characteristics according to their
frequencies, the frequency distributions so generated are called
Sampling Distributions.
The sampling distribution of large samples are assumed to be
a normal distribution. The standard deviation of a sampling
distribution is also called as the standard error (SE).
Testing of Hypothesis
Making certain assumption to arrive at a decision regarding
the population a sample population will be referred to as
hypothesis
The hypothesis formulated for the purpose of its rejection
under the assumption that the true is called as the null hypothesis
denoted as H0 .
Errors
In a test process there can be four possible situations lead to
the two types of errors and same is tabulated as follows:
Accepting the
hypothesis
Rejecting the
hypothesis
Hypothesis is true Correct decision Wrong decision
Type I error
Hypothesis is false Wrong decision
Type II error
Correct decision
In order to minimize both these types of errors we need to increase
the sample size.
Significance level:
The probability level below which leads to the hypothesis is
known as the significance level. This probability is conventionally
fixed at 0.05 or 0.01 i.e., 5% or 1%
Therefore rejecting hypothesis at 1% level of significance,
implies that at 5% level of significance, there may be errors of
either types (Type I or II) is 0.05.
TESTS OF SIGNIFICANCE AND CONFIDENCE
INTERVALS
The process which helps us to decide about the acceptance or
rejection of the hypothesis is called as the test of significance.
Suppose that we have a normal population with mean μ and S
D as		ߪ. If xത	 is the sample mean of a random sample size (n), the
quantity “t” defined by
‫ܜ‬ =
ሺ‫ܠ‬തି	ࣆሻ√࢔
࣌
(1)
is called as the standard normal variate (SNV) whose xത = 0	, σ =
1
From the table of the normal areas, we find that 95% of the
area lies between
t = -1.96 and t = 1.96
Further 5% level of significance is denoted by t0.05, therefore,
−1.96 ≤	
ሺ୶തି	ఓሻ√௡
ఙ
	≤ 1.96
ఙ
√௡	
൫– 1.96൯ ≤ xത − 	ߤ		 ≤
ఙ
√௡	
1.96 (2)
ߤ	 ≤ xത +	
ߪ
√݊	
ሺ1.96ሻ	and				xത −	
ߪ
√݊	
ሺ1.96ሻ ≤ ߤ
∴		‫ܠ‬ത −	
࣌
√࢔	
ሺ૚. ૢ૟ሻ ≤ ࣆ	 ≤	‫ܠ‬ത +	
࣌
√࢔	
ሺ૚. ૢ૟ሻ
(3)
Similarly from the table of the normal areas 99% of the area
lies between
-2.58 and 2.58. This is equivalent to the form,
∴		‫ܠ‬ത −	
࣌
√࢔	
ሺ૛. ૞ૡሻ ≤ ૄ	 ≤	‫ܠ‬ത +	
࣌
√࢔	
ሺ૛. ૞ૡሻ
(4)
Therefore representation (3) is that 95% confidence interval
and Representation (3) is the 99% confidence level.
Graph:
Tests of significance for large samples:
Let N be the large sample having n members. Let p and q
denote number of success and failure respectively, then p+ q = 1.
By binomial distribution, N (p + q) n
denotes the frequencies of
samples. Therefore N (p + q) n
denotes the sampling distribution of
the number of successes in the sample.
We know that by binomial distribution ‫ܠ‬ത = ‫ܘܖ‬ and ો =	ඥ‫ܙܘܖ‬
then,
• Mean proportion of success =
୬୮
୬
= p
• S.D.(or S.E) proportion of success =
ඥ‫ܙܘܖ‬
‫ܖ‬
= ටቀ
‫ܙܘ‬
‫ܖ‬
ቁ
Let ‘x’ be the observed number of successes in a sample size
(n) and ૄ = ‫	.ܘܖ‬
The standard normal variate Z is defined as,
Z =
୶ିஜ
஢
=
୶ି୬୮
ඥ୬୮୯
If Z ≤ 2.58, we conclude that the differences is highly significant
and reject the hypothesis. Then p	 ± 2.58ටቀ
‫ܙܘ‬
‫ܖ‬
ቁ be the probable
limits of Z.
p − 2.58ටቀ
‫ܙܘ‬
‫ܖ‬
ቁ			≤ ‫	܈‬ ≤ 	p + 2.58ටቀ
‫ܙܘ‬
‫ܖ‬
ቁ			
For a normal distribution, only 5% of members lie outside
μ	 ± 1.96	σ while only 1% of the members lie outside μ	 ± 2.58	σ
If x be the observed number of successes in the sample and Z is the
standard normal variate the Z =
୶ିஜ
஢
=
୶ି୬୮
ඥ୬୮୯
We have the following test of significance
• If Z < 1.96, difference between the observed and expected
number of successes is not significant.
• If Z > 1.96 difference is significant at 5% level of
significance.
• If Z > 2.58, difference is significant at 1% level of
significance.
Example:
A coin is tossed 1000 times and it turns up head 540 times , decide
on the hypothesis is un biased .
Solution: Let	us	suppose	that		the	coin	is		unbiased
P = probability	of	getting	a	head	in	one	toss = 1/2
Since p + q = 1, q =
ଵ
ଶ
Expected number of heads in 1000 tosses	ൟ = np
= 1000 ×
ଵ
ଶ
= 500
‫	݂݋	ݎܾ݁݉ݑܰ	݈ܽݑݐܿܣ‬ℎ݁ܽ݀‫ݏ‬ = 540 = ‫ݔ‬
‫ݐ‬ℎ݁݊	‫ݔ‬ − ݊‫݌‬ = 540 − 500 = 40
ܿ‫ݖ	ݎ݁݀݅ݏ݊݋‬ =
‫ݔ‬ − ݊‫݌‬
ඥ݊‫ݍ݌‬
=
40
ට1000 ×
ଵ
ଶ
×
ଵ
ଶ
= 2.53 < 2.58
∴ ‫ݖ‬ = 2.53 < 2.58 ⇉ 99%	ሺܷ݊݀݁‫ݎ‬ሻ
⇉ ܶℎ݁	‫݀݁ݏܾܽ݅݊ݑ	ݏ݅	݊݅݋ܥ‬
Example:
A survey was conducted in one locality of 2000 families by
selecting a sample size 800. It was revealed that 180 families were
illiterates. Find the probable limits of the literate families in a
population of 2000.
Solution: Probability of illiterate families = ܲ =
ଵ଼଴
଼଴଴
= 0.225
Also ‫ݍ‬ = 1 − ܲ = 1 − 0.225 = 0.775
Probability limits of illiterate families = ܲ ± 2.58ට
௣௤
௡
= 0.225 ± 2.58ඨ
ሺ0.225ሻሺ0.775ሻ
800
= 0.187	ܽ݊݀	0.263
Therefore Probable limits of illiterate families in a sample of 2000
is
= 0.187ሺ2000ሻ	ܽ݊݀		0.263ሺ2000ሻ
= 374 and 526
Example:
A die was thrown 9000 times and a throw of 5 or 6 was
obtained 3240 times. On the assumption of random throwing, do
the data indicate an unbiased die.
Solution:
Suppose ‘the die is unbiased’
then Probability of throwing 5 or 6 with one die
= p(5) or p(6) = p(5) + p(6) = (1/6 ) + (1/6) = 1/3
q = 1-p = 1- (1/3) = 2/3
Then		expected		number	of	successes	ሺnpሻ =
1
3
× 9000 = 3000
= μ	ሺsayሻ
But	the	observed	value	of	successes = 3240
Excess	of	observed	value	of	successes = x − np = 3240 − 3000
= 240
Here	n = 9000, p =
1
3
, q =, np = 3000
∴ S. D = ඥnpq = ඨ9000 × ൬
1
3
൰ × ൬
2
3
൰ = 44.72
∴ Z	ሺSNVሻ =	
x − np
ඥnpq
=	
240
44.72
= 5.36 ≈ 5.4 > 2.58
⇉ Highly	Signiϐicant
⇉ hypotheses	is	to	be	rejected	at	1%	level	of	Signiϐicance	
∴ die	is	biased.
Example:
A biased dice is tossed 500 times a particular appears120 times.
Find the 95% confidence limit of obtaining the value. Also find the
standard error of proportion of success (Use binomial distribution).
Solution:
Let p =
ଵଶ଴
ହ଴଴
= 0.24
then q = 0.76, n = 500.
Standard error = 9.55
Then mean proportion of success = np/n = p = 0.24 and
mean proportion of S. E =	ඥnpq /n= 0.019
then 95% confidence interval for proportion of success is 	
	nሺ0.203ሻ ≤ np	 ≤ nሺ0.277ሻ
⇉ 500ሺ0.203ሻ ≤ np	 ≤ 500ሺ0.277ሻ
101 ≤ np	 ≤ 138
The interval is [101 , 138 ].
We say that with 95% confidence that out of 500 times always we
get particular number between 101 and 138 times.
Degrees of freedom (d.f )
It is the number of values in a set which may be set
arbitrarily.
d.f = n -1 for n number of observations
d.f = n -2 for n -1 number of observations
d.f = n -3 for n - 2 number of observations etc.
Ex: for 25 observations we have 24 d.f
Student’s t distribution
It is to test the significance of a sample mean for a normal
population where the population S is not known.
It is given by 	‫ܜ‬ =
ሺ‫ܠ‬തି	ࣆሻ√࢔
࢙
where
‫̅ݔ‬ = ݉݁ܽ݊ =
∑ ௫
௡
,	ߤ = ‫	,	݊ܽ݁݉	݊݋݅ݐ݈ܽݑ݌݋݌‬
‫ݏ‬ଶ
=	
1
ሺ݊ − 1ሻ
	෍ሺ‫ݔ‬ − ‫̅ݔ‬ሻଶ
We need to test the hypothesis, whether the sample mean ‫̅ݔ‬ differs
significantly from the population mean ߤ.
If the calculated value of t i.e. |‫ݐ‬|	is greater than the table value of t
say t 0.05, we say that the difference between ‫̅ݔ‬ and ߤ is significant
at 5% level.
If 	|‫ݐ‬| > t 0.01, the difference is significant at 1% level.
Note: 95% confidence limits for the population mean ߤ. Is
‫̅ݔ‬ 	± ሺ
௦
√௡
ሻ
Example:
A machine is expected to produce nails of length 3 inches. A
random sample of 25 nails gave an average length of 3.1 inches
with standard deviation 0.3 can it be said that the machine is
producing nail as per the specification.(value of students t 0.05 for
24 d.f is 2.064 )
Solution:
Given ߤ = 3 , ‫̅ݔ‬ = 3.1 , n = 25 , s = 0.3
‫ܜ‬ =
ሺ‫ܠ‬തି	ࣆሻ√࢔
࢙
= 1.67 < 2.064
∴ The	hypothesis	that	the	machine	is	producing	nails	as	per	
speciϐication	is	accepted	at	5%	level	of		signiϐicance	.
Example:
Ten individuals are chosen at random from a population and
their heights in inches are found to be 63, 63, 66, 67, 68, 69, 70,
71,71, test the hypothesis that the mean height of the universe is 66
inches (value of t 0.05 = 2.262 for 9 d.f).
Solution:
We have ߤ = 66 , n = 10, ∴ d.f = 9
‫̅ݔ‬ =
∑ ௫
௡
=
଺଻଼
ଵ଴
= 67.8
‫ݏ‬ଶ
=
ଵ
௡ିଵ
∑ሺ‫ݔ‬ − ‫̅ݔ‬ሻଶ
	=
ଵ
ଽ
	ሾሺ63 − 67.8ሻଶ
+	… … +
ሺ71 − 67.8ሻଶሿ =9.067
S = 3.011
We have ‫ܜ‬ =
ሺ‫ܠ‬തି	ࣆሻ√࢔
࢙
=
ሺ૟ૠ.ૡି૟૟	ሻ
૜.૙૚૚
√૚૙ = 1.89 < 2.262	 (given in
the problem)
⇉ The hypothesis is accepted at 5% level of significance.
Example:
Eleven school boys were given a test in drawing. They were
given a month’s further tution and a second test of equal difficulty
was healed at the end of it do the marks give evidence that the
students have benefitted by extra coaching (t 0.05 for d.f = 10) =
2.228
Boys 1 2 3 4 5 6 7 8 9 10 11
Marks
test 1
23 20 19 21 18 20 18 17 23 16 19
Marks
test 2
24 19 22 18 20 22 20 20 23 20 17
Chi-Square distribution: (࣑૛
)
It provides a measure of correspondence between the
Theoretical frequencies and observed frequencies
Let Oi ( i = 1 , 2 , ….. n ) – observed frequencies
Ei ( i = 1 , 2 , ….. n ) – estimated frequencies
The quantity ࣑૛
(chi square) distribution is defined as
࣑૛
= ∑
ሺ୓౟ି	୉౟	ሻమ
	୉౟	
௡
௜ୀଵ ; degrees of freedom = n-1
Chi – square test as a test of goodness of fit:
࣑૛
test helps us to test the goodness of fit of the distributions
such as Binomial, Poisson and Normal distributions.
If the calculated value of ࣑૛
is less than the table value of ࣑૛
at a specified level of significance, the hypothesis is accepted.
Otherwise the hypothesis is rejected.
Example:
A die is thrown 264 times and the number appearing on the
face (x) follows the following frequency distribution
x 1 2 3 4 5 6
f 40 32 28 58 54 60
Calculate the value of ࣑૛
Solution:
Frequencies in the given table are the observed frequencies.
Assuming that the die is unbiased the expected number of
frequencies for the numbers 1, 2, 3,4,5,6 to appear on the face is
264/6 = 44 each
Then the data is as follows
No. on the
die
1 2 3 4 5 6
Observed
frequency(Oi)
40 32 28 58 54 60
Expected
frequency(Ei)
44 44 44 44 44 44
࣑૛
= ∑
ሺ୓౟ି	୉౟	ሻమ
	୉౟	
଺
௜ୀଵ
࣑૛
= 22
Example:
Five dice were thrown 96 times and the numbers 1 or 2 or 3
appearing on the face of the die follows the following frequency
distribution
No. of
dice
showing 1
or 2 or 3
5 4 3 2 1 0
Frequency 7 19 35 24 8 3
Test the hypothesis that the data follows a binomial
distribution.
Solution:
Probability of a single die throwing 1 or 2 or 3 is
P = 1/6+1/6+1/6 = ½
q = ½
Binomial distribution to fit the data
Nሺq + pሻ୬
= 96 ൤
1
2
+
1
2
൨
ହ
=96 ቀ
ଵ
ଶ
ቁ
ହ
,96 × 5ܿଵ ×
ቀ
ଵ
ଶ
ቁ
ହ
, … … 96 ቀ
ଵ
ଶ
ቁ
ହ
∴ New table of values are
No. on the
die 12 or 3
5 4 3 2 1 0
Observed
frequency(Oi)
7 19 35 24 8 3
Expected
frequency(Ei)
3 15 30 30 15 3
࣑૛
= ∑
ሺ୓౟ି	୉౟	ሻమ
	୉౟	
ହ
௜ୀ଴
࣑૛
0.05 = 11.7	> 11.07ሺ	table	valueሻ
Hence the hypothesis that the data follows a binomial
distribution is rejected.
Example:
Fit the Poisson distribution for the following data and test the
goodness of fit given that X2
0.05 = 7.815 for degrees of freedom =
4
Solution:
Poisson distribution to fit the data = Np(x) = Ne-m
mx
/x!
m = np =
∑ ௙௫
ே
=	
ଵ
ଶ
x 0 1 2 3 4
f 122 60 15 2 1
Ne-m
mx
/x! = 200 ቂ
௘షభ/మሺଵ/ଶሻೣ
௫!
ቃ where x = 0, 1, 2, 3, 4
= 121, 61, 15, 3, 0
Therefore new table is
࣑૛
= ∑
ሺ୓౟ି	୉౟	ሻమ
	୉౟	
ସ
௜ୀ଴
࣑૛
= 0.025 < ࣑૛
0.05 = 7.815
Therefore the fitness is considered good.
∴ The hypothesis that the fitness is good can be accepted.
Example:
The number of accidents per day (x) as recorded in a textile
industry over a period of 400 is given below. Test the
goodness of fit in respect of Poisson distribution of fit to the
given data
x 0 1 2 3 4
f(oi) 122 60 15 2 1
Ei 121 61 15 3 0
x 0 1 2 3 4 5
f 173 168 37 18 3 1
Solution:
Poisson distribution to fit the data = Np(x) = Ne-m
mx
/x!
m = np =
∑ ௙௫
ே
= 0.7825
Therefore new table is
࣑૛
= ∑
ሺ୓౟ି	୉౟	ሻమ
	୉౟	
ହ
௜ୀ଴
࣑૛
= 12.297 ≈ 12.3 > ࣑૛
0.05 = 9.49
Therefore the fitness is not good
∴ The hypothesis that the fitness is good is rejected.
Example:
In experiments of pea breeding, the following frequencies of seeds
were obtained
Round &
yellow
Wrinkled &
yellow
Round &
green
Wrinkled &
green
total
315 101 108 32 556
Theory predicts that the frequency should be in proportion 9:3:3:1.
Examine the correspondence between theory and experiment.
x 0 1 2 3 4 5
f(oi) 173 168 37 18 3 1
Ei 183 143 56 15 3 0
Solution:
Corresponding frequencies are 313, 104, 104, 35.
࣑૛
= 0.51 < ࣑૛
0.05 = 7.815
⟹ The calculated value of ࣑૛
is much less than ࣑૛
0.05
⟹ There exists agreement between theory and experiment.

More Related Content

What's hot

Central limit theorem
Central limit theoremCentral limit theorem
Central limit theoremVijeesh Soman
 
Normal curve
Normal curveNormal curve
Normal curveLori Rapp
 
Function overloading(c++)
Function overloading(c++)Function overloading(c++)
Function overloading(c++)Ritika Sharma
 
Interpolation and its applications
Interpolation and its applicationsInterpolation and its applications
Interpolation and its applicationsRinkuMonani
 
6.5 central limit
6.5 central limit6.5 central limit
6.5 central limitleblance
 
2 d viewing computer graphics
2 d viewing computer graphics2 d viewing computer graphics
2 d viewing computer graphicsKALESHWAR KUMAR
 
CMSC 56 | Lecture 8: Growth of Functions
CMSC 56 | Lecture 8: Growth of FunctionsCMSC 56 | Lecture 8: Growth of Functions
CMSC 56 | Lecture 8: Growth of Functionsallyn joy calcaben
 
Partial differential equations
Partial differential equationsPartial differential equations
Partial differential equationsaman1894
 
FUNCTIONS IN c++ PPT
FUNCTIONS IN c++ PPTFUNCTIONS IN c++ PPT
FUNCTIONS IN c++ PPT03062679929
 
Liang barsky Line Clipping Algorithm
Liang barsky Line Clipping AlgorithmLiang barsky Line Clipping Algorithm
Liang barsky Line Clipping AlgorithmArvind Kumar
 
Simplex Method
Simplex MethodSimplex Method
Simplex MethodSachin MK
 
Application of-differential-equation-in-real-life
Application of-differential-equation-in-real-lifeApplication of-differential-equation-in-real-life
Application of-differential-equation-in-real-lifeRazwanul Ghani
 

What's hot (20)

Control statements
Control statementsControl statements
Control statements
 
Confidence Intervals
Confidence IntervalsConfidence Intervals
Confidence Intervals
 
Central limit theorem
Central limit theoremCentral limit theorem
Central limit theorem
 
Nested loops
Nested loopsNested loops
Nested loops
 
Normal curve
Normal curveNormal curve
Normal curve
 
Function overloading(c++)
Function overloading(c++)Function overloading(c++)
Function overloading(c++)
 
Template Matching
Template MatchingTemplate Matching
Template Matching
 
Interpolation and its applications
Interpolation and its applicationsInterpolation and its applications
Interpolation and its applications
 
Operations Research - The Dual Simplex Method
Operations Research - The Dual Simplex MethodOperations Research - The Dual Simplex Method
Operations Research - The Dual Simplex Method
 
6.5 central limit
6.5 central limit6.5 central limit
6.5 central limit
 
Bender schmidt method
Bender schmidt methodBender schmidt method
Bender schmidt method
 
2 d viewing computer graphics
2 d viewing computer graphics2 d viewing computer graphics
2 d viewing computer graphics
 
Boundary fill algm
Boundary fill algmBoundary fill algm
Boundary fill algm
 
Linear Programming
Linear ProgrammingLinear Programming
Linear Programming
 
CMSC 56 | Lecture 8: Growth of Functions
CMSC 56 | Lecture 8: Growth of FunctionsCMSC 56 | Lecture 8: Growth of Functions
CMSC 56 | Lecture 8: Growth of Functions
 
Partial differential equations
Partial differential equationsPartial differential equations
Partial differential equations
 
FUNCTIONS IN c++ PPT
FUNCTIONS IN c++ PPTFUNCTIONS IN c++ PPT
FUNCTIONS IN c++ PPT
 
Liang barsky Line Clipping Algorithm
Liang barsky Line Clipping AlgorithmLiang barsky Line Clipping Algorithm
Liang barsky Line Clipping Algorithm
 
Simplex Method
Simplex MethodSimplex Method
Simplex Method
 
Application of-differential-equation-in-real-life
Application of-differential-equation-in-real-lifeApplication of-differential-equation-in-real-life
Application of-differential-equation-in-real-life
 

Similar to U unit8 ksb

Point Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsPoint Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsUniversity of Salerno
 
Statistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciStatistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciSelvin Hadi
 
Lec. 10: Making Assumptions of Missing data
Lec. 10: Making Assumptions of Missing dataLec. 10: Making Assumptions of Missing data
Lec. 10: Making Assumptions of Missing dataMohamadKharseh1
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceLong Beach City College
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Long Beach City College
 
5-Propability-2-87.pdf
5-Propability-2-87.pdf5-Propability-2-87.pdf
5-Propability-2-87.pdfelenashahriari
 
Sampling distribution.pptx
Sampling distribution.pptxSampling distribution.pptx
Sampling distribution.pptxssusera0e0e9
 
raghu veera stats.ppt
raghu veera stats.pptraghu veera stats.ppt
raghu veera stats.pptDevarajuBn
 
C2 st lecture 10 basic statistics and the z test handout
C2 st lecture 10   basic statistics and the z test handoutC2 st lecture 10   basic statistics and the z test handout
C2 st lecture 10 basic statistics and the z test handoutfatima d
 
Pengenalan Ekonometrika
Pengenalan EkonometrikaPengenalan Ekonometrika
Pengenalan EkonometrikaXYZ Williams
 
law of large number and central limit theorem
 law of large number and central limit theorem law of large number and central limit theorem
law of large number and central limit theoremlovemucheca
 
7-THE-SAMPLING-DISTRIBUTION-OF-SAMPLE-MEANS-CLT.pptx
7-THE-SAMPLING-DISTRIBUTION-OF-SAMPLE-MEANS-CLT.pptx7-THE-SAMPLING-DISTRIBUTION-OF-SAMPLE-MEANS-CLT.pptx
7-THE-SAMPLING-DISTRIBUTION-OF-SAMPLE-MEANS-CLT.pptxHASDINABKARIANEBRAHI
 

Similar to U unit8 ksb (20)

Estimation by c.i
Estimation by c.iEstimation by c.i
Estimation by c.i
 
Estimating a Population Mean
Estimating a Population MeanEstimating a Population Mean
Estimating a Population Mean
 
Point Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsPoint Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis tests
 
Inorganic CHEMISTRY
Inorganic CHEMISTRYInorganic CHEMISTRY
Inorganic CHEMISTRY
 
Statistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciStatistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ci
 
Talk 3
Talk 3Talk 3
Talk 3
 
Lec. 10: Making Assumptions of Missing data
Lec. 10: Making Assumptions of Missing dataLec. 10: Making Assumptions of Missing data
Lec. 10: Making Assumptions of Missing data
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
5-Propability-2-87.pdf
5-Propability-2-87.pdf5-Propability-2-87.pdf
5-Propability-2-87.pdf
 
Sampling distribution.pptx
Sampling distribution.pptxSampling distribution.pptx
Sampling distribution.pptx
 
raghu veera stats.ppt
raghu veera stats.pptraghu veera stats.ppt
raghu veera stats.ppt
 
Inferential statistics-estimation
Inferential statistics-estimationInferential statistics-estimation
Inferential statistics-estimation
 
C2 st lecture 10 basic statistics and the z test handout
C2 st lecture 10   basic statistics and the z test handoutC2 st lecture 10   basic statistics and the z test handout
C2 st lecture 10 basic statistics and the z test handout
 
05inference_2011.ppt
05inference_2011.ppt05inference_2011.ppt
05inference_2011.ppt
 
Estimating a Population Mean
Estimating a Population Mean  Estimating a Population Mean
Estimating a Population Mean
 
Pengenalan Ekonometrika
Pengenalan EkonometrikaPengenalan Ekonometrika
Pengenalan Ekonometrika
 
Estimating a Population Mean
Estimating a Population MeanEstimating a Population Mean
Estimating a Population Mean
 
law of large number and central limit theorem
 law of large number and central limit theorem law of large number and central limit theorem
law of large number and central limit theorem
 
7-THE-SAMPLING-DISTRIBUTION-OF-SAMPLE-MEANS-CLT.pptx
7-THE-SAMPLING-DISTRIBUTION-OF-SAMPLE-MEANS-CLT.pptx7-THE-SAMPLING-DISTRIBUTION-OF-SAMPLE-MEANS-CLT.pptx
7-THE-SAMPLING-DISTRIBUTION-OF-SAMPLE-MEANS-CLT.pptx
 

More from Akhilesh Deshpande (6)

U unit7 ssb
U unit7 ssbU unit7 ssb
U unit7 ssb
 
U unit4 vm
U unit4 vmU unit4 vm
U unit4 vm
 
U unit3 vm
U unit3 vmU unit3 vm
U unit3 vm
 
U uni 6 ssb
U uni 6 ssbU uni 6 ssb
U uni 6 ssb
 
Unit2 vrs
Unit2 vrsUnit2 vrs
Unit2 vrs
 
Unit1 vrs
Unit1 vrsUnit1 vrs
Unit1 vrs
 

Recently uploaded

Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 

Recently uploaded (20)

Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 

U unit8 ksb

  • 1. VTU Edusat Programme – 16 Subject : Engineering Mathematics Sub Code: 10MAT41 UNIT – 8: Sampling Theory Dr. K.S.Basavarajappa Professor & Head Department of Mathematics Bapuji Institute of Engineering and of Technology Davangere-577004 Email: ksbraju@hotmail.com
  • 2. Statistical Inference: It is necessary to draw some valid and reasonable conclusions concerning a large mass of individuals or things. Every individual or the entire group is known as population. Small part of this population is known as a sample. The process of drawing some valid and reasonable conclusion about the entire population is Statistical Inference. Random sampling: A large collection of individuals or attributed or numerical data can be understood as population or universe. A finite subset of the universe is called a sample. The number of individuals in a sample is called a Sample Size (n). Sampling distribution: For every sample size (n) we can compute quantities like mean, median, standard deviation etc., obviously these will not be the same. Suppose we group these characteristics according to their frequencies, the frequency distributions so generated are called Sampling Distributions. The sampling distribution of large samples are assumed to be a normal distribution. The standard deviation of a sampling distribution is also called as the standard error (SE). Testing of Hypothesis
  • 3. Making certain assumption to arrive at a decision regarding the population a sample population will be referred to as hypothesis The hypothesis formulated for the purpose of its rejection under the assumption that the true is called as the null hypothesis denoted as H0 . Errors In a test process there can be four possible situations lead to the two types of errors and same is tabulated as follows: Accepting the hypothesis Rejecting the hypothesis Hypothesis is true Correct decision Wrong decision Type I error Hypothesis is false Wrong decision Type II error Correct decision In order to minimize both these types of errors we need to increase the sample size. Significance level: The probability level below which leads to the hypothesis is known as the significance level. This probability is conventionally fixed at 0.05 or 0.01 i.e., 5% or 1% Therefore rejecting hypothesis at 1% level of significance, implies that at 5% level of significance, there may be errors of either types (Type I or II) is 0.05.
  • 4. TESTS OF SIGNIFICANCE AND CONFIDENCE INTERVALS The process which helps us to decide about the acceptance or rejection of the hypothesis is called as the test of significance. Suppose that we have a normal population with mean μ and S D as ߪ. If xത is the sample mean of a random sample size (n), the quantity “t” defined by ‫ܜ‬ = ሺ‫ܠ‬തି ࣆሻ√࢔ ࣌ (1) is called as the standard normal variate (SNV) whose xത = 0 , σ = 1 From the table of the normal areas, we find that 95% of the area lies between t = -1.96 and t = 1.96 Further 5% level of significance is denoted by t0.05, therefore, −1.96 ≤ ሺ୶തି ఓሻ√௡ ఙ ≤ 1.96 ఙ √௡ ൫– 1.96൯ ≤ xത − ߤ ≤ ఙ √௡ 1.96 (2) ߤ ≤ xത + ߪ √݊ ሺ1.96ሻ and xത − ߪ √݊ ሺ1.96ሻ ≤ ߤ ∴ ‫ܠ‬ത − ࣌ √࢔ ሺ૚. ૢ૟ሻ ≤ ࣆ ≤ ‫ܠ‬ത + ࣌ √࢔ ሺ૚. ૢ૟ሻ (3) Similarly from the table of the normal areas 99% of the area lies between
  • 5. -2.58 and 2.58. This is equivalent to the form, ∴ ‫ܠ‬ത − ࣌ √࢔ ሺ૛. ૞ૡሻ ≤ ૄ ≤ ‫ܠ‬ത + ࣌ √࢔ ሺ૛. ૞ૡሻ (4) Therefore representation (3) is that 95% confidence interval and Representation (3) is the 99% confidence level. Graph: Tests of significance for large samples: Let N be the large sample having n members. Let p and q denote number of success and failure respectively, then p+ q = 1. By binomial distribution, N (p + q) n denotes the frequencies of samples. Therefore N (p + q) n denotes the sampling distribution of the number of successes in the sample. We know that by binomial distribution ‫ܠ‬ത = ‫ܘܖ‬ and ો = ඥ‫ܙܘܖ‬ then,
  • 6. • Mean proportion of success = ୬୮ ୬ = p • S.D.(or S.E) proportion of success = ඥ‫ܙܘܖ‬ ‫ܖ‬ = ටቀ ‫ܙܘ‬ ‫ܖ‬ ቁ Let ‘x’ be the observed number of successes in a sample size (n) and ૄ = ‫ .ܘܖ‬ The standard normal variate Z is defined as, Z = ୶ିஜ ஢ = ୶ି୬୮ ඥ୬୮୯ If Z ≤ 2.58, we conclude that the differences is highly significant and reject the hypothesis. Then p ± 2.58ටቀ ‫ܙܘ‬ ‫ܖ‬ ቁ be the probable limits of Z. p − 2.58ටቀ ‫ܙܘ‬ ‫ܖ‬ ቁ ≤ ‫ ܈‬ ≤ p + 2.58ටቀ ‫ܙܘ‬ ‫ܖ‬ ቁ For a normal distribution, only 5% of members lie outside μ ± 1.96 σ while only 1% of the members lie outside μ ± 2.58 σ If x be the observed number of successes in the sample and Z is the standard normal variate the Z = ୶ିஜ ஢ = ୶ି୬୮ ඥ୬୮୯ We have the following test of significance • If Z < 1.96, difference between the observed and expected number of successes is not significant. • If Z > 1.96 difference is significant at 5% level of significance. • If Z > 2.58, difference is significant at 1% level of significance.
  • 7. Example: A coin is tossed 1000 times and it turns up head 540 times , decide on the hypothesis is un biased . Solution: Let us suppose that the coin is unbiased P = probability of getting a head in one toss = 1/2 Since p + q = 1, q = ଵ ଶ Expected number of heads in 1000 tosses ൟ = np = 1000 × ଵ ଶ = 500 ‫ ݂݋ ݎܾ݁݉ݑܰ ݈ܽݑݐܿܣ‬ℎ݁ܽ݀‫ݏ‬ = 540 = ‫ݔ‬ ‫ݐ‬ℎ݁݊ ‫ݔ‬ − ݊‫݌‬ = 540 − 500 = 40 ܿ‫ݖ ݎ݁݀݅ݏ݊݋‬ = ‫ݔ‬ − ݊‫݌‬ ඥ݊‫ݍ݌‬ = 40 ට1000 × ଵ ଶ × ଵ ଶ = 2.53 < 2.58 ∴ ‫ݖ‬ = 2.53 < 2.58 ⇉ 99% ሺܷ݊݀݁‫ݎ‬ሻ ⇉ ܶℎ݁ ‫݀݁ݏܾܽ݅݊ݑ ݏ݅ ݊݅݋ܥ‬ Example: A survey was conducted in one locality of 2000 families by selecting a sample size 800. It was revealed that 180 families were illiterates. Find the probable limits of the literate families in a population of 2000.
  • 8. Solution: Probability of illiterate families = ܲ = ଵ଼଴ ଼଴଴ = 0.225 Also ‫ݍ‬ = 1 − ܲ = 1 − 0.225 = 0.775 Probability limits of illiterate families = ܲ ± 2.58ට ௣௤ ௡ = 0.225 ± 2.58ඨ ሺ0.225ሻሺ0.775ሻ 800 = 0.187 ܽ݊݀ 0.263 Therefore Probable limits of illiterate families in a sample of 2000 is = 0.187ሺ2000ሻ ܽ݊݀ 0.263ሺ2000ሻ = 374 and 526 Example: A die was thrown 9000 times and a throw of 5 or 6 was obtained 3240 times. On the assumption of random throwing, do the data indicate an unbiased die. Solution: Suppose ‘the die is unbiased’ then Probability of throwing 5 or 6 with one die = p(5) or p(6) = p(5) + p(6) = (1/6 ) + (1/6) = 1/3 q = 1-p = 1- (1/3) = 2/3
  • 9. Then expected number of successes ሺnpሻ = 1 3 × 9000 = 3000 = μ ሺsayሻ But the observed value of successes = 3240 Excess of observed value of successes = x − np = 3240 − 3000 = 240 Here n = 9000, p = 1 3 , q =, np = 3000 ∴ S. D = ඥnpq = ඨ9000 × ൬ 1 3 ൰ × ൬ 2 3 ൰ = 44.72 ∴ Z ሺSNVሻ = x − np ඥnpq = 240 44.72 = 5.36 ≈ 5.4 > 2.58 ⇉ Highly Signiϐicant ⇉ hypotheses is to be rejected at 1% level of Signiϐicance ∴ die is biased. Example: A biased dice is tossed 500 times a particular appears120 times. Find the 95% confidence limit of obtaining the value. Also find the standard error of proportion of success (Use binomial distribution). Solution: Let p = ଵଶ଴ ହ଴଴ = 0.24 then q = 0.76, n = 500. Standard error = 9.55 Then mean proportion of success = np/n = p = 0.24 and
  • 10. mean proportion of S. E = ඥnpq /n= 0.019 then 95% confidence interval for proportion of success is nሺ0.203ሻ ≤ np ≤ nሺ0.277ሻ ⇉ 500ሺ0.203ሻ ≤ np ≤ 500ሺ0.277ሻ 101 ≤ np ≤ 138 The interval is [101 , 138 ]. We say that with 95% confidence that out of 500 times always we get particular number between 101 and 138 times. Degrees of freedom (d.f ) It is the number of values in a set which may be set arbitrarily. d.f = n -1 for n number of observations d.f = n -2 for n -1 number of observations d.f = n -3 for n - 2 number of observations etc. Ex: for 25 observations we have 24 d.f Student’s t distribution It is to test the significance of a sample mean for a normal population where the population S is not known. It is given by ‫ܜ‬ = ሺ‫ܠ‬തି ࣆሻ√࢔ ࢙ where ‫̅ݔ‬ = ݉݁ܽ݊ = ∑ ௫ ௡ , ߤ = ‫ , ݊ܽ݁݉ ݊݋݅ݐ݈ܽݑ݌݋݌‬
  • 11. ‫ݏ‬ଶ = 1 ሺ݊ − 1ሻ ෍ሺ‫ݔ‬ − ‫̅ݔ‬ሻଶ We need to test the hypothesis, whether the sample mean ‫̅ݔ‬ differs significantly from the population mean ߤ. If the calculated value of t i.e. |‫ݐ‬| is greater than the table value of t say t 0.05, we say that the difference between ‫̅ݔ‬ and ߤ is significant at 5% level. If |‫ݐ‬| > t 0.01, the difference is significant at 1% level. Note: 95% confidence limits for the population mean ߤ. Is ‫̅ݔ‬ ± ሺ ௦ √௡ ሻ Example: A machine is expected to produce nails of length 3 inches. A random sample of 25 nails gave an average length of 3.1 inches with standard deviation 0.3 can it be said that the machine is producing nail as per the specification.(value of students t 0.05 for 24 d.f is 2.064 ) Solution: Given ߤ = 3 , ‫̅ݔ‬ = 3.1 , n = 25 , s = 0.3 ‫ܜ‬ = ሺ‫ܠ‬തି ࣆሻ√࢔ ࢙ = 1.67 < 2.064 ∴ The hypothesis that the machine is producing nails as per speciϐication is accepted at 5% level of signiϐicance .
  • 12. Example: Ten individuals are chosen at random from a population and their heights in inches are found to be 63, 63, 66, 67, 68, 69, 70, 71,71, test the hypothesis that the mean height of the universe is 66 inches (value of t 0.05 = 2.262 for 9 d.f). Solution: We have ߤ = 66 , n = 10, ∴ d.f = 9 ‫̅ݔ‬ = ∑ ௫ ௡ = ଺଻଼ ଵ଴ = 67.8 ‫ݏ‬ଶ = ଵ ௡ିଵ ∑ሺ‫ݔ‬ − ‫̅ݔ‬ሻଶ = ଵ ଽ ሾሺ63 − 67.8ሻଶ + … … + ሺ71 − 67.8ሻଶሿ =9.067 S = 3.011 We have ‫ܜ‬ = ሺ‫ܠ‬തି ࣆሻ√࢔ ࢙ = ሺ૟ૠ.ૡି૟૟ ሻ ૜.૙૚૚ √૚૙ = 1.89 < 2.262 (given in the problem) ⇉ The hypothesis is accepted at 5% level of significance. Example: Eleven school boys were given a test in drawing. They were given a month’s further tution and a second test of equal difficulty was healed at the end of it do the marks give evidence that the students have benefitted by extra coaching (t 0.05 for d.f = 10) = 2.228
  • 13. Boys 1 2 3 4 5 6 7 8 9 10 11 Marks test 1 23 20 19 21 18 20 18 17 23 16 19 Marks test 2 24 19 22 18 20 22 20 20 23 20 17 Chi-Square distribution: (࣑૛ ) It provides a measure of correspondence between the Theoretical frequencies and observed frequencies Let Oi ( i = 1 , 2 , ….. n ) – observed frequencies Ei ( i = 1 , 2 , ….. n ) – estimated frequencies The quantity ࣑૛ (chi square) distribution is defined as ࣑૛ = ∑ ሺ୓౟ି ୉౟ ሻమ ୉౟ ௡ ௜ୀଵ ; degrees of freedom = n-1 Chi – square test as a test of goodness of fit: ࣑૛ test helps us to test the goodness of fit of the distributions such as Binomial, Poisson and Normal distributions. If the calculated value of ࣑૛ is less than the table value of ࣑૛ at a specified level of significance, the hypothesis is accepted. Otherwise the hypothesis is rejected.
  • 14. Example: A die is thrown 264 times and the number appearing on the face (x) follows the following frequency distribution x 1 2 3 4 5 6 f 40 32 28 58 54 60 Calculate the value of ࣑૛ Solution: Frequencies in the given table are the observed frequencies. Assuming that the die is unbiased the expected number of frequencies for the numbers 1, 2, 3,4,5,6 to appear on the face is 264/6 = 44 each Then the data is as follows No. on the die 1 2 3 4 5 6 Observed frequency(Oi) 40 32 28 58 54 60 Expected frequency(Ei) 44 44 44 44 44 44 ࣑૛ = ∑ ሺ୓౟ି ୉౟ ሻమ ୉౟ ଺ ௜ୀଵ ࣑૛ = 22
  • 15. Example: Five dice were thrown 96 times and the numbers 1 or 2 or 3 appearing on the face of the die follows the following frequency distribution No. of dice showing 1 or 2 or 3 5 4 3 2 1 0 Frequency 7 19 35 24 8 3 Test the hypothesis that the data follows a binomial distribution. Solution: Probability of a single die throwing 1 or 2 or 3 is P = 1/6+1/6+1/6 = ½ q = ½ Binomial distribution to fit the data Nሺq + pሻ୬ = 96 ൤ 1 2 + 1 2 ൨ ହ =96 ቀ ଵ ଶ ቁ ହ ,96 × 5ܿଵ × ቀ ଵ ଶ ቁ ହ , … … 96 ቀ ଵ ଶ ቁ ହ ∴ New table of values are
  • 16. No. on the die 12 or 3 5 4 3 2 1 0 Observed frequency(Oi) 7 19 35 24 8 3 Expected frequency(Ei) 3 15 30 30 15 3 ࣑૛ = ∑ ሺ୓౟ି ୉౟ ሻమ ୉౟ ହ ௜ୀ଴ ࣑૛ 0.05 = 11.7 > 11.07ሺ table valueሻ Hence the hypothesis that the data follows a binomial distribution is rejected. Example: Fit the Poisson distribution for the following data and test the goodness of fit given that X2 0.05 = 7.815 for degrees of freedom = 4 Solution: Poisson distribution to fit the data = Np(x) = Ne-m mx /x! m = np = ∑ ௙௫ ே = ଵ ଶ x 0 1 2 3 4 f 122 60 15 2 1
  • 17. Ne-m mx /x! = 200 ቂ ௘షభ/మሺଵ/ଶሻೣ ௫! ቃ where x = 0, 1, 2, 3, 4 = 121, 61, 15, 3, 0 Therefore new table is ࣑૛ = ∑ ሺ୓౟ି ୉౟ ሻమ ୉౟ ସ ௜ୀ଴ ࣑૛ = 0.025 < ࣑૛ 0.05 = 7.815 Therefore the fitness is considered good. ∴ The hypothesis that the fitness is good can be accepted. Example: The number of accidents per day (x) as recorded in a textile industry over a period of 400 is given below. Test the goodness of fit in respect of Poisson distribution of fit to the given data x 0 1 2 3 4 f(oi) 122 60 15 2 1 Ei 121 61 15 3 0 x 0 1 2 3 4 5 f 173 168 37 18 3 1
  • 18. Solution: Poisson distribution to fit the data = Np(x) = Ne-m mx /x! m = np = ∑ ௙௫ ே = 0.7825 Therefore new table is ࣑૛ = ∑ ሺ୓౟ି ୉౟ ሻమ ୉౟ ହ ௜ୀ଴ ࣑૛ = 12.297 ≈ 12.3 > ࣑૛ 0.05 = 9.49 Therefore the fitness is not good ∴ The hypothesis that the fitness is good is rejected. Example: In experiments of pea breeding, the following frequencies of seeds were obtained Round & yellow Wrinkled & yellow Round & green Wrinkled & green total 315 101 108 32 556 Theory predicts that the frequency should be in proportion 9:3:3:1. Examine the correspondence between theory and experiment. x 0 1 2 3 4 5 f(oi) 173 168 37 18 3 1 Ei 183 143 56 15 3 0
  • 19. Solution: Corresponding frequencies are 313, 104, 104, 35. ࣑૛ = 0.51 < ࣑૛ 0.05 = 7.815 ⟹ The calculated value of ࣑૛ is much less than ࣑૛ 0.05 ⟹ There exists agreement between theory and experiment.