continution hypothesis testing
Post on 04-Jun-2018
225 Views
Preview:
TRANSCRIPT
-
8/14/2019 Continution Hypothesis Testing
1/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.3
Inference With Variance Unknown Previously, we looked at estimating and testing the
population mean when the population standard deviation ( )was known or given:
But how often do we know the actual population variance?
Instead, we use the Student t-statistic , given by:
-
8/14/2019 Continution Hypothesis Testing
2/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.4
Inference With Variance Unknown When is unknown, we use its point estimator s
and the z-statistic is replaced by the the t-statistic, where thenumber of degrees of freedom , is n 1.
-
8/14/2019 Continution Hypothesis Testing
3/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.5
Testing when is unknown When the population standard deviation is unknown and the
population is normal, the test statistic for testing hypothesesabout is:
which is Student t distributed with = n 1 degrees offreedom. The confidence interval estimator of is given
by:
-
8/14/2019 Continution Hypothesis Testing
4/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.6
Example 12.1 Will new workers achieve 90% of the level of experienced
workers within one week of being hired and trained?
Experienced workers can process 500 packages/hour, thus if
our conjecture is correct, we expect new workers to be ableto process .90(500) = 450 packages per hour.
Given the data , is this the case?
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-01.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-01.xls -
8/14/2019 Continution Hypothesis Testing
5/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.7
Example 12.1 Our objective is to describe the population of the numbers of
packages processed in 1 hour by new workers, that is wewant to know whether the new workers productivity is morethan 90% of that of experienced workers. Thus we have:
H1: > 450
Therefore we set our usual null hypothesis to:
H0: = 450
IDENTIFY
-
8/14/2019 Continution Hypothesis Testing
6/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.8
Example 12.1 Our test statistic is:
With n=50 data points, we have n 1=49 degrees of freedom.
Our hypothesis under question is:H1: > 450
Our rejection region becomes:
Thus we will reject the null hypothesis in favor of the alternative if ourcalculated test static falls in this region.
COMPUTE
-
8/14/2019 Continution Hypothesis Testing
7/38
-
8/14/2019 Continution Hypothesis Testing
8/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.10
Example 12.1 Alternatively, we can use t-test:Mean from
Tools > Data Analysis Plus in Excel
COMPUTE
:: r e
j e c
t i o n
r e g
i o n
-
8/14/2019 Continution Hypothesis Testing
9/38
-
8/14/2019 Continution Hypothesis Testing
10/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.12
Example 12.2 Can we estimate the return on investment for companies that
won quality awards?
We have are given a random sample of n = 83 such
companies. We want to construct a 95% confidence intervalfor the mean return, i.e. what is: ??
IDENTIFY
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-02.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-02.xls -
8/14/2019 Continution Hypothesis Testing
11/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.13
Example 12.2 From the data, we calculate:
For this term
and so:
COMPUTE
-
8/14/2019 Continution Hypothesis Testing
12/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.14
Example 12.2
We are 95% confident that the population mean, , i.e.the mean return of all publicly traded companies that win
quality awards, lies between 13.20% and 16.84%
Tools > Data Analysis Plus > t-Estimate: Mean is an alternative to themanual calculation
INTERPRET
-
8/14/2019 Continution Hypothesis Testing
13/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.15
Check Requisite Conditions The Student t distribution is robust , which means that if the
population is nonnormal, the results of the t-test andconfidence interval estimate are still valid provided that the
population is not extremely nonnormal.
To check this requirement, draw a histogram of the data andsee how bell shaped the resulting figure is. If a histogramis extremely skewed (say in the case of an exponential
distribution), that could be considered extremelynonnormal and hence t -statistics would be not be valid inthis case.
-
8/14/2019 Continution Hypothesis Testing
14/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.16
Estimating Totals of Finite Populations Large populations are defined as populations that are at
least 20 times the sample size
We can use the confidence interval estimator of a mean to
produce a confidence interval estimator of the populationtotal :
Where N is the size of the finite population.
-
8/14/2019 Continution Hypothesis Testing
15/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.17
Estimating Totals of Finite Populations For example, a sample of 500 households (in a city of 1
million households) reveals a 95% confidence intervalestimate that the household mean spent on Halloween candylies between $20 & $30.
We can estimate the total amount spent in the city bymultiplying these lower and upper confidence limits by thetotal population:
Thus we estimate that the total amount spent on Halloweenin the city lies between $20 million and $30 million.
-
8/14/2019 Continution Hypothesis Testing
16/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.18
Identifying Factors Factors that identify the t-test and estimator of :
-
8/14/2019 Continution Hypothesis Testing
17/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.19
Inference About Population Variance If we are interested in drawing inferences about a
populations variability , the parameter we need toinvestigate is the population variance:
The sample variance ( s 2 ) is an unbiased, consistent andefficient point estimator for . Moreover,
the statistic, , has a chi-squared distribution,with n 1 degrees of freedom.
-
8/14/2019 Continution Hypothesis Testing
18/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.20
Testing & Estimating Population VarianceThe test statistic used to test hypotheses about is:
(which is chi-squared with = n 1 degrees of freedom).
-
8/14/2019 Continution Hypothesis Testing
19/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.21
Testing & Estimating Population VarianceCombining this statistic:
With the probability statement:
Yields the confidence interval estimator for :
lower confidence limit upper confidence limit
-
8/14/2019 Continution Hypothesis Testing
20/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.22
Example 12.3 Consider a container filling machine. Management wants a
machine to fill 1 liter (1,000 ccs) so that that variance of thefills is less than 1 cc 2. A random sample of n=25 1 liter fillswere taken. Does the machine perform as it should at the 5%significance level?
We want to show that:H1: < 1
(so our null hypothesis becomes: H 0: = 1). We will usethis test statistic:
Variance is less than 1 cc 2
IDENTIFY
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-03.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-03.xls -
8/14/2019 Continution Hypothesis Testing
21/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.23
Example 12.3 Since our alternative hypothesis is phrased as:
H1: < 1
We will reject H 0 in favor of H 1 if our test statistic falls into
this rejection region:
We computer the sample variance to be: s 2=.8088
And thus our test statistic takes on this value
COMPUTE
-
8/14/2019 Continution Hypothesis Testing
22/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.24
Example 12.3 Since:
There is not enough evidence to infer that the claim is true.
Excel output can also be used for this test
INTERPRET
compare
-
8/14/2019 Continution Hypothesis Testing
23/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.25
Example 12.4 As we saw, we cannot reject the null hypothesis in favor of
the alternative. That is, there is not enough evidence to inferthat the claim is true.Note: the result does not say that the variance is greater than1, rather it merely states that we are unable to show that the variance is less than 1 .
We could estimate (at 99% confidence say) the variance of
the fills
-
8/14/2019 Continution Hypothesis Testing
24/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.26
Example 12.4 In order to create a confidence interval estimate of the
variance, we need these formulae:
we know (n 1)s2 = 19.41 from our previous calculation, andwe have from Table 5 in Appendix B:
COMPUTE
lower confidence limit upper confidence limit
-
8/14/2019 Continution Hypothesis Testing
25/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.27
Example 12.4 Thus the 99% confidence interval estimate is:
That is, the variance of fills lies between .426 and 1.963 cc 2.
COMPUTE
-
8/14/2019 Continution Hypothesis Testing
26/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.28
Identifying Factors Factors that identify the chi-squared test and estimator of :
-
8/14/2019 Continution Hypothesis Testing
27/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.29
Inference: Population Proportion When data are nominal, we count the number of occurrences
of each value and calculate proportions. Thus, the parameterof interest in describing a population of nominal data is the
population proportion p .
This parameter was based on the binomial experiment.
Recall the use of this statistic:
where p-hat ( ) is the sample proportion: x successes in asample size of n items.
-
8/14/2019 Continution Hypothesis Testing
28/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.30
Inference: Population Proportion When np and n(1 p) are both greater than 5, the sampling
distribution of is approximately normal with
mean:
standard deviation:
Hence:
-
8/14/2019 Continution Hypothesis Testing
29/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.31
Inference: Population Proportion Test statistic for p :
The confidence interval estimator for p is given by:
(both of which require that np >5 and n(1 p)>5)
-
8/14/2019 Continution Hypothesis Testing
30/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.32
Example 12.5 At an exit poll , voters are asked by a certain network if they
voted Democrat (code=1) or Republican (code=2). Based ontheir small sample, can the network conclude that theRepublican candidate will win the vote?
That is:H1: p > .50
And hence our null hypothesis becomes:H0: p = .50
IDENTIFY
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-05.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-05.xls -
8/14/2019 Continution Hypothesis Testing
31/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.33
Example 12.5 Since our research hypothesis is:
H1: p > .50
our rejection region becomes:
Looking at the data, we count 407 (of 765) votes for code=2.Hence, we calculate our test statistic as follows
COMPUTE
-
8/14/2019 Continution Hypothesis Testing
32/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.34
Example 12.5 Since:
we reject H 0 in favor of H 1, that is, there is enoughevidence to believe that the Republicans win the vote.
Likewise from Excel:
INTERPRET
compare these
or look at p -value
-
8/14/2019 Continution Hypothesis Testing
33/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.35
Selecting the Sample Size The confidence interval estimator for a population
proportion is:
Thus the (half) width of the interval is:
Solving for n , we have:
-
8/14/2019 Continution Hypothesis Testing
34/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.36
Selecting the Sample Size For example, we want to know how many customers to
survey in order to estimate the proportion of customers who prefer our brand to within .03 (with 95% confidence).
I.e. our confidence interval after surveying will be .03,that means W=.03
Substituting into the equation
Uh Oh. Since we havent takena sample yet, we dont have
this sample proportion
-
8/14/2019 Continution Hypothesis Testing
35/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.37
Selecting the Sample Size Two methods in each case we choose a value for then
solve the equation for n.
Method 1 : no knowledge of even a rough value of . This
is a worst case scenario so we substitute = .50
Method 2 : we have some idea about the value of . This isa better scenario and we substitute in our estimated value.
-
8/14/2019 Continution Hypothesis Testing
36/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.38
Selecting the Sample Size Method 1 : no knowledge of value of , use 50%:
Method 2 : some idea about a possible value, say 20%:
Thus, we can sample fewer people if we already have areasonable estimate of the population proportion beforestarting.
-
8/14/2019 Continution Hypothesis Testing
37/38
Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.39
Estimating Totals for Large Populations In much the same way as we saw earlier, when a population
is large and finite we can estimate the total number ofsuccesses in the population by taking the product of the sizeof the population ( N ) and the confidence interval estimator:
The Nielsen Ratings (used to measure TV audiences) usesthis technique. Results from a small sample audience (say2,000 viewers) is extrapolated to the total number of TV sets(say 100 million)
-
8/14/2019 Continution Hypothesis Testing
38/38
Nielsen Ratings Example Problem: describe the population of television shows
watched by viewers across the country (population), byexamining the results from 2,000 viewers (sample).
We take these values and multiply them by N =100 million toestimate that between 9.9 million and 12.7 million viewersare watching the Tonight Show.
COMPUTE
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Nielson.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Nielson.xls
top related