continution hypothesis testing

Report

Post on 04-Jun-2018

225 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

8/14/2019 Continution Hypothesis Testing

1/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.3

Inference With Variance Unknown Previously, we looked at estimating and testing the

population mean when the population standard deviation ( )was known or given:

But how often do we know the actual population variance?

Instead, we use the Student t-statistic , given by:
8/14/2019 Continution Hypothesis Testing

2/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.4

Inference With Variance Unknown When is unknown, we use its point estimator s

and the z-statistic is replaced by the the t-statistic, where thenumber of degrees of freedom , is n 1.
8/14/2019 Continution Hypothesis Testing

3/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.5

Testing when is unknown When the population standard deviation is unknown and the

population is normal, the test statistic for testing hypothesesabout is:

which is Student t distributed with = n 1 degrees offreedom. The confidence interval estimator of is given

by:
8/14/2019 Continution Hypothesis Testing

4/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.6

Example 12.1 Will new workers achieve 90% of the level of experienced

workers within one week of being hired and trained?

Experienced workers can process 500 packages/hour, thus if

our conjecture is correct, we expect new workers to be ableto process .90(500) = 450 packages per hour.

Given the data , is this the case?
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-01.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-01.xls
8/14/2019 Continution Hypothesis Testing

5/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.7

Example 12.1 Our objective is to describe the population of the numbers of

packages processed in 1 hour by new workers, that is wewant to know whether the new workers productivity is morethan 90% of that of experienced workers. Thus we have:

H1: > 450

Therefore we set our usual null hypothesis to:

H0: = 450

IDENTIFY
8/14/2019 Continution Hypothesis Testing

6/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.8

Example 12.1 Our test statistic is:

With n=50 data points, we have n 1=49 degrees of freedom.

Our hypothesis under question is:H1: > 450

Our rejection region becomes:

Thus we will reject the null hypothesis in favor of the alternative if ourcalculated test static falls in this region.

COMPUTE
8/14/2019 Continution Hypothesis Testing

7/38
8/14/2019 Continution Hypothesis Testing

8/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.10

Example 12.1 Alternatively, we can use t-test:Mean from

Tools > Data Analysis Plus in Excel

COMPUTE

:: r e

j e c

t i o n

r e g

i o n
8/14/2019 Continution Hypothesis Testing

9/38
8/14/2019 Continution Hypothesis Testing

10/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.12

Example 12.2 Can we estimate the return on investment for companies that

won quality awards?

We have are given a random sample of n = 83 such

companies. We want to construct a 95% confidence intervalfor the mean return, i.e. what is: ??

IDENTIFY
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-02.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-02.xls
8/14/2019 Continution Hypothesis Testing

11/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.13

Example 12.2 From the data, we calculate:

For this term

and so:

COMPUTE
8/14/2019 Continution Hypothesis Testing

12/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.14

Example 12.2

We are 95% confident that the population mean, , i.e.the mean return of all publicly traded companies that win

quality awards, lies between 13.20% and 16.84%

Tools > Data Analysis Plus > t-Estimate: Mean is an alternative to themanual calculation

INTERPRET
8/14/2019 Continution Hypothesis Testing

13/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.15

Check Requisite Conditions The Student t distribution is robust , which means that if the

population is nonnormal, the results of the t-test andconfidence interval estimate are still valid provided that the

population is not extremely nonnormal.

To check this requirement, draw a histogram of the data andsee how bell shaped the resulting figure is. If a histogramis extremely skewed (say in the case of an exponential

distribution), that could be considered extremelynonnormal and hence t -statistics would be not be valid inthis case.
8/14/2019 Continution Hypothesis Testing

14/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.16

Estimating Totals of Finite Populations Large populations are defined as populations that are at

least 20 times the sample size

We can use the confidence interval estimator of a mean to

produce a confidence interval estimator of the populationtotal :

Where N is the size of the finite population.
8/14/2019 Continution Hypothesis Testing

15/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.17

Estimating Totals of Finite Populations For example, a sample of 500 households (in a city of 1

million households) reveals a 95% confidence intervalestimate that the household mean spent on Halloween candylies between $20 & $30.

We can estimate the total amount spent in the city bymultiplying these lower and upper confidence limits by thetotal population:

Thus we estimate that the total amount spent on Halloweenin the city lies between $20 million and $30 million.
8/14/2019 Continution Hypothesis Testing

16/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.18

Identifying Factors Factors that identify the t-test and estimator of :
8/14/2019 Continution Hypothesis Testing

17/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.19

Inference About Population Variance If we are interested in drawing inferences about a

populations variability , the parameter we need toinvestigate is the population variance:

The sample variance ( s 2 ) is an unbiased, consistent andefficient point estimator for . Moreover,

the statistic, , has a chi-squared distribution,with n 1 degrees of freedom.
8/14/2019 Continution Hypothesis Testing

18/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.20

Testing & Estimating Population VarianceThe test statistic used to test hypotheses about is:

(which is chi-squared with = n 1 degrees of freedom).
8/14/2019 Continution Hypothesis Testing

19/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.21

Testing & Estimating Population VarianceCombining this statistic:

With the probability statement:

Yields the confidence interval estimator for :

lower confidence limit upper confidence limit
8/14/2019 Continution Hypothesis Testing

20/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.22

Example 12.3 Consider a container filling machine. Management wants a

machine to fill 1 liter (1,000 ccs) so that that variance of thefills is less than 1 cc 2. A random sample of n=25 1 liter fillswere taken. Does the machine perform as it should at the 5%significance level?

We want to show that:H1: < 1

(so our null hypothesis becomes: H 0: = 1). We will usethis test statistic:

Variance is less than 1 cc 2

IDENTIFY
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-03.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-03.xls
8/14/2019 Continution Hypothesis Testing

21/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.23

Example 12.3 Since our alternative hypothesis is phrased as:

H1: < 1

We will reject H 0 in favor of H 1 if our test statistic falls into

this rejection region:

We computer the sample variance to be: s 2=.8088

And thus our test statistic takes on this value

COMPUTE
8/14/2019 Continution Hypothesis Testing

22/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.24

Example 12.3 Since:

There is not enough evidence to infer that the claim is true.

Excel output can also be used for this test

INTERPRET

compare
8/14/2019 Continution Hypothesis Testing

23/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.25

Example 12.4 As we saw, we cannot reject the null hypothesis in favor of

the alternative. That is, there is not enough evidence to inferthat the claim is true.Note: the result does not say that the variance is greater than1, rather it merely states that we are unable to show that the variance is less than 1 .

We could estimate (at 99% confidence say) the variance of

the fills
8/14/2019 Continution Hypothesis Testing

24/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.26

Example 12.4 In order to create a confidence interval estimate of the

variance, we need these formulae:

we know (n 1)s2 = 19.41 from our previous calculation, andwe have from Table 5 in Appendix B:

COMPUTE

lower confidence limit upper confidence limit
8/14/2019 Continution Hypothesis Testing

25/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.27

Example 12.4 Thus the 99% confidence interval estimate is:

That is, the variance of fills lies between .426 and 1.963 cc 2.

COMPUTE
8/14/2019 Continution Hypothesis Testing

26/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.28

Identifying Factors Factors that identify the chi-squared test and estimator of :
8/14/2019 Continution Hypothesis Testing

27/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.29

Inference: Population Proportion When data are nominal, we count the number of occurrences

of each value and calculate proportions. Thus, the parameterof interest in describing a population of nominal data is the

population proportion p .

This parameter was based on the binomial experiment.

Recall the use of this statistic:

where p-hat ( ) is the sample proportion: x successes in asample size of n items.
8/14/2019 Continution Hypothesis Testing

28/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.30

Inference: Population Proportion When np and n(1 p) are both greater than 5, the sampling

distribution of is approximately normal with

mean:

standard deviation:

Hence:
8/14/2019 Continution Hypothesis Testing

29/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.31

Inference: Population Proportion Test statistic for p :

The confidence interval estimator for p is given by:

(both of which require that np >5 and n(1 p)>5)
8/14/2019 Continution Hypothesis Testing

30/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.32

Example 12.5 At an exit poll , voters are asked by a certain network if they

voted Democrat (code=1) or Republican (code=2). Based ontheir small sample, can the network conclude that theRepublican candidate will win the vote?

That is:H1: p > .50

And hence our null hypothesis becomes:H0: p = .50

IDENTIFY
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-05.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-05.xls
8/14/2019 Continution Hypothesis Testing

31/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.33

Example 12.5 Since our research hypothesis is:

H1: p > .50

our rejection region becomes:

Looking at the data, we count 407 (of 765) votes for code=2.Hence, we calculate our test statistic as follows

COMPUTE
8/14/2019 Continution Hypothesis Testing

32/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.34

Example 12.5 Since:

we reject H 0 in favor of H 1, that is, there is enoughevidence to believe that the Republicans win the vote.

Likewise from Excel:

INTERPRET

compare these

or look at p -value
8/14/2019 Continution Hypothesis Testing

33/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.35

Selecting the Sample Size The confidence interval estimator for a population

proportion is:

Thus the (half) width of the interval is:

Solving for n , we have:
8/14/2019 Continution Hypothesis Testing

34/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.36

Selecting the Sample Size For example, we want to know how many customers to

survey in order to estimate the proportion of customers who prefer our brand to within .03 (with 95% confidence).

I.e. our confidence interval after surveying will be .03,that means W=.03

Substituting into the equation

Uh Oh. Since we havent takena sample yet, we dont have

this sample proportion
8/14/2019 Continution Hypothesis Testing

35/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.37

Selecting the Sample Size Two methods in each case we choose a value for then

solve the equation for n.

Method 1 : no knowledge of even a rough value of . This

is a worst case scenario so we substitute = .50

Method 2 : we have some idea about the value of . This isa better scenario and we substitute in our estimated value.
8/14/2019 Continution Hypothesis Testing

36/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.38

Selecting the Sample Size Method 1 : no knowledge of value of , use 50%:

Method 2 : some idea about a possible value, say 20%:

Thus, we can sample fewer people if we already have areasonable estimate of the population proportion beforestarting.
8/14/2019 Continution Hypothesis Testing

37/38

Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.39

Estimating Totals for Large Populations In much the same way as we saw earlier, when a population

is large and finite we can estimate the total number ofsuccesses in the population by taking the product of the sizeof the population ( N ) and the confidence interval estimator:

The Nielsen Ratings (used to measure TV audiences) usesthis technique. Results from a small sample audience (say2,000 viewers) is extrapolated to the total number of TV sets(say 100 million)
8/14/2019 Continution Hypothesis Testing

38/38

Nielsen Ratings Example Problem: describe the population of television shows

watched by viewers across the country (population), byexamining the results from 2,000 viewers (sample).

We take these values and multiply them by N =100 million toestimate that between 9.9 million and 12.7 million viewersare watching the Tonight Show.

COMPUTE
http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Nielson.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Nielson.xls