continution hypothesis testing

Upload: hamshavathini-yohoratnam

Post on 04-Jun-2018

225 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/14/2019 Continution Hypothesis Testing

    1/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.3

    Inference With Variance Unknown Previously, we looked at estimating and testing the

    population mean when the population standard deviation ( )was known or given:

    But how often do we know the actual population variance?

    Instead, we use the Student t-statistic , given by:

  • 8/14/2019 Continution Hypothesis Testing

    2/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.4

    Inference With Variance Unknown When is unknown, we use its point estimator s

    and the z-statistic is replaced by the the t-statistic, where thenumber of degrees of freedom , is n 1.

  • 8/14/2019 Continution Hypothesis Testing

    3/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.5

    Testing when is unknown When the population standard deviation is unknown and the

    population is normal, the test statistic for testing hypothesesabout is:

    which is Student t distributed with = n 1 degrees offreedom. The confidence interval estimator of is given

    by:

  • 8/14/2019 Continution Hypothesis Testing

    4/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.6

    Example 12.1 Will new workers achieve 90% of the level of experienced

    workers within one week of being hired and trained?

    Experienced workers can process 500 packages/hour, thus if

    our conjecture is correct, we expect new workers to be ableto process .90(500) = 450 packages per hour.

    Given the data , is this the case?

    http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-01.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-01.xls
  • 8/14/2019 Continution Hypothesis Testing

    5/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.7

    Example 12.1 Our objective is to describe the population of the numbers of

    packages processed in 1 hour by new workers, that is wewant to know whether the new workers productivity is morethan 90% of that of experienced workers. Thus we have:

    H1: > 450

    Therefore we set our usual null hypothesis to:

    H0: = 450

    IDENTIFY

  • 8/14/2019 Continution Hypothesis Testing

    6/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.8

    Example 12.1 Our test statistic is:

    With n=50 data points, we have n 1=49 degrees of freedom.

    Our hypothesis under question is:H1: > 450

    Our rejection region becomes:

    Thus we will reject the null hypothesis in favor of the alternative if ourcalculated test static falls in this region.

    COMPUTE

  • 8/14/2019 Continution Hypothesis Testing

    7/38

  • 8/14/2019 Continution Hypothesis Testing

    8/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.10

    Example 12.1 Alternatively, we can use t-test:Mean from

    Tools > Data Analysis Plus in Excel

    COMPUTE

    :: r e

    j e c

    t i o n

    r e g

    i o n

  • 8/14/2019 Continution Hypothesis Testing

    9/38

  • 8/14/2019 Continution Hypothesis Testing

    10/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.12

    Example 12.2 Can we estimate the return on investment for companies that

    won quality awards?

    We have are given a random sample of n = 83 such

    companies. We want to construct a 95% confidence intervalfor the mean return, i.e. what is: ??

    IDENTIFY

    http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-02.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-02.xls
  • 8/14/2019 Continution Hypothesis Testing

    11/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.13

    Example 12.2 From the data, we calculate:

    For this term

    and so:

    COMPUTE

  • 8/14/2019 Continution Hypothesis Testing

    12/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.14

    Example 12.2

    We are 95% confident that the population mean, , i.e.the mean return of all publicly traded companies that win

    quality awards, lies between 13.20% and 16.84%

    Tools > Data Analysis Plus > t-Estimate: Mean is an alternative to themanual calculation

    INTERPRET

  • 8/14/2019 Continution Hypothesis Testing

    13/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.15

    Check Requisite Conditions The Student t distribution is robust , which means that if the

    population is nonnormal, the results of the t-test andconfidence interval estimate are still valid provided that the

    population is not extremely nonnormal.

    To check this requirement, draw a histogram of the data andsee how bell shaped the resulting figure is. If a histogramis extremely skewed (say in the case of an exponential

    distribution), that could be considered extremelynonnormal and hence t -statistics would be not be valid inthis case.

  • 8/14/2019 Continution Hypothesis Testing

    14/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.16

    Estimating Totals of Finite Populations Large populations are defined as populations that are at

    least 20 times the sample size

    We can use the confidence interval estimator of a mean to

    produce a confidence interval estimator of the populationtotal :

    Where N is the size of the finite population.

  • 8/14/2019 Continution Hypothesis Testing

    15/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.17

    Estimating Totals of Finite Populations For example, a sample of 500 households (in a city of 1

    million households) reveals a 95% confidence intervalestimate that the household mean spent on Halloween candylies between $20 & $30.

    We can estimate the total amount spent in the city bymultiplying these lower and upper confidence limits by thetotal population:

    Thus we estimate that the total amount spent on Halloweenin the city lies between $20 million and $30 million.

  • 8/14/2019 Continution Hypothesis Testing

    16/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.18

    Identifying Factors Factors that identify the t-test and estimator of :

  • 8/14/2019 Continution Hypothesis Testing

    17/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.19

    Inference About Population Variance If we are interested in drawing inferences about a

    populations variability , the parameter we need toinvestigate is the population variance:

    The sample variance ( s 2 ) is an unbiased, consistent andefficient point estimator for . Moreover,

    the statistic, , has a chi-squared distribution,with n 1 degrees of freedom.

  • 8/14/2019 Continution Hypothesis Testing

    18/38Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.20

    Testing & Estimating Population VarianceThe test statistic used to test hypotheses about is:

    (which is chi-squared with = n 1 degrees of freedom).

  • 8/14/2019 Continution Hypothesis Testing

    19/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.21

    Testing & Estimating Population VarianceCombining this statistic:

    With the probability statement:

    Yields the confidence interval estimator for :

    lower confidence limit upper confidence limit

  • 8/14/2019 Continution Hypothesis Testing

    20/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.22

    Example 12.3 Consider a container filling machine. Management wants a

    machine to fill 1 liter (1,000 ccs) so that that variance of thefills is less than 1 cc 2. A random sample of n=25 1 liter fillswere taken. Does the machine perform as it should at the 5%significance level?

    We want to show that:H1: < 1

    (so our null hypothesis becomes: H 0: = 1). We will usethis test statistic:

    Variance is less than 1 cc 2

    IDENTIFY

    http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-03.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-03.xls
  • 8/14/2019 Continution Hypothesis Testing

    21/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.23

    Example 12.3 Since our alternative hypothesis is phrased as:

    H1: < 1

    We will reject H 0 in favor of H 1 if our test statistic falls into

    this rejection region:

    We computer the sample variance to be: s 2=.8088

    And thus our test statistic takes on this value

    COMPUTE

  • 8/14/2019 Continution Hypothesis Testing

    22/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.24

    Example 12.3 Since:

    There is not enough evidence to infer that the claim is true.

    Excel output can also be used for this test

    INTERPRET

    compare

  • 8/14/2019 Continution Hypothesis Testing

    23/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.25

    Example 12.4 As we saw, we cannot reject the null hypothesis in favor of

    the alternative. That is, there is not enough evidence to inferthat the claim is true.Note: the result does not say that the variance is greater than1, rather it merely states that we are unable to show that the variance is less than 1 .

    We could estimate (at 99% confidence say) the variance of

    the fills

  • 8/14/2019 Continution Hypothesis Testing

    24/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.26

    Example 12.4 In order to create a confidence interval estimate of the

    variance, we need these formulae:

    we know (n 1)s2 = 19.41 from our previous calculation, andwe have from Table 5 in Appendix B:

    COMPUTE

    lower confidence limit upper confidence limit

  • 8/14/2019 Continution Hypothesis Testing

    25/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.27

    Example 12.4 Thus the 99% confidence interval estimate is:

    That is, the variance of fills lies between .426 and 1.963 cc 2.

    COMPUTE

  • 8/14/2019 Continution Hypothesis Testing

    26/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.28

    Identifying Factors Factors that identify the chi-squared test and estimator of :

  • 8/14/2019 Continution Hypothesis Testing

    27/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.29

    Inference: Population Proportion When data are nominal, we count the number of occurrences

    of each value and calculate proportions. Thus, the parameterof interest in describing a population of nominal data is the

    population proportion p .

    This parameter was based on the binomial experiment.

    Recall the use of this statistic:

    where p-hat ( ) is the sample proportion: x successes in asample size of n items.

  • 8/14/2019 Continution Hypothesis Testing

    28/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.30

    Inference: Population Proportion When np and n(1 p) are both greater than 5, the sampling

    distribution of is approximately normal with

    mean:

    standard deviation:

    Hence:

  • 8/14/2019 Continution Hypothesis Testing

    29/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.31

    Inference: Population Proportion Test statistic for p :

    The confidence interval estimator for p is given by:

    (both of which require that np >5 and n(1 p)>5)

  • 8/14/2019 Continution Hypothesis Testing

    30/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.32

    Example 12.5 At an exit poll , voters are asked by a certain network if they

    voted Democrat (code=1) or Republican (code=2). Based ontheir small sample, can the network conclude that theRepublican candidate will win the vote?

    That is:H1: p > .50

    And hence our null hypothesis becomes:H0: p = .50

    IDENTIFY

    http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-05.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Xm12-05.xls
  • 8/14/2019 Continution Hypothesis Testing

    31/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.33

    Example 12.5 Since our research hypothesis is:

    H1: p > .50

    our rejection region becomes:

    Looking at the data, we count 407 (of 765) votes for code=2.Hence, we calculate our test statistic as follows

    COMPUTE

  • 8/14/2019 Continution Hypothesis Testing

    32/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.34

    Example 12.5 Since:

    we reject H 0 in favor of H 1, that is, there is enoughevidence to believe that the Republicans win the vote.

    Likewise from Excel:

    INTERPRET

    compare these

    or look at p -value

  • 8/14/2019 Continution Hypothesis Testing

    33/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.35

    Selecting the Sample Size The confidence interval estimator for a population

    proportion is:

    Thus the (half) width of the interval is:

    Solving for n , we have:

  • 8/14/2019 Continution Hypothesis Testing

    34/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.36

    Selecting the Sample Size For example, we want to know how many customers to

    survey in order to estimate the proportion of customers who prefer our brand to within .03 (with 95% confidence).

    I.e. our confidence interval after surveying will be .03,that means W=.03

    Substituting into the equation

    Uh Oh. Since we havent takena sample yet, we dont have

    this sample proportion

  • 8/14/2019 Continution Hypothesis Testing

    35/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.37

    Selecting the Sample Size Two methods in each case we choose a value for then

    solve the equation for n.

    Method 1 : no knowledge of even a rough value of . This

    is a worst case scenario so we substitute = .50

    Method 2 : we have some idea about the value of . This isa better scenario and we substitute in our estimated value.

  • 8/14/2019 Continution Hypothesis Testing

    36/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.38

    Selecting the Sample Size Method 1 : no knowledge of value of , use 50%:

    Method 2 : some idea about a possible value, say 20%:

    Thus, we can sample fewer people if we already have areasonable estimate of the population proportion beforestarting.

  • 8/14/2019 Continution Hypothesis Testing

    37/38

    Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. 12.39

    Estimating Totals for Large Populations In much the same way as we saw earlier, when a population

    is large and finite we can estimate the total number ofsuccesses in the population by taking the product of the sizeof the population ( N ) and the confidence interval estimator:

    The Nielsen Ratings (used to measure TV audiences) usesthis technique. Results from a small sample audience (say2,000 viewers) is extrapolated to the total number of TV sets(say 100 million)

  • 8/14/2019 Continution Hypothesis Testing

    38/38

    Nielsen Ratings Example Problem: describe the population of television shows

    watched by viewers across the country (population), byexamining the results from 2,000 viewers (sample).

    We take these values and multiply them by N =100 million toestimate that between 9.9 million and 12.7 million viewersare watching the Tonight Show.

    COMPUTE

    http://localhost/var/www/apps/conversion/tmp/scratch_6/References/Nielson.xlshttp://localhost/var/www/apps/conversion/tmp/scratch_6/References/Nielson.xls