Mathematics

Hypothesis Test for Correlation

A hypothesis test for correlation is a statistical method used to determine if there is a significant linear relationship between two variables. It involves testing the null hypothesis that there is no correlation against the alternative hypothesis that there is a correlation. The test produces a p-value, which indicates the strength of evidence against the null hypothesis.

Written by Perlego with AI-assistance

7 Key excerpts on "Hypothesis Test for Correlation"

Index pages curate the most relevant extracts from our library of academic textbooks. They’ve been created using an in-house natural language model (NLM), each adding context and meaning to key research topics.
  • Mathematics for Enzyme Reaction Kinetics and Reactor Performance
    • F. Xavier Malcata(Author)
    • 2020(Publication Date)
    • Wiley
      (Publisher)

    ...19 Statistical Hypothesis Testing A statistical hypothesis is a hypothesis that is testable, on the basis of observing a process modeled by a set of random variables that follow a probability distribution known a priori. Hypothesis testing (or confirmatory data analysis) is a method of statistical inference that resorts to tests of significance to determine the probability that a statement is true, and at what likelihood such a statement may be accepted as true. Usually, two datasets are compared, or data obtained from sampling is compared against synthetic data produced via an idealized model; a working hypothesis is then put forward for the statistical relationship between the data, versus a complementary hypothesis. Therefore, the basic process of hypothesis testing consists of four sequential steps: (i) formulate the null hypothesis, H 0 (commonly stated as if the observations are the result of pure chance), and the alternative hypothesis, H 1 (commonly stated as if the observations unfold an actual effect, along with an unavoidable component of variation by chance); (ii) identify an appropriate test statistic that can be used to assess the truth of the null hypothesis (dependent on the nature of the data and of the test); (iii) compute the P ‐value, or associated probability that a test statistic at least as significant as the one determined from the sample data would be obtained, assuming that the null hypothesis held true (the smaller said probability, the stronger the evidence against the null hypothesis); and (iv) compare the P ‐value with an acceptable significance level, α (or threshold of probability, often 1% or 5%) – if P ≤ α, then the observed effect is statistically significant, so the null hypothesis is ruled out, and the alternative hypothesis is concomitantly accepted as valid. The probability of rejecting the null hypothesis (H 0, expressed...

  • Correlation and Regression
    eBook - ePub

    Correlation and Regression

    Applications for Industrial Organizational Psychology and Management

    ...Rather, one wants some statistical indication that these nonzero sample results will generalize to other groups of individuals (i.e., that the nonnull result is true in the population of interest). 1 This is precisely what statistical significance testing is all about (see the brief review in Chapter I). In fact, testing Pearson product moment correlations for significance is relatively straightforward, although the procedures depend upon the type of hypothesis being tested. TESTING A SINGLE CORRELATION AGAINST ZERO The hypothesis H 0 : ρ = 0, where ρ is the underlying population correlation, is by far the most frequently tested correlational hypothesis. In fact, many researchers run around asking each other, “Was your r significant?” This is researcher shorthand for, “Was your sample value of r significantly different from the hypothesized value of ρ = 0?”or “Assuming the population correlation was 0, was your sample value of r so far away from 0 that it was not likely to have occurred by chance?” Notationally, let ρ be the value of the population correlation, let r be our old friend the sample correlation, and let n be the sample size. Then, it can be shown that the value has a t -distribution with (n − 2) degrees of freedom. Formally, we have Note that the value of r is tested by computing a fairly straightforward (but nonlinear) function of r and comparing the result to well-known critical values of Student’s t -distribution. 2 For example, suppose the predictive validity of the Scholastic Aptitude Test (SAT) is being questioned. Assume that a researcher uses a simple random sample of n = 122 and obtains a measure of success in college (Y) as well as previous SAT scores (X) for each individual. Suppose the resulting correlation is r =.27. Then, Equation III.A gives The obtained sample value of t = 3.07 is larger than the two-sided, p =.05 critical t -value of 1.980 (see Appendix, Table A.1)...

  • Research Methods for Public Administrators
    • Gary Rassel, Suzanne Leland, Zachary Mohr, Elizabethann O'Sullivan(Authors)
    • 2020(Publication Date)
    • Routledge
      (Publisher)

    ...A test of statistical significance cannot indicate that a relationship is strong or important; it may not even indicate if the direction is as hypothesized. Nor does a finding of statistical significance imply that other parts of the research were carried out correctly. The test only makes a statistical statement about the nature of a relationship. To carry out a test of statistical significance, a researcher states the null and the research hypotheses selects an alpha level selects and computes a test statistic makes a decision Figure 12.4 illustrates these steps. Figure 12.4 Testing Statistical Significance of a Relationship The null hypothesis postulates that an independent and a dependent variable are not related. For some statistical tests, the null hypothesis may state that the relationship goes in a certain direction or does not exceed a specific value. In hypothesis testing, a researcher runs the risk of making two errors. First, he may reject a null hypothesis that is actually true. The researcher, then, has “accepted” an untrue research hypothesis. This error is called a Type I error. Second, the researcher may accept a null hypothesis that is untrue. The researcher has failed to accept a true research hypothesis; that is called a Type II error. In practice, researchers are more likely to make a judgment about a hypothesis based on the associated probability, the sample size, the magnitude of the effect, and the practical consequences of their decisions than they are to decide on a specific alpha level set a priori. The associated probability is the probability of a specific Χ 2 or t -value occurring if the relationship in the sample does not occur in the population, that is, if the null hypothesis is true. To detect small effects, the investigator will tend to set higher, that is, less strict, alpha levels, accept higher associated probabilities or use larger samples...

  • Econometrics
    eBook - ePub
    • K. Nirmal Ravi Kumar(Author)
    • 2020(Publication Date)
    • CRC Press
      (Publisher)

    ...We know, ‘r’ is a sample correlation coefficient and it serves as an estimate of population correlation coefficient ‘ρ’. As a statistical estimate, ‘r’ is subjected to error because of sampling and hence, it should be tested for validity. Student’s ‘t’ test will be employed to test the significance of ‘r’ value. This test provides the researcher with some evidence that, there really is a relationship between the two variables and not due to peculiarity in the data. In order to test the correlation coefficient for statistical significance, it is necessary to define the true correlation coefficient that would be observed, if all population values were obtained. As mentioned earlier, this true correlation coefficient is usually given the Greek symbol ‘r’ (rho). So, the null hypothesis (H O) is that: H O : ρ = 0 (there is no relationship between the two variables X and Y, when all population values are observed) and an alternative hypothesis (H A) is framed against this as: H A : Alternative hypothesis could be any one of the following three forms viz., (ρ≠0) or (ρ<0) or (ρ>0). That is, if the researcher has no idea whether or how two variables are related, then a two tailed H A ie., H A : (ρ≠0) is formulated. If the researcher suspects, or has knowledge, that the two variables are negatively related then, H A : (ρ<0) is formulated (ie., one tailed H A) and if the researcher predicts a positive relationship between the variables, then H A : (ρ>0) is formulated (ie., one tailed H A). The test statistic for the hypothesis test is the sample or observed correlation coefficient ‘r’ obtained from a sample. The sampling distribution of ‘ r ’ is approximated by a ‘ t ’ distribution with n –2 Degrees of Freedom (df). The formula to compute ‘ t cal ’ value is given by: t = r S E (r) Equation 2.14 where, r = Sample correlation coefficient, Standard Error (SE) of ‘ r ’ = 1 − r 2 n − 2 n = number of observations...

  • Choosing and Using Statistics
    eBook - ePub

    ...Test the data to see if they follow a normal distribution. Consider the alternatives. 2 The statistical significance of correlation is not a good guide to the real significance of the correlation. With large sample sizes the value of r required to achieve statistical significance (i.e. to show that there is some relationship between the two variables) is rather low. It is perhaps better to use the value of r 2 as an indicator of the real significance as this value shows the amount of variation in one variable explained by the other. An example A marine biologist working on Adélie penguins (Pygoscelis adeliae) has measured the sizes of birds forming pairs. The measure used is the length of a bone in the leg which is known, from previous studies, to be a good indication of size. It is measured to the nearest 0.1 mm. The null hypothesis is that male size is not correlated with female size. Unfortunately data could only be collected from six pairs. Pair Female Male 1 17.1 16.5 2 18.5 17.4 3 19.7 17.3 4 16.2 16.8 5 21.3 19.5 6 19.6 18.3 It is assumed that both variables are normally distributed. This data set is a little small to test this although a larger sample of the population might be more suitable (see ‘Do frequency distributions differ?’ in the previous chapter, page 72). In this case the null hypothesis is rejected as there is a significant positive correlation between male and female size indicating that there is positive, assortative mating in this species. The value of r is 0.88 and r 2 is 0.77 indicating that 77% of the variation in the size of one sex is explained by the size of the other. SPSS This is one of the easiest tests to carry out in SPSS. Input the data in two columns and add appropriate column labels. The cases (pairs in the example) do not require a separate column. From the ‘Analyze’ menu, select ‘Correlate’ then ‘Bivariate…’...

  • Quantitative and Statistical Research Methods
    eBook - ePub
    • William E. Martin, Krista D. Bridgmon(Authors)
    • 2012(Publication Date)
    • Jossey-Bass
      (Publisher)

    ...Chapter 2 LOGICAL STEPS OF CONDUCTING QUANTITATIVE RESEARCH: HYPOTHESIS-TESTING PROCESS LEARNING OBJECTIVES Understand the logic and purpose of the hypothesis-testing process in scientific research. Identify the components and application of alternative and null hypotheses. Examine the meaning of alpha level and commonly used criterion levels of alpha (α) used in research. Explore the elements used to choose an appropriate statistic for use to test a null hypothesis. Understand decision rules associated with rejecting and failing to reject a null hypothesis. Realize the importance of including effect size and confidence interval information to further clarify making a decision concerning the null hypothesis. The hypothesis-testing process is a logical sequence of steps to conduct the statistical analyses in a quantitative research study. Indeed, hypothesis testing is the most widely used statistical tool in scientific research (Salsburg, 2001, p. 114). However, we must remember that no method, including obtaining statistical results from hypothesis testing, is the absolute final answer to a research problem. As Snedecor and Cochran (1967) stated, “But the basic ideas in statistics assist us in thinking clearly about the problem, provide some guidance about the conditions that must be satisfied if sound inferences are to be made, and enable us to detect many inferences that have not good logical foundation” (p. 3). HYPOTHESIS-TESTING PROCESS There are six steps of the hypothesis-testing process that provide the procedure for conducting the statistical analyses used in this book. Descriptions and key concepts are discussed for each hypothesis step. 1. Establish the alternative (research) hypothesis (H a). An alternative (research) hypothesis (H a) is a speculative statement about the relations between two or more variables used in a quantitative research study (Kerlinger & Lee, 2000)...

  • Medical Statistics
    eBook - ePub

    Medical Statistics

    A Textbook for the Health Sciences

    • Stephen J. Walters, Michael J. Campbell, David Machin(Authors)
    • 2020(Publication Date)
    • Wiley-Blackwell
      (Publisher)

    ...6 Hypothesis Testing, P ‐values and Statistical Inference 6.1 Introduction 6.2 The Null Hypothesis 6.3 The Main Steps in Hypothesis Testing 6.4 Using Your P-value to Make a Decision About Whether to Reject, or Not Reject, Your Null Hypothesis 6.5 Statistical Power 6.6 One-sided and Two-sided Tests 6.7 Confidence Intervals (CIs) 6.8 Large Sample Tests for Two Independent Means or Proportions 6.9 Issues with P-values 6.10 Points When Reading the Literature 6.11 Exercises Summary The main aim of statistical analysis is to use the information gained from a sample of individuals to make inferences or form judgements about the parameters (e.g. the mean) of a population of interest. This chapter will discuss two of the basic approaches to statistical analysis: estimation (with confidence intervals (CIs)) and hypothesis testing (with P‐ values). The concepts of the null hypothesis, statistical significance, the use of statistical tests, P‐ values and their relationship to CIs are introduced. The difficulties with the use and mis‐interpretation of P ‐values are discussed. 6.1 Introduction We have seen that, in sampling from a population which can be assumed to have a Normal distribution, the sample mean can be regarded as estimating the corresponding population mean μ. Similarly, s 2 estimates the population variance, σ 2. We therefore describe the distribution of the population with the information given by the sample statistics and s 2. More generally, in comparing two populations, perhaps the population of subjects exposed to a particular hazard and the population of those who were not, two samples are taken, and their respective summary statistics calculated...