Hybrid Frequentist/Bayesian Power and Bayesian Power in Planning Clinical Trials provides a practical introduction to unconditional approaches to planning randomised clinical trials, particularly aimed at drug development in the pharmaceutical industry. This book is aimed at providing guidance to practitioners in using average power, assurance and related concepts. This book brings together recent research and sets them in a consistent framework and provides a fresh insight into how such methods can be used.
Features:
A focuson normal theory linking average power, expected power, predictive power, assurance, conditional Bayesian power and Bayesian power.
Extensions of the concepts to binomial, and time-to-event outcomes and non-inferiority trials
An investigation into the upper bound on average power, assurance and Bayesian power based on the prior probability of a positive treatment effect
Application of assurance to a series of trials in a development program and an introduction of the assurance of an individual trial conditional on the positive outcome of an earlier trial in the program, or to the successful outcome of an interim analysis
Prior distribution of power and sample size
Extension of the basic approach to proof-of-concept trials with dual success criteria
Investigation of the connection betweenconditional and predictive power at an interim analysis and power and assurance
Introduction of the idea of surety in sample sizing of clinical trials based on the width of the confidence intervals for the treatment effect, and an unconditional version.
Trusted byĀ 375,005 students
Access to over 1.5 million titles for a fair monthly price.
The power of a test is both a conditional and predictive concept. Conditional on the assumed statistical model, the alternative hypothesis and other parameters of the model, which may or may not be nuisance parameters, the outcome of an experiment is predicted and the proportion of predictions giving a āsignificant resultā at a predetermined level is the power. The power of an experiment will therefore only achieve its nominal level if the conditionally assumed model and alternative hypothesis are true.
In this book, it is my intention to review unconditional (absolute) approaches, essentially Bayesian in construct, which can be used both in the planning phase of a clinical trial and during an interim analysis to adjust the sample size or in the extreme case to halt the study for futility. These approaches account for the current uncertainty in our knowledge of the treatment effect and have variously been called the strength, the expected power, the average power (AP), the predictive power (PP) and the assurance of the test.
In the experimental sciences, received wisdom has it that there are four factors that drive a statistical design, some, but not all, of which are in the direct control of the experimenter. These are:
An appropriate null hypothesis of no treatment effect;
The probability of committing a type I error (rejecting the null hypothesis when true), α, the significance level;
An alternative hypothesis representing a treatment effect magnitude that is of interest or one that is expected to be achieved ā variously termed the clinically relevant difference (Lachin, 1977) or minimally clinically important difference (MCID) (Chuang-Stein et al., 2011a);
A sample size per experimental arm to give the probability of committing a type II error (not rejecting the null hypothesis when the alternative hypothesis is true), β.
For illustrative purposes, throughout this book, we will assume a simple two treatment experiment in which n1 patients are randomised to each arm. The difference in sample means, is assumed to have a normal distribution
(1.1)
in which Ī“ is the true difference in population means and we assume that the variance Ļ2 is known. With these assumptions, the sample size, n1, to test the null hypothesis Ho : Ī“ = 0 against the alternative hypothesis HA : Ī“ = Ī“0 is usually given by the formula
(1.2)
In all cases, we will be considering a one-sided test, but this is not a restrictive assumption.
Although this approach is not appropriate in all circumstances, the central limit theorem will often allow it to be used, at least following a suitable transformation. In clinical trials, Spiegelhalter et al. (2004) suggest it is reasonable to assume in many cases that after m āeffective observationsā relevant to a treatment effect Ļ on a suitable scale, a summary statistic ym exists with approximately a normal likelihood
Table 1.1 illustrates four examples and shows how the definition of m differs from case to case. We will show how this approach can be used in simple scenarios and that more complex models can also be dealt with.
Table 1.1 Normal Approximations to Standard Likelihoods
Distribution
Treatment Effect
Variance - V(ym)
Definition of m
Ļ2
Normal
ym = difference in means
Sample size per group
2Ļ2
Binary
ym = log (odds ratio)
Total # events
4
Poisson count
ym = log (rate ratio)
Total count
4
Survival
ym = log (hazard ratio)
Total # events
4
For many, the concept of the power of a test was introduced by Neyman and Pearson (1933a) both to define tests of specific hypotheses and as a measure for comparing different tests of the same hypothesis, for a fixed type I error. However, in truth, the basic idea had appeared four years earlier in work by Pearson and AdyanthÄya (1929) on the robustness of Studentās t-test to non-normal symmetrical distributions or skew distributions and independently considered by Dodge and Romig (1929) in their introduction of consumer risks and producer risks in acceptance sampling. The type-I error corresponds to the producerās risk that consumers reject a good product or service indicated by the null hypothesis. In other words, a producer introduces a good product and in doing so, he/she takes a risk that consumers will reject it. In contrast, the type II error corresponds to the consumerās risk for not rejecting a possibly worthless product or service indicated by the null hypothesis.
Neyman and Pearsonās proposal had little immediate impact on the design of individual clinical trials. As Campbell (2013) has commented, āeven exemplary clinical trials done in the early 1940s did not use statistical arguments to justify their Sample sizesā, including the Medical Research Council trial of streptomycin in the treatment of tuberculosis (Marshall et al., 1948). It was not until the early 1960s that power calculations began to appear in reports of clinical trials, one of the earliest being Zubrod et al. (1960).
In psychological research, Cohen (1962) was the first to seriously study the power of studies. He found amongst 70 studies from the Journal of Abnormal and Social Psychology that the median power of a medium effect size (ES), defined as the treatment difference divided by the pooled standard deviation of 0.5, was 0.48. This led to the recommendation that āinvestigators use larger sample sizes than they customarily doā, not least because āthe chance of obtaining a significant result was about that of tossing a head with a fair coinā (Cohen, 1992). Later, Cohen produced a āpower handbookā for psychologists āto solve the problemā (Cohen, 1969). One aspect of his work was the identification of the ratio of the type II to type I error rates as a measure of the relative importance of a type I error compared to a type II error. For example, in many studies, the type I error is set at 5% and the type II error at 20% which Cohen says implies that āmistaken rejection of the null hypothesis is considered four times as serious as mistaken acceptanceā.
Recently, there has been increased interest in investigating optimal type I and type II errors when the objective is to minimise the weighted sum of errors in which the weights represent the relative importance of the errors (Mudge et al., 2012; Grieve, 2015). Grieve showed that for a fixed sample size, the optimal type I and type II error rates depend upon the non-centrality parameter (NCP), which is related to Cohenās ES. Figure 1.1 displays the optimal error rates as a function of the NCP of a normal, two-arm comparative study, with known variance when type I errors are 4 times more important than type II errors. To illustrate, suppose that the NCP, takes the value 2 and we believe that type I errors are 4 times more important than type II errors. From Figure 1.1 we can read off that the optimal type I (two-sided) error is 0.090 (2Ć0.045) and the corresponding optimal type II error is 0.380 and their ratio is not 4:1. The optimal values are dependent on the value of the NCP and change as the NCP changes. For example, if the NCP takes the value 1.5 the optimal two-sided type I error is 0.094 and the optimal type II error is 0.569. Going further, Grieve also considered the inverse problem. For what relative importance are type I (two-sided) and type II error rates of 0.05 and 0.2, respectively, optimal? He showed that the solution is independent of the NCP. A two-sided type I error of 0.05 and a type II error of 0.2 are optimal for a relative importance of 4.79, 20% larger than the nominal ratio of 4. Walley and Grieve (2021) extend the analytic results developed by Grieve (2015) to studies for which the primary analysis is planned to be Bayesian. In the context of this book in which we concentrate on two-arm trials, Walley and Grieve (2021) consider examples in which a prior is available for the treatment effect and when an informative prior is available for the response mean in the control arm with a vague prior on the treatment effect which is related to Walley et al. (2015) and Lim et al. (2018).
Figure1.1 Optimised type I/type II errors as a function of the NCP, if type I errors are 4 times more important than type II errors.
In 1978, Freiman et al. (1978) investigated 71 ānegativeā trials taken mainly from the New England Journal of Medicine, The Lancet and the Journal of the American Medical Association restricting their attention to studies with a binar...
Table of contents
Cover Page
Half Title page
Series Page
Title Page
Copyright Page
Dedication
Contents
List of Figures
List of Tables
Preface
Acknowledgements
Author
List of Acronyms
1 Introduction
2 All Power Is Conditional Unless It's Absolute
3 Assurance
4 Average Power in Non-Normal Settings
5 Bayesian Power
6 Prior Distributions of Power and Sample Size
7 Interim Predictions
8 Case Studies in Simulation
9 Decision Criteria in Proof-of-Concept Trials
10 Surety and Assurance in Estimation
References
Appendix 1 Evaluation of a Double Normal Integral
Appendix 2 Besag's Candidate Formula
Index
Frequently asked questions
Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline
Perlego offers two plans: Essential and Complete
Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.5M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1.5 million books across 990+ topics, weāve got you covered! Learn about our mission
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud
Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere ā even offline. Perfect for commutes or when youāre on the go. Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app
Yes, you can access Hybrid Frequentist/Bayesian Power and Bayesian Power in Planning Clinical Trials by Andrew P. Grieve in PDF and/or ePUB format, as well as other popular books in Medicina & Probabilidad y estadĆstica. We have over 1.5 million books available in our catalogue for you to explore.