Calculating a plausible effect size

Get an effect size that should be detected reliably (based on expected mean differences and what we know about the SD in this kind of study)

library(pwr)

## Warning: Paket 'pwr' wurde unter R Version 4.2.3 erstellt

pwr.t.test(d=(-0.2-(-0.5))/1.6,
           power=0.8,
           sig.level=0.05,
           type="two.sample",
           alternative="greater")

## 
##      Two-sample t test power calculation 
## 
##               n = 352.3972
##               d = 0.1875
##       sig.level = 0.05
##           power = 0.8
##     alternative = greater
## 
## NOTE: n is number in *each* group

This yields \(N~= 712\) (\(n~= 356\)), 89 in each of the 8 total conditions. This is the sample size if there was only one look at the data.

A small effect of \(d~= 0.19\) can be detected with 80% power in a one-sided (directed) two-sample t test if the sample size is \(N~= 712\).

Note, however, that this sample is to small if a sequential testing strategy is applied. This is because of the Type-I error control in such an analysis, which leads to a reduction in test power. The final necessary total sample size will thus be conducted below.

Planning with Sequential Testing

However, the present study aimed to employ a sequential testing strategy with one planned interim look after 70% of the final sample size.

For more information, see this website.

What are the critical p-values in this sequential analysis?

library(rpact)

## Warning: Paket 'rpact' wurde unter R Version 4.2.3 erstellt

design <- getDesignGroupSequential(
  typeOfDesign = "asP", 
  informationRates = c(0.7, 1), 
  alpha = 0.05, 
  sided = 1)
design$stageLevels * 1

## [1] 0.03948640 0.02670694

The critical (on-sided) p-value at the interim look is \(.039\), and the critical p-value at the final look is \(.0267\).

Since these p-value adjustments lead to p-values smaller than \(0.05\) (the sig. value used in the initial power planning above), this leads to a (mild) reduction of statistical power. This can be compensated, however, by increasing the planned sample size accordingly.

This can be done this way:

library(rpact)
seq_design <- getDesignGroupSequential(
  informationRates = c(0.7, 1), 
  typeOfDesign = "asP",
  sided = 1,
  alpha = 0.05,
  beta = 0.2
  )

# Compute the sample size we need
power_res_seq <- getSampleSizeMeans(
  design = seq_design,
  groups = 2,
  alternative = 0.1875, 
  stDev = 1, 
  allocationRatioPlanned = 1,
  normalApproximation = FALSE)

power_res_seq$maxNumberOfSubjects

## [1] 788.9624

To compensate for the power loss due to sequential testing with one interim look at \(0.7N\), a total sample size of \(N~= 789\) is needed. This was rounded to \(N~= 800\) (\(n~= 100\) per condition).

Determine the N for the interim look

Given this information, at what point (i.e., at what fraction of the total N of 800) must the interim look take place?

interim_N <- 0.7 * 800
interim_N

## [1] 560

Interim look takes place at \(N~= 560\) (\(n~= 70\) subjects in each of the 8 conditions)

Sample Size Planning

Calculating a plausible effect size

Planning with Sequential Testing

Determine the N for the interim look