All opinions expressed are those of the presenter and do not represent any organization.
Some slides need to be scrolled down to see the full content.
May 2022
All opinions expressed are those of the presenter and do not represent any organization.
Some slides need to be scrolled down to see the full content.
“FDA does not require use of any specific software for statistical analyses, and statistical software is not explicitly discussed in Title 21 of the Code of Federal Regulations [e.g., in 21CFR part 11]. However, the software package(s) used for statistical analyses should be fully documented in the submission, including version and build identification.”
As an organization, we need to ensure compliance and reduce the risk of using R and R packages in regulatory deliverables.
gsDesign
: an R package for group sequential design under proportional hazards.simtrial
, gsDesign2
, and gsdmvn
: experimental R packages for group sequential design under non-proportional hazards“Sponsors should provide the software programs used to create all ADaM datasets and generate tables and figures associated with primary and secondary efficacy analyses. Furthermore, sponsors should submit software programs used to generate additional information included in Section 14 CLINICAL STUDIES of the Prescribing Information (PI)26 if applicable. The specific software utilized should be specified in the ADRG. The main purpose of requesting the submission of these programs is to understand the process by which the variables for the respective analyses were created and to confirm the analysis algorithms. Sponsors should submit software programs in ASCII text format; however, executable file extensions should not be used.”
Although FDA did not expect submitted R code is executable, sponsor shall enhance reproducibility.
We share the same philosophy described in Section 1.1 of the R Packages book and quote here.
Tools:
pkglite
: represent and exchange R package source code as text files.cleanslate
(under internal validation): create portable R environments.Bookdown: https://r4csr.org/
pkglite
reimagines the way to represent R packages.
library("pkglite") "/path/to/pkg/" %>% collate(file_ectd(), file_auto("inst/")) %>% pack() pack( "/path/to/pkg1/" %>% collate(file_ectd()), "/path/to/pkg2/" %>% collate(file_ectd()), output = "/path/to/pkglite.txt" ) "/path/to/pkglite.txt" %>% unpack(output = "/path/to/output/", install = TRUE)
.Rproj
, .Rprofile
, .Renviron
)library("cleanslate") "portable-project/" %>% use_project(repo = "https://url/snapshot/2021-11-20/") %>% use_rprofile() %>% use_renviron() %>% use_r_version(version = "4.1.1") %>% use_rtools(version = "rtools40")
Within a regulatory R environment:
As statisticians, we use tidyverse, r2rtf, and internal tools to define the mock-up table, listing and figure (TLFs) for statistical analysis of a clinical trial.
As programmers, we use tidyverse, r2rtf, and internal tools to develop and/or validate analysis results based on mock-up TLFs.
As statisticians/programmers, we use pkglite
and internal tools to prepare proprietary R packages and analysis R scripts for eCTD submission packages.
As an internal/external reviewers, we use cleanslate
to re-construct a portable environment (if required) to reproduce analysis results.
More details: https://r4csr.org/
We recommended to use R package structure to organize standard tools, analysis projects, and Shiny apps.
More details: https://r4csr.org/project-folder.html
More details: https://r4csr.org/project-management.html
Diao, Guoqing, Guanghan F Liu, Donglin Zeng, Yilong Zhang, Gregory Golm, Joseph F Heyse, and Joseph G Ibrahim. 2020. “Efficient Multiple Imputation for Sensitivity Analysis of Recurrent Events Data with Informative Censoring.” Statistics in Biopharmaceutical Research, 1–9.
Gao, Fei, Guanghan F Liu, Donglin Zeng, Lei Xu, Bridget Lin, Guoqing Diao, Gregory Golm, Joseph F Heyse, and Joseph G Ibrahim. 2017. “Control-Based Imputation for Sensitivity Analyses in Informative Censoring for Recurrent Event Data.” Pharmaceutical Statistics 16 (6): 424–32.
Lachin, John M, and Mary A Foulkes. 1986. “Evaluation of Sample Size and Power for Analyses of Survival with Allowance for Nonuniform Patient Entry, Losses to Follow-up, Noncompliance, and Stratification.” Biometrics, 507–19.
Liu, Siyi, Shu Yang, Yilong Zhang, et al. 2021. “Multiply Robust Estimators in Longitudinal Studies with Missing Data Under Control-Based Imputation.” arXiv Preprint arXiv:2112.06000.