Clinical trials are only as trustworthy as the analytical decisions made before data are unblinded. Pre-specification — committing to hypotheses, endpoints, and statistical methods in writing before any outcome data are examined — is the single most important protection against inadvertent or deliberate bias in trial reporting. Yet it remains one of the most commonly misunderstood aspects of trial methodology.
A trial dataset containing hundreds of variables offers enormous scope for exploratory analysis. If an investigator searches freely across subgroups, endpoints, and time points, a statistically significant result will almost inevitably emerge — not because of a genuine treatment effect, but because of chance. This practice, sometimes described as HARKing (Hypothesising After Results are Known) or p-hacking, produces findings that appear credible but systematically overestimate treatment benefit and fail to replicate.
The consequences are serious. When post-hoc findings enter clinical practice — as they frequently do — patients may receive treatments that are ineffective or unsafe. Regulatory agencies and journals have progressively tightened requirements for pre-specification precisely because the historical literature is littered with false positives generated by flexible analysis.
"The strength of a clinical inference is directly proportional to the degree to which it was anticipated before the data were examined."
Pre-specification operates at several levels, each requiring explicit commitment before database lock:
Key distinction: Exploratory analyses conducted after unblinding are valid and scientifically valuable — but they must be clearly labelled as hypothesis-generating, not confirmatory. The error is not in exploring data; it is in presenting post-hoc findings as if they were pre-planned.
The SAP is a standalone document — separate from the protocol — that is finalised and date-stamped before the trial database is locked. At KCLEAGENICS MEDICAL, our SAPs are prepared by an independent biostatistician and reviewed by the trial steering committee. They are routinely shared with the Data Safety Monitoring Board (DSMB) and held on file for regulatory inspection.
A well-constructed SAP should specify:
Subgroup analyses are one of the highest-risk areas for spurious findings. A trial that pre-specifies five subgroup analyses at a two-sided alpha of 0.05 has a substantially elevated probability of producing at least one false positive by chance alone — and this probability increases dramatically with each additional subgroup examined.
Our approach is to pre-specify a small number of subgroup analyses that have strong prior biological justification, adjust for multiplicity where the results will be presented as confirmatory, and present all other subgroup analyses as clearly exploratory with appropriately scaled confidence intervals. Subgroup-by-treatment interaction tests are always reported alongside the main subgroup result.
All KCLEAGENICS MEDICAL interventional trials are prospectively registered on ClinicalTrials.gov before first participant enrolment, as required by FDAAA 801. The registered protocol includes the primary endpoint and key secondary endpoints. Any subsequent protocol amendments — including changes to endpoints — are recorded and publicly visible in the trial registration history. This audit trail makes it possible for readers, journal editors, and regulators to identify any discrepancy between the pre-registered analysis plan and the final published analysis.
Where we publish, we adhere to CONSORT reporting standards and provide the SAP as a supplementary document. We believe transparency in reporting is inseparable from the scientific integrity of the trial itself.
If you are considering participation in a KCLEAGENICS MEDICAL trial as a co-investigator or site principal investigator, you should expect to receive a copy of the SAP before site initiation. We encourage investigators to review it carefully and raise any questions before the database is locked. Understanding what the primary analysis is testing — and what it is not — is essential for interpreting the final results accurately and communicating them to your patients.
For more on our trial methodology and current studies, see our Research Portfolio or contact our research team directly.
Adaptive Platform Trial Design: A Practical Guide
For CliniciansBiomarker-Driven Patient Selection in Precision Oncology
For CliniciansMetabolic Risk and Haematological Malignancy
Subscribe to our monthly Clinical Intelligence Briefing