分享

FDA重磅发布《Clinical Trial Endpoints Approval of Cance...

 42ZB432 2019-01-31


I. INTRODUCTION 
 This guidance provides recommendations to applicants on endpoints for cancer clinical trials submitted to the Food and Drug Administration (FDA) to support effectiveness claims in new drug applications (NDAs), biologics license applications (BLAs), or supplemental applications.2  It also provides background information and discusses general regulatory principles.  The endpoints discussed in this guidance are for drugs to treat patients with an existing cancer.3  This guidance does not address endpoints for drugs to prevent or decrease the incidence of cancer.
 
This guidance is a revision of the final guidance of the same title that published in May 2007.  This guidance replaces the May 2007 guidance of the same title.   
 
In general, FDA’s guidance documents do not establish legally enforceable responsibilities.  Instead, guidances describe the Agency’s current thinking on a topic and should be viewed only as recommendations, unless specific regulatory or statutory requirements are cited.  The use of the word should in Agency guidances means that something is suggested or recommended, but not required.  

延伸阅读:

 

II. BACKGROUND
 Clinical trial endpoints serve different purposes. In conventional oncology drug development, early phase clinical trials evaluate safety and identify evidence of biological drug activity, such as tumor shrinkage.  Endpoints for later phase efficacy studies commonly evaluate whether a drug provides a clinical benefit such as prolongation of survival or an improvement in symptoms.  The following sections discuss the general regulatory requirements for efficacy and how they have influenced endpoint selection for the approval of cancer drugs.  Later sections describe these endpoints in more detail and discuss whether they might serve as measures of disease activity or clinical benefit in various clinical settings.
 A. Statutory and Regulatory Requirements for Effectiveness
 The requirement that new drugs show effectiveness is based on a 1962 amendment to the Federal Food, Drug, and Cosmetic Act (FD&C Act).  This law requires substantial evidence of effectiveness and specifies that this evidence must be derived from adequate and well-controlled clinical investigations.  Similarly, the Public Health Service Act requires biological products to be safe, pure, and potent.4  Clinical benefits that have supported drug approval have included important clinical outcomes (e.g., increased survival, symptomatic improvement) but have also included effects on surrogate endpoints known to predict clinical benefit (e.g., blood pressure). 
 
The accelerated approval regulations (21 CFR part 314, subpart H and 21 CFR part 601, subpart E), promulgated in 1992, allow use of additional endpoints for approval of drugs or biological products that are intended to treat serious or life-threatening diseases and that generally demonstrate an improvement over available therapy or provide therapy where none exists.  In this setting, the FDA may grant approval based on an effect on a surrogate endpoint that is reasonably likely to predict clinical benefit.5  Such surrogate endpoints are less wellestablished than those surrogate endpoints for traditional approvals, such as blood pressure for cardiovascular disease.  FDA may also grant accelerated approval based on an effect on a clinical endpoint that can be measured earlier than irreversible morbidity or mortality that is reasonably likely to predict an effect on irreversible morbidity or mortality or other clinical benefit (i.e., an intermediate clinical endpoint).6  A drug is approved under the accelerated approval regulations on condition that the manufacturer conducts clinical studies to verify and describe the actual clinical benefit.7  If the postmarketing studies fail to demonstrate clinical benefit or if the applicant does not demonstrate due diligence in conducting the required studies, among other reasons, FDA may withdraw approval of the drug or indication.8  In the following discussion, the term traditional approval denotes the longstanding route of drug approval based on the demonstration of clinical benefit or an effect on a surrogate endpoint  known to predict clinical benefit.  That term is distinguished from accelerated approval, which is associated with use of a surrogate endpoint or intermediate clinical endpoint that is reasonably likely to predict clinical benefit to support drug approval.
 
The evidence needed to establish effectiveness is discussed in the guidance for industry Providing Clinical Evidence of Effectiveness for Human Drug and Biological Products and the guidance for industry FDA Approval of New Cancer Treatment Uses for Marketed Drug and Biological Products.  In many cases, at least two adequate and well-controlled clinical investigations are needed.  In certain cases, evidence from a single adequate and well-controlled clinical investigation, with confirmatory evidence, can be sufficient (e.g., in cases in which a single multicenter study provides highly reliable and statistically strong evidence of an important clinical benefit).  FDA may consider data from one adequate and well-controlled clinical investigation and confirmatory evidence as substantial evidence, if FDA determines that such data and evidence are sufficient to establish effectiveness.9  For example, for drugs approved for treatment of patients with a specific stage of a particular malignancy, evidence from one adequate and well-controlled clinical investigation with confirmatory evidence may be sufficient to support an efficacy supplement for treatment of a different stage of the same cancer.
 B. Endpoints Supporting Past Approvals in Oncology
 For traditional approval, applicants show direct evidence of clinical benefit or improvement in a surrogate endpoint known to predict clinical benefit.  In oncology, survival improvement is considered an appropriate measure of clinical benefit.  In addition, sponsors have used other endpoints for cancer drug approval.  In the 1970s, the FDA usually approved cancer drugs based on objective response rate (ORR), determined by tumor assessments from radiological tests or physical examinations.  In the early 1980s, after discussion with the Oncologic Drugs Advisory Committee (ODAC), the FDA determined that cancer drug approval should be based on more direct evidence of clinical benefit, such as improvement in survival, improvement in a patient’s quality of life, improved physical functioning, or improved tumor-related symptoms. These benefits may not always be predicted by, or correlate with, ORR.
 
Over time, larger improvements in tumor reduction and delay in tumor growth have been seen, and tumor measurement endpoints have been used to support both traditional and accelerated approval.  Improvement in disease-free survival (DFS) has supported drug approval in selected adjuvant settings, in which a large proportion of patients were expected to have cancer symptoms at the time of recurrence.  Durable complete response has supported traditional approval in leukemia, where complete response (CR) is associated with less infection, bleeding, and blood product support.  A large improvement in progression-free survival (PFS) or high, substantiated durable ORR has been used to support traditional approval in select malignancies, but magnitude of effect, relief of tumor-related symptoms, and drug toxicity should also be considered when making the approval decision.10,11,12  For example, randomized trials for hormonal drugs for breast cancer and a single arm trial of a drug for ROS1-positive metastatic non-small cell lung cancer have used ORR as an endpoint supporting traditional approval. Improvement in tumor-related symptoms in conjunction with an improved ORR and adequate response duration has supported traditional approval in several clinical settings.
 
Surrogate endpoints for accelerated approval must be reasonably likely to predict clinical benefit (FD&C Act § 506(c)(1)(A); 21 CFR part 314, subpart H; and 21 CFR part 601, subpart E).  While durable ORR has been used as a traditional approval endpoint in some circumstances, ORR has also been the most commonly used surrogate endpoint in support of accelerated approval.  Tumor response is widely accepted by oncologists in guiding cancer treatments.  Because ORR is directly attributable to drug effect, single-arm trials conducted in patients with refractory tumors where no available therapy exists provide an accurate assessment of ORR.  Whether tumor measures such as ORR or PFS are used as an accelerated approval or traditional approval endpoint will depend on the disease context and the magnitude of the effect, among other factors (see sections III.B.2 and 4).
 
 
III. GENERAL ENDPOINT CONSIDERATIONS
 This section provides an overview of general issues in cancer drug development.  A discussion of commonly used cancer endpoints is followed by a discussion of pertinent issues in cancer clinical trial design using these endpoints (see the guidances referenced in footnote 3 regarding non-small cell lung cancer and high-risk early-stage breast cancer for more detailed discussion for these diseases).  The endpoints that are discussed in this section include overall survival, endpoints based on tumor assessments (e.g., DFS, event-free survival (EFS), ORR, CR, time to progression (TTP), and PFS), endpoints involving symptom assessment, blood or body fluidbased biomarkers, and emerging endpoints.  Tables 1 and 2 provide a comparison of endpoints in cancer drug approval.  Many issues relating to the proper analysis of efficacy endpoints are addressed in the ICH guidance for industry E9 Statistical Principles for Clinical Trials.  Recommendations regarding the use of placebos and blinding in randomized controlled clinical trials are described in the draft guidance for industry Hematologic Malignancy and Oncologic Disease:  Considerations for Use of Placebos and Blinding in Randomized Clinical Trials for Drug Product Development.13  Cancer patient and caregiver experiences provide unique insights that contribute to important patient preference information for identifying relevant clinical trial endpoints to ultimately inform medical product development programs that best meet patient needs.  Recommendations on how stakeholders can collect and submit patient experience data14 and other relevant information from patients and caregivers for medical product development and regulatory decision making are described in the draft guidance for industry, FDA staff, and other stakeholders Patient-Focused Drug Development: Collecting Comprehensive and Representative Input.15  

Table 1.  A Comparison of Important Cancer Approval Endpoints
 As noted in the table, several oncology endpoints can serve different purposes (i.e., clinical endpoint that represents clinical benefit for traditional approval, surrogate endpoint to support traditional approval, surrogate endpoint to support accelerated approval) based on the specific context of use.  The determination is based on the specific diseases and is highly dependent upon factors such as effect size, effect duration, depth of response (e.g., number of CRs), available therapy, disease setting, location of disease, the clinical consequences of delaying or preventing disease progression or delaying administration of more toxic therapies, and the risk-benefit relationship.  See text for details.  See section V regarding recommendations for obtaining FDA feedback on endpoints and protocol design.

A. Overall Survival
 Overall survival is defined as the time from randomization until death from any cause and is measured in the intent-to-treat population.  Survival is considered the most reliable cancer endpoint, and when studies can be conducted to adequately assess survival, it is usually the preferred endpoint.  This endpoint is precise and easy to measure, documented by the date of    death.  Bias is not a factor in endpoint measurement.  Survival improvement should be analyzed as a risk-benefit analysis to assess clinical benefit.
 
Overall survival should be evaluated in randomized controlled studies.  Data derived from externally controlled trials are seldom reliable for time-to-event endpoints, including overall survival.  Apparent differences in outcome between external controls and current treatment groups can arise from differences other than drug treatment, including patient selection, improved imaging techniques, or improved supportive care.  Randomized studies minimize the effect of these known and unknown differences by providing a direct outcome comparison.  Demonstration of a statistically significant improvement in overall survival can be considered to be clinically significant if the toxicity profile is acceptable and has often supported new drug approval.
 
Difficulties in performing and analyzing survival studies include long follow-up periods in large trials and subsequent cancer therapy potentially confounding survival analysis.
 B. Endpoints Based on Tumor Assessments
 This section discusses several endpoints that are based on tumor assessments.  In many cancer types, radiographic tumor assessments directly measure components of the disease, and tumor measures commonly trigger treatment decisions in clinical practice.  Therefore, tumor measurebased endpoints are considered more clinically relevant than other biomarkers.  Tumor measurebased endpoints may support either traditional (as a clinical endpoint that represents clinical benefit or a surrogate endpoint for traditional approval) or accelerated approval and include DFS (and EFS), ORR, CR, TTP, and PFS. 
 
Tumor assessment endpoint selection should include two judgments.  First, a determination of whether the endpoint may support either accelerated approval or traditional approval should be ascertained.  Second, the endpoint should be evaluated for the potential of bias or uncertainty. Drug applications using studies that rely on tumor assessment endpoints as sole evidence of efficacy may need confirmatory evidence from a second trial.  Accuracy in measuring tumors can differ among tumor settings.  Tumor measurements used in ORR determinations can be imprecise in locations where there is a lack of demarcated margins (e.g., pleural or peritoneal mesothelioma, pancreatic cancer, brain tumors).
 
When the primary study endpoint is based on tumor measurements (e.g., PFS or ORR), tumor assessments generally should be verified by central reviewers blinded to study treatments (see Appendix) to ascertain lack of assessment bias.  Additional details regarding data collection are listed in the guidance for industry Clinical Trial Endpoints for the Approval of Non-Small Cell Lung Cancer Drugs and Biologics, Appendix A.16  Alternatively, a random sample-based blinded central review auditing approach could be used with a detailed auditing plan prespecified including a strategy to detect potential assessment bias.  If bias cannot be excluded based upon the audit, a blinded central review of all radiographic images will be necessary.17  Sponsors should seek FDA input prior to conducting an audit based central review.  Centralized independent verification of tumor assessment endpoints (especially for PFS or DFS) may not be necessary when randomized trials are blinded (unless the adverse event profile would substantially unblind the trial in practice) or effect sizes are robust in large randomized trials where sensitivity analysis supports lack of investigator bias.
 
1. Disease-Free Survival (and Event-Free Survival)
 
Generally, DFS is defined as the time from randomization until disease recurrence or death from any cause.  The most frequent use of this endpoint is in the adjuvant setting after definitive surgery or radiotherapy.  DFS also can be an important endpoint when a large percentage of patients achieve CRs with chemotherapy.  Although overall survival is a conventional endpoint for most adjuvant settings, DFS can be an important endpoint in situations where survival may be prolonged, making an overall survival endpoint impractical.  An endpoint that is similar to DFS but is differentiated from it in that randomization takes place before definitive surgery or radiotherapy in the adjuvant setting is EFS.  For the purpose of this guidance, EFS is defined as time from randomization to any of the following events: progression of disease that precludes surgery, local or distant recurrence, or death due to any cause.  DFS has been the primary basis of approval for adjuvant breast cancer hormonal therapy, adjuvant colon cancer, and adjuvant cytotoxic breast cancer therapy, whereas EFS is an appropriate endpoint for the evaluation of neoadjuvant breast cancer therapy given prior to definitive surgery.  Compared with standard cytotoxic therapies, hormonal therapies carry minimal side effects and thus a favorable riskbenefit relationship.  Treatment effect measured by DFS or EFS can be a surrogate endpoint to support accelerated approval, a surrogate endpoint to support traditional approval, or it can represent direct clinical benefit based on the specific disease, context of use, magnitude of the effect, the disease setting, available therapy, and the risk-benefit relationship.  In December 2003, the ODAC consensus was DFS prolongation represented clinical benefit if the magnitude of this benefit outweighed the toxicity of the adjuvant treatment.  In May 2004, the ODAC recommended that DFS be considered a clinical endpoint for colon cancer drugs in the surgical adjuvant setting.18  DFS has been used for traditional approval for diseases such as breast cancer, colorectal cancer, gastrointestinal stromal tumors, melanoma, and renal cell carcinoma. 
 
Important considerations in evaluating DFS or EFS as a potential endpoint include the estimated size of the treatment effect and proven benefits of standard therapies.  The protocol should  carefully delineate both the definition of DFS or EFS and the schedule for follow-up assessments and visits.  Unscheduled assessments can occur for many reasons and differences between study arms in the frequency, timing, or reason for unscheduled assessments can introduce bias.  Bias can be minimized by blinding patients and investigators to the treatment assignments, as appropriate. 
 
Application of the definition of DFS or EFS in a study can be complicated, particularly when deaths are noted without prior tumor progression documentation.  These events can be scored either as disease recurrences or as censored events.  Although all methods for statistical analysis of deaths have some limitations, considering deaths from all causes as recurrences can minimize bias.  DFS or EFS can be overestimated using this definition, especially in patients who die after a long period without observation.  Bias can be introduced if the frequency of long-term follow-up visits is dissimilar between the study arms or if dropouts are not random because of toxicity.  Some analyses count cancer-related deaths as DFS or EFS events and censor noncancer deaths.  This method can introduce bias in the attribution of the cause of death.  Furthermore, any method that censors observations on patients, whether at death or at the last visit, assumes that the patients with censored observations have the same risk of recurrence as patients with noncensored observations who have not yet experienced the event.
 
2. Objective Response Rate
 
ORR is defined as the proportion of patients with tumor size reduction of a predefined amount and for a minimum time period.  Response duration usually is measured from the time of initial response until documented tumor progression.  Generally, the FDA has defined ORR as the sum of partial responses plus CRs.  When defined in this manner, ORR is a direct measure of a drug antitumor activity, which can be evaluated in a single-arm study.  Stable disease should not be a component of ORR.  Stable disease can reflect the natural history of disease, whereas tumor reduction is a direct therapeutic effect.  Also, stable disease can be more accurately assessed by TTP or PFS analysis (see section III.B.4).  If available, standardized criteria should be used to ascertain response.  A variety of response criteria have been considered appropriate (e.g., revised RECIST guideline (version 1.1))19  The response criteria should be predefined in the protocol before the start of the study.  The significance of ORR is assessed by its magnitude and duration, and the percentage of CRs.  Treatment effect measured by ORR can be a surrogate endpoint to support accelerated approval, a surrogate endpoint to support traditional approval, or it can represent direct clinical benefit based on the specific disease, context of use, magnitude of the effect, the number of CRs, the durability of response, the disease setting, the location of the tumors, available therapy, and the risk-benefit relationship. 
 
3. Complete Response 
 
CR is defined as no detectable evidence of tumor.  CR is generally measured through imaging studies (e.g., CT scans) or through histopathologic assessment (e.g., bone marrow biopsy or breast cancer resection specimens).  Treatment effect measured by CR can be a surrogate endpoint to support accelerated approval, a surrogate endpoint to support traditional approval, or it can represent direct clinical benefit based on the specific disease, context of use, magnitude of the effect, effect duration, disease setting, location of disease, available therapy, and the risk-benefit relationship.  For hematologic malignancies such as acute leukemia, CR has been used as a clinical endpoint for traditional approval.  For early high-risk breast cancer, pathologic CR (pCR) has been used as a surrogate endpoint for accelerated approval. 
 
4. Time to Progression and Progression-Free Survival
 
TTP and PFS have served as primary endpoints for drug approval.  TTP is defined as the time from randomization until objective tumor progression; TTP does not include deaths.  PFS is defined as the time from randomization until objective tumor progression or death, whichever occurs first.  The precise definition of tumor progression is important and should be carefully detailed in the protocol.
 
Compared with TTP, PFS is the preferred regulatory endpoint.  PFS includes deaths and thus can be a better correlate to overall survival.  In TTP analysis, death events are censored, either at the time of death or at an earlier visit representing informative censoring (nonrandom pattern of loss from the study).  PFS assumes that death events are randomly related to tumor progression.
 
PFS can reflect tumor growth and be assessed before the determination of a survival benefit.  Its determination is not confounded by subsequent therapy.  For a given sample size, the magnitude of effect on PFS can be larger than the effect on overall survival.  Data are usually insufficient to allow a robust evaluation of the correlation between effects on overall survival and PFS.  Cancer trials are often small, and proven survival benefits of existing drugs are generally modest.  Treatment effect measured by PFS can be a surrogate endpoint to support accelerated approval, a surrogate endpoint to support traditional approval, or it can represent direct clinical benefit based on the specific disease, context of use, magnitude of the effect, the disease setting, location of metastatic sites, available therapy, the risk-benefit relationship, and the clinical consequences of delaying or preventing progression in key disease sites (e.g., delay of new lesions in the brain or spine) or delaying administration of more toxic therapies.
 
It is important to carefully define tumor progression criteria in the protocol.  There are no standard regulatory criteria for defining progression.  Applicants have used a variety of different criteria, including the RECIST criteria.  The broad outline presented in most published PFS criteria should be supplemented with additional details in the protocol and statistical analysis plan (SAP).  
 
Because progression data can be collected from multiple sources (including physical exams at unscheduled visits and radiological scans of various types) and at different times, data collection for each assessment visit should be limited to a specified short time interval around the scheduled visit.  Difficulties can arise in determining the event date and censoring date when data are collected over a prolonged time period.  We recommend assigning the progression date to the earliest time when any progression is observed without prior missing assessments and censoring at the date when the last radiological assessment determined a lack of progression. The guidance for industry Clinical Trial Endpoints for the Approval of Non-Small Cell Lung Cancer Drugs and Biologics, Appendices C and D provides a set of tables for potential analyses of PFS that can be used for primary or sensitivity analyses.  Plans for PFS data collection and analysis should be discussed with FDA at end-of-phase 2 meetings.
 5. Time to Treatment Failure
 
Time to treatment failure (TTF) is defined as a composite endpoint measuring time from randomization to discontinuation of treatment for any reason, including disease progression, treatment toxicity, and death.  TTF is generally not recommended as a regulatory endpoint for new molecular entity drug approval.   
 C. Endpoints Involving Symptom Assessment
 Symptomatic improvement is considered a clinical benefit.  FDA drug approvals have used patient symptom assessments and/or physical signs representing symptomatic improvement (e.g., weight gain, decreased effusion) as an important efficacy endpoint.  For the improvement of signs and symptoms or other clinical outcome assessments (COA) to be used as efficacy endpoints to support cancer drug approval, symptoms should be assessed that are due to cancer rather than drug toxicity to the extent possible.  
 
1. Specific Symptom Endpoints
 Symptom improvement/palliation is a direct measure of clinical benefit rather than a surrogate endpoint.  A decrease in the severity of cancer symptoms has been used to support traditional approval of anti-cancer agents where anti-tumor activity has also been demonstrated.  The use of a symptom palliation endpoint requires that the population be symptomatic at baseline, which can be problematic in many cancer trials where patients can often be asymptomatic at baseline.  This endpoint can also be subject to open label response bias, the magnitude of which is not well described.  
 
Time to progression of cancer symptoms, an endpoint similar to TTP, is also a direct measure of clinical benefit rather than a potential surrogate endpoint.  As discussed earlier, problems in measuring progression (e.g., missing assessments) also exist in evaluating time to symptomatic progression.  Because few cancer trials are blinded, symptom assessments can also be subject to response bias.  A delay between tumor progression and the onset of cancer symptoms can occur.  Often alternative treatments are initiated before achieving the symptom endpoint, confounding this analysis.  In addition, tumor symptoms can be difficult to differentiate from drug toxicity.
 
A composite symptom endpoint or symptom scale should have components of similar clinical importance and an analysis of the contribution of the components should be submitted with the primary analysis of the overall composite endpoint.  An example is the composite symptom scale that includes several important symptoms of myelofibrosis, the myelofibrosis symptom assessment form.20  An example of a composite endpoint for clinical events exists for patients with cancer metastases to the skeleton.  Drugs have been approved based on delaying the time to skeletal-related events defined as pathological fractures, radiation therapy to bone, surgery to bone, and spinal cord compression.
 
 Selection of the appropriate population can be critical for documenting symptom benefit. Patients symptomatic at study baseline can be evaluated with a categorical symptom response analysis.  In asymptomatic patients at baseline, a time-to-first-symptom analysis can be used, although this approach can be limited by intermittent missing data and discontinuation of the study drug prior to symptomatic progression.  
 
2. Problems Encountered with Symptom Data
 
Missing data and infrequent assessments can complicate the evaluation of symptom data particularly for time to deterioration analyses.  Procedures should be put in place to maximize completion rate during trial conduct and the statistical analysis plan should outline how missing data will be handled.  Data collection to support multiple symptom endpoints should be addressed prospectively regarding multiple hypotheses testing and the necessary statistical adjustments should be specified in the SAP.  Additional information on the use of patientreported outcomes can be found in the guidance for industry Patient Reported Outcome Measures: Use in Medical Product Development to Support Labeling Claims.    
 D. Blood or Body Fluid-Based Biomarkers
 Generally, biomarkers assayed from blood or body fluids have not served as primary endpoints for cancer drug approval, although paraprotein levels measured in blood and urine have been used as part of myeloma response criteria, and durable major molecular response is a surrogate endpoint used for traditional approval for therapies in chronic myelogenous leukemia. 
 
The FDA has accepted blood-based markers as elements of a composite endpoint.  The occurrence of certain clinical events (a significant decrease in performance status, or bowel obstruction) in conjunction with marked increases in CA-125 was considered progression in ovarian cancer patients.  In addition, blood-based biomarkers can be useful in identifying prognostic factors and in selection of patients and stratification factors to be considered in study designs.
 E. Emerging Endpoints
 In addition to the endpoints discussed in this section, FDA recognizes that advances in science are facilitating development of oncology products, which may also result in the identification of additional endpoints that may be used to support approval of oncology products.  As examples, minimal residual disease21 has been used as a surrogate endpoint for accelerated approval for a therapy in acute lymphoblastic leukemia and metastasis-free survival has been used as a clinical endpoint for traditional approval for a therapy in non-metastatic castration-resistant prostate cancer.  If a sponsor is planning to use an emerging endpoint in its clinical development program, we recommend discussing such use with the applicable FDA Division or Office prior to initiating a trial (see section V).

IV. CLINICAL TRIAL DESIGN AND ANALYSIS CONSIDERATIONS
 Per 21 CFR 314.126, the FDA approves drugs based on substantial evidence of efficacy from “adequate and well-controlled investigations,” as described in the regulations.  Studies must allow a valid comparison to a control and must provide a quantitative assessment of the drug’s effect.22  The most reliable method for demonstrating efficacy is to show a statistically significant improvement in a clinically meaningful endpoint in randomized controlled trials (i.e., superiority).  The following sections discuss other approaches that may be indicated when a randomized controlled trial demonstrating superiority is not feasible or ethical.  If a sponsor is planning to use one of these approaches, we recommend discussing such use with the applicable FDA Division or Office prior to initiating a trial (see section V).
 A. Single-Arm Studies
 In settings where there is no available therapy and where major tumor regressions can be presumed to be attributed to the tested drug, the FDA has sometimes accepted ORR and response duration observed in single-arm studies as substantial evidence supporting accelerated approval.  Response rates have been used in settings such as acute leukemia for traditional approval where CRs have been associated with decreased transfusion requirements, decrease in infections, and increased survival.  Single-arm trials do not adequately characterize time-toevent endpoints such as overall survival, DFS (and EFS), TTP, or PFS.  Because of variability in the natural history of many forms of cancer, a randomized study is necessary to evaluate time-to-event endpoints.
 B. Randomized Studies Designed to Demonstrate Noninferiority
 A noninferiority (NI) trial should demonstrate the new drug’s effectiveness by showing that the new drug is not less effective than a standard regimen (the active control) by a prespecified amount (noninferiority margin).23  This noninferiority margin should be a clinically acceptable loss that is not larger than the effect of the active control drug.  The standard regimen should have a well-characterized clinical benefit (survival benefit).  If the new drug is inferior to the active control by more than the noninferiority margin, it will be presumed to be ineffective.
 
NI trials rely on externally controlled (historical) data to establish the active control’s treatment effect size.  In cancer trials, this effect frequently has not been adequately characterized.  NI trials also rely on constancy assumption.  This assumption includes that the active-control effect has remained constant between the externally controlled study and the current study.  This assumes constancy of patient population characteristics, supportive care measures, and evaluation techniques between the current trial and the externally controlled data from which the active-control effect was derived.  The estimated size of the active-control’s treatment effect should be based on a comprehensive meta-analysis of externally controlled studies.  These studies should reliably reproduce the active-control effect compared with placebo arm.

Difficulties in conducting NI trials include the estimation of active-control effect and the determination of amount of effect (NI margin) to be retained.  NI trials usually involve large sample sizes compared with superiority trials and involve replication of clinical trial results. Furthermore, subsequent therapies and crossover to the active-control arm can confound any NI analysis.  NI trials with endpoints other than overall survival and ORR are problematic.
 C. Trial Designs for Radiotherapy Protectants and Chemotherapy Protectants
 Radiotherapy protectants and chemotherapy protectants are drugs designed to ameliorate the toxicities of therapies.  These trials usually have two objectives.  The first is to assess the amelioration of cancer treatment toxicity.  The second objective is to determine whether anticancer activity is compromised by the protectant.  The second objective usually examines earlier endpoints; for example, ORR or PFS, rather than overall survival.
 D. Clinical Trial Design Considerations
 The methodology for assessing, measuring, and analyzing the endpoint(s) should be detailed in the protocol and SAP.  
 
Visits and radiological assessments, when applicable, should be symmetric between the two study arms to prevent systematic bias.  The FDA and the applicant should agree prospectively on the following items and the applicant should finalize the items before the initiation of the study, to the extent possible:
 
· The study design · The definition of progression · The SAP · The methodology for handling missing data and censoring methods · The data to be recorded on the case report form (CRF) · The operating procedures of an independent endpoint review committee (IRC), if applicable (see Appendix)
 E. Clinical Trial Analysis Issues
 Missing data can complicate endpoint analysis.  For endpoints based on tumor assessments, the protocol should define an adequate assessment visit for each patient (i.e., a visit when all scheduled tumor assessments have been done).  The analysis plan should outline a comparison of the adequacy of follow-up in each treatment arm.  Methodology for analyzing incomplete and/or missing follow-up visits and censoring methods should be specified in the protocol.  The analysis plan should specify the primary analysis and one or more sensitivity analyses to evaluate the robustness of the results.  Although any analyses with missing data can be problematic, the results can be strengthened by similar results in both the primary and the sensitivity analyses.  When applicable, the evaluation should include the number of deaths in patients who have been lost to follow-up for a prolonged time period.  For example, an imbalance in such deaths could bias a PFS measurement by overestimating PFS in the treatment arm with less follow-up.
 V. CONCLUSION
 Although the general principles outlined in this guidance should help applicants select endpoints for marketing applications, we recommend that applicants meet with the FDA before submitting protocols intended to support NDA or BLA marketing applications.  The FDA will ensure that these meetings include a multidisciplinary FDA team of oncologists, statisticians, clinical pharmacologists, and external expert consultants as needed.  Applicants can submit protocols after these meetings and request a special protocol assessment that provides confirmation of the acceptability of endpoints and protocol design to support drug marketing applications.24  Ultimately, of course, marketing approval depends not only on the design of clinical trials, but on FDA review of the results and data from all studies in the drug marketing application.

APPENDIX: INDEPENDENT REVIEW OF TUMOR ENDPOINTS
 Clinical trial results that support drug approval should be verifiable by applicants and the Food and Drug Administration (FDA).  Objective response rate (ORR) determined in single-arm studies can be evaluated by reviewing a limited number of images.  When drug approval is based on measurement of progression-free survival (PFS), careful planning can minimize bias and enable the applicant and the FDA to verify results.  An independent endpoint review committee (IRC) can minimize bias in radiographic interpretation of the radiological findings and independent adjudication of assessments.  A clearly written plan of the charter outlining the IRC function and process (independent review charter) should be in place before initiation of the study.  The plan should describe the assurance of the committee’s independence and procedure for collection, storage, and transportation of the results.  The charter also should include the resolution of differences in interpretation and incorporation of clinical data in the final interpretation of data and audit procedures.  The use of an IRC is discussed further in the guidance for industry Developing Medical Imaging Drug and Biological Products, Part 3: Design, Analysis, and Interpretation of Clinical Studies.
 
  

    本站是提供个人知识管理的网络存储空间,所有内容均由用户发布,不代表本站观点。请注意甄别内容中的联系方式、诱导购买等信息,谨防诈骗。如发现有害或侵权内容,请点击一键举报。
    转藏 分享 献花(0

    0条评论

    发表

    请遵守用户 评论公约

    类似文章 更多