Why Relevance and Eligibility Aren't Enough: The Case for Evidence-Informed Clinical Trial Ranking

Moving from "you might qualify" to "here's what the evidence says"

March 3, 2026

TL;DR: Most clinical trial matching tools stop at relevance and eligibility screening. We tested a benchmark cancer patient case and found that TrialGPT-only ranking would have shown the wrong trials first—including a diagnostic imaging study ranked #1. Evidence-informed ranking surfaced novel therapies as more promising options.

The Problem with Matching Without Evidence

Clinical trial matching has come a long way. AI systems like TrialGPT can now read a patient's medical record, assess relevance, and determine which trials they're eligible for with impressive accuracy.

But here's the uncomfortable truth: relevance and eligibility aren't enough.

A patient might match dozens of relevant trials, but that doesn't mean those trials are equally good choices. Consider:

A diagnostic imaging study that won't provide any treatment
A trial testing a drug with poor results in similar tumors
A Phase I dose-finding study with no proven efficacy
A trial for a related but different cancer type

All of these might pass relevance and eligibility screening. But are they worth considering? That requires understanding the scientific evidence—something patient-trial matching alone can't tell you.

Case Study: Spinal Astrocytoma

To demonstrate this gap, we ran a benchmark patient case through both TrialGPT matching and evidence-informed ranking.

Patient Profile

Age/Sex: 45-year-old male
Diagnosis: Anaplastic astrocytoma of the thoracolumbar spine (unresectable)
Complications: Severe lower extremity weakness, urinary retention
Prior treatments:
- Radiation therapy (T10-L1)
- 11 cycles of temozolomide
- Irinotecan + bevacizumab (Avastin)

This is a heavily pretreated patient with a rare spinal cord tumor. Finding the right trial could be life-changing—or the wrong choice could waste precious time.

This case is from the TREC 2021 Clinical Trials Track, a benchmark dataset used to evaluate clinical trial matching systems.

We searched ClinicalTrials.gov (as of March 3, 2026) and found 23 active interventional trials for anaplastic astrocytoma—a typical number for a rare tumor type. After running them through eligibility screening using TrialGPT (an NIH-developed AI system for patient-trial matching), 10 trials passed the threshold.

Now comes the critical question: How should we rank these 10 trials?

TrialGPT-Only Ranking: What Eligibility Matching Shows

If we rank purely by TrialGPT score (which combines trial relevance and criteria matching, but not scientific evidence about treatments), here's what we get (showing top 7 by TrialGPT score):

Rank	Trial	TrialGPT Score	Worth Considering?
#1	NCT06241391 - PET/CT Imaging Study	86	No (diagnostic)
#2	NCT01269853 - Bevacizumab Intra-arterial	84	Yes
#3	NCT06264388 - DB107 Gene Therapy	81	Yes
#4	NCT05698524 - Temodar + Abexinostat	80	Maybe
#5	NCT04541082 - ONC206 Phase I	79	Maybe
#6	NCT06975332 - Alpha Emitter Targeted Therapy	54	Maybe
#7	NCT02800486 - Cetuximab Intra-arterial	51	Yes

Problem: The #1 trial by TrialGPT is NOT a good choice (it's a diagnostic study, not treatment). A patient following this ranking would see a non-therapeutic option first.

What's Wrong with the Top-Ranked Trials?

#1: NCT06241391 — PET/CT Imaging Study

TrialGPT: 86 ClinTrialFinder: 35

The problem: This isn't a treatment trial at all—it's a diagnostic imaging study using a radiotracer. The patient would undergo a scan but receive no therapeutic benefit.

Why it wasn't filtered earlier: ClinicalTrials.gov classifies this as "interventional" (patients receive a radiotracer injection), so it passes the interventional study filter. But "interventional" doesn't always mean "therapeutic."

Eligibility screening sees "glioma patient" and matches. Evidence analysis sees "no treatment offered" and correctly drops the score.

#4: NCT05698524 — Temodar + Abexinostat

TrialGPT: 80 ClinTrialFinder: 62

The nuance: This patient already received 11 cycles of temozolomide (Temodar)—and progressed. While the trial combines it with a novel HDAC inhibitor (Abexinostat), the primary drug is one the patient already failed.

Evidence analysis gave a moderate score (62) recognizing the novel combination, but not as high as trials with completely new mechanisms.

A Borderline Case: Phase I Trials

#5: NCT04541082 — ONC206 Phase I

TrialGPT: 79 ClinTrialFinder: 65

The nuance: This Phase I trial scored 65—just above our 60 threshold. Evidence analysis noted "Phase I design limits efficacy certainty" and "no proven benefit yet," but recognized the novel mechanism.

For a heavily pretreated patient who has exhausted standard options, Phase I trials with novel mechanisms can be reasonable choices. This is a judgment call, not a clear-cut wrong answer. The evidence helps patients understand the trade-off: promising safety profile, but unproven efficacy.

Evidence-Informed Ranking: How Context Changes the Picture

When we incorporate scientific evidence—checking published literature, understanding drug mechanisms, and considering the patient's treatment history—the ranking changes:

Rank	Trial	ClinTrialFinder Score	TrialGPT Rank
#1	NCT01269853 - Bevacizumab Intra-arterial	75	was #2
#2	NCT06264388 - DB107 Gene Therapy	72	was #3
#3	NCT02800486 - Cetuximab Intra-arterial	70	was #7
#4	NCT04541082 - ONC206 Phase I	65	was #5

Result: The top 4 recommended trials are all genuinely good options with novel mechanisms. Notably, the diagnostic study (NCT06241391) that TrialGPT ranked #1 is correctly excluded from recommendations. And the Cetuximab trial (#7 by TrialGPT) was promoted to #3 by evidence analysis.

Why These Trials Rank Higher

#1: NCT01269853 — Bevacizumab Intra-arterial

TrialGPT: 84 ClinTrialFinder: 75

Why it's promising: While the patient previously received systemic bevacizumab (Avastin), this trial uses a novel intra-arterial delivery method—injecting directly into cerebral arteries. This approach can achieve much higher drug concentrations at the tumor site than systemic administration.

Evidence analysis recognized this as a meaningfully different approach worth considering.

#3: NCT02800486 — Cetuximab Intra-arterial

TrialGPT: 51 ClinTrialFinder: 70

Why TrialGPT scored it low: The trial requires "pathology confirmed EGFR overexpression" (not mentioned in patient record) and "Karnofsky ≥60%" (patient has severe weakness and urinary retention, suggesting lower performance status). These eligibility concerns dropped the score to 51.

Why evidence analysis boosted it: The treatment mechanism—EGFR-targeted therapy via intra-arterial delivery with re-irradiation—has moderate evidence supporting efficacy in gliomas. For a patient who has exhausted standard options, this novel approach may be worth discussing with their doctor, even if eligibility requires clarification.

This illustrates how the two scores complement each other: TrialGPT flags real eligibility concerns, while evidence analysis recognizes therapeutic potential.

Summary: How Evidence Changed Rankings

Metric	TrialGPT-Only	Evidence-Informed
#1 ranked trial is worth considering	No (diagnostic study)	Yes (novel therapy)
Diagnostic study correctly filtered	No (ranked #1)	Yes (score dropped to 35)
Cetuximab trial rank	#7 (low visibility)	#3 (promoted by evidence)

How Evidence-Informed Ranking Works

Evidence-informed ranking goes beyond "does this patient match the criteria" to ask:

Is this a treatment trial? Diagnostic, observational, and sample-collection studies are filtered appropriately.
Has the patient already tried this approach? Offering failed drug classes again is deprioritized.
What does the literature say? Drugs with promising efficacy data in similar tumors are prioritized.
Is this a novel mechanism? For heavily pretreated patients, new approaches may offer more hope than incremental improvements.
What's the risk/benefit profile? Phase, study design, and known side effects are considered.

Behind the Scores: The Drug Evidence

To show exactly how evidence affects ranking, here's what the system found for each drug when searching PubMed and Perplexity:

Trial	Drug	Evidence Source	Key Finding	Score Impact
NCT06241391	Ga-68 PSMA	PubMed	Diagnostic imaging only—not a treatment. "No direct evidence for treatment efficacy."	86 → 35
NCT01269853	Bevacizumab IA	PubMed	Intra-arterial delivery achieves higher tumor concentrations than systemic. Novel delivery method despite prior Avastin use.	84 → 75
NCT02800486	Cetuximab IA	Perplexity	Moderate evidence for EGFR-targeted therapy in gliomas. Innovative intra-arterial delivery with re-irradiation.	51 → 70
NCT06264388	DB107-RRV + FC	Perplexity	Gene therapy targeting DGM7+ patients (20-30% of HGG). Limited efficacy data available.	81 → 72
NCT04541082	ONC206	Perplexity	No clinical trial results for anaplastic astrocytoma. Phase I ongoing, preclinical promise only.	79 → 65
NCT05698524	Temodar + Abexinostat	Perplexity	Novel HDAC inhibitor combination, but patient already failed Temozolomide. Moderate novelty.	80 → 62

            The pattern: Evidence analysis doesn't just check if a drug exists—it asks "is this drug likely to help this specific patient?" A diagnostic tracer, a failed drug, and a preclinical compound all look different through the lens of published evidence.
        

Implications for Clinical Trial Matching

This case study illustrates a fundamental limitation of matching systems that rely solely on relevance and eligibility criteria:

TrialGPT answers: "Is this trial relevant, and could this patient enroll?"

Evidence-informed ranking answers: "Is this trial worth considering for this patient?"

For patients with complex treatment histories—especially in oncology—the difference is critical. The "best" trial isn't always the one with the highest eligibility match. It's the one most worth considering given the patient's specific situation.

Conclusion

AI-powered eligibility screening is a major advance. But stopping there leaves patients with a list of "maybe eligible" trials without guidance on which ones actually make sense for their situation.

Evidence-informed ranking closes this gap by incorporating scientific literature, treatment mechanisms, and clinical context. In our test case, it surfaced novel therapies as promising options—trials that eligibility-only ranking had ranked lower.

For cancer patients searching for clinical trials, the question shouldn't just be "what am I eligible for?" It should be "what's worth considering given my situation?"

That's the question evidence-informed matching tries to answer.

This analysis was conducted using ClinTrialFinder, an evidence-informed clinical trial matching tool. The patient case is from the TREC 2021 Clinical Trials benchmark dataset.

Try ClinTrialFinder

← Back to Blog