<scp>Two‐sample</scp> inference procedures under nonproportional hazards

Tai, Y. T.; Wang, W.; Wells, Martin T.

doi:10.1002/pst.2324

Pharmaceutical Statistics

2023

DOI: 10.1002/pst.2324

|View full text |Cite

Two‐sample inference procedures under nonproportional hazards

Y. T. Tai

W. Wang

Martin T. Wells

Abstract: We introduce a new two‐sample inference procedure to assess the relative performance of two groups over time. Our model‐free method does not assume proportional hazards, making it suitable for scenarios where nonproportional hazards may exist. Our procedure includes a diagnostic tau plot to identify changes in hazard timing and a formal inference procedure. The tau‐based measures we develop are clinically meaningful and provide interpretable estimands to summarize the treatment effect over time. Our proposed s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article2

Relationship

Self Cite1

Independent1

Authors

Journals

Cited by 2 publications

(5 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Obviously, the dRMST test and the tau-based test did not lead to the same conclusion. However, it could be noted that the PFS curves in our study were similar to case 5 (early crossing setting) considered in the simulation study conducted by Tai et al 2 In that setting, authors have shown that the tau-based test has poor power and was less powerful than the dRMST test.…”

supporting

confidence: 73%

“…Previously, we proposed a novel estimand-based measure, the tau process, to describe the treatment effect useful for NPH . Denote T l as the time to a specified end point for group l with F l ( t ) = Pr( T l ≤ t ) and S l ( t ) = Pr( T l > t ), respectively, where l = 0,1 indicate the treatment and control groups, respectively.…”

mentioning

confidence: 99%

“…Prompt design is currently more of an art than a science and can be challenging, time-consuming, and costly. 2,3 In the future, automated approaches to normalizing prompts and soft prompting may alleviate some of these issues. 4,5 Methods such as developing more specialized clinical language models, linking to vetted knowledge sources, and fact-checking across different LLM instances are just a few other potential avenues to develop more factual question-answering systems.Robustness to minor alterations in prompts is a major issue for evaluating these LLMs and poses safety concerns.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Progression-Free Survival Analysis With a Graphical Estimand Approach in the Phase 2 SAMCO-PRODIGE 54 Trial

Tai,

Wang,

Wells

2024

JAMA Oncol

Self Cite

View full text Add to dashboard Cite

mation within token constraints. Longer responses do not necessarily improve accuracy or retrieve better information. As Schulte mentions, we designed our prompts to understand implications for patient self-education, and so our straightforward and broad prompts were appropriate for this intended use. 1 Additional prompt engineering might improve results; however, they would not represent the intended use that we were studying.The best way to improve LLM factuality is an open question, and prompt engineering is not necessarily the best approach. Prompt design is currently more of an art than a science and can be challenging, time-consuming, and costly. 2,3 In the future, automated approaches to normalizing prompts and soft prompting may alleviate some of these issues. 4,5 Methods such as developing more specialized clinical language models, linking to vetted knowledge sources, and fact-checking across different LLM instances are just a few other potential avenues to develop more factual question-answering systems.Robustness to minor alterations in prompts is a major issue for evaluating these LLMs and poses safety concerns. This is further compounded by a lack of transparency from OpenAI with respect to the data and methods used to train and evaluate their models and the fact that model weights and/or settings may be updated without notice when accessed via the browser interface. Transparency and reproducibility will be key for effective and safe implementation. In our study, the LLM was accessed only via the application programming interface, 6 allowing more control over settings and reliable reporting of which model was used. In addition, we make all of our data, including prompts and scores, publicly available, alongside clear definitions of the criteria used for our evaluation. 1 We propose that these steps should be standard for studies evaluating LLMs to ensure transparent and reproducible evaluation, which will form the foundation for safe clinical implementation in the future. We believe that our study contributes to the growing body of research on LLM performance in medicine that investigates both the strengths and deficiencies. Understanding this totality is the only way to meaningfully and safely integrate them into clinical practice.

show abstract

supporting

confidence: 73%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Progression-Free Survival Analysis With a Graphical Estimand Approach in the Phase 2 SAMCO-PRODIGE 54 Trial

Tai,

Wang,

Wells

2024

JAMA Oncol

Self Cite

View full text Add to dashboard Cite

show abstract

“…Obviously, the dRMST test and the tau-based test did not lead to the same conclusion. However, it could be noted that the PFS curves in our study were similar to case 5 (early crossing setting) considered in the simulation study conducted by Tai et al In that setting, authors have shown that the tau-based test has poor power and was less powerful than the dRMST test.…”

supporting

confidence: 71%

“…We then discussed the importance of identifying patients with deficient mismatch repair and/or microsatellite instability status and initial resistance to immunotherapy to select them for an appropriate suitable alternative treatment. In this context of delayed treatment effect, we agree that the tau process methodology proposed by Tai et al provide a useful tool to more accurately capture the timing of reversing hazards and examine treatment effects over time. However, regarding the comparison tests of survival curves, there is not a general rule to determine which method will perform better.…”

mentioning

confidence: 54%

Progression-Free Survival Analysis With a Graphical Estimand Approach in the Phase 2 SAMCO-PRODIGE 54 Trial—Reply

Boussari,

Taïeb

2024

JAMA Oncol

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Two‐sample inference procedures under nonproportional hazards

Cited by 2 publications

References 43 publications

Progression-Free Survival Analysis With a Graphical Estimand Approach in the Phase 2 SAMCO-PRODIGE 54 Trial

Progression-Free Survival Analysis With a Graphical Estimand Approach in the Phase 2 SAMCO-PRODIGE 54 Trial

Progression-Free Survival Analysis With a Graphical Estimand Approach in the Phase 2 SAMCO-PRODIGE 54 Trial—Reply

Contact Info

Product

Resources

About