369 research outputs found
Improving the Prognostic Ability through Better Use of Standard Clinical Data - The Nottingham Prognostic Index as an Example
Background Prognostic factors and prognostic models play a key role in medical
research and patient management. The Nottingham Prognostic Index (NPI) is a
well-established prognostic classification scheme for patients with breast
cancer. In a very simple way, it combines the information from tumor size,
lymph node stage and tumor grade. For the resulting index cutpoints are
proposed to classify it into three to six groups with different prognosis. As
not all prognostic information from the three and other standard factors is
used, we will consider improvement of the prognostic ability using suitable
analysis approaches. Methods and Findings Reanalyzing overall survival data of
1560 patients from a clinical database by using multivariable fractional
polynomials and further modern statistical methods we illustrate suitable
multivariable modelling and methods to derive and assess the prognostic
ability of an index. Using a REMARK type profile we summarize relevant steps
of the analysis. Adding the information from hormonal receptor status and
using the full information from the three NPI components, specifically
concerning the number of positive lymph nodes, an extended NPI with improved
prognostic ability is derived. Conclusions The prognostic ability of even one
of the best established prognostic index in medicine can be improved by using
suitable statistical methodology to extract the full information from standard
clinical data. This extended version of the NPI can serve as a benchmark to
assess the added value of new information, ranging from a new single clinical
marker to a derived index from omics data. An established benchmark would also
help to harmonize the statistical analyses of such studies and protect against
the propagation of many false promises concerning the prognostic value of new
measurements. Statistical methods used are generally available and can be used
for similar analyses in other diseases
Return to work after COVID-19 infection – A Danish nationwide registry study
OBJECTIVES: This study aimed to explore return to work after COVID-19 and how disease severity affects this. STUDY DESIGN: This is a Nationwide Danish registry–based cohort study using a retrospective follow-up design. METHODS: Patients with a first-time positive SARS-CoV-2 polymerase chain reaction test between 1 January 2020 and 30 May 2020, including 18–64 years old, 30-day survivors, and available to the workforce at the time of the first positive test were included. Admission types (i.e. no admission, admission to non–intensive care unit [ICU] department and admission to ICU) and return to work was investigated using Cox regression standardised to the age, sex, comorbidity and education-level distribution of all included subjects with estimates at 3 months from positive test displayed. RESULTS: Among the 7466 patients included in the study, 81.9% (6119/7466) and 98.4% (7344/7466) returned to work within 4 weeks and 6 months, respectively, with 1.5% (109/7466) not returning. Of the patients admitted, 72.1% (627/870) and 92.6% (805/870) returned 1 month and 6 months after admission to the hospital, with 6.6% (58/870) not returning within 6 months. Of patients admitted to the ICU, 36% (9/25) did not return within 6 months. Patients with an admission had a lower chance of return to work 3 months from positive test (relative risk [RR] 0.95, 95% confidence interval [CI] 0.94–0.96), with the lowest chance in patients admitted to an ICU department (RR 0.54, 95% CI 0.35–0.72). Female sex, older age, and comorbidity were associated with a lower chance of returning to work. CONCLUSION: Hospitalised patients with COVID-19 infection have a lower chance of returning to work with potential implications for postinfection follow-up and rehabilitation
Incorporating pathway information into boosting estimation of high-dimensional risk prediction models
Abstract Background There are several techniques for fitting risk prediction models to high-dimensional data, arising from microarrays. However, the biological knowledge about relations between genes is only rarely taken into account. One recent approach incorporates pathway information, available, e.g., from the KEGG database, by augmenting the penalty term in Lasso estimation for continuous response models. Results As an alternative, we extend componentwise likelihood-based boosting techniques for incorporating pathway information into a larger number of model classes, such as generalized linear models and the Cox proportional hazards model for time-to-event data. In contrast to Lasso-like approaches, no further assumptions for explicitly specifying the penalty structure are needed, as pathway information is incorporated by adapting the penalties for single microarray features in the course of the boosting steps. This is shown to result in improved prediction performance when the coefficients of connected genes have opposite sign. The properties of the fitted models resulting from this approach are then investigated in two application examples with microarray survival data. Conclusion The proposed approach results not only in improved prediction performance but also in structurally different model fits. Incorporating pathway information in the suggested way is therefore seen to be beneficial in several ways.</p
Nonlinear joint models for individual dynamic prediction of risk of death using Hamiltonian Monte Carlo: application to metastatic prostate cancer
Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
Cardiovascular Risk Associated with Interactions among Polymorphisms in Genes from the Renin-Angiotensin, Bradykinin, and Fibrinolytic Systems
Vascular fibrinolytic balance is maintained primarily by interplay of tissue plasminogen activator (t-PA) and plasminogen activator inhibitor type 1 (PAI-1). Previous research has shown that polymorphisms in genes from the renin-angiotensin (RA), bradykinin, and fibrinolytic systems affect plasma concentrations of both t-PA and PAI-1 through a set of gene-gene interactions. In the present study, we extend this finding by exploring the effects of polymorphisms in genes from these systems on incident cardiovascular disease, explicitly examining two-way interactions in a large population-based study
Regularized Regression Incorporating Network Information: Simultaneous Estimation of Covariate Coefficients and Connection Signs
We develop an algorithm that incorporates network information into regression settings. It simultaneously estimates the covariate coefficients and the signs of the network connections (i.e. whether the connections are of an activating or of a repressing type). For the coefficient estimation steps an additional penalty is set on top of the lasso penalty, similarly to Li and Li (2008). We develop a fast implementation for the new method based on coordinate descent. Furthermore, we show how the new methods can be applied to time-to-event data. The new method yields good results in simulation studies concerning sensitivity and specificity of non-zero covariate coefficients, estimation of network connection signs, and prediction performance. We also apply the new method to two microarray time-to-event data sets from patients with ovarian cancer and diffuse large B-cell lymphoma. The new method performs very well in both cases. The main application of this new method is of biomedical nature, but it may also be useful in other fields where network data is available
Boosting the discriminatory power of sparse survival models via optimization of the concordance index and stability selection
Background
When constructing new biomarker or gene signature scores for time-to-event outcomes, the underlying aims are to develop a discrimination model that helps to predict whether patients have a poor or good prognosis and to identify the most influential variables for this task. In practice, this is often done fitting Cox models. Those are, however, not necessarily optimal with respect to the resulting discriminatory power and are based on restrictive assumptions. We present a combined approach to automatically select and fit sparse discrimination models for potentially high-dimensional survival data based on boosting a smooth version of the concordance index (C-index). Due to this objective function, the resulting prediction models are optimal with respect to their ability to discriminate between patients with longer and shorter survival times. The gradient boosting algorithm is combined with the stability selection approach to enhance and control its variable selection properties.
Results
The resulting algorithm fits prediction models based on the rankings of the survival times and automatically selects only the most stable predictors. The performance of the approach, which works best for small numbers of informative predictors, is demonstrated in a large scale simulation study: C-index boosting in combination with stability selection is able to identify a small subset of informative predictors from a much larger set of non-informative ones while controlling the per-family error rate. In an application to discover biomarkers for breast cancer patients based on gene expression data, stability selection yielded sparser models and the resulting discriminatory power was higher than with lasso penalized Cox regression models.
Conclusion
The combination of stability selection and C-index boosting can be used to select small numbers of informative biomarkers and to derive new prediction rules that are optimal with respect to their discriminatory power. Stability selection controls the per-family error rate which makes the new approach also appealing from an inferential point of view, as it provides an alternative to classical hypothesis tests for single predictor effects. Due to the shrinkage and variable selection properties of statistical boosting algorithms, the latter tests are typically unfeasible for prediction models fitted by boosting
Prognostic Value of Three Different Methods of MGMT Promoter Methylation Analysis in a Prospective Trial on Newly Diagnosed Glioblastoma
Hypermethylation in the promoter region of the MGMT gene encoding the DNA repair protein O6-methylguanine-DNA methyltransferase is among the most important prognostic factors for patients with glioblastoma and predicts response to treatment with alkylating agents like temozolomide. Hence, the MGMT status is widely determined in most clinical trials and frequently requested in routine diagnostics of glioblastoma. Since various different techniques are available for MGMT promoter methylation analysis, a generally accepted consensus as to the most suitable diagnostic method remains an unmet need. Here, we assessed methylation-specific polymerase chain reaction (MSP) as a qualitative and semi-quantitative method, pyrosequencing (PSQ) as a quantitative method, and methylation-specific multiplex ligation-dependent probe amplification (MS-MLPA) as a semi-quantitative method in a series of 35 formalin-fixed, paraffin-embedded glioblastoma tissues derived from patients treated in a prospective clinical phase II trial that tested up-front chemoradiotherapy with dose-intensified temozolomide (UKT-05). Our goal was to determine which of these three diagnostic methods provides the most accurate prediction of progression-free survival (PFS). The MGMT promoter methylation status was assessable by each method in almost all cases (n = 33/35 for MSP; n = 35/35 for PSQ; n = 34/35 for MS-MLPA). We were able to calculate significant cut-points for the continuous methylation signals at each CpG site analysed by PSQ (range, 11.5 to 44.9%) and at one CpG site assessed by MS-MLPA (3.6%) indicating that a dichotomisation of continuous methylation data as a prerequisite for comparative survival analyses is feasible. Our results show that, unlike MS-MLPA, MSP and PSQ provide a significant improvement of predicting PFS compared with established clinical prognostic factors alone (likelihood ratio tests: p<0.001). Conclusively, taking into consideration prognostic value, cost effectiveness and ease of use, we recommend pyrosequencing for analyses of MGMT promoter methylation in high-throughput settings and MSP for clinical routine diagnostics with low sample numbers
Integrating multiple molecular sources into a clinical risk prediction signature by extracting complementary information
- …
