6 research outputs found
Development of a Structured Query Language and Natural Language Processing Algorithm to Identify Lung Nodules in a Cancer Centre
Importance: The stratification of indeterminate lung nodules is a growing problem, but the burden of lung nodules on healthcare services is not well-described. Manual service evaluation and research cohort curation can be time-consuming and potentially improved by automation.Objective: To automate lung nodule identification in a tertiary cancer centre.Methods: This retrospective cohort study used Electronic Healthcare Records to identify CT reports generated between 31st October 2011 and 24th July 2020. A structured query language/natural language processing tool was developed to classify reports according to lung nodule status. Performance was externally validated. Sentences were used to train machine-learning classifiers to predict concerning nodule features in 2,000 patients.Results: 14,586 patients with lung nodules were identified. The cancer types most commonly associated with lung nodules were lung (39%), neuro-endocrine (38%), skin (35%), colorectal (33%) and sarcoma (33%). Lung nodule patients had a greater proportion of metastatic diagnoses (45 vs. 23%, p &lt; 0.001), a higher mean post-baseline scan number (6.56 vs. 1.93, p &lt; 0.001), and a shorter mean scan interval (4.1 vs. 5.9 months, p &lt; 0.001) than those without nodules. Inter-observer agreement for sentence classification was 0.94 internally and 0.98 externally. Sensitivity and specificity for nodule identification were 93 and 99% internally, and 100 and 100% at external validation, respectively. A linear-support vector machine model predicted concerning sentence features with 94% accuracy.Conclusion: We have developed and validated an accurate tool for automated lung nodule identification that is valuable for service evaluation and research data acquisition.</jats:p
Recovering from COVID-19: lessons learnt from an intensive secondary care follow-up service.
In response to the first COVID-19 surge in 2020, secondary care outpatient services were rapidly reconfigured to provide specialist review for disease sequelae. At our institution, comprising hospitals across three sites in London, we initially implemented a COVID-19 follow-up pathway that was in line with expert opinion at the time but more intensive than initial clinical guidelines suggested. We retrospectively evaluated the resource requirements for this service, which supported 526 patients from April 2020 to October 2020. At the 6-week review, 193/403 (47.9%) patients reported persistent breathlessness, 46/336 (13.7%) desaturated on exercise testing, 167/403 (41.4%) were discharged from COVID-19-related secondary care services and 190/403 (47.1%) needed 12-week follow-up. At the 12-week review, 113/309 (36.6%) patients reported persistent breathlessness, 30/266 (11.3%) desaturated on exercise testing and 150/309 (48.5%) were discharged from COVID-19-related secondary care services. Referrals were generated to multiple medical specialties, particularly respiratory subspecialties. Our analysis allowed us to justify rationalising and streamlining provisions for subsequent COVID-19 waves while reassured that opportunities for early intervention were not being missed
Data_Sheet_1_Development of a Structured Query Language and Natural Language Processing Algorithm to Identify Lung Nodules in a Cancer Centre.PDF
Importance: The stratification of indeterminate lung nodules is a growing problem, but the burden of lung nodules on healthcare services is not well-described. Manual service evaluation and research cohort curation can be time-consuming and potentially improved by automation.Objective: To automate lung nodule identification in a tertiary cancer centre.Methods: This retrospective cohort study used Electronic Healthcare Records to identify CT reports generated between 31st October 2011 and 24th July 2020. A structured query language/natural language processing tool was developed to classify reports according to lung nodule status. Performance was externally validated. Sentences were used to train machine-learning classifiers to predict concerning nodule features in 2,000 patients.Results: 14,586 patients with lung nodules were identified. The cancer types most commonly associated with lung nodules were lung (39%), neuro-endocrine (38%), skin (35%), colorectal (33%) and sarcoma (33%). Lung nodule patients had a greater proportion of metastatic diagnoses (45 vs. 23%, p Conclusion: We have developed and validated an accurate tool for automated lung nodule identification that is valuable for service evaluation and research data acquisition.</p
A radiomics-based decision support tool improves lung cancer diagnosis in combination with the Herder score in large lung nodules
BACKGROUND: Large lung nodules (≥15 mm) have the highest risk of malignancy, and may exhibit important differences in phenotypic or clinical characteristics to their smaller counterparts. Existing risk models do not stratify large nodules well. We aimed to develop and validate an integrated segmentation and classification pipeline, incorporating deep-learning and traditional radiomics, to classify large lung nodules according to cancer risk. METHODS: 502 patients from five U.K. centres were recruited to the large-nodule arm of the retrospective LIBRA study between July 2020 and April 2022. 838 CT scans were used for model development, split into training and test sets (70% and 30% respectively). An nnUNet model was trained to automate lung nodule segmentation. A radiomics signature was developed to classify nodules according to malignancy risk. Performance of the radiomics model, termed the large-nodule radiomics predictive vector (LN-RPV), was compared to three radiologists and the Brock and Herder scores. FINDINGS: 499 patients had technically evaluable scans (mean age 69 ± 11, 257 men, 242 women). In the test set of 252 scans, the nnUNet achieved a DICE score of 0.86, and the LN-RPV achieved an AUC of 0.83 (95% CI 0.77–0.88) for malignancy classification. Performance was higher than the median radiologist (AUC 0.75 [95% CI 0.70–0.81], DeLong p = 0.03). LN-RPV was robust to auto-segmentation (ICC 0.94). For baseline solid nodules in the test set (117 patients), LN-RPV had an AUC of 0.87 (95% CI 0.80–0.93) compared to 0.67 (95% CI 0.55–0.76, DeLong p = 0.002) for the Brock score and 0.83 (95% CI 0.75–0.90, DeLong p = 0.4) for the Herder score. In the international external test set (n = 151), LN-RPV maintained an AUC of 0.75 (95% CI 0.63–0.85). 18 out of 22 (82%) malignant nodules in the Herder 10–70% category in the test set were identified as high risk by the decision-support tool, and may have been referred for earlier intervention. INTERPRETATION: The model accurately segments and classifies large lung nodules, and may improve upon existing clinical models. FUNDING: This project represents independent research funded by: 1) Royal Marsden Partners Cancer Alliance, 2) the 10.13039/100016916Royal Marsden Cancer Charity, 3) the 10.13039/501100000272National Institute for Health Research (NIHR) Biomedical Research Centre at the Royal Marsden NHS Foundation Trust and The Institute of Cancer Research, London, 4) the 10.13039/501100000272National Institute for Health Research (NIHR) Biomedical Research Centre at 10.13039/501100000761Imperial College London, 5) 10.13039/501100000289Cancer Research UK (C309/A31316)
Symptomatic, biochemical and radiographic recovery in patients with COVID-19
BackgroundThe symptoms, radiography, biochemistry and healthcare utilisation of patients with COVID-19 following discharge from hospital have not been well described.MethodsRetrospective analysis of 401 adult patients attending a clinic following an index hospital admission or emergency department attendance with COVID-19. Regression models were used to assess the association between characteristics and persistent abnormal chest radiographs or breathlessness.Results75.1% of patients were symptomatic at a median of 53 days post discharge and 72 days after symptom onset and chest radiographs were abnormal in 47.4%. Symptoms and radiographic abnormalities were similar in PCR-positive and PCR-negative patients. Severity of COVID-19 was significantly associated with persistent radiographic abnormalities and breathlessness. 18.5% of patients had unscheduled healthcare visits in the 30 days post discharge.ConclusionsPatients with COVID-19 experience persistent symptoms and abnormal blood biomarkers with a gradual resolution of radiological abnormalities over time. These findings can inform patients and clinicians about expected recovery times and plan services for follow-up of patients with COVID-19.</jats:sec
