Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.rights.licenseReconocimiento 4.0 Internacional. (CC BY)-
dc.contributor.authorRolando, Matíases
dc.contributor.authorRaggio, Victores
dc.contributor.authorNaya, Hugoes
dc.contributor.authorCagnina, Leticiaes
dc.contributor.authorSpangenberg, Lucíaes
dc.date.accessioned2025-06-11T17:24:40Z-
dc.date.available2025-06-11T17:24:40Z-
dc.date.issued2025-02-
dc.identifier.urihttps://hdl.handle.net/20.500.12381/4061-
dc.description.abstractRare diseases (RDs) are a group of pathologies that individually affect less than 1 in 2000 people but collectively impact around 7% of the world's population. Most of them affect children, are chronic and progressive, and have no specific treatment. RD patients face diagnostic challenges, with an average diagnosis time of 5 years, multiple specialist visits, and invasive procedures. This 'diagnostic odyssey' can be detrimental to their health. Machine learning (ML) has the potential to improve healthcare by providing more personalized and accurate patient management, diagnoses, and in some cases, treatments. Leveraging the MIMIC-III database and additional medical notes from different sources such as in-house data, PubMed and chatGPT, we propose a labeled dataset for early RD detection in hospital settings. Applying various supervised ML methods, including logistic regression, decision trees, support vector machine (SVM), deep learning methods (LSTM and CNN), and Transformers (BERT), we validated the use of the proposed resource, achieving 92.7% F-measure and a 96% AUC using SVM. These findings highlight the potential of ML in redirecting RD patients towards more accurate diagnostic pathways and presents a corpus that can be used for future development and refinements.es
dc.description.sponsorshipAgencia Nacional de Investigación e Innovaciónes
dc.language.isoenges
dc.publisherNature Portfolioes
dc.rightsAcceso abierto*
dc.sourceScientific Reportses
dc.subjectaprendizaje automáticoes
dc.subjecthistorias clínicases
dc.titleA labeled medical records corpus for the timely detection of rare diseases using machine learning approacheses
dc.typeArtículoes
dc.subject.aniiCiencias Naturales y Exactas
dc.subject.aniiCiencias de la Computación e Información
dc.subject.aniiCiencias de la Información y Bioinformática
dc.identifier.aniiFSS_X_2022_1_173209es
dc.type.versionPublicadoes
dc.identifier.doi10.1038/s41598-025-90450-0-
dc.anii.institucionresponsableInstitut Pasteur de Montevideoes
dc.anii.subjectcompleto//Ciencias Naturales y Exactas/Ciencias de la Computación e Información/Ciencias de la Información y Bioinformáticaes
Aparece en las colecciones: Institut Pasteur de Montevideo

Archivos en este ítem:
archivo  Descripción Tamaño Formato
41598_2025_Article_90450.pdfDescargar articulo1.5 MBAdobe PDF

Las obras en REDI están protegidas por licencias Creative Commons.
Por más información sobre los términos de esta publicación, visita: Reconocimiento 4.0 Internacional. (CC BY)