Skip to Main content Skip to Navigation
Journal articles

Identification of relations between risk factors and their pathologies or health conditions by mining scientific literature.

Abstract : Risk factors discovery and prevention is an active research field within the biomedical domain. Despite abundant existing information on risk factors, as found in bibliographical databases or on several websites, accessing this information may be difficult. Methods from Natural Language Processing and Information Extraction can be helpful to access it more easily. Specifically, we show a procedure for analyzing massive amounts of scientific literature and for detecting linguistically marked associations between pathologies and risk factors. This approach allowed us to extract over 22,000 risk factors and associated pathologies. The performed evaluations pointed out that (1) over 88% of risk factors for coronary heart disease are correct, (2) associated pathologies, when they could be compared to MeSH indexing, are correct in about 70%, and (3) in existing terminologies links between risk factors and their pathologies are seldom recorded.
Complete list of metadatas

https://hal-riip.archives-ouvertes.fr/pasteur-00606238
Contributor : Mariella Botta <>
Submitted on : Tuesday, July 5, 2011 - 5:45:18 PM
Last modification on : Wednesday, August 19, 2020 - 11:18:03 AM
Long-term archiving on: : Sunday, December 4, 2016 - 4:56:26 AM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : jamais

Please log in to resquest access to the document

Identifiers

Citation

Thierry Hamon, Martin Graña, Víctor Raggio, Natalia Grabar, Hugo Naya. Identification of relations between risk factors and their pathologies or health conditions by mining scientific literature.. Studies in Health Technology and Informatics, IOS Press, 2010, 160 (Pt 2), pp.964-8. ⟨10.1111/j.1567-1364.2008.00361.x⟩. ⟨pasteur-00606238⟩

Share

Metrics

Record views

521