Loading...
Loading...

Go to the content (press return)

Building a Spanish/Catalan health records corpus with very sparse protected information labelled

Author
Medina, S.; Turmo, J.
Type of activity
Presentation of work at congresses
Name of edition
11th International Conference on Language Resources and Evaluation
Date of publication
2018
Presentation's date
2018-05-07
Book of congress proceedings
LREC 2018: Workshop MultilingualBIO: Multilingual Biomedical Text Processing: proceedings
First page
1
Last page
7
Project funding
Semantic graph extraction from textual health histories
Repository
http://hdl.handle.net/2117/124710 Open in new window
URL
http://www.elra.info/en/ Open in new window
Abstract
Electronic Health Records (EHR) are an important resource for the research and study of diseases, treatments and symptoms. However, due to data protection laws, information that could potentially compromise privacy must be anonymized before making use of them. Thus, the identification of these pieces of information is mandatory. This identification is usually performed by linguistic models built from EHRs corpora in which Protected Health Information (PHI) has been previously annotated. Neverthe...
Citation
Medina, S., Turmo, J. Building a Spanish/Catalan health records corpus with very sparse protected information labelled. A: International Conference on Language Resources and Evaluation. "LREC 2018: Workshop MultilingualBIO: Multilingual Biomedical Text Processing: proceedings". 2018, p. 1-7.
Keywords
Anonymization, Health Records, Sparse
Group of research
GPLN - Natural Language Processing Group
IDEAI-UPC - Intelligent Data Science and Artificial Intelligence Research Center
TALP - Centre for Language and Speech Technologies and Applications

Participants

Attachments