A textual dataset of de-identified health records in Spanish and Catalan for medical entity recognition and anonymization
{{output}}
The advancement of clinical natural language processing systems is crucial to exploit the wealth of textual data contained in medical records. Diverse data sources are required in different languages and from different sites to represent global health services... ...