Medical and Legal Domains

Natural Language Processing techniques usually need of some kind of adaptation when they are used in specific domains such as the medical and legal domains.

In the health domain we started collaborating in 2010 with the Galdakao-Usansolo hospital with the aim of improving the encoding of their health records with the International Classification of Diseases (ICD). Thereafter, and always in order to benefit patients' care we have worked on different areas: detection of adverse drug reactions, recognition of clinical named entities (drug, disorder, body-part names...) and lately in electronic health record (EHR) machine translation. We have mainly worked with EHRs in Spanish but have also made use of medical corpora in English, and what is very important for us, we made important steps bringing Basque to the health services. In this sense, we automatically translated the huge medical lexicon named SNOMED-CT from English into Basque and we obtained our first results in neural machine translation between Basque and Spanish.

In collaboration with some hospitals from the Basque Sanitary System (Osakidetza) we worked in the OSAKU, DETEAMI, EXTRECM and PROSAMED research projects and we achieved a contract from Osakidetza to automatically translate ICD-10 to Basque.

 

In the legal domain we were contracted by Minsait to collaborate in Prótagoras, an internal project that aims to build NLU system for Spanish. We focused in the legal domain, in which the back-office generates huge amount of administrative documentation that needs to be stored and organized very fast to enable an efficient decision-taking. 

In the project we presented a solution for a real case solution of information extraction of notarial deeds, and fullfill the objective of alleviating the costly manual effort. Thus, without using any labeled data, we developed systems that are able to extracts the main attributes of the properties described in the documents (locality, street name, property number, and the price, among others) as well as the information about the owners of the property (e.g. names and surnames, ownership type, etc.).

Pages