Extraction of the Relations among Significant Pharmacological Entities in Russian-Language Reviews of Internet Users on Medications

Selivanov, A.; Moloshnikov, I.; Rybka, R.; Gryaznov, A.; Sboev, A.; Сбоев, Александр Георгиевич

Publication:
Extraction of the Relations among Significant Pharmacological Entities in Russian-Language Reviews of Internet Users on Medications

Дата

2022

Авторы

Selivanov, A.

Moloshnikov, I.

Rybka, R.

Gryaznov, A.

Sboev, A.

Сбоев, Александр Георгиевич

Организационные подразделения

Организационная единица

Институт ядерной физики и технологий

Цель ИЯФиТ и стратегия развития - создание и развитие научно-образовательного центра мирового уровня в области ядерной физики и технологий, радиационного материаловедения, физики элементарных частиц, астрофизики и космофизики.

Аннотация

© 2021 by the authors. Licensee MDPI, Basel, Switzerland.Nowadays, the analysis of digital media aimed at prediction of the society’s reaction to particular events and processes is a task of a great significance. Internet sources contain a large amount of meaningful information for a set of domains, such as marketing, author profiling, social situation analysis, healthcare, etc. In the case of healthcare, this information is useful for the pharmacovigilance purposes, including re-profiling of medications. The analysis of the mentioned sources requires the development of automatic natural language processing methods. These methods, in turn, require text datasets with complex annotation including information about named entities and relations between them. As the relevant literature analysis shows, there is a scarcity of datasets in the Russian language with annotated entity relations, and none have existed so far in the medical domain. This paper presents the first Russian-language textual corpus where entities have labels of different contexts within a single text, so that related entities share a common context. therefore this corpus is suitable for the task of belonging to the medical domain. Our second contribution is a method for the automated extraction of entity relations in Russian-language texts using the XLM-RoBERTa language model preliminarily trained on Russian drug review texts. A comparison with other machine learning methods is performed to estimate the efficiency of the proposed method. The method yields state-of-the-art accuracy of extracting the following relationship types: ADR–Drugname, Drugname– Diseasename, Drugname–SourceInfoDrug, Diseasename–Indication. As shown on the presented subcorpus from the Russian Drug Review Corpus, the method developed achieves a mean F1-score of 80.4% (estimated with cross-validation, averaged over the four relationship types). This result is 3.6% higher compared to the existing language model RuBERT, and 21.77% higher compared to basic ML classifiers.

Цитирование

Extraction of the Relations among Significant Pharmacological Entities in Russian-Language Reviews of Internet Users on Medications / Selivanov, A. [et al.] // Big Data and Cognitive Computing. - 2022. - 6. - № 1. - 10.3390/bdcc6010010

URI

https://www.doi.org/10.3390/bdcc6010010
https://www.scopus.com/record/display.uri?eid=2-s2.0-85123786018&origin=resultslist
http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=Alerting&SrcApp=Alerting&DestApp=WOS_CPL&DestLinkType=FullRecord&UT=WOS:000776828400001
https://openrepository.mephi.ru/handle/123456789/28677

Коллекции

Публикации

Полная страница элемента

Publication:
Extraction of the Relations among Significant Pharmacological Entities in Russian-Language Reviews of Internet Users on Medications

Дата

Авторы

Journal Title

Journal ISSN

Volume Title

Издатель

Научные группы

Организационные подразделения

Выпуск журнала

Аннотация

Описание

Ключевые слова

Цитирование

URI

Коллекции

Publication: Extraction of the Relations among Significant Pharmacological Entities in Russian-Language Reviews of Internet Users on Medications

Дата

Авторы

Journal Title

Journal ISSN

Volume Title

Издатель

Научные группы

Организационные подразделения

Выпуск журнала

Аннотация

Описание

Ключевые слова

Цитирование

URI

Коллекции

Publication:
Extraction of the Relations among Significant Pharmacological Entities in Russian-Language Reviews of Internet Users on Medications