Adverse Drug Reaction Concept Normalization in Russian-Language Reviews of Internet Users

Sboev, A.; Rybka, R.; Gryaznov, A.; Moloshnikov, I.; Sboeva, S.; Rylkov, G.; Selivanov, A.; Сбоев, Александр Георгиевич

Publication:
Adverse Drug Reaction Concept Normalization in Russian-Language Reviews of Internet Users

dc.contributor.author	Sboev, A.
dc.contributor.author	Rybka, R.
dc.contributor.author	Gryaznov, A.
dc.contributor.author	Moloshnikov, I.
dc.contributor.author	Sboeva, S.
dc.contributor.author	Rylkov, G.
dc.contributor.author	Selivanov, A.
dc.contributor.author	Сбоев, Александр Георгиевич
dc.date.accessioned	2024-12-25T12:29:14Z
dc.date.available	2024-12-25T12:29:14Z
dc.date.issued	2022
dc.description.abstract	Mapping the pharmaceutically significant entities on natural language to standardized terms/concepts is a key task in the development of the systems for pharmacovigilance, marketing, and using drugs out of the application scope. This work estimates the accuracy of mapping adverse reaction mentions to the concepts from the Medical Dictionary of Regulatory Activity (MedDRA) in the case of adverse reactions extracted from the reviews on the use of pharmaceutical products by Russian-speaking Internet users (normalization task). The solution we propose is based on a neural network approach using two neural network models: the first one for encoding concepts, and the second one for encoding mentions. Both models are pre-trained language models, but the second one is additionally tuned for the normalization task using both the Russian Drug Reviews (RDRS) corpus and a set of open English-language corpora automatically translated into Russian. Additional tuning of the model during the proposed procedure increases the accuracy of mentions of adverse drug reactions by 3% on the RDRS corpus. The resulting accuracy for the adverse reaction mentions mapping to the preferred terms of MedDRA in RDRS is 70.9% F1-micro. The paper analyzes the factors that affect the accuracy of solving the task based on a comparison of the RDRS and the CSIRO Adverse Drug Event Corpus (CADEC) corpora. It is shown that the composition of the concepts of the MedDRA and the number of examples for each concept play a key role in the task solution. The proposed model shows a comparable accuracy of 87.5% F1-micro on a subsample of RDRS and CADEC datasets with the same set of MedDRA preferred terms.
dc.identifier.citation	Adverse Drug Reaction Concept Normalization in Russian-Language Reviews of Internet Users / Sboev, A. [et al.] // Big Data and Cognitive Computing. - 2022. - 6. - № 4. - 10.3390/bdcc6040145
dc.identifier.doi	10.3390/bdcc6040145
dc.identifier.uri	https://www.doi.org/10.3390/bdcc6040145
dc.identifier.uri	https://www.scopus.com/record/display.uri?eid=2-s2.0-85144594217&origin=resultslist
dc.identifier.uri	http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=Alerting&SrcApp=Alerting&DestApp=WOS_CPL&DestLinkType=FullRecord&UT=WOS:000900468100001
dc.identifier.uri	https://openrepository.mephi.ru/handle/123456789/27872
dc.relation.ispartof	Big Data and Cognitive Computing
dc.title	Adverse Drug Reaction Concept Normalization in Russian-Language Reviews of Internet Users
dc.type	Article
dspace.entity.type	Publication
oaire.citation.issue	4
oaire.citation.volume	6
relation.isAuthorOfPublication	fc2d63d7-5260-41ba-a952-0420c8848b13
relation.isAuthorOfPublication.latestForDiscovery	fc2d63d7-5260-41ba-a952-0420c8848b13
relation.isOrgUnitOfPublication	ba0b4738-e6bd-4285-bda5-16ab2240dbd1
relation.isOrgUnitOfPublication.latestForDiscovery	ba0b4738-e6bd-4285-bda5-16ab2240dbd1

Коллекции

Публикации

Publication: Adverse Drug Reaction Concept Normalization in Russian-Language Reviews of Internet Users

Файлы

Коллекции

Publication:
Adverse Drug Reaction Concept Normalization in Russian-Language Reviews of Internet Users