Publication:
The Two-Stage Algorithm for Extraction of the Significant Pharmaceutical Named Entities and Their Relations in the Russian-Language Reviews on Medications on Base of the XLM-RoBERTa Language Model

dc.contributor.authorMoloshnikov, I.
dc.contributor.authorSelivanov, A.
dc.contributor.authorRylkov, G.
dc.contributor.authorRybka, R.
dc.contributor.authorSboev, A.
dc.contributor.authorСбоев, Александр Георгиевич
dc.date.accessioned2024-12-26T10:53:03Z
dc.date.available2024-12-26T10:53:03Z
dc.date.issued2022
dc.description.abstract© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.The Internet contains a large amount of heterogeneous information, the extraction and structuring of which is currently a relevant task. This is especially relevant for tasks of social importance, in particular the analysis of the experience of using pharmaceutical products. In this paper, we propose a two-step sequential algorithm for extracting named entities and the relationships between them. Its creation was made possible by the availability of a marked-up corpus of Internet users’ reviews of medicines (Russian Drug Review Corpus). The basis of the algorithm is the language model XLM-RoBERTa-sag, which is pre-trained on a large corpus of unlabeled texts of reviews. The developed algorithm achieves the accuracy of identifying related entities: 71.6 and relations: 80.5, which is the first estimate of the accuracy of the solution of the considered problem on the Russian-language drug review texts.
dc.format.extentС. 463-471
dc.identifier.citationThe Two-Stage Algorithm for Extraction of the Significant Pharmaceutical Named Entities and Their Relations in the Russian-Language Reviews on Medications on Base of the XLM-RoBERTa Language Model / Moloshnikov, I. [et al.] // Studies in Computational Intelligence. - 2022. - 1032 SCI. - P. 463-471. - 10.1007/978-3-030-96993-6_51
dc.identifier.doi10.1007/978-3-030-96993-6_51
dc.identifier.urihttps://www.doi.org/10.1007/978-3-030-96993-6_51
dc.identifier.urihttps://www.scopus.com/record/display.uri?eid=2-s2.0-85127627366&origin=resultslist
dc.identifier.urihttp://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=Alerting&SrcApp=Alerting&DestApp=WOS_CPL&DestLinkType=FullRecord&UT=WOS:000833484200051
dc.identifier.urihttps://openrepository.mephi.ru/handle/123456789/28967
dc.relation.ispartofStudies in Computational Intelligence
dc.titleThe Two-Stage Algorithm for Extraction of the Significant Pharmaceutical Named Entities and Their Relations in the Russian-Language Reviews on Medications on Base of the XLM-RoBERTa Language Model
dc.typeConference Paper
dspace.entity.typePublication
oaire.citation.volume1032 SCI
relation.isAuthorOfPublicationfc2d63d7-5260-41ba-a952-0420c8848b13
relation.isAuthorOfPublication.latestForDiscoveryfc2d63d7-5260-41ba-a952-0420c8848b13
relation.isOrgUnitOfPublicationba0b4738-e6bd-4285-bda5-16ab2240dbd1
relation.isOrgUnitOfPublication.latestForDiscoveryba0b4738-e6bd-4285-bda5-16ab2240dbd1
Файлы
Коллекции