Multisource Speech Analysis for Speaker Recognition

Sorokin, V. N.; Leonov, A. S.; Леонов, Александр Сергеевич

Publication:
Multisource Speech Analysis for Speaker Recognition

dc.contributor.author	Sorokin, V. N.
dc.contributor.author	Leonov, A. S.
dc.contributor.author	Леонов, Александр Сергеевич
dc.date.accessioned	2024-11-21T08:15:18Z
dc.date.available	2024-11-21T08:15:18Z
dc.date.issued	2019
dc.description.abstract	© 2019, Pleiades Publishing, Ltd. On a comprehensive speech database, speaker recognition characteristics are compared under the usage of various voice-source models. Inverse problems to find a source via vowel speech segments are solved on the base of a special speech-production model and voice-source models (A-source, piecewise-linear source, nonparametric source, and source found by means of the spectral relation method). In the first stage, we find the pulses such that the relative residuals of their segmented and their theoretical analogs computed by means of the speech-production model are less than 0.25. For the selected pulses, a posteriori estimates of the error of their determining are computed and the final selection of the source pulses is performed: for the recognition procedure, we leave only pulses with a posteriori estimates of the error less than the accepted level 0.3. In the space of parameters found for each source model, a statistical model is created for each speaker and the recognition is performed. For the speaker recognition with respect to one vowel, the mean error is approximately equal to 66% for the piecewise-linear source, 61% for the spectral relation method, and 33% for the A-source.
dc.format.extent	С. 181-193
dc.identifier.citation	Sorokin, V. N. Multisource Speech Analysis for Speaker Recognition / Sorokin, V.N., Leonov, A.S. // Pattern Recognition and Image Analysis. - 2019. - 29. - № 1. - P. 181-193. - 10.1134/S1054661818040260
dc.identifier.doi	10.1134/S1054661818040260
dc.identifier.uri	https://www.doi.org/10.1134/S1054661818040260
dc.identifier.uri	https://www.scopus.com/record/display.uri?eid=2-s2.0-85065019248&origin=resultslist
dc.identifier.uri	http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=Alerting&SrcApp=Alerting&DestApp=WOS_CPL&DestLinkType=FullRecord&UT=WOS:000705650100018
dc.identifier.uri	https://openrepository.mephi.ru/handle/123456789/17907
dc.relation.ispartof	Pattern Recognition and Image Analysis
dc.title	Multisource Speech Analysis for Speaker Recognition
dc.type	Article
dspace.entity.type	Publication
oaire.citation.issue	1
oaire.citation.volume	29
relation.isAuthorOfPublication	c6afce80-8607-4201-989c-678043775779
relation.isAuthorOfPublication.latestForDiscovery	c6afce80-8607-4201-989c-678043775779
relation.isOrgUnitOfPublication	d19559ab-04cd-486a-ae8e-f40ccd36a1a6
relation.isOrgUnitOfPublication.latestForDiscovery	d19559ab-04cd-486a-ae8e-f40ccd36a1a6

Коллекции

Публикации

Publication: Multisource Speech Analysis for Speaker Recognition

Файлы

Коллекции

Publication:
Multisource Speech Analysis for Speaker Recognition