Publication:
Efficient Exact Algorithm for Count Distinct Problem

dc.contributor.authorGolov, N.
dc.contributor.authorBruskin, S.
dc.contributor.authorFilatov, A.
dc.date.accessioned2024-11-21T11:35:16Z
dc.date.available2024-11-21T11:35:16Z
dc.date.issued2019
dc.description.abstract© 2019, Springer Nature Switzerland AG.This paper describes and analyses optimization approaches, which make possible the exact calculation of millions of hierarchical count distinct measures over hundreds of billions data rows. Described approach evolved for several years, in parallel with the growth of tasks from a fast growing internet company, and was finally implemented as a PEAPM (Pipelined Exact Accumulation for Paralleled Measures) algorithm. Current version of an algorithm outputs exact values (not estimates), works in a single thread, in minutes using a general commodity hardware, and requires volume of RAM equal to the doubled size of required measures.
dc.format.extentС. 67-77
dc.identifier.citationGolov, N. Efficient Exact Algorithm for Count Distinct Problem / Golov, N., Bruskin, S., Filatov, A. // Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). - 2019. - 11661 LNCS. - P. 67-77. - 10.1007/978-3-030-26831-2_5
dc.identifier.doi10.1007/978-3-030-26831-2_5
dc.identifier.urihttps://www.doi.org/10.1007/978-3-030-26831-2_5
dc.identifier.urihttps://www.scopus.com/record/display.uri?eid=2-s2.0-85071414798&origin=resultslist
dc.identifier.urihttp://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=Alerting&SrcApp=Alerting&DestApp=WOS_CPL&DestLinkType=FullRecord&UT=WOS:000555272600005
dc.identifier.urihttps://openrepository.mephi.ru/handle/123456789/18634
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.titleEfficient Exact Algorithm for Count Distinct Problem
dc.typeConference Paper
dspace.entity.typePublication
oaire.citation.volume11661 LNCS
Файлы
Коллекции