Ecole d'ingénieur et centre de recherche en Sciences du numérique

PaMPa-HD: a parallel MapReduce-based frequent pattern miner for high-dimensional data

Apiletti, Daniele; Baralis, Elena; Cerquitelli, Tania; Garza, Paolo; Pulvirenti, Fabio

HDM 2015, 3rd International Workshop on High Dimensional Data Mining, In conjunction with the IEEE International Conference on Data Mining (IEEE ICDM 2015), 14 November 2015, Atlantic City, NJ, USA

Frequent closed itemset mining is among the most complex exploratory techniques in data mining, and provides the ability to discover hidden correlations in transactional datasets. The explosion of Big Data is leading to new parallel and distributed approaches. Unfortunately, most of them are designed to cope with low-dimensional datasets, whereas no distributed highdimensional frequent closed itemset mining algorithms exists. This work introduces PaMPa-HD, a parallel MapReduce-based frequent closed itemset mining algorithm for high-dimensional datasets, based on Carpenter. The experimental results, performed on both real and synthetic datasets, show the efficiency and scalability of PaMPa-HD.

Document Bibtex

Titre:PaMPa-HD: a parallel MapReduce-based frequent pattern miner for high-dimensional data
Ville:Atlantic City
Département:Data Science
Eurecom ref:4748
Copyright: © 2015 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Bibtex: @inproceedings{EURECOM+4748, year = {2015}, title = {{P}a{MP}a-{HD}: a parallel {M}ap{R}educe-based frequent pattern miner for high-dimensional data}, author = {{A}piletti, {D}aniele and {B}aralis, {E}lena and {C}erquitelli, {T}ania and {G}arza, {P}aolo and {P}ulvirenti, {F}abio}, booktitle = {{HDM} 2015, 3rd {I}nternational {W}orkshop on {H}igh {D}imensional {D}ata {M}ining, {I}n conjunction with the {IEEE} {I}nternational {C}onference on {D}ata {M}ining ({IEEE} {ICDM} 2015), 14 {N}ovember 2015, {A}tlantic {C}ity, {NJ}, {USA} }, address = {{A}tlantic {C}ity, {\'{E}}{TATS}-{UNIS}}, month = {11}, url = {} }
Voir aussi: