Skip to Main content Skip to Navigation
Journal articles

What to expect from a set of itemsets?

Abstract : Dealing with redundancy is one of the main challenges in frequency based data mining and itemset mining in particular. To tackle this issue in the most objective possible way, we introduce the theoretical bases of a new probabilistic concept: Mutual constrained independence (MCI). Thanks to this notion, we describe a MCI model for the frequencies of all itemsets which is the least binding in terms of model hypotheses defined by the knowledge of the frequencies of some of the itemsets. We provide a method for computing MCI models based on algebraic geometry. We establish the link between MCI models and a class of MaxEnt models which has already known to be used in pattern mining. As such, our research presents further insight on the nature of such models and an entirely novel approach for computing them.
Complete list of metadata
Contributor : Thomas Delacroix-Sadighiyan Connect in order to contact the contributor
Submitted on : Friday, March 4, 2022 - 7:57:36 AM
Last modification on : Saturday, September 24, 2022 - 12:04:06 PM


Files produced by the author(s)



T. Delacroix, Philippe Lenca, S. Lallich. What to expect from a set of itemsets?. Information Sciences, Elsevier, 2022, 593, pp.314-340. ⟨10.1016/j.ins.2021.12.115⟩. ⟨hal-03594213⟩



Record views


Files downloads