TY - GEN
T1 - The price of clustering in bin-packing with applications to bin-packing with delays
AU - Azar, Yossi
AU - Van Stee, Rob
AU - Emek, Yuval
AU - Vainstein, Danny
N1 - Publisher Copyright:
© 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2019/6/17
Y1 - 2019/6/17
N2 - One of the most significant algorithmic challenges in the “big data era” is handling instances that are too large to be processed by a single machine. The common practice in this regard is to partition the massive problem instance into smaller ones and process each one of them separately. In some cases, the solutions for the smaller instances are later on assembled into a solution for the whole instance, but in many cases this last stage cannot be pursued (e.g., because it is too costly, because of locality issues, or due to privacy considerations). Motivated by this phenomenon, we consider the following natural combinatorial question: Given a bin-packing instance (namely, a set of items with sizes in (0, 1] that should be packed into unit capacity bins) I and a partition {Ii}i of I into clusters, how large is the ratio Íi Opt(Ii)/Opt(I), where Opt(J) denotes the optimal number of bins into which the items in J can be packed? In this paper, we investigate the supremum of this ratio over all instances I and partitions {Ii}i, referred to as the bin-packing price of clustering (PoC). It is trivial to observe that if each cluster contains only one tiny item (and hence, Opt(Ii) = 1), then the PoC is unbounded. On the other hand, a relatively straightforward argument shows that under the constraint that Opt(Ii) ≥ 2, the PoC is 2. Our main challenge was to determine whether the PoC drops below 2 when Opt(Ii) > 2. In addition, one may hope that limk→∞ PoC(k) = 1, where PoC(k) denotes the PoC under the restriction to clusters Ii with Opt(Ii) ≥ k. We resolve the former question affirmatively and the latter one negatively: Our main results are that PoC(k) ≤ 1.951 for any k ≥ 3 and limk→∞ PoC(k) = 1.691 . . . Moreover, the former bound cannot be significantly improved as PoC(3) > 1.933. In addition to the immediate contribution of this combinatorial result to “big data” kind of applications, it turns out that it is useful also for an interesting online problem called bin-packing with delays.
AB - One of the most significant algorithmic challenges in the “big data era” is handling instances that are too large to be processed by a single machine. The common practice in this regard is to partition the massive problem instance into smaller ones and process each one of them separately. In some cases, the solutions for the smaller instances are later on assembled into a solution for the whole instance, but in many cases this last stage cannot be pursued (e.g., because it is too costly, because of locality issues, or due to privacy considerations). Motivated by this phenomenon, we consider the following natural combinatorial question: Given a bin-packing instance (namely, a set of items with sizes in (0, 1] that should be packed into unit capacity bins) I and a partition {Ii}i of I into clusters, how large is the ratio Íi Opt(Ii)/Opt(I), where Opt(J) denotes the optimal number of bins into which the items in J can be packed? In this paper, we investigate the supremum of this ratio over all instances I and partitions {Ii}i, referred to as the bin-packing price of clustering (PoC). It is trivial to observe that if each cluster contains only one tiny item (and hence, Opt(Ii) = 1), then the PoC is unbounded. On the other hand, a relatively straightforward argument shows that under the constraint that Opt(Ii) ≥ 2, the PoC is 2. Our main challenge was to determine whether the PoC drops below 2 when Opt(Ii) > 2. In addition, one may hope that limk→∞ PoC(k) = 1, where PoC(k) denotes the PoC under the restriction to clusters Ii with Opt(Ii) ≥ k. We resolve the former question affirmatively and the latter one negatively: Our main results are that PoC(k) ≤ 1.951 for any k ≥ 3 and limk→∞ PoC(k) = 1.691 . . . Moreover, the former bound cannot be significantly improved as PoC(3) > 1.933. In addition to the immediate contribution of this combinatorial result to “big data” kind of applications, it turns out that it is useful also for an interesting online problem called bin-packing with delays.
KW - Bin packing
KW - Online algorithms
KW - Price of clustering
UR - http://www.scopus.com/inward/record.url?scp=85068669644&partnerID=8YFLogxK
U2 - 10.1145/3323165.3323180
DO - 10.1145/3323165.3323180
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85068669644
T3 - Annual ACM Symposium on Parallelism in Algorithms and Architectures
SP - 1
EP - 10
BT - SPAA 2019 - Proceedings of the 31st ACM Symposium on Parallelism in Algorithms and Architectures
T2 - 31st ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2019
Y2 - 22 June 2019 through 24 June 2019
ER -