Many distributed systems face the problem of load imbalance between machines. With the advent of Big Data, large datasets whose values are often highly skewed are produced by heterogeneous sources to be often processed in real time. Thus, it is necessary to be able to adapt to the variations of size/content/source of the incoming data. In this...
-
September 23, 2015 (v1)PublicationUploaded on: March 25, 2023
-
December 9, 2014 (v1)Conference paper
Storing highly skewed data in a distributed system has become a very frequent issue, in particular with the emergence of semantic web and Big Data. This often leads to biased data dissemination among nodes. Addressing load imbalance is necessary, especially to minimize response time and avoid workload being handled by only one or few nodes. Our...
Uploaded on: March 25, 2023 -
2018 (v1)Journal article
Hash functions are at the heart of data insertion and retrieval in DHT-based overlays. However, a standard hash function destroys the natural ordering of data. To perform efficient range queries processing, more and more systems opt for an order-preserving hash function to place data. Unlike a standard hash function, this technique cannot...
Uploaded on: December 4, 2022 -
2017 (v1)Journal article
Hash functions are at the heart of data insertion and retrieval in DHT-based overlays. However, a standard hash function destroys the natural ordering of data. To perform efficient range queries processing, in a minimum number of hops, more and more systems opt for an order-preserving hash function to place data. Unlike a standard hash...
Uploaded on: February 28, 2023 -
June 25, 2012 (v1)Conference paper
Stocker des informations du web sémantique implique d'être capable de pouvoir potentiellement gérer de très importants volumes de données. D'où le besoin d'opter pour une solution forcément distribuée, entre autres de type pair-à-pair, pour pouvoir passer à l'échelle. Un système de stockage RDF réparti requiert de mettre en place un algorithme...
Uploaded on: December 3, 2022 -
June 25, 2012 (v1)Conference paper
Stocker des informations du web sémantique implique d'être capable de pouvoir potentiellement gérer de très importants volumes de données. D'où le besoin d'opter pour une solution forcément distribuée, entre autres de type pair-à-pair, pour pouvoir passer à l'échelle. Un système de stockage RDF réparti requiert de mettre en place un algorithme...
Uploaded on: October 11, 2023 -
October 22, 2014 (v1)Conference paper
Real world datasets are known to be highly skewed, often leading to an important load imbalance issue for distributed systems managing them. To address this issue, there exist almost as many load balancing strategies as there are different systems. When designing a scalable distributed system geared towards handling large amounts of...
Uploaded on: March 25, 2023 -
August 14, 2015 (v1)Journal article
Distributed systems for big data management very often face the problem of load imbalance among nodes. To address this issue, there exist almost as many load balancing strategies as there are different systems. When designing a scalable distributed system geared towards handling large amounts of information, it is often not so easy to...
Uploaded on: February 28, 2023 -
July 11, 2014 (v1)Report
Many structured Peer-to-Peer systems for data management face the problem of load imbalance. To address this issue, there exist almost as many load balancing strategies as there are different systems. Besides, the proposed solutions are often coupled to their own API, making it difficult to port a scheme from a system to another. In this...
Uploaded on: March 25, 2023