Published 2013
| Version v1
Publication
HFSP: Size-based scheduling for Hadoop
Description
Size-based scheduling with aging has, for long, been recognized as an effective approach to guarantee fairness and near-optimal system response times. We present HFSP, a scheduler introducing this technique to a real, multi-server, complex and widely used system such as Hadoop. Size-based scheduling requires a priori job size information, which is not available in Hadoop: HFSP builds such knowledge by estimating it on-line during job execution. Our experiments, which are based on realistic workloads generated via a standard benchmarking suite, pinpoint at a significant decrease in system response times with respect to the widely used Hadoop Fair scheduler, and show that HFSP is largely tolerant to job size estimation errors. © 2013 IEEE.
Additional details
- URL
- http://hdl.handle.net/11567/1070998
- URN
- urn:oai:iris.unige.it:11567/1070998
- Origin repository
- UNIGE