טכניון מכון טכנולוגי לישראל
הטכניון מכון טכנולוגי לישראל - בית הספר ללימודי מוסמכים  
M.Sc Thesis
M.Sc StudentZlotnik Alexander
SubjectEfficient Use of Geographically Spread Cloud
Resources
DepartmentDepartment of Computer Science
Supervisor Professor Dan Raz
Full Thesis textFull thesis text - English Version


Abstract

With the expansion of cloud computing, more and more global services are offered from several locations around the globe. Naturally, the demand for cloud services in each geographical location changes over time depending on the time of the day. Thus, when one data center (say in the east coast of the US) experiences peak daily load, other data centers (say in Europe) experience lower daily loads. Our research addresses the efficiency of load sharing between geographically spread cloud resources. We observe that despite the network latency, for several common services it is very beneficial to share the load across two or more data centers, each located in a different time zone.


We rigorously analyze a simple setting in which customers can be redirected between two servers, each experiencing a different local load. We show that a threshold-based load sharing scheme, in which jobs are redirected when the load exceeds some threshold, is significantly more efficient than a static load sharing scheme, where jobs are redirected independently of the current state. Our load sharing techniques can reduce the average service time by 40% during peak demand in typical service scenarios. Looking at the same result from a different perspective, we show that (in the same setting) deploying our threshold based load sharing scheme on a geographically dispersed system can provide similar user experience with 15%-20% less resources.


To further validate our approach, we deployed Wikipedia instances on Amazon EC2 both in Europe and in the US and tested our techniques using real Wikimedia access logs. Our results show that threshold-based load sharing between the US and Europe, achieves an improvement of up to 32% in average service time over these logs.