Performance management via MPC for Web services in cloud

Singh, Durgesh; Mukherjee, Joydeep; Saikrishna, P. S.; Pasumarthy, Ramkrishna; Krishnamurthy, Diwakar

doi:10.23919/ACC.2018.8430989

Performance management via MPC for Web services in cloud

Date Issued

09-08-2018

Author(s)

Singh, Durgesh

Mukherjee, Joydeep

Saikrishna, P. S.

Pasumarthy, Ramkrishna

Indian Institute of Technology, Madras

Krishnamurthy, Diwakar

DOI

10.23919/ACC.2018.8430989

Abstract

Web services are increasingly being deployed on cloud platforms. Due to their interactive nature, Web services need to ensure fast response times to their end users. Unfortunately, the performance of a Web service can suffer due to a sudden surge in incoming traffic. Furthermore, a cloud-based service can also incur performance degradation due to interference, i.e., contention among services in the cloud platform for shared resources. Such issues motivate the need for automated runtime performance management solutions that ensure response time goals are continuously met. This paper explores a control theoretic approach called Model Predictive Control (MPC) for realizing such a solution. MPC is based on an optimization formulation, which lends itself well to expressing multiple constraints related to response time performance and the amount of resources, e.g., number of virtual machines (VMs), available to a Web service. We outline the design and operation of an MPC controller that governs the scale out and scale in of VMs while adhering to operator-specified thresholds for mean response time and the number of VMs available. Using a realistic Web service testbed, we show that the controller is able to satisfy the specified response time constraint even when the service is subjected to workload surges and interference.

Volume

2018-June

Options

Performance management via MPC for Web services in cloud