Go Back Research Article November, 2022

DEVELOPING AN EFFICIENT SCHEDULING MODEL FOR CLOUD-BASED AI TRAINING USING PREDICTIVE WORKLOAD PATTERNS

Abstract

Cloud-based AI training introduces unique challenges in managing computing resources efficiently under dynamic workloads. Predictive workload models can anticipate resource needs and optimize scheduling, reducing latency and energy consumption. This paper proposes a predictive scheduling model integrating LSTM-based workload forecasting and dynamic resource allocation tailored for AI training in cloud environments. We evaluate its performance against conventional models using real trace data, showing significant improvements in resource utilization and training throughput.

Keywords

AI Training Cloud Computing Predictive Scheduling Workload Modeling LSTM
Document Preview
Download PDF
Details
Volume 2
Issue 2
Pages 1-9
ISSN 4739-1529