Go Back Research Article August, 2023

GENERATIVE AI AND LLM OPTIMIZING TECHNIQUES FOR DEVELOPING COST EFFECTIVE ENTERPRISE APPLICATIONS

Abstract

Generative AI usage has increased exponentially since start of the year and has created tremendous opportunities from startups to large enterprises. As more and more LLMs are released for research and commercial use, it becomes complex for enterprises to adopt the LLMs either using a managed service offering or even hosting it in-house as the cost is extremely high. This paper will focus on helping companies to optimize LLM, provide examples of use cases and solutions on fine tuning, cost optimizations, hosting LLM models internally in Kubernetes to solve data privacy, security and governance risks.

Keywords

generative ai llm enterprise applications ai hosting llms llm in kubernetes quantization llm optimization pruning llama cost optimization
Document Preview
Download PDF
Details
Volume 2
Issue 1
Pages 70-81
ISSN 4867-9994