Paper Title

GENERATIVE AI AND LLM OPTIMIZING TECHNIQUES FOR DEVELOPING COST EFFECTIVE ENTERPRISE APPLICATIONS

Authors

Amreth Chandrasehar

Keywords

generative ai
llm
enterprise applications
ai
hosting llms
llm in kubernetes
quantization
llm optimization
pruning
llama
cost optimization

Article Type

Research Article

Journal

International Journal of Artificial Intelligence & Applications

Issue

Volume : 2 | Issue : 1 | Page No : 70-81

Published On

August, 2023

Downloads

FULL PDF

CITATION

COPY LINK

Abstract

Generative AI usage has increased exponentially since start of the year and has created tremendous opportunities from startups to large enterprises. As more and more LLMs are released for research and commercial use, it becomes complex for enterprises to adopt the LLMs either using a managed service offering or even hosting it in-house as the cost is extremely high. This paper will focus on helping companies to optimize LLM, provide examples of use cases and solutions on fine tuning, cost optimizations, hosting LLM models internally in Kubernetes to solve data privacy, security and governance risks.