GENERATIVE AI AND LLM OPTIMIZING TECHNIQUES FOR DEVELOPING COST EFFECTIVE ENTERPRISE APPLICATIONS
Abstract
Generative AI usage has increased exponentially since start of the year and has created tremendous opportunities from startups to large enterprises. As more and more LLMs are released for research and commercial use, it becomes complex for enterprises to adopt the LLMs either using a managed service offering or even hosting it in-house as the cost is extremely high. This paper will focus on helping companies to optimize LLM, provide examples of use cases and solutions on fine tuning, cost optimizations, hosting LLM models internally in Kubernetes to solve data privacy, security and governance risks.
Keywords
generative ai
llm
enterprise applications
ai
hosting llms
llm in kubernetes
quantization
llm optimization
pruning
llama
cost optimization
Document Preview
Download PDF
https://scholar9.com/publication-detail/generative-ai-and-llm-optimizing-techniques-for-de--34016
Details
Volume
2
Issue
1
Pages
70-81
ISSN
4867-9994
Amreth Chandrasehar
"GENERATIVE AI AND LLM OPTIMIZING TECHNIQUES FOR DEVELOPING COST EFFECTIVE ENTERPRISE APPLICATIONS".
International Journal of Artificial Intelligence & Applications,
vol: 2,
No. 1
Aug. 2023, pp: 70-81,
https://scholar9.com/publication-detail/generative-ai-and-llm-optimizing-techniques-for-de--34016