Back to Top

Paper Title

GENERATIVE AI AND LLM OPTIMIZING TECHNIQUES FOR DEVELOPING COST EFFECTIVE ENTERPRISE APPLICATIONS

Keywords

  • generative ai
  • llm
  • enterprise applications
  • ai
  • hosting llms
  • llm in kubernetes
  • quantization
  • llm optimization
  • pruning
  • llama
  • cost optimization

Article Type

Research Article

Issue

Volume : 2 | Issue : 1 | Page No : 70-81

Published On

August, 2023

Downloads

Abstract

Generative AI usage has increased exponentially since start of the year and has created tremendous opportunities from startups to large enterprises. As more and more LLMs are released for research and commercial use, it becomes complex for enterprises to adopt the LLMs either using a managed service offering or even hosting it in-house as the cost is extremely high. This paper will focus on helping companies to optimize LLM, provide examples of use cases and solutions on fine tuning, cost optimizations, hosting LLM models internally in Kubernetes to solve data privacy, security and governance risks.

View more >>

Uploded Document Preview