Gespeichert in:
Bibliographische Detailangaben
Beteilige Person: Subramanian, Shreyas (VerfasserIn)
Format: Elektronisch E-Book
Sprache:Englisch
Veröffentlicht: Hoboken, New Jersey John Wiley & Sons, Incorporated [2024]
Schriftenreihe:Tech Today
Links:https://ebookcentral.proquest.com/lib/hsansbach/detail.action?docID=31246938
Abstract:Cover -- Contents At A Glance -- Title Page -- Copyright Page -- Dedication Page -- About the Author -- About the Technical Editor -- Contents -- Introduction -- GenAI Applications and Large Language Models -- Importance of Cost Optimization -- Challenges and Opportunities -- Micro Case Studies -- OpenAI: Leading the Way -- Hugging Face: Open-Source Community Building -- Bloomberg GPT: LLMs in Large Commercial Institutions -- Who Is This Book For? -- Summary -- Chapter 1 Introduction -- Overview of GenAI Applications and Large Language Models -- The Rise of Large Language Models -- Neural Networks, Transformers, and Beyond -- GenAI vs. LLMs: What's the Difference? -- The Three-Layer GenAI Application Stack -- The Infrastructure Layer -- The Model Layer -- The Application Layer -- Paths to Productionizing GenAI Applications -- Sample LLM-Powered Chat Application -- The Importance of Cost Optimization -- Cost Assessment of the Model Inference Component -- Cost Assessment of the Vector Database Component -- Benchmarking Setup and Results -- Other Factors to Consider -- Cost Assessment of the Large Language Model Component -- Summary -- Chapter 2 Tuning Techniques for Cost Optimization -- Fine-Tuning and Customizability -- Basic Scaling Laws You Should Know -- Parameter-Efficient Fine-Tuning Methods -- Adapters Under the Hood -- Prompt Tuning -- Prefix Tuning -- P-tuning -- IA3 -- Low-Rank Adaptation -- Cost and Performance Implications of PEFT Methods -- Summary -- Chapter 3 Inference Techniques for Cost Optimization -- Introduction to Inference Techniques -- Prompt Engineering -- Impact of Prompt Engineering on Cost -- Estimating Costs for Other Models -- Clear and Direct Prompts -- Adding Qualifying Words for Brief Responses -- Breaking Down the Request -- Example of Using Claude for PII Removal -- Conclusion -- Providing Context.
Umfang:1 Online-Ressource (xxv, 190 Seiten) Illustrationen
ISBN:9781394240746