Gespeichert in:
Beteilige Person: | |
---|---|
Format: | Elektronisch E-Book |
Sprache: | Englisch |
Veröffentlicht: |
Hoboken, New Jersey
John Wiley & Sons, Incorporated
[2024]
|
Schriftenreihe: | Tech Today
|
Links: | https://ebookcentral.proquest.com/lib/hsansbach/detail.action?docID=31246938 |
Abstract: | Cover -- Contents At A Glance -- Title Page -- Copyright Page -- Dedication Page -- About the Author -- About the Technical Editor -- Contents -- Introduction -- GenAI Applications and Large Language Models -- Importance of Cost Optimization -- Challenges and Opportunities -- Micro Case Studies -- OpenAI: Leading the Way -- Hugging Face: Open-Source Community Building -- Bloomberg GPT: LLMs in Large Commercial Institutions -- Who Is This Book For? -- Summary -- Chapter 1 Introduction -- Overview of GenAI Applications and Large Language Models -- The Rise of Large Language Models -- Neural Networks, Transformers, and Beyond -- GenAI vs. LLMs: What's the Difference? -- The Three-Layer GenAI Application Stack -- The Infrastructure Layer -- The Model Layer -- The Application Layer -- Paths to Productionizing GenAI Applications -- Sample LLM-Powered Chat Application -- The Importance of Cost Optimization -- Cost Assessment of the Model Inference Component -- Cost Assessment of the Vector Database Component -- Benchmarking Setup and Results -- Other Factors to Consider -- Cost Assessment of the Large Language Model Component -- Summary -- Chapter 2 Tuning Techniques for Cost Optimization -- Fine-Tuning and Customizability -- Basic Scaling Laws You Should Know -- Parameter-Efficient Fine-Tuning Methods -- Adapters Under the Hood -- Prompt Tuning -- Prefix Tuning -- P-tuning -- IA3 -- Low-Rank Adaptation -- Cost and Performance Implications of PEFT Methods -- Summary -- Chapter 3 Inference Techniques for Cost Optimization -- Introduction to Inference Techniques -- Prompt Engineering -- Impact of Prompt Engineering on Cost -- Estimating Costs for Other Models -- Clear and Direct Prompts -- Adding Qualifying Words for Brief Responses -- Breaking Down the Request -- Example of Using Claude for PII Removal -- Conclusion -- Providing Context. |
Umfang: | 1 Online-Ressource (xxv, 190 Seiten) Illustrationen |
ISBN: | 9781394240746 |
Internformat
MARC
LEADER | 00000nam a22000001c 4500 | ||
---|---|---|---|
001 | BV049699093 | ||
003 | DE-604 | ||
005 | 00000000000000.0 | ||
007 | cr|uuu---uuuuu | ||
008 | 240528s2024 xx a||| o|||| 00||| eng d | ||
020 | |a 9781394240746 |9 978-1-394-24074-6 | ||
035 | |a (ZDB-30-PQE)31246938 | ||
035 | |a (OCoLC)1437869844 | ||
035 | |a (DE-599)KEP102336784 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-1102 | ||
100 | 1 | |a Subramanian, Shreyas |e Verfasser |4 aut | |
245 | 1 | 0 | |a Large Language Model-Based Solutions |b How to Deliver Value with Cost-Effective Generative AI Applications |c Shreyas Subramanian |
264 | 1 | |a Hoboken, New Jersey |b John Wiley & Sons, Incorporated |c [2024] | |
300 | |a 1 Online-Ressource (xxv, 190 Seiten) |b Illustrationen | ||
336 | |b txt |2 rdacontent | ||
337 | |b c |2 rdamedia | ||
338 | |b cr |2 rdacarrier | ||
490 | 0 | |a Tech Today | |
520 | 3 | |a Cover -- Contents At A Glance -- Title Page -- Copyright Page -- Dedication Page -- About the Author -- About the Technical Editor -- Contents -- Introduction -- GenAI Applications and Large Language Models -- Importance of Cost Optimization -- Challenges and Opportunities -- Micro Case Studies -- OpenAI: Leading the Way -- Hugging Face: Open-Source Community Building -- Bloomberg GPT: LLMs in Large Commercial Institutions -- Who Is This Book For? -- Summary -- Chapter 1 Introduction -- Overview of GenAI Applications and Large Language Models -- The Rise of Large Language Models -- Neural Networks, Transformers, and Beyond -- GenAI vs. LLMs: What's the Difference? -- The Three-Layer GenAI Application Stack -- The Infrastructure Layer -- The Model Layer -- The Application Layer -- Paths to Productionizing GenAI Applications -- Sample LLM-Powered Chat Application -- The Importance of Cost Optimization -- Cost Assessment of the Model Inference Component -- Cost Assessment of the Vector Database Component -- Benchmarking Setup and Results -- Other Factors to Consider -- Cost Assessment of the Large Language Model Component -- Summary -- Chapter 2 Tuning Techniques for Cost Optimization -- Fine-Tuning and Customizability -- Basic Scaling Laws You Should Know -- Parameter-Efficient Fine-Tuning Methods -- Adapters Under the Hood -- Prompt Tuning -- Prefix Tuning -- P-tuning -- IA3 -- Low-Rank Adaptation -- Cost and Performance Implications of PEFT Methods -- Summary -- Chapter 3 Inference Techniques for Cost Optimization -- Introduction to Inference Techniques -- Prompt Engineering -- Impact of Prompt Engineering on Cost -- Estimating Costs for Other Models -- Clear and Direct Prompts -- Adding Qualifying Words for Brief Responses -- Breaking Down the Request -- Example of Using Claude for PII Removal -- Conclusion -- Providing Context. | |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe |z 9781394240722 |
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe, EPUB |z 9781394240739 |
912 | |a ZDB-30-PQE | ||
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-035041535 | |
966 | e | |u https://ebookcentral.proquest.com/lib/hsansbach/detail.action?docID=31246938 |l DE-1102 |p ZDB-30-PQE |q FAN_Einzelkauf_2024 |x Aggregator |3 Volltext |
Datensatz im Suchindex
_version_ | 1818992014931263488 |
---|---|
any_adam_object | |
author | Subramanian, Shreyas |
author_facet | Subramanian, Shreyas |
author_role | aut |
author_sort | Subramanian, Shreyas |
author_variant | s s ss |
building | Verbundindex |
bvnumber | BV049699093 |
collection | ZDB-30-PQE |
ctrlnum | (ZDB-30-PQE)31246938 (OCoLC)1437869844 (DE-599)KEP102336784 |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03124nam a22003371c 4500</leader><controlfield tag="001">BV049699093</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">cr|uuu---uuuuu</controlfield><controlfield tag="008">240528s2024 xx a||| o|||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781394240746</subfield><subfield code="9">978-1-394-24074-6</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ZDB-30-PQE)31246938</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1437869844</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)KEP102336784</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-1102</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Subramanian, Shreyas</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Large Language Model-Based Solutions</subfield><subfield code="b">How to Deliver Value with Cost-Effective Generative AI Applications</subfield><subfield code="c">Shreyas Subramanian</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Hoboken, New Jersey</subfield><subfield code="b">John Wiley & Sons, Incorporated</subfield><subfield code="c">[2024]</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (xxv, 190 Seiten)</subfield><subfield code="b">Illustrationen</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Tech Today</subfield></datafield><datafield tag="520" ind1="3" ind2=" "><subfield code="a">Cover -- Contents At A Glance -- Title Page -- Copyright Page -- Dedication Page -- About the Author -- About the Technical Editor -- Contents -- Introduction -- GenAI Applications and Large Language Models -- Importance of Cost Optimization -- Challenges and Opportunities -- Micro Case Studies -- OpenAI: Leading the Way -- Hugging Face: Open-Source Community Building -- Bloomberg GPT: LLMs in Large Commercial Institutions -- Who Is This Book For? -- Summary -- Chapter 1 Introduction -- Overview of GenAI Applications and Large Language Models -- The Rise of Large Language Models -- Neural Networks, Transformers, and Beyond -- GenAI vs. LLMs: What's the Difference? -- The Three-Layer GenAI Application Stack -- The Infrastructure Layer -- The Model Layer -- The Application Layer -- Paths to Productionizing GenAI Applications -- Sample LLM-Powered Chat Application -- The Importance of Cost Optimization -- Cost Assessment of the Model Inference Component -- Cost Assessment of the Vector Database Component -- Benchmarking Setup and Results -- Other Factors to Consider -- Cost Assessment of the Large Language Model Component -- Summary -- Chapter 2 Tuning Techniques for Cost Optimization -- Fine-Tuning and Customizability -- Basic Scaling Laws You Should Know -- Parameter-Efficient Fine-Tuning Methods -- Adapters Under the Hood -- Prompt Tuning -- Prefix Tuning -- P-tuning -- IA3 -- Low-Rank Adaptation -- Cost and Performance Implications of PEFT Methods -- Summary -- Chapter 3 Inference Techniques for Cost Optimization -- Introduction to Inference Techniques -- Prompt Engineering -- Impact of Prompt Engineering on Cost -- Estimating Costs for Other Models -- Clear and Direct Prompts -- Adding Qualifying Words for Brief Responses -- Breaking Down the Request -- Example of Using Claude for PII Removal -- Conclusion -- Providing Context.</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">9781394240722</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe, EPUB</subfield><subfield code="z">9781394240739</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-PQE</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-035041535</subfield></datafield><datafield tag="966" ind1="e" ind2=" "><subfield code="u">https://ebookcentral.proquest.com/lib/hsansbach/detail.action?docID=31246938</subfield><subfield code="l">DE-1102</subfield><subfield code="p">ZDB-30-PQE</subfield><subfield code="q">FAN_Einzelkauf_2024</subfield><subfield code="x">Aggregator</subfield><subfield code="3">Volltext</subfield></datafield></record></collection> |
id | DE-604.BV049699093 |
illustrated | Illustrated |
indexdate | 2024-12-20T20:19:26Z |
institution | BVB |
isbn | 9781394240746 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-035041535 |
oclc_num | 1437869844 |
open_access_boolean | |
owner | DE-1102 |
owner_facet | DE-1102 |
physical | 1 Online-Ressource (xxv, 190 Seiten) Illustrationen |
psigel | ZDB-30-PQE ZDB-30-PQE FAN_Einzelkauf_2024 |
publishDate | 2024 |
publishDateSearch | 2024 |
publishDateSort | 2024 |
publisher | John Wiley & Sons, Incorporated |
record_format | marc |
series2 | Tech Today |
spelling | Subramanian, Shreyas Verfasser aut Large Language Model-Based Solutions How to Deliver Value with Cost-Effective Generative AI Applications Shreyas Subramanian Hoboken, New Jersey John Wiley & Sons, Incorporated [2024] 1 Online-Ressource (xxv, 190 Seiten) Illustrationen txt rdacontent c rdamedia cr rdacarrier Tech Today Cover -- Contents At A Glance -- Title Page -- Copyright Page -- Dedication Page -- About the Author -- About the Technical Editor -- Contents -- Introduction -- GenAI Applications and Large Language Models -- Importance of Cost Optimization -- Challenges and Opportunities -- Micro Case Studies -- OpenAI: Leading the Way -- Hugging Face: Open-Source Community Building -- Bloomberg GPT: LLMs in Large Commercial Institutions -- Who Is This Book For? -- Summary -- Chapter 1 Introduction -- Overview of GenAI Applications and Large Language Models -- The Rise of Large Language Models -- Neural Networks, Transformers, and Beyond -- GenAI vs. LLMs: What's the Difference? -- The Three-Layer GenAI Application Stack -- The Infrastructure Layer -- The Model Layer -- The Application Layer -- Paths to Productionizing GenAI Applications -- Sample LLM-Powered Chat Application -- The Importance of Cost Optimization -- Cost Assessment of the Model Inference Component -- Cost Assessment of the Vector Database Component -- Benchmarking Setup and Results -- Other Factors to Consider -- Cost Assessment of the Large Language Model Component -- Summary -- Chapter 2 Tuning Techniques for Cost Optimization -- Fine-Tuning and Customizability -- Basic Scaling Laws You Should Know -- Parameter-Efficient Fine-Tuning Methods -- Adapters Under the Hood -- Prompt Tuning -- Prefix Tuning -- P-tuning -- IA3 -- Low-Rank Adaptation -- Cost and Performance Implications of PEFT Methods -- Summary -- Chapter 3 Inference Techniques for Cost Optimization -- Introduction to Inference Techniques -- Prompt Engineering -- Impact of Prompt Engineering on Cost -- Estimating Costs for Other Models -- Clear and Direct Prompts -- Adding Qualifying Words for Brief Responses -- Breaking Down the Request -- Example of Using Claude for PII Removal -- Conclusion -- Providing Context. Erscheint auch als Druck-Ausgabe 9781394240722 Erscheint auch als Online-Ausgabe, EPUB 9781394240739 |
spellingShingle | Subramanian, Shreyas Large Language Model-Based Solutions How to Deliver Value with Cost-Effective Generative AI Applications |
title | Large Language Model-Based Solutions How to Deliver Value with Cost-Effective Generative AI Applications |
title_auth | Large Language Model-Based Solutions How to Deliver Value with Cost-Effective Generative AI Applications |
title_exact_search | Large Language Model-Based Solutions How to Deliver Value with Cost-Effective Generative AI Applications |
title_full | Large Language Model-Based Solutions How to Deliver Value with Cost-Effective Generative AI Applications Shreyas Subramanian |
title_fullStr | Large Language Model-Based Solutions How to Deliver Value with Cost-Effective Generative AI Applications Shreyas Subramanian |
title_full_unstemmed | Large Language Model-Based Solutions How to Deliver Value with Cost-Effective Generative AI Applications Shreyas Subramanian |
title_short | Large Language Model-Based Solutions |
title_sort | large language model based solutions how to deliver value with cost effective generative ai applications |
title_sub | How to Deliver Value with Cost-Effective Generative AI Applications |
work_keys_str_mv | AT subramanianshreyas largelanguagemodelbasedsolutionshowtodelivervaluewithcosteffectivegenerativeaiapplications |