Saved in:
Main Author: | |
---|---|
Format: | Electronic eBook |
Language: | English |
Published: |
Hoboken, New Jersey
JOHN WILEY
2024
|
Subjects: | |
Links: | https://learning.oreilly.com/library/view/-/9781394240722/?ar |
Summary: | Learn to build cost-effective apps using Large Language Models In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine tuning. The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find: Effective strategies to address the challenge of the high computational cost associated with LLMs Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models Perfect for developers and data scientists interested in deploying foundational models, or business leaders planning to scale out their use of GenAI, Large Language Model-Based Solutions will also benefit project leaders and managers, technical support staff, and administrators with an interest or stake in the subject. |
Physical Description: | 1 Online-Ressource |
ISBN: | 9781394240746 1394240740 9781394240722 |
Staff View
MARC
LEADER | 00000nam a22000002c 4500 | ||
---|---|---|---|
001 | ZDB-30-ORH-102563640 | ||
003 | DE-627-1 | ||
005 | 20240429114545.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240429s2024 xx |||||o 00| ||eng c | ||
020 | |a 9781394240746 |c electronic bk. |9 978-1-394-24074-6 | ||
020 | |a 1394240740 |c electronic bk. |9 1-394-24074-0 | ||
020 | |a 9781394240722 |9 978-1-394-24072-2 | ||
035 | |a (DE-627-1)102563640 | ||
035 | |a (DE-599)KEP102563640 | ||
035 | |a (ORHE)9781394240722 | ||
035 | |a (DE-627-1)102563640 | ||
040 | |a DE-627 |b ger |c DE-627 |e rda | ||
041 | |a eng | ||
082 | 0 | |a 006.3/5 |2 23/eng/20240416 | |
100 | 1 | |a Subramanian, Shreyas |e VerfasserIn |4 aut | |
245 | 1 | 0 | |a LARGE LANGUAGE MODEL-BASED SOLUTIONS |b how to deliver value with cost-effective generative AI applications |c Shreyas Subramanian |
264 | 1 | |a Hoboken, New Jersey |b JOHN WILEY |c 2024 | |
300 | |a 1 Online-Ressource | ||
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Learn to build cost-effective apps using Large Language Models In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine tuning. The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find: Effective strategies to address the challenge of the high computational cost associated with LLMs Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models Perfect for developers and data scientists interested in deploying foundational models, or business leaders planning to scale out their use of GenAI, Large Language Model-Based Solutions will also benefit project leaders and managers, technical support staff, and administrators with an interest or stake in the subject. | ||
650 | 0 | |a Natural language generation (Computer science) | |
650 | 0 | |a Artificial intelligence |x Computer programs | |
650 | 4 | |a Génération automatique de texte | |
650 | 4 | |a Intelligence artificielle ; Logiciels | |
776 | 1 | |z 1394240724 | |
776 | 0 | 8 | |i Erscheint auch als |n Druck-Ausgabe |z 1394240724 |
966 | 4 | 0 | |l DE-91 |p ZDB-30-ORH |q TUM_PDA_ORH |u https://learning.oreilly.com/library/view/-/9781394240722/?ar |m X:ORHE |x Aggregator |z lizenzpflichtig |3 Volltext |
912 | |a ZDB-30-ORH | ||
951 | |a BO | ||
912 | |a ZDB-30-ORH | ||
049 | |a DE-91 |
Record in the Search Index
DE-BY-TUM_katkey | ZDB-30-ORH-102563640 |
---|---|
_version_ | 1835903249603887104 |
adam_text | |
any_adam_object | |
author | Subramanian, Shreyas |
author_facet | Subramanian, Shreyas |
author_role | aut |
author_sort | Subramanian, Shreyas |
author_variant | s s ss |
building | Verbundindex |
bvnumber | localTUM |
collection | ZDB-30-ORH |
ctrlnum | (DE-627-1)102563640 (DE-599)KEP102563640 (ORHE)9781394240722 |
dewey-full | 006.3/5 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.3/5 |
dewey-search | 006.3/5 |
dewey-sort | 16.3 15 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03038nam a22004092c 4500</leader><controlfield tag="001">ZDB-30-ORH-102563640</controlfield><controlfield tag="003">DE-627-1</controlfield><controlfield tag="005">20240429114545.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">240429s2024 xx |||||o 00| ||eng c</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781394240746</subfield><subfield code="c">electronic bk.</subfield><subfield code="9">978-1-394-24074-6</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1394240740</subfield><subfield code="c">electronic bk.</subfield><subfield code="9">1-394-24074-0</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781394240722</subfield><subfield code="9">978-1-394-24072-2</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)102563640</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)KEP102563640</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ORHE)9781394240722</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)102563640</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.3/5</subfield><subfield code="2">23/eng/20240416</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Subramanian, Shreyas</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">LARGE LANGUAGE MODEL-BASED SOLUTIONS</subfield><subfield code="b">how to deliver value with cost-effective generative AI applications</subfield><subfield code="c">Shreyas Subramanian</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Hoboken, New Jersey</subfield><subfield code="b">JOHN WILEY</subfield><subfield code="c">2024</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Learn to build cost-effective apps using Large Language Models In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine tuning. The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find: Effective strategies to address the challenge of the high computational cost associated with LLMs Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models Perfect for developers and data scientists interested in deploying foundational models, or business leaders planning to scale out their use of GenAI, Large Language Model-Based Solutions will also benefit project leaders and managers, technical support staff, and administrators with an interest or stake in the subject.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Natural language generation (Computer science)</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Artificial intelligence</subfield><subfield code="x">Computer programs</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Génération automatique de texte</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Intelligence artificielle ; Logiciels</subfield></datafield><datafield tag="776" ind1="1" ind2=" "><subfield code="z">1394240724</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">1394240724</subfield></datafield><datafield tag="966" ind1="4" ind2="0"><subfield code="l">DE-91</subfield><subfield code="p">ZDB-30-ORH</subfield><subfield code="q">TUM_PDA_ORH</subfield><subfield code="u">https://learning.oreilly.com/library/view/-/9781394240722/?ar</subfield><subfield code="m">X:ORHE</subfield><subfield code="x">Aggregator</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">BO</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91</subfield></datafield></record></collection> |
id | ZDB-30-ORH-102563640 |
illustrated | Not Illustrated |
indexdate | 2025-06-25T12:16:16Z |
institution | BVB |
isbn | 9781394240746 1394240740 9781394240722 |
language | English |
open_access_boolean | |
owner | DE-91 DE-BY-TUM |
owner_facet | DE-91 DE-BY-TUM |
physical | 1 Online-Ressource |
psigel | ZDB-30-ORH TUM_PDA_ORH ZDB-30-ORH |
publishDate | 2024 |
publishDateSearch | 2024 |
publishDateSort | 2024 |
publisher | JOHN WILEY |
record_format | marc |
spelling | Subramanian, Shreyas VerfasserIn aut LARGE LANGUAGE MODEL-BASED SOLUTIONS how to deliver value with cost-effective generative AI applications Shreyas Subramanian Hoboken, New Jersey JOHN WILEY 2024 1 Online-Ressource Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Learn to build cost-effective apps using Large Language Models In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine tuning. The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find: Effective strategies to address the challenge of the high computational cost associated with LLMs Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models Perfect for developers and data scientists interested in deploying foundational models, or business leaders planning to scale out their use of GenAI, Large Language Model-Based Solutions will also benefit project leaders and managers, technical support staff, and administrators with an interest or stake in the subject. Natural language generation (Computer science) Artificial intelligence Computer programs Génération automatique de texte Intelligence artificielle ; Logiciels 1394240724 Erscheint auch als Druck-Ausgabe 1394240724 |
spellingShingle | Subramanian, Shreyas LARGE LANGUAGE MODEL-BASED SOLUTIONS how to deliver value with cost-effective generative AI applications Natural language generation (Computer science) Artificial intelligence Computer programs Génération automatique de texte Intelligence artificielle ; Logiciels |
title | LARGE LANGUAGE MODEL-BASED SOLUTIONS how to deliver value with cost-effective generative AI applications |
title_auth | LARGE LANGUAGE MODEL-BASED SOLUTIONS how to deliver value with cost-effective generative AI applications |
title_exact_search | LARGE LANGUAGE MODEL-BASED SOLUTIONS how to deliver value with cost-effective generative AI applications |
title_full | LARGE LANGUAGE MODEL-BASED SOLUTIONS how to deliver value with cost-effective generative AI applications Shreyas Subramanian |
title_fullStr | LARGE LANGUAGE MODEL-BASED SOLUTIONS how to deliver value with cost-effective generative AI applications Shreyas Subramanian |
title_full_unstemmed | LARGE LANGUAGE MODEL-BASED SOLUTIONS how to deliver value with cost-effective generative AI applications Shreyas Subramanian |
title_short | LARGE LANGUAGE MODEL-BASED SOLUTIONS |
title_sort | large language model based solutions how to deliver value with cost effective generative ai applications |
title_sub | how to deliver value with cost-effective generative AI applications |
topic | Natural language generation (Computer science) Artificial intelligence Computer programs Génération automatique de texte Intelligence artificielle ; Logiciels |
topic_facet | Natural language generation (Computer science) Artificial intelligence Computer programs Génération automatique de texte Intelligence artificielle ; Logiciels |
work_keys_str_mv | AT subramanianshreyas largelanguagemodelbasedsolutionshowtodelivervaluewithcosteffectivegenerativeaiapplications |