Saved in:
Other Authors: | , , , , , , , , , , |
---|---|
Format: | Electronic Video |
Language: | English |
Published: |
[Sebastopol, California]
O'Reilly Media, Inc.
2024
|
Edition: | [First edition]. |
Subjects: | |
Links: | https://learning.oreilly.com/library/view/-/0642572057336/?ar |
Summary: | While large language models are groundbreaking tools for automating everyday text-based tasks such as text summarization, translation, and generation, we've also seen the emergence of more complex generative AI models that can process and output different types of data, such as images, audio, and even video. Multimodal AI models, such as GPT-4, are capable of working across different data formats, for example, to generate speech from text, text from images, or text from audio. By combining different modalities, multimodal AI can interact with humans in more natural, intuitive ways, mimicking how humans perceive and understand the world around them. The possibilities from processing inputs more holistically and providing more intuitive outputs are already nudging us closer to true artificial general intelligence. |
Item Description: | Online resource; title from title details screen (O'Reilly, viewed October 1, 2024) |
Physical Description: | 1 online resource (1 video file (4 hr., 11 min.)) sound, color. |
Staff View
MARC
LEADER | 00000cgm a22000002c 4500 | ||
---|---|---|---|
001 | ZDB-30-ORH-108527123 | ||
003 | DE-627-1 | ||
005 | 20241107103346.0 | ||
006 | m o | | | ||
007 | cr uuu---uuuuu | ||
008 | 241001s2024 xx ||| |o o ||eng c | ||
035 | |a (DE-627-1)108527123 | ||
035 | |a (DE-599)KEP108527123 | ||
035 | |a (ORHE)0642572057336 | ||
035 | |a (DE-627-1)108527123 | ||
040 | |a DE-627 |b ger |c DE-627 |e rda | ||
041 | |a eng | ||
082 | 0 | |a 006.3 |2 23/eng/20241001 | |
245 | 0 | 0 | |a AI superstream |b multimodal generative AI |c Susan Shu Chang, Rikin Gandhi, Suhas Pai, Nahid Alam, Anthony Susevski, Andrei Betlen, Shekhar Iyer, Jingying Gao, Antje Barth, Omar Aldughayem, Chris Fregly |
250 | |a [First edition]. | ||
264 | 1 | |a [Sebastopol, California] |b O'Reilly Media, Inc. |c 2024 | |
300 | |a 1 online resource (1 video file (4 hr., 11 min.)) |b sound, color. | ||
336 | |a zweidimensionales bewegtes Bild |b tdi |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Online resource; title from title details screen (O'Reilly, viewed October 1, 2024) | ||
520 | |a While large language models are groundbreaking tools for automating everyday text-based tasks such as text summarization, translation, and generation, we've also seen the emergence of more complex generative AI models that can process and output different types of data, such as images, audio, and even video. Multimodal AI models, such as GPT-4, are capable of working across different data formats, for example, to generate speech from text, text from images, or text from audio. By combining different modalities, multimodal AI can interact with humans in more natural, intuitive ways, mimicking how humans perceive and understand the world around them. The possibilities from processing inputs more holistically and providing more intuitive outputs are already nudging us closer to true artificial general intelligence. | ||
650 | 0 | |a Artificial intelligence | |
650 | 4 | |a Intelligence artificielle | |
650 | 4 | |a artificial intelligence | |
650 | 4 | |a Instructional films | |
650 | 4 | |a Nonfiction films | |
650 | 4 | |a Internet videos | |
650 | 4 | |a Films de formation | |
650 | 4 | |a Films autres que de fiction | |
650 | 4 | |a Vidéos sur Internet | |
700 | 1 | |a Chang, Susan Shu |e RednerIn |4 spk | |
700 | 1 | |a Gandhi, Rikin |e RednerIn |4 spk | |
700 | 1 | |a Pai, Suhas |e RednerIn |4 spk | |
700 | 1 | |a Alam, Nahid |e RednerIn |4 spk | |
700 | 1 | |a Susevski, Anthony |e RednerIn |4 spk | |
700 | 1 | |a Betlen, Andrei |e RednerIn |4 spk | |
700 | 1 | |a Iyer, Shekhar |e RednerIn |4 spk | |
700 | 1 | |a Gao, Jingying |e RednerIn |4 spk | |
700 | 1 | |a Barth, Antje |e RednerIn |4 spk | |
700 | 1 | |a Aldughayem, Omar |e RednerIn |4 spk | |
700 | 1 | |a Fregly, Chris |e RednerIn |4 spk | |
710 | 2 | |a O'Reilly (Firm), |e Verlag |4 pbl | |
966 | 4 | 0 | |l DE-91 |p ZDB-30-ORH |q TUM_PDA_ORH |u https://learning.oreilly.com/library/view/-/0642572057336/?ar |m X:ORHE |x Aggregator |z lizenzpflichtig |3 Volltext |
912 | |a ZDB-30-ORH | ||
935 | |c vide | ||
951 | |a BO | ||
912 | |a ZDB-30-ORH | ||
049 | |a DE-91 |
Record in the Search Index
DE-BY-TUM_katkey | ZDB-30-ORH-108527123 |
---|---|
_version_ | 1833357133001785344 |
adam_text | |
any_adam_object | |
author2 | Chang, Susan Shu Gandhi, Rikin Pai, Suhas Alam, Nahid Susevski, Anthony Betlen, Andrei Iyer, Shekhar Gao, Jingying Barth, Antje Aldughayem, Omar Fregly, Chris |
author2_role | spk spk spk spk spk spk spk spk spk spk spk |
author2_variant | s s c ss ssc r g rg s p sp n a na a s as a b ab s i si j g jg a b ab o a oa c f cf |
author_facet | Chang, Susan Shu Gandhi, Rikin Pai, Suhas Alam, Nahid Susevski, Anthony Betlen, Andrei Iyer, Shekhar Gao, Jingying Barth, Antje Aldughayem, Omar Fregly, Chris |
building | Verbundindex |
bvnumber | localTUM |
collection | ZDB-30-ORH |
ctrlnum | (DE-627-1)108527123 (DE-599)KEP108527123 (ORHE)0642572057336 |
dewey-full | 006.3 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 006 - Special computer methods |
dewey-raw | 006.3 |
dewey-search | 006.3 |
dewey-sort | 16.3 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
edition | [First edition]. |
format | Electronic Video |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03019cgm a22005892c 4500</leader><controlfield tag="001">ZDB-30-ORH-108527123</controlfield><controlfield tag="003">DE-627-1</controlfield><controlfield tag="005">20241107103346.0</controlfield><controlfield tag="006">m o | | </controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">241001s2024 xx ||| |o o ||eng c</controlfield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)108527123</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)KEP108527123</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ORHE)0642572057336</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)108527123</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.3</subfield><subfield code="2">23/eng/20241001</subfield></datafield><datafield tag="245" ind1="0" ind2="0"><subfield code="a">AI superstream</subfield><subfield code="b">multimodal generative AI</subfield><subfield code="c">Susan Shu Chang, Rikin Gandhi, Suhas Pai, Nahid Alam, Anthony Susevski, Andrei Betlen, Shekhar Iyer, Jingying Gao, Antje Barth, Omar Aldughayem, Chris Fregly</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">[First edition].</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">[Sebastopol, California]</subfield><subfield code="b">O'Reilly Media, Inc.</subfield><subfield code="c">2024</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource (1 video file (4 hr., 11 min.))</subfield><subfield code="b">sound, color.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">zweidimensionales bewegtes Bild</subfield><subfield code="b">tdi</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Online resource; title from title details screen (O'Reilly, viewed October 1, 2024)</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">While large language models are groundbreaking tools for automating everyday text-based tasks such as text summarization, translation, and generation, we've also seen the emergence of more complex generative AI models that can process and output different types of data, such as images, audio, and even video. Multimodal AI models, such as GPT-4, are capable of working across different data formats, for example, to generate speech from text, text from images, or text from audio. By combining different modalities, multimodal AI can interact with humans in more natural, intuitive ways, mimicking how humans perceive and understand the world around them. The possibilities from processing inputs more holistically and providing more intuitive outputs are already nudging us closer to true artificial general intelligence.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Artificial intelligence</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Intelligence artificielle</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">artificial intelligence</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Instructional films</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Nonfiction films</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Internet videos</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Films de formation</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Films autres que de fiction</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Vidéos sur Internet</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Chang, Susan Shu</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Gandhi, Rikin</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Pai, Suhas</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Alam, Nahid</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Susevski, Anthony</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Betlen, Andrei</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Iyer, Shekhar</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Gao, Jingying</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Barth, Antje</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Aldughayem, Omar</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Fregly, Chris</subfield><subfield code="e">RednerIn</subfield><subfield code="4">spk</subfield></datafield><datafield tag="710" ind1="2" ind2=" "><subfield code="a">O'Reilly (Firm),</subfield><subfield code="e">Verlag</subfield><subfield code="4">pbl</subfield></datafield><datafield tag="966" ind1="4" ind2="0"><subfield code="l">DE-91</subfield><subfield code="p">ZDB-30-ORH</subfield><subfield code="q">TUM_PDA_ORH</subfield><subfield code="u">https://learning.oreilly.com/library/view/-/0642572057336/?ar</subfield><subfield code="m">X:ORHE</subfield><subfield code="x">Aggregator</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="935" ind1=" " ind2=" "><subfield code="c">vide</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">BO</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91</subfield></datafield></record></collection> |
id | ZDB-30-ORH-108527123 |
illustrated | Not Illustrated |
indexdate | 2025-05-28T09:46:50Z |
institution | BVB |
language | English |
open_access_boolean | |
owner | DE-91 DE-BY-TUM |
owner_facet | DE-91 DE-BY-TUM |
physical | 1 online resource (1 video file (4 hr., 11 min.)) sound, color. |
psigel | ZDB-30-ORH TUM_PDA_ORH ZDB-30-ORH |
publishDate | 2024 |
publishDateSearch | 2024 |
publishDateSort | 2024 |
publisher | O'Reilly Media, Inc. |
record_format | marc |
spelling | AI superstream multimodal generative AI Susan Shu Chang, Rikin Gandhi, Suhas Pai, Nahid Alam, Anthony Susevski, Andrei Betlen, Shekhar Iyer, Jingying Gao, Antje Barth, Omar Aldughayem, Chris Fregly [First edition]. [Sebastopol, California] O'Reilly Media, Inc. 2024 1 online resource (1 video file (4 hr., 11 min.)) sound, color. zweidimensionales bewegtes Bild tdi rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Online resource; title from title details screen (O'Reilly, viewed October 1, 2024) While large language models are groundbreaking tools for automating everyday text-based tasks such as text summarization, translation, and generation, we've also seen the emergence of more complex generative AI models that can process and output different types of data, such as images, audio, and even video. Multimodal AI models, such as GPT-4, are capable of working across different data formats, for example, to generate speech from text, text from images, or text from audio. By combining different modalities, multimodal AI can interact with humans in more natural, intuitive ways, mimicking how humans perceive and understand the world around them. The possibilities from processing inputs more holistically and providing more intuitive outputs are already nudging us closer to true artificial general intelligence. Artificial intelligence Intelligence artificielle artificial intelligence Instructional films Nonfiction films Internet videos Films de formation Films autres que de fiction Vidéos sur Internet Chang, Susan Shu RednerIn spk Gandhi, Rikin RednerIn spk Pai, Suhas RednerIn spk Alam, Nahid RednerIn spk Susevski, Anthony RednerIn spk Betlen, Andrei RednerIn spk Iyer, Shekhar RednerIn spk Gao, Jingying RednerIn spk Barth, Antje RednerIn spk Aldughayem, Omar RednerIn spk Fregly, Chris RednerIn spk O'Reilly (Firm), Verlag pbl |
spellingShingle | AI superstream multimodal generative AI Artificial intelligence Intelligence artificielle artificial intelligence Instructional films Nonfiction films Internet videos Films de formation Films autres que de fiction Vidéos sur Internet |
title | AI superstream multimodal generative AI |
title_auth | AI superstream multimodal generative AI |
title_exact_search | AI superstream multimodal generative AI |
title_full | AI superstream multimodal generative AI Susan Shu Chang, Rikin Gandhi, Suhas Pai, Nahid Alam, Anthony Susevski, Andrei Betlen, Shekhar Iyer, Jingying Gao, Antje Barth, Omar Aldughayem, Chris Fregly |
title_fullStr | AI superstream multimodal generative AI Susan Shu Chang, Rikin Gandhi, Suhas Pai, Nahid Alam, Anthony Susevski, Andrei Betlen, Shekhar Iyer, Jingying Gao, Antje Barth, Omar Aldughayem, Chris Fregly |
title_full_unstemmed | AI superstream multimodal generative AI Susan Shu Chang, Rikin Gandhi, Suhas Pai, Nahid Alam, Anthony Susevski, Andrei Betlen, Shekhar Iyer, Jingying Gao, Antje Barth, Omar Aldughayem, Chris Fregly |
title_short | AI superstream |
title_sort | ai superstream multimodal generative ai |
title_sub | multimodal generative AI |
topic | Artificial intelligence Intelligence artificielle artificial intelligence Instructional films Nonfiction films Internet videos Films de formation Films autres que de fiction Vidéos sur Internet |
topic_facet | Artificial intelligence Intelligence artificielle artificial intelligence Instructional films Nonfiction films Internet videos Films de formation Films autres que de fiction Vidéos sur Internet |
work_keys_str_mv | AT changsusanshu aisuperstreammultimodalgenerativeai AT gandhirikin aisuperstreammultimodalgenerativeai AT paisuhas aisuperstreammultimodalgenerativeai AT alamnahid aisuperstreammultimodalgenerativeai AT susevskianthony aisuperstreammultimodalgenerativeai AT betlenandrei aisuperstreammultimodalgenerativeai AT iyershekhar aisuperstreammultimodalgenerativeai AT gaojingying aisuperstreammultimodalgenerativeai AT barthantje aisuperstreammultimodalgenerativeai AT aldughayemomar aisuperstreammultimodalgenerativeai AT freglychris aisuperstreammultimodalgenerativeai AT oreillyfirm aisuperstreammultimodalgenerativeai |