Serverless ETL and Analytics with AWS Glue: Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features
Build efficient data lakes that can scale to virtually unlimited size using AWS Glue Key Features Learn to work with AWS Glue to overcome typical implementation challenges in data lakes Create and manage serverless ETL pipelines that can scale to manage big data Written by AWS Glue community members...
Gespeichert in:
Beteilige Person: | |
---|---|
Weitere beteiligte Personen: | , , , , |
Format: | Elektronisch E-Book |
Sprache: | Englisch |
Veröffentlicht: |
Birmingham
Packt Publishing, Limited
2022
|
Schlagwörter: | |
Links: | https://learning.oreilly.com/library/view/-/9781800564985/?ar |
Zusammenfassung: | Build efficient data lakes that can scale to virtually unlimited size using AWS Glue Key Features Learn to work with AWS Glue to overcome typical implementation challenges in data lakes Create and manage serverless ETL pipelines that can scale to manage big data Written by AWS Glue community members, this practical guide shows you how to implement AWS Glue in no time Book Description Organizations these days have gravitated toward services such as AWS Glue that undertake undifferentiated heavy lifting and provide serverless Spark, enabling you to create and manage data lakes in a serverless fashion. This guide shows you how AWS Glue can be used to solve real-world problems along with helping you learn about data processing, data integration, and building data lakes. Beginning with AWS Glue basics, this book teaches you how to perform various aspects of data analysis such as ad hoc queries, data visualization, and real-time analysis using this service. It also provides a walk-through of CI/CD for AWS Glue and how to shift left on quality using automated regression tests. You'll find out how data security aspects such as access control, encryption, auditing, and networking are implemented, as well as getting to grips with useful techniques such as picking the right file format, compression, partitioning, and bucketing. As you advance, you'll discover AWS Glue features such as crawlers, Lake Formation, governed tables, lineage, DataBrew, Glue Studio, and custom connectors. The concluding chapters help you to understand various performance tuning, troubleshooting, and monitoring options. By the end of this AWS book, you'll be able to create, manage, troubleshoot, and deploy ETL pipelines using AWS Glue. What you will learn Apply various AWS Glue features to manage and create data lakes Use Glue DataBrew and Glue Studio for data preparation Optimize data layout in cloud storage to accelerate analytics workloads Manage metadata including database, table, and schema definitions Secure your data during access control, encryption, auditing, and networking Monitor AWS Glue jobs to detect delays and loss of data Integrate Spark ML and SageMaker with AWS Glue to create machine learning models Who this book is for This book is for ETL developers, data engineers, and data analysts who want to understand how AWS Glue can help you solve your business problems. Basic knowledge of AWS data services is assumed. |
Beschreibung: | Print version record |
Umfang: | 1 Online-Ressource (435 Seiten) |
ISBN: | 1800562551 9781800562554 |
Internformat
MARC
LEADER | 00000cam a22000002 4500 | ||
---|---|---|---|
001 | ZDB-30-ORH-082178232 | ||
003 | DE-627-1 | ||
005 | 20240228121750.0 | ||
007 | cr uuu---uuuuu | ||
008 | 221012s2022 xx |||||o 00| ||eng c | ||
020 | |a 1800562551 |9 1-80056-255-1 | ||
020 | |a 9781800562554 |c electronic bk. |9 978-1-80056-255-4 | ||
035 | |a (DE-627-1)082178232 | ||
035 | |a (DE-599)KEP082178232 | ||
035 | |a (ORHE)9781800564985 | ||
035 | |a (DE-627-1)082178232 | ||
040 | |a DE-627 |b ger |c DE-627 |e rda | ||
041 | |a eng | ||
082 | 0 | |a 004.6782 |2 23/eng/20220907 | |
100 | 1 | |a Pathak, Vishal |e VerfasserIn |4 aut | |
245 | 1 | 0 | |a Serverless ETL and Analytics with AWS Glue |b Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features |c Vishal Pathak, Subramanya Vajiraya, Moritaka Sekiyama, Tomohiro Tanaka, Albert Quiroga, Ishan Gaur |
264 | 1 | |a Birmingham |b Packt Publishing, Limited |c 2022 | |
300 | |a 1 Online-Ressource (435 Seiten) | ||
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Print version record | ||
520 | |a Build efficient data lakes that can scale to virtually unlimited size using AWS Glue Key Features Learn to work with AWS Glue to overcome typical implementation challenges in data lakes Create and manage serverless ETL pipelines that can scale to manage big data Written by AWS Glue community members, this practical guide shows you how to implement AWS Glue in no time Book Description Organizations these days have gravitated toward services such as AWS Glue that undertake undifferentiated heavy lifting and provide serverless Spark, enabling you to create and manage data lakes in a serverless fashion. This guide shows you how AWS Glue can be used to solve real-world problems along with helping you learn about data processing, data integration, and building data lakes. Beginning with AWS Glue basics, this book teaches you how to perform various aspects of data analysis such as ad hoc queries, data visualization, and real-time analysis using this service. It also provides a walk-through of CI/CD for AWS Glue and how to shift left on quality using automated regression tests. You'll find out how data security aspects such as access control, encryption, auditing, and networking are implemented, as well as getting to grips with useful techniques such as picking the right file format, compression, partitioning, and bucketing. As you advance, you'll discover AWS Glue features such as crawlers, Lake Formation, governed tables, lineage, DataBrew, Glue Studio, and custom connectors. The concluding chapters help you to understand various performance tuning, troubleshooting, and monitoring options. By the end of this AWS book, you'll be able to create, manage, troubleshoot, and deploy ETL pipelines using AWS Glue. What you will learn Apply various AWS Glue features to manage and create data lakes Use Glue DataBrew and Glue Studio for data preparation Optimize data layout in cloud storage to accelerate analytics workloads Manage metadata including database, table, and schema definitions Secure your data during access control, encryption, auditing, and networking Monitor AWS Glue jobs to detect delays and loss of data Integrate Spark ML and SageMaker with AWS Glue to create machine learning models Who this book is for This book is for ETL developers, data engineers, and data analysts who want to understand how AWS Glue can help you solve your business problems. Basic knowledge of AWS data services is assumed. | ||
630 | 2 | 0 | |a Amazon Web Services |
650 | 0 | |a Cloud computing | |
650 | 0 | |a Web services | |
650 | 0 | |a Application software |x Development | |
650 | 4 | |a Infonuagique | |
650 | 4 | |a Services Web | |
650 | 4 | |a Logiciels d'application ; Développement | |
650 | 4 | |a Application software ; Development | |
650 | 4 | |a Cloud computing | |
650 | 4 | |a Web services | |
700 | 1 | |a Vajiraya, Subramanya |e MitwirkendeR |4 ctb | |
700 | 1 | |a Sekiyama, Noritaka |e MitwirkendeR |4 ctb | |
700 | 1 | |a Tanaka, Tomohiro |e MitwirkendeR |4 ctb | |
700 | 1 | |a Quiroga, Albert |e MitwirkendeR |4 ctb | |
700 | 1 | |a Gaur, Ishan |e MitwirkendeR |4 ctb | |
966 | 4 | 0 | |l DE-91 |p ZDB-30-ORH |q TUM_PDA_ORH |u https://learning.oreilly.com/library/view/-/9781800564985/?ar |m X:ORHE |x Aggregator |z lizenzpflichtig |3 Volltext |
912 | |a ZDB-30-ORH | ||
912 | |a ZDB-30-ORH | ||
951 | |a BO | ||
912 | |a ZDB-30-ORH | ||
049 | |a DE-91 |
Datensatz im Suchindex
DE-BY-TUM_katkey | ZDB-30-ORH-082178232 |
---|---|
_version_ | 1821494819240804352 |
adam_text | |
any_adam_object | |
author | Pathak, Vishal |
author2 | Vajiraya, Subramanya Sekiyama, Noritaka Tanaka, Tomohiro Quiroga, Albert Gaur, Ishan |
author2_role | ctb ctb ctb ctb ctb |
author2_variant | s v sv n s ns t t tt a q aq i g ig |
author_facet | Pathak, Vishal Vajiraya, Subramanya Sekiyama, Noritaka Tanaka, Tomohiro Quiroga, Albert Gaur, Ishan |
author_role | aut |
author_sort | Pathak, Vishal |
author_variant | v p vp |
building | Verbundindex |
bvnumber | localTUM |
collection | ZDB-30-ORH |
ctrlnum | (DE-627-1)082178232 (DE-599)KEP082178232 (ORHE)9781800564985 |
dewey-full | 004.6782 |
dewey-hundreds | 000 - Computer science, information, general works |
dewey-ones | 004 - Computer science |
dewey-raw | 004.6782 |
dewey-search | 004.6782 |
dewey-sort | 14.6782 |
dewey-tens | 000 - Computer science, information, general works |
discipline | Informatik |
format | Electronic eBook |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>04372cam a22005292 4500</leader><controlfield tag="001">ZDB-30-ORH-082178232</controlfield><controlfield tag="003">DE-627-1</controlfield><controlfield tag="005">20240228121750.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">221012s2022 xx |||||o 00| ||eng c</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1800562551</subfield><subfield code="9">1-80056-255-1</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781800562554</subfield><subfield code="c">electronic bk.</subfield><subfield code="9">978-1-80056-255-4</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)082178232</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)KEP082178232</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ORHE)9781800564985</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)082178232</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">004.6782</subfield><subfield code="2">23/eng/20220907</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Pathak, Vishal</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Serverless ETL and Analytics with AWS Glue</subfield><subfield code="b">Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features</subfield><subfield code="c">Vishal Pathak, Subramanya Vajiraya, Moritaka Sekiyama, Tomohiro Tanaka, Albert Quiroga, Ishan Gaur</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Birmingham</subfield><subfield code="b">Packt Publishing, Limited</subfield><subfield code="c">2022</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (435 Seiten)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Print version record</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Build efficient data lakes that can scale to virtually unlimited size using AWS Glue Key Features Learn to work with AWS Glue to overcome typical implementation challenges in data lakes Create and manage serverless ETL pipelines that can scale to manage big data Written by AWS Glue community members, this practical guide shows you how to implement AWS Glue in no time Book Description Organizations these days have gravitated toward services such as AWS Glue that undertake undifferentiated heavy lifting and provide serverless Spark, enabling you to create and manage data lakes in a serverless fashion. This guide shows you how AWS Glue can be used to solve real-world problems along with helping you learn about data processing, data integration, and building data lakes. Beginning with AWS Glue basics, this book teaches you how to perform various aspects of data analysis such as ad hoc queries, data visualization, and real-time analysis using this service. It also provides a walk-through of CI/CD for AWS Glue and how to shift left on quality using automated regression tests. You'll find out how data security aspects such as access control, encryption, auditing, and networking are implemented, as well as getting to grips with useful techniques such as picking the right file format, compression, partitioning, and bucketing. As you advance, you'll discover AWS Glue features such as crawlers, Lake Formation, governed tables, lineage, DataBrew, Glue Studio, and custom connectors. The concluding chapters help you to understand various performance tuning, troubleshooting, and monitoring options. By the end of this AWS book, you'll be able to create, manage, troubleshoot, and deploy ETL pipelines using AWS Glue. What you will learn Apply various AWS Glue features to manage and create data lakes Use Glue DataBrew and Glue Studio for data preparation Optimize data layout in cloud storage to accelerate analytics workloads Manage metadata including database, table, and schema definitions Secure your data during access control, encryption, auditing, and networking Monitor AWS Glue jobs to detect delays and loss of data Integrate Spark ML and SageMaker with AWS Glue to create machine learning models Who this book is for This book is for ETL developers, data engineers, and data analysts who want to understand how AWS Glue can help you solve your business problems. Basic knowledge of AWS data services is assumed.</subfield></datafield><datafield tag="630" ind1="2" ind2="0"><subfield code="a">Amazon Web Services</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Cloud computing</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Web services</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Application software</subfield><subfield code="x">Development</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Infonuagique</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Services Web</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Logiciels d'application ; Développement</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Application software ; Development</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Cloud computing</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Web services</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Vajiraya, Subramanya</subfield><subfield code="e">MitwirkendeR</subfield><subfield code="4">ctb</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Sekiyama, Noritaka</subfield><subfield code="e">MitwirkendeR</subfield><subfield code="4">ctb</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Tanaka, Tomohiro</subfield><subfield code="e">MitwirkendeR</subfield><subfield code="4">ctb</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Quiroga, Albert</subfield><subfield code="e">MitwirkendeR</subfield><subfield code="4">ctb</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Gaur, Ishan</subfield><subfield code="e">MitwirkendeR</subfield><subfield code="4">ctb</subfield></datafield><datafield tag="966" ind1="4" ind2="0"><subfield code="l">DE-91</subfield><subfield code="p">ZDB-30-ORH</subfield><subfield code="q">TUM_PDA_ORH</subfield><subfield code="u">https://learning.oreilly.com/library/view/-/9781800564985/?ar</subfield><subfield code="m">X:ORHE</subfield><subfield code="x">Aggregator</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">BO</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91</subfield></datafield></record></collection> |
id | ZDB-30-ORH-082178232 |
illustrated | Not Illustrated |
indexdate | 2025-01-17T11:20:26Z |
institution | BVB |
isbn | 1800562551 9781800562554 |
language | English |
open_access_boolean | |
owner | DE-91 DE-BY-TUM |
owner_facet | DE-91 DE-BY-TUM |
physical | 1 Online-Ressource (435 Seiten) |
psigel | ZDB-30-ORH TUM_PDA_ORH ZDB-30-ORH |
publishDate | 2022 |
publishDateSearch | 2022 |
publishDateSort | 2022 |
publisher | Packt Publishing, Limited |
record_format | marc |
spelling | Pathak, Vishal VerfasserIn aut Serverless ETL and Analytics with AWS Glue Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features Vishal Pathak, Subramanya Vajiraya, Moritaka Sekiyama, Tomohiro Tanaka, Albert Quiroga, Ishan Gaur Birmingham Packt Publishing, Limited 2022 1 Online-Ressource (435 Seiten) Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Print version record Build efficient data lakes that can scale to virtually unlimited size using AWS Glue Key Features Learn to work with AWS Glue to overcome typical implementation challenges in data lakes Create and manage serverless ETL pipelines that can scale to manage big data Written by AWS Glue community members, this practical guide shows you how to implement AWS Glue in no time Book Description Organizations these days have gravitated toward services such as AWS Glue that undertake undifferentiated heavy lifting and provide serverless Spark, enabling you to create and manage data lakes in a serverless fashion. This guide shows you how AWS Glue can be used to solve real-world problems along with helping you learn about data processing, data integration, and building data lakes. Beginning with AWS Glue basics, this book teaches you how to perform various aspects of data analysis such as ad hoc queries, data visualization, and real-time analysis using this service. It also provides a walk-through of CI/CD for AWS Glue and how to shift left on quality using automated regression tests. You'll find out how data security aspects such as access control, encryption, auditing, and networking are implemented, as well as getting to grips with useful techniques such as picking the right file format, compression, partitioning, and bucketing. As you advance, you'll discover AWS Glue features such as crawlers, Lake Formation, governed tables, lineage, DataBrew, Glue Studio, and custom connectors. The concluding chapters help you to understand various performance tuning, troubleshooting, and monitoring options. By the end of this AWS book, you'll be able to create, manage, troubleshoot, and deploy ETL pipelines using AWS Glue. What you will learn Apply various AWS Glue features to manage and create data lakes Use Glue DataBrew and Glue Studio for data preparation Optimize data layout in cloud storage to accelerate analytics workloads Manage metadata including database, table, and schema definitions Secure your data during access control, encryption, auditing, and networking Monitor AWS Glue jobs to detect delays and loss of data Integrate Spark ML and SageMaker with AWS Glue to create machine learning models Who this book is for This book is for ETL developers, data engineers, and data analysts who want to understand how AWS Glue can help you solve your business problems. Basic knowledge of AWS data services is assumed. Amazon Web Services Cloud computing Web services Application software Development Infonuagique Services Web Logiciels d'application ; Développement Application software ; Development Vajiraya, Subramanya MitwirkendeR ctb Sekiyama, Noritaka MitwirkendeR ctb Tanaka, Tomohiro MitwirkendeR ctb Quiroga, Albert MitwirkendeR ctb Gaur, Ishan MitwirkendeR ctb |
spellingShingle | Pathak, Vishal Serverless ETL and Analytics with AWS Glue Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features Amazon Web Services Cloud computing Web services Application software Development Infonuagique Services Web Logiciels d'application ; Développement Application software ; Development |
title | Serverless ETL and Analytics with AWS Glue Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features |
title_auth | Serverless ETL and Analytics with AWS Glue Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features |
title_exact_search | Serverless ETL and Analytics with AWS Glue Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features |
title_full | Serverless ETL and Analytics with AWS Glue Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features Vishal Pathak, Subramanya Vajiraya, Moritaka Sekiyama, Tomohiro Tanaka, Albert Quiroga, Ishan Gaur |
title_fullStr | Serverless ETL and Analytics with AWS Glue Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features Vishal Pathak, Subramanya Vajiraya, Moritaka Sekiyama, Tomohiro Tanaka, Albert Quiroga, Ishan Gaur |
title_full_unstemmed | Serverless ETL and Analytics with AWS Glue Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features Vishal Pathak, Subramanya Vajiraya, Moritaka Sekiyama, Tomohiro Tanaka, Albert Quiroga, Ishan Gaur |
title_short | Serverless ETL and Analytics with AWS Glue |
title_sort | serverless etl and analytics with aws glue your comprehensive reference guide to learning about aws glue and its features |
title_sub | Your Comprehensive Reference Guide to Learning about AWS Glue and Its Features |
topic | Amazon Web Services Cloud computing Web services Application software Development Infonuagique Services Web Logiciels d'application ; Développement Application software ; Development |
topic_facet | Amazon Web Services Cloud computing Web services Application software Development Infonuagique Services Web Logiciels d'application ; Développement Application software ; Development |
work_keys_str_mv | AT pathakvishal serverlessetlandanalyticswithawsglueyourcomprehensivereferenceguidetolearningaboutawsglueanditsfeatures AT vajirayasubramanya serverlessetlandanalyticswithawsglueyourcomprehensivereferenceguidetolearningaboutawsglueanditsfeatures AT sekiyamanoritaka serverlessetlandanalyticswithawsglueyourcomprehensivereferenceguidetolearningaboutawsglueanditsfeatures AT tanakatomohiro serverlessetlandanalyticswithawsglueyourcomprehensivereferenceguidetolearningaboutawsglueanditsfeatures AT quirogaalbert serverlessetlandanalyticswithawsglueyourcomprehensivereferenceguidetolearningaboutawsglueanditsfeatures AT gaurishan serverlessetlandanalyticswithawsglueyourcomprehensivereferenceguidetolearningaboutawsglueanditsfeatures |