Verfügbarkeit: Modern data engineering with Apache Spark

Modern data engineering with Apache Spark: a hands-on guide for building mission-critical streaming applications

Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Beteilige Person:	Haines, Scott (VerfasserIn)
Format:	Buch
Sprache:	Englisch
Veröffentlicht:	Berkeley, CA Apress [2022]
Ausgabe:	1st ed
Schlagwörter:	bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language) Hardcover, Softcover / Informatik, EDV/Programmiersprachen
Zusammenfassung:	Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow.Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes.Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.What You Will Learn - Simplify data transformation with Spark Pipelines and Spark SQL- Bridge data engineering with machine learning- Architect modular data pipeline applications- Build reusable application components and libraries- Containerize your Spark applications for consistency and reliability- Use Docker and Kubernetes to deploy your Spark applications- Speed up application experimentation using Apache Zeppelin and Docker- Understand serializable structured data and data contracts - Harness effective strategies for optimizing data in your data lakes- Build end-to-end Spark structured streaming applications using Redis and Apache Kafka- Embrace testing for your batch and streaming applications- Deploy and monitor your Spark applications Who This Book Is ForProfessional software engineers who want to take their current skills and
Umfang:	xxv, 585 Seiten Diagramme 1149 grams
ISBN:	9781484274514

Internformat

MARC


LEADER	00000nam a2200000 c 4500
001	BV048203252
003	DE-604
005	20220622
007	t\|
008	220506s2022 xx \|\|\|\| \|\|\|\| 00\|\|\| eng d
020			\|a 9781484274514 \|c pbk \|9 978-1-4842-7451-4
024	3		\|a 9781484274514
035			\|a (OCoLC)1334031132
035			\|a (DE-599)BVBBV048203252
040			\|a DE-604 \|b ger \|e rda
041	0		\|a eng
049			\|a DE-29T
100	1		\|a Haines, Scott \|4 aut
245	1	0	\|a Modern data engineering with Apache Spark \|b a hands-on guide for building mission-critical streaming applications \|c Scott Haines
250			\|a 1st ed
264		1	\|a Berkeley, CA \|b Apress \|c [2022]
264		4	\|c © 2022
300			\|a xxv, 585 Seiten \|b Diagramme \|c 1149 grams
336			\|b txt \|2 rdacontent
337			\|b n \|2 rdamedia
338			\|b nc \|2 rdacarrier
520			\|a Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow.Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload.
520			\|a This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes.Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications.
520			\|a You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.What You Will Learn - Simplify data transformation with Spark Pipelines and Spark SQL- Bridge data engineering with machine learning- Architect modular data pipeline applications- Build reusable application components and libraries- Containerize your Spark applications for consistency and reliability- Use Docker and Kubernetes to deploy your Spark applications- Speed up application experimentation using Apache Zeppelin and Docker- Understand serializable structured data and data contracts - Harness effective strategies for optimizing data in your data lakes- Build end-to-end Spark structured streaming applications using Redis and Apache Kafka- Embrace testing for your batch and streaming applications- Deploy and monitor your Spark applications Who This Book Is ForProfessional software engineers who want to take their current skills and
650		4	\|a bicssc
650		4	\|a bisacsh
650		4	\|a Quantitative research
650		4	\|a Database management
650		4	\|a Data mining
650		4	\|a Java (Computer program language)
653			\|a Hardcover, Softcover / Informatik, EDV/Programmiersprachen
776	0	8	\|i Erscheint auch als \|n Online-Ausgabe \|z 978-1-4842-7452-1
943	1		\|a oai:aleph.bib-bvb.de:BVB01-033584207

Datensatz im Suchindex

_version_	1818989428927889408
any_adam_object
author	Haines, Scott
author_facet	Haines, Scott
author_role	aut
author_sort	Haines, Scott
author_variant	s h sh
building	Verbundindex
bvnumber	BV048203252
ctrlnum	(OCoLC)1334031132 (DE-599)BVBBV048203252
edition	1st ed
format	Book
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03634nam a2200421 c 4500</leader><controlfield tag="001">BV048203252</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20220622 </controlfield><controlfield tag="007">t\|</controlfield><controlfield tag="008">220506s2022 xx \|\|\|\| \|\|\|\| 00\|\|\| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781484274514</subfield><subfield code="c">pbk</subfield><subfield code="9">978-1-4842-7451-4</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9781484274514</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1334031132</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV048203252</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Haines, Scott</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Modern data engineering with Apache Spark</subfield><subfield code="b">a hands-on guide for building mission-critical streaming applications</subfield><subfield code="c">Scott Haines</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1st ed</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berkeley, CA</subfield><subfield code="b">Apress</subfield><subfield code="c">[2022]</subfield></datafield><datafield tag="264" ind1=" " ind2="4"><subfield code="c">© 2022</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xxv, 585 Seiten</subfield><subfield code="b">Diagramme</subfield><subfield code="c">1149 grams</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow.Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. </subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes.Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. </subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.What You Will Learn - Simplify data transformation with Spark Pipelines and Spark SQL- Bridge data engineering with machine learning- Architect modular data pipeline applications- Build reusable application components and libraries- Containerize your Spark applications for consistency and reliability- Use Docker and Kubernetes to deploy your Spark applications- Speed up application experimentation using Apache Zeppelin and Docker- Understand serializable structured data and data contracts - Harness effective strategies for optimizing data in your data lakes- Build end-to-end Spark structured streaming applications using Redis and Apache Kafka- Embrace testing for your batch and streaming applications- Deploy and monitor your Spark applications Who This Book Is ForProfessional software engineers who want to take their current skills and </subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Quantitative research</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Database management</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Java (Computer program language)</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Hardcover, Softcover / Informatik, EDV/Programmiersprachen</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-1-4842-7452-1</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-033584207</subfield></datafield></record></collection>
id	DE-604.BV048203252
illustrated	Not Illustrated
indexdate	2024-12-20T19:38:20Z
institution	BVB
isbn	9781484274514
language	English
oai_aleph_id	oai:aleph.bib-bvb.de:BVB01-033584207
oclc_num	1334031132
open_access_boolean
owner	DE-29T
owner_facet	DE-29T
physical	xxv, 585 Seiten Diagramme 1149 grams
publishDate	2022
publishDateSearch	2022
publishDateSort	2022
publisher	Apress
record_format	marc
spelling	Haines, Scott aut Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications Scott Haines 1st ed Berkeley, CA Apress [2022] © 2022 xxv, 585 Seiten Diagramme 1149 grams txt rdacontent n rdamedia nc rdacarrier Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow.Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes.Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.What You Will Learn - Simplify data transformation with Spark Pipelines and Spark SQL- Bridge data engineering with machine learning- Architect modular data pipeline applications- Build reusable application components and libraries- Containerize your Spark applications for consistency and reliability- Use Docker and Kubernetes to deploy your Spark applications- Speed up application experimentation using Apache Zeppelin and Docker- Understand serializable structured data and data contracts - Harness effective strategies for optimizing data in your data lakes- Build end-to-end Spark structured streaming applications using Redis and Apache Kafka- Embrace testing for your batch and streaming applications- Deploy and monitor your Spark applications Who This Book Is ForProfessional software engineers who want to take their current skills and bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language) Hardcover, Softcover / Informatik, EDV/Programmiersprachen Erscheint auch als Online-Ausgabe 978-1-4842-7452-1
spellingShingle	Haines, Scott Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language)
title	Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications
title_auth	Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications
title_exact_search	Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications
title_full	Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications Scott Haines
title_fullStr	Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications Scott Haines
title_full_unstemmed	Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications Scott Haines
title_short	Modern data engineering with Apache Spark
title_sort	modern data engineering with apache spark a hands on guide for building mission critical streaming applications
title_sub	a hands-on guide for building mission-critical streaming applications
topic	bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language)
topic_facet	bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language)
work_keys_str_mv	AT hainesscott moderndataengineeringwithapachesparkahandsonguideforbuildingmissioncriticalstreamingapplications

Verfügbarkeit

MARC

Datensatz im Suchindex

‌