Modern data engineering with Apache Spark: a hands-on guide for building mission-critical streaming applications
Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey...
Gespeichert in:
Beteilige Person: | |
---|---|
Format: | Buch |
Sprache: | Englisch |
Veröffentlicht: |
Berkeley, CA
Apress
[2022]
|
Ausgabe: | 1st ed |
Schlagwörter: | |
Zusammenfassung: | Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow.Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes.Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.What You Will Learn - Simplify data transformation with Spark Pipelines and Spark SQL- Bridge data engineering with machine learning- Architect modular data pipeline applications- Build reusable application components and libraries- Containerize your Spark applications for consistency and reliability- Use Docker and Kubernetes to deploy your Spark applications- Speed up application experimentation using Apache Zeppelin and Docker- Understand serializable structured data and data contracts - Harness effective strategies for optimizing data in your data lakes- Build end-to-end Spark structured streaming applications using Redis and Apache Kafka- Embrace testing for your batch and streaming applications- Deploy and monitor your Spark applications Who This Book Is ForProfessional software engineers who want to take their current skills and |
Umfang: | xxv, 585 Seiten Diagramme 1149 grams |
ISBN: | 9781484274514 |
Internformat
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV048203252 | ||
003 | DE-604 | ||
005 | 20220622 | ||
007 | t| | ||
008 | 220506s2022 xx |||| |||| 00||| eng d | ||
020 | |a 9781484274514 |c pbk |9 978-1-4842-7451-4 | ||
024 | 3 | |a 9781484274514 | |
035 | |a (OCoLC)1334031132 | ||
035 | |a (DE-599)BVBBV048203252 | ||
040 | |a DE-604 |b ger |e rda | ||
041 | 0 | |a eng | |
049 | |a DE-29T | ||
100 | 1 | |a Haines, Scott |4 aut | |
245 | 1 | 0 | |a Modern data engineering with Apache Spark |b a hands-on guide for building mission-critical streaming applications |c Scott Haines |
250 | |a 1st ed | ||
264 | 1 | |a Berkeley, CA |b Apress |c [2022] | |
264 | 4 | |c © 2022 | |
300 | |a xxv, 585 Seiten |b Diagramme |c 1149 grams | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
520 | |a Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow.Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. | ||
520 | |a This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes.Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. | ||
520 | |a You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.What You Will Learn - Simplify data transformation with Spark Pipelines and Spark SQL- Bridge data engineering with machine learning- Architect modular data pipeline applications- Build reusable application components and libraries- Containerize your Spark applications for consistency and reliability- Use Docker and Kubernetes to deploy your Spark applications- Speed up application experimentation using Apache Zeppelin and Docker- Understand serializable structured data and data contracts - Harness effective strategies for optimizing data in your data lakes- Build end-to-end Spark structured streaming applications using Redis and Apache Kafka- Embrace testing for your batch and streaming applications- Deploy and monitor your Spark applications Who This Book Is ForProfessional software engineers who want to take their current skills and | ||
650 | 4 | |a bicssc | |
650 | 4 | |a bisacsh | |
650 | 4 | |a Quantitative research | |
650 | 4 | |a Database management | |
650 | 4 | |a Data mining | |
650 | 4 | |a Java (Computer program language) | |
653 | |a Hardcover, Softcover / Informatik, EDV/Programmiersprachen | ||
776 | 0 | 8 | |i Erscheint auch als |n Online-Ausgabe |z 978-1-4842-7452-1 |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-033584207 |
Datensatz im Suchindex
_version_ | 1818989428927889408 |
---|---|
any_adam_object | |
author | Haines, Scott |
author_facet | Haines, Scott |
author_role | aut |
author_sort | Haines, Scott |
author_variant | s h sh |
building | Verbundindex |
bvnumber | BV048203252 |
ctrlnum | (OCoLC)1334031132 (DE-599)BVBBV048203252 |
edition | 1st ed |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03634nam a2200421 c 4500</leader><controlfield tag="001">BV048203252</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20220622 </controlfield><controlfield tag="007">t|</controlfield><controlfield tag="008">220506s2022 xx |||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781484274514</subfield><subfield code="c">pbk</subfield><subfield code="9">978-1-4842-7451-4</subfield></datafield><datafield tag="024" ind1="3" ind2=" "><subfield code="a">9781484274514</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)1334031132</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV048203252</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-29T</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Haines, Scott</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Modern data engineering with Apache Spark</subfield><subfield code="b">a hands-on guide for building mission-critical streaming applications</subfield><subfield code="c">Scott Haines</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">1st ed</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berkeley, CA</subfield><subfield code="b">Apress</subfield><subfield code="c">[2022]</subfield></datafield><datafield tag="264" ind1=" " ind2="4"><subfield code="c">© 2022</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">xxv, 585 Seiten</subfield><subfield code="b">Diagramme</subfield><subfield code="c">1149 grams</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow.Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. </subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes.Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. </subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.What You Will Learn - Simplify data transformation with Spark Pipelines and Spark SQL- Bridge data engineering with machine learning- Architect modular data pipeline applications- Build reusable application components and libraries- Containerize your Spark applications for consistency and reliability- Use Docker and Kubernetes to deploy your Spark applications- Speed up application experimentation using Apache Zeppelin and Docker- Understand serializable structured data and data contracts - Harness effective strategies for optimizing data in your data lakes- Build end-to-end Spark structured streaming applications using Redis and Apache Kafka- Embrace testing for your batch and streaming applications- Deploy and monitor your Spark applications Who This Book Is ForProfessional software engineers who want to take their current skills and </subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">bicssc</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">bisacsh</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Quantitative research</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Database management</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Java (Computer program language)</subfield></datafield><datafield tag="653" ind1=" " ind2=" "><subfield code="a">Hardcover, Softcover / Informatik, EDV/Programmiersprachen</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Online-Ausgabe</subfield><subfield code="z">978-1-4842-7452-1</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-033584207</subfield></datafield></record></collection> |
id | DE-604.BV048203252 |
illustrated | Not Illustrated |
indexdate | 2024-12-20T19:38:20Z |
institution | BVB |
isbn | 9781484274514 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-033584207 |
oclc_num | 1334031132 |
open_access_boolean | |
owner | DE-29T |
owner_facet | DE-29T |
physical | xxv, 585 Seiten Diagramme 1149 grams |
publishDate | 2022 |
publishDateSearch | 2022 |
publishDateSort | 2022 |
publisher | Apress |
record_format | marc |
spelling | Haines, Scott aut Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications Scott Haines 1st ed Berkeley, CA Apress [2022] © 2022 xxv, 585 Seiten Diagramme 1149 grams txt rdacontent n rdamedia nc rdacarrier Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey beginning with the basics of data ingestion, processing, and transformation, and ending up with an entire local data platform running Apache Spark, Apache Zeppelin, Apache Kafka, Redis, MySQL, Minio (S3), and Apache Airflow.Apache Spark applications solve a wide range of data problems from traditional data loading and processing to rich SQL-based analysis as well as complex machine learning workloads and even near real-time processing of streaming data. Spark fits well as a central foundation for any data engineering workload. This book will teach you to write interactive Spark applications using Apache Zeppelin notebooks, write and compile reusable applications and modules, and fully test both batch and streaming. You will also learn to containerize your applications using Docker and run and deploy your Spark applications using a variety of tools such as Apache Airflow, Docker and Kubernetes.Reading this book will empower you to take advantage of Apache Spark to optimize your data pipelines and teach you to craft modular and testable Spark applications. You will create and deploy mission-critical streaming spark applications in a low-stress environment that paves the way for your own path to production.What You Will Learn - Simplify data transformation with Spark Pipelines and Spark SQL- Bridge data engineering with machine learning- Architect modular data pipeline applications- Build reusable application components and libraries- Containerize your Spark applications for consistency and reliability- Use Docker and Kubernetes to deploy your Spark applications- Speed up application experimentation using Apache Zeppelin and Docker- Understand serializable structured data and data contracts - Harness effective strategies for optimizing data in your data lakes- Build end-to-end Spark structured streaming applications using Redis and Apache Kafka- Embrace testing for your batch and streaming applications- Deploy and monitor your Spark applications Who This Book Is ForProfessional software engineers who want to take their current skills and bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language) Hardcover, Softcover / Informatik, EDV/Programmiersprachen Erscheint auch als Online-Ausgabe 978-1-4842-7452-1 |
spellingShingle | Haines, Scott Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language) |
title | Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications |
title_auth | Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications |
title_exact_search | Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications |
title_full | Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications Scott Haines |
title_fullStr | Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications Scott Haines |
title_full_unstemmed | Modern data engineering with Apache Spark a hands-on guide for building mission-critical streaming applications Scott Haines |
title_short | Modern data engineering with Apache Spark |
title_sort | modern data engineering with apache spark a hands on guide for building mission critical streaming applications |
title_sub | a hands-on guide for building mission-critical streaming applications |
topic | bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language) |
topic_facet | bicssc bisacsh Quantitative research Database management Data mining Java (Computer program language) |
work_keys_str_mv | AT hainesscott moderndataengineeringwithapachesparkahandsonguideforbuildingmissioncriticalstreamingapplications |