Availability: ADVANCED ANALYTICS WITH PYSPARK | Technische Universität München

Saved in:

Bibliographic Details
Main Authors:	Tandon, Akash (Author), Owen, Sean (Author), Wills, Josh (Author), Ryza, Sandy (Author), Laserson, Uri 1983- (Author)
Format:	Electronic eBook
Language:	English
Published:	[Erscheinungsort nicht ermittelbar] O'REILLY MEDIA 2022
Subjects:	SPARK (Electronic resource) Python (Computer program language) Data mining Python (Langage de programmation) Exploration de données (Informatique)
Links:	https://learning.oreilly.com/library/view/-/9781098103644/?ar
Summary:	The amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Spark's Python API, and other best practices in Spark programming. Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing. If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis. Familiarize yourself with Spark's programming model and ecosystem Learn general approaches in data science Examine complete implementations that analyze large public datasets Discover which machine learning tools make sense for particular problems Explore code that can be adapted to many uses.
Physical Description:	1 online resource
ISBN:	9781098103620 1098103629

Staff View

MARC


LEADER	00000cam a22000002c 4500
001	ZDB-30-ORH-063078163
003	DE-627-1
005	20240228121722.0
007	cr uuu---uuuuu
008	210427s2022 xx \|\|\|\|\|o 00\| \|\|eng c
020			\|a 9781098103620 \|c electronic bk. \|9 978-1-0981-0362-0
020			\|a 1098103629 \|c electronic bk. \|9 1-0981-0362-9
035			\|a (DE-627-1)063078163
035			\|a (DE-599)KEP063078163
035			\|a (ORHE)9781098103644
035			\|a (DE-627-1)063078163
040			\|a DE-627 \|b ger \|c DE-627 \|e rda
041			\|a eng
082	0		\|a 006.3/12 \|2 23/eng/20220621
100	1		\|a Tandon, Akash \|e VerfasserIn \|4 aut
245	1	0	\|a ADVANCED ANALYTICS WITH PYSPARK \|b patterns for learning from data at scale using Python and Spark \|c Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen & Josh Wills
264		1	\|a [Erscheinungsort nicht ermittelbar] \|b O'REILLY MEDIA \|c 2022
300			\|a 1 online resource
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
520			\|a The amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Spark's Python API, and other best practices in Spark programming. Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing. If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis. Familiarize yourself with Spark's programming model and ecosystem Learn general approaches in data science Examine complete implementations that analyze large public datasets Discover which machine learning tools make sense for particular problems Explore code that can be adapted to many uses.
630	2	0	\|a SPARK (Electronic resource)
650		0	\|a Python (Computer program language)
650		0	\|a Data mining
650		4	\|a SPARK (Electronic resource)
650		4	\|a Python (Langage de programmation)
650		4	\|a Exploration de données (Informatique)
650		4	\|a Data mining
650		4	\|a Python (Computer program language)
700	1		\|a Owen, Sean \|e VerfasserIn \|4 aut
700	1		\|a Wills, Josh \|e VerfasserIn \|4 aut
700	1		\|a Ryza, Sandy \|e VerfasserIn \|4 aut
700	1		\|a Laserson, Uri \|d 1983- \|e VerfasserIn \|4 aut
776	1		\|z 1098103653
776	0	8	\|i Erscheint auch als \|n Druck-Ausgabe \|z 1098103653
966	4	0	\|l DE-91 \|p ZDB-30-ORH \|q TUM_PDA_ORH \|u https://learning.oreilly.com/library/view/-/9781098103644/?ar \|m X:ORHE \|x Aggregator \|z lizenzpflichtig \|3 Volltext
912			\|a ZDB-30-ORH
912			\|a ZDB-30-ORH
951			\|a BO
912			\|a ZDB-30-ORH
049			\|a DE-91

Record in the Search Index

DE-BY-TUM_katkey	ZDB-30-ORH-063078163
_version_	1833357041752604672
adam_text
any_adam_object
author	Tandon, Akash Owen, Sean Wills, Josh Ryza, Sandy Laserson, Uri 1983-
author_facet	Tandon, Akash Owen, Sean Wills, Josh Ryza, Sandy Laserson, Uri 1983-
author_role	aut aut aut aut aut
author_sort	Tandon, Akash
author_variant	a t at s o so j w jw s r sr u l ul
building	Verbundindex
bvnumber	localTUM
collection	ZDB-30-ORH
ctrlnum	(DE-627-1)063078163 (DE-599)KEP063078163 (ORHE)9781098103644
dewey-full	006.3/12
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	006 - Special computer methods
dewey-raw	006.3/12
dewey-search	006.3/12
dewey-sort	16.3 212
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
format	Electronic eBook
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03113cam a22005052c 4500</leader><controlfield tag="001">ZDB-30-ORH-063078163</controlfield><controlfield tag="003">DE-627-1</controlfield><controlfield tag="005">20240228121722.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">210427s2022 xx \|\|\|\|\|o 00\| \|\|eng c</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781098103620</subfield><subfield code="c">electronic bk.</subfield><subfield code="9">978-1-0981-0362-0</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1098103629</subfield><subfield code="c">electronic bk.</subfield><subfield code="9">1-0981-0362-9</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)063078163</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)KEP063078163</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ORHE)9781098103644</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)063078163</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.3/12</subfield><subfield code="2">23/eng/20220621</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Tandon, Akash</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">ADVANCED ANALYTICS WITH PYSPARK</subfield><subfield code="b">patterns for learning from data at scale using Python and Spark</subfield><subfield code="c">Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen & Josh Wills</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">[Erscheinungsort nicht ermittelbar]</subfield><subfield code="b">O'REILLY MEDIA</subfield><subfield code="c">2022</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 online resource</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">The amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Spark's Python API, and other best practices in Spark programming. Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing. If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis. Familiarize yourself with Spark's programming model and ecosystem Learn general approaches in data science Examine complete implementations that analyze large public datasets Discover which machine learning tools make sense for particular problems Explore code that can be adapted to many uses.</subfield></datafield><datafield tag="630" ind1="2" ind2="0"><subfield code="a">SPARK (Electronic resource)</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Python (Computer program language)</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">SPARK (Electronic resource)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Python (Langage de programmation)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Exploration de données (Informatique)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Data mining</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Python (Computer program language)</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Owen, Sean</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Wills, Josh</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Ryza, Sandy</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Laserson, Uri</subfield><subfield code="d">1983-</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="776" ind1="1" ind2=" "><subfield code="z">1098103653</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">1098103653</subfield></datafield><datafield tag="966" ind1="4" ind2="0"><subfield code="l">DE-91</subfield><subfield code="p">ZDB-30-ORH</subfield><subfield code="q">TUM_PDA_ORH</subfield><subfield code="u">https://learning.oreilly.com/library/view/-/9781098103644/?ar</subfield><subfield code="m">X:ORHE</subfield><subfield code="x">Aggregator</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">BO</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91</subfield></datafield></record></collection>
id	ZDB-30-ORH-063078163
illustrated	Not Illustrated
indexdate	2025-05-28T09:45:23Z
institution	BVB
isbn	9781098103620 1098103629
language	English
open_access_boolean
owner	DE-91 DE-BY-TUM
owner_facet	DE-91 DE-BY-TUM
physical	1 online resource
psigel	ZDB-30-ORH TUM_PDA_ORH ZDB-30-ORH
publishDate	2022
publishDateSearch	2022
publishDateSort	2022
publisher	O'REILLY MEDIA
record_format	marc
spelling	Tandon, Akash VerfasserIn aut ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen & Josh Wills [Erscheinungsort nicht ermittelbar] O'REILLY MEDIA 2022 1 online resource Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier The amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Spark's Python API, and other best practices in Spark programming. Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing. If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis. Familiarize yourself with Spark's programming model and ecosystem Learn general approaches in data science Examine complete implementations that analyze large public datasets Discover which machine learning tools make sense for particular problems Explore code that can be adapted to many uses. SPARK (Electronic resource) Python (Computer program language) Data mining Python (Langage de programmation) Exploration de données (Informatique) Owen, Sean VerfasserIn aut Wills, Josh VerfasserIn aut Ryza, Sandy VerfasserIn aut Laserson, Uri 1983- VerfasserIn aut 1098103653 Erscheint auch als Druck-Ausgabe 1098103653
spellingShingle	Tandon, Akash Owen, Sean Wills, Josh Ryza, Sandy Laserson, Uri 1983- ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark SPARK (Electronic resource) Python (Computer program language) Data mining Python (Langage de programmation) Exploration de données (Informatique)
title	ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark
title_auth	ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark
title_exact_search	ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark
title_full	ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen & Josh Wills
title_fullStr	ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen & Josh Wills
title_full_unstemmed	ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen & Josh Wills
title_short	ADVANCED ANALYTICS WITH PYSPARK
title_sort	advanced analytics with pyspark patterns for learning from data at scale using python and spark
title_sub	patterns for learning from data at scale using Python and Spark
topic	SPARK (Electronic resource) Python (Computer program language) Data mining Python (Langage de programmation) Exploration de données (Informatique)
topic_facet	SPARK (Electronic resource) Python (Computer program language) Data mining Python (Langage de programmation) Exploration de données (Informatique)
work_keys_str_mv	AT tandonakash advancedanalyticswithpysparkpatternsforlearningfromdataatscaleusingpythonandspark AT owensean advancedanalyticswithpysparkpatternsforlearningfromdataatscaleusingpythonandspark AT willsjosh advancedanalyticswithpysparkpatternsforlearningfromdataatscaleusingpythonandspark AT ryzasandy advancedanalyticswithpysparkpatternsforlearningfromdataatscaleusingpythonandspark AT lasersonuri advancedanalyticswithpysparkpatternsforlearningfromdataatscaleusingpythonandspark

Availability

‌

Read online