Verfügbarkeit: Applied data science using Pyspark | Technische Universität München

Gespeichert in:

Bibliographische Detailangaben
Beteilige Person:	Kakarla, Ramcharan (VerfasserIn)
Weitere beteiligte Personen:	Krishnan, Sundar (MitwirkendeR), Alla, Sridhar (MitwirkendeR)
Format:	Elektronisch E-Book
Sprache:	Englisch
Veröffentlicht:	Berkeley, CA Apress 2021
Schlagwörter:	Big data Machine learning Python (Computer program language) Parallel processing (Electronic computers) Données volumineuses Apprentissage automatique Python (Langage de programmation) Parallélisme (Informatique) Computer software
Links:	https://learning.oreilly.com/library/view/-/9781484265000/?ar
Zusammenfassung:	Discover the capabilities of PySpark and its application in the realm of data science. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. Applied Data Science Using PySpark is divided unto six sections which walk you through the book. In section 1, you start with the basics of PySpark focusing on data manipulation. We make you comfortable with the language and then build upon it to introduce you to the mathematical functions available off the shelf. In section 2, you will dive into the art of variable selection where we demonstrate various selection techniques available in PySpark. In section 3, we take you on a journey through machine learning algorithms, implementations, and fine-tuning techniques. We will also talk about different validation metrics and how to use them for picking the best models. Sections 4 and 5 go through machine learning pipelines and various methods available to operationalize the model and serve it through Docker/an API. In the final section, you will cover reusable objects for easy experimentation and learn some tricks that can help you optimize your programs and machine learning pipelines. By the end of this book, you will have seen the flexibility and advantages of PySpark in data science applications. This book is recommended to those who want to unleash the power of parallel computing by simultaneously working with big datasets. You will: Build an end-to-end predictive model Implement multiple variable selection techniques Operationalize models Master multiple algorithms and implementations.
Beschreibung:	Gradient Descent. - Includes index. - Print version record
Umfang:	1 Online-Ressource (427 pages)
ISBN:	9781484265000 1484265009 1484265017

Internformat

MARC


LEADER	00000cam a22000002c 4500
001	ZDB-30-ORH-061075728
003	DE-627-1
005	20240228121241.0
007	cr uuu---uuuuu
008	210118s2021 xx \|\|\|\|\|o 00\| \|\|eng c
020			\|a 9781484265000 \|c electronic bk. \|9 978-1-4842-6500-0
020			\|a 1484265009 \|c electronic bk. \|9 1-4842-6500-9
020			\|a 1484265017 \|9 1-4842-6501-7
035			\|a (DE-627-1)061075728
035			\|a (DE-599)KEP061075728
035			\|a (ORHE)9781484265000
035			\|a (DE-627-1)061075728
040			\|a DE-627 \|b ger \|c DE-627 \|e rda
041			\|a eng
072		7	\|a U. \|2 bicssc
072		7	\|a COM000000 \|2 bisacsh
082	0		\|a 005.7 \|2 23
082	0		\|a 004 \|2 23
100	1		\|a Kakarla, Ramcharan \|e VerfasserIn \|4 aut
245	1	0	\|a Applied data science using Pyspark \|b learn the end-to-end predictive model-building cycle \|c Ramcharan Kakarla, Sundar Krishnan, Sridhar Alla
264		1	\|a Berkeley, CA \|b Apress \|c 2021
300			\|a 1 Online-Ressource (427 pages)
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
500			\|a Gradient Descent. - Includes index. - Print version record
520			\|a Discover the capabilities of PySpark and its application in the realm of data science. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. Applied Data Science Using PySpark is divided unto six sections which walk you through the book. In section 1, you start with the basics of PySpark focusing on data manipulation. We make you comfortable with the language and then build upon it to introduce you to the mathematical functions available off the shelf. In section 2, you will dive into the art of variable selection where we demonstrate various selection techniques available in PySpark. In section 3, we take you on a journey through machine learning algorithms, implementations, and fine-tuning techniques. We will also talk about different validation metrics and how to use them for picking the best models. Sections 4 and 5 go through machine learning pipelines and various methods available to operationalize the model and serve it through Docker/an API. In the final section, you will cover reusable objects for easy experimentation and learn some tricks that can help you optimize your programs and machine learning pipelines. By the end of this book, you will have seen the flexibility and advantages of PySpark in data science applications. This book is recommended to those who want to unleash the power of parallel computing by simultaneously working with big datasets. You will: Build an end-to-end predictive model Implement multiple variable selection techniques Operationalize models Master multiple algorithms and implementations.
650		0	\|a Big data
650		0	\|a Machine learning
650		0	\|a Python (Computer program language)
650		0	\|a Parallel processing (Electronic computers)
650		4	\|a Données volumineuses
650		4	\|a Apprentissage automatique
650		4	\|a Python (Langage de programmation)
650		4	\|a Parallélisme (Informatique)
650		4	\|a Python (Computer program language)
650		4	\|a Parallel processing (Electronic computers)
650		4	\|a Big data
650		4	\|a Computer software
650		4	\|a Machine learning
700	1		\|a Krishnan, Sundar \|e MitwirkendeR \|4 ctb
700	1		\|a Alla, Sridhar \|e MitwirkendeR \|4 ctb
776	1		\|z 9781484264997
776	0	8	\|i Erscheint auch als \|n Druck-Ausgabe \|z 9781484264997
966	4	0	\|l DE-91 \|p ZDB-30-ORH \|q TUM_PDA_ORH \|u https://learning.oreilly.com/library/view/-/9781484265000/?ar \|m X:ORHE \|x Aggregator \|z lizenzpflichtig \|3 Volltext
912			\|a ZDB-30-ORH
912			\|a ZDB-30-ORH
951			\|a BO
912			\|a ZDB-30-ORH
049			\|a DE-91

Datensatz im Suchindex

DE-BY-TUM_katkey	ZDB-30-ORH-061075728
_version_	1835903151638577152
adam_text
any_adam_object
author	Kakarla, Ramcharan
author2	Krishnan, Sundar Alla, Sridhar
author2_role	ctb ctb
author2_variant	s k sk s a sa
author_facet	Kakarla, Ramcharan Krishnan, Sundar Alla, Sridhar
author_role	aut
author_sort	Kakarla, Ramcharan
author_variant	r k rk
building	Verbundindex
bvnumber	localTUM
collection	ZDB-30-ORH
ctrlnum	(DE-627-1)061075728 (DE-599)KEP061075728 (ORHE)9781484265000
dewey-full	005.7 004
dewey-hundreds	000 - Computer science, information, general works
dewey-ones	005 - Computer programming, programs, data, security 004 - Computer science
dewey-raw	005.7 004
dewey-search	005.7 004
dewey-sort	15.7
dewey-tens	000 - Computer science, information, general works
discipline	Informatik
format	Electronic eBook
fullrecord	<?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>03800cam a22006012c 4500</leader><controlfield tag="001">ZDB-30-ORH-061075728</controlfield><controlfield tag="003">DE-627-1</controlfield><controlfield tag="005">20240228121241.0</controlfield><controlfield tag="007">cr uuu---uuuuu</controlfield><controlfield tag="008">210118s2021 xx \|\|\|\|\|o 00\| \|\|eng c</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781484265000</subfield><subfield code="c">electronic bk.</subfield><subfield code="9">978-1-4842-6500-0</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1484265009</subfield><subfield code="c">electronic bk.</subfield><subfield code="9">1-4842-6500-9</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">1484265017</subfield><subfield code="9">1-4842-6501-7</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)061075728</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)KEP061075728</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(ORHE)9781484265000</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-627-1)061075728</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-627</subfield><subfield code="b">ger</subfield><subfield code="c">DE-627</subfield><subfield code="e">rda</subfield></datafield><datafield tag="041" ind1=" " ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">U.</subfield><subfield code="2">bicssc</subfield></datafield><datafield tag="072" ind1=" " ind2="7"><subfield code="a">COM000000</subfield><subfield code="2">bisacsh</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">005.7</subfield><subfield code="2">23</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">004</subfield><subfield code="2">23</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Kakarla, Ramcharan</subfield><subfield code="e">VerfasserIn</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Applied data science using Pyspark</subfield><subfield code="b">learn the end-to-end predictive model-building cycle</subfield><subfield code="c">Ramcharan Kakarla, Sundar Krishnan, Sridhar Alla</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berkeley, CA</subfield><subfield code="b">Apress</subfield><subfield code="c">2021</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">1 Online-Ressource (427 pages)</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="a">Text</subfield><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="a">Computermedien</subfield><subfield code="b">c</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="a">Online-Ressource</subfield><subfield code="b">cr</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="500" ind1=" " ind2=" "><subfield code="a">Gradient Descent. - Includes index. - Print version record</subfield></datafield><datafield tag="520" ind1=" " ind2=" "><subfield code="a">Discover the capabilities of PySpark and its application in the realm of data science. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. Applied Data Science Using PySpark is divided unto six sections which walk you through the book. In section 1, you start with the basics of PySpark focusing on data manipulation. We make you comfortable with the language and then build upon it to introduce you to the mathematical functions available off the shelf. In section 2, you will dive into the art of variable selection where we demonstrate various selection techniques available in PySpark. In section 3, we take you on a journey through machine learning algorithms, implementations, and fine-tuning techniques. We will also talk about different validation metrics and how to use them for picking the best models. Sections 4 and 5 go through machine learning pipelines and various methods available to operationalize the model and serve it through Docker/an API. In the final section, you will cover reusable objects for easy experimentation and learn some tricks that can help you optimize your programs and machine learning pipelines. By the end of this book, you will have seen the flexibility and advantages of PySpark in data science applications. This book is recommended to those who want to unleash the power of parallel computing by simultaneously working with big datasets. You will: Build an end-to-end predictive model Implement multiple variable selection techniques Operationalize models Master multiple algorithms and implementations.</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Big data</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Machine learning</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Python (Computer program language)</subfield></datafield><datafield tag="650" ind1=" " ind2="0"><subfield code="a">Parallel processing (Electronic computers)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Données volumineuses</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Apprentissage automatique</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Python (Langage de programmation)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Parallélisme (Informatique)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Python (Computer program language)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Parallel processing (Electronic computers)</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Big data</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Computer software</subfield></datafield><datafield tag="650" ind1=" " ind2="4"><subfield code="a">Machine learning</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Krishnan, Sundar</subfield><subfield code="e">MitwirkendeR</subfield><subfield code="4">ctb</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Alla, Sridhar</subfield><subfield code="e">MitwirkendeR</subfield><subfield code="4">ctb</subfield></datafield><datafield tag="776" ind1="1" ind2=" "><subfield code="z">9781484264997</subfield></datafield><datafield tag="776" ind1="0" ind2="8"><subfield code="i">Erscheint auch als</subfield><subfield code="n">Druck-Ausgabe</subfield><subfield code="z">9781484264997</subfield></datafield><datafield tag="966" ind1="4" ind2="0"><subfield code="l">DE-91</subfield><subfield code="p">ZDB-30-ORH</subfield><subfield code="q">TUM_PDA_ORH</subfield><subfield code="u">https://learning.oreilly.com/library/view/-/9781484265000/?ar</subfield><subfield code="m">X:ORHE</subfield><subfield code="x">Aggregator</subfield><subfield code="z">lizenzpflichtig</subfield><subfield code="3">Volltext</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="951" ind1=" " ind2=" "><subfield code="a">BO</subfield></datafield><datafield tag="912" ind1=" " ind2=" "><subfield code="a">ZDB-30-ORH</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-91</subfield></datafield></record></collection>
id	ZDB-30-ORH-061075728
illustrated	Not Illustrated
indexdate	2025-06-25T12:14:43Z
institution	BVB
isbn	9781484265000 1484265009 1484265017
language	English
open_access_boolean
owner	DE-91 DE-BY-TUM
owner_facet	DE-91 DE-BY-TUM
physical	1 Online-Ressource (427 pages)
psigel	ZDB-30-ORH TUM_PDA_ORH ZDB-30-ORH
publishDate	2021
publishDateSearch	2021
publishDateSort	2021
publisher	Apress
record_format	marc
spelling	Kakarla, Ramcharan VerfasserIn aut Applied data science using Pyspark learn the end-to-end predictive model-building cycle Ramcharan Kakarla, Sundar Krishnan, Sridhar Alla Berkeley, CA Apress 2021 1 Online-Ressource (427 pages) Text txt rdacontent Computermedien c rdamedia Online-Ressource cr rdacarrier Gradient Descent. - Includes index. - Print version record Discover the capabilities of PySpark and its application in the realm of data science. This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade. Applied Data Science Using PySpark is divided unto six sections which walk you through the book. In section 1, you start with the basics of PySpark focusing on data manipulation. We make you comfortable with the language and then build upon it to introduce you to the mathematical functions available off the shelf. In section 2, you will dive into the art of variable selection where we demonstrate various selection techniques available in PySpark. In section 3, we take you on a journey through machine learning algorithms, implementations, and fine-tuning techniques. We will also talk about different validation metrics and how to use them for picking the best models. Sections 4 and 5 go through machine learning pipelines and various methods available to operationalize the model and serve it through Docker/an API. In the final section, you will cover reusable objects for easy experimentation and learn some tricks that can help you optimize your programs and machine learning pipelines. By the end of this book, you will have seen the flexibility and advantages of PySpark in data science applications. This book is recommended to those who want to unleash the power of parallel computing by simultaneously working with big datasets. You will: Build an end-to-end predictive model Implement multiple variable selection techniques Operationalize models Master multiple algorithms and implementations. Big data Machine learning Python (Computer program language) Parallel processing (Electronic computers) Données volumineuses Apprentissage automatique Python (Langage de programmation) Parallélisme (Informatique) Computer software Krishnan, Sundar MitwirkendeR ctb Alla, Sridhar MitwirkendeR ctb 9781484264997 Erscheint auch als Druck-Ausgabe 9781484264997
spellingShingle	Kakarla, Ramcharan Applied data science using Pyspark learn the end-to-end predictive model-building cycle Big data Machine learning Python (Computer program language) Parallel processing (Electronic computers) Données volumineuses Apprentissage automatique Python (Langage de programmation) Parallélisme (Informatique) Computer software
title	Applied data science using Pyspark learn the end-to-end predictive model-building cycle
title_auth	Applied data science using Pyspark learn the end-to-end predictive model-building cycle
title_exact_search	Applied data science using Pyspark learn the end-to-end predictive model-building cycle
title_full	Applied data science using Pyspark learn the end-to-end predictive model-building cycle Ramcharan Kakarla, Sundar Krishnan, Sridhar Alla
title_fullStr	Applied data science using Pyspark learn the end-to-end predictive model-building cycle Ramcharan Kakarla, Sundar Krishnan, Sridhar Alla
title_full_unstemmed	Applied data science using Pyspark learn the end-to-end predictive model-building cycle Ramcharan Kakarla, Sundar Krishnan, Sridhar Alla
title_short	Applied data science using Pyspark
title_sort	applied data science using pyspark learn the end to end predictive model building cycle
title_sub	learn the end-to-end predictive model-building cycle
topic	Big data Machine learning Python (Computer program language) Parallel processing (Electronic computers) Données volumineuses Apprentissage automatique Python (Langage de programmation) Parallélisme (Informatique) Computer software
topic_facet	Big data Machine learning Python (Computer program language) Parallel processing (Electronic computers) Données volumineuses Apprentissage automatique Python (Langage de programmation) Parallélisme (Informatique) Computer software
work_keys_str_mv	AT kakarlaramcharan applieddatascienceusingpysparklearntheendtoendpredictivemodelbuildingcycle AT krishnansundar applieddatascienceusingpysparklearntheendtoendpredictivemodelbuildingcycle AT allasridhar applieddatascienceusingpysparklearntheendtoendpredictivemodelbuildingcycle

Verfügbarkeit

‌

Online lesen