Big Data Processing Using Spark in Cloud

The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding o...

Full description

Bibliographic Details
Corporate Author: SpringerLink (Online service)
Other Authors: Mittal, Mamta (Editor, http://id.loc.gov/vocabulary/relators/edt), Balas, Valentina E. (Editor, http://id.loc.gov/vocabulary/relators/edt), Goyal, Lalit Mohan (Editor, http://id.loc.gov/vocabulary/relators/edt), Kumar, Raghvendra (Editor, http://id.loc.gov/vocabulary/relators/edt)
Format: Electronic eBook
Language:English
Published: Singapore : Springer Singapore : Imprint: Springer, 2019.
Edition:1st ed. 2019.
Series:Studies in Big Data, 43
Subjects:
Online Access:Full Text via HEAL-Link
LEADER 04050nam a2200529 4500
001 978-981-13-0550-4
003 DE-He213
005 20190617194007.0
007 cr nn 008mamaa
008 180616s2019 si | s |||| 0|eng d
020 |a 9789811305504  |9 978-981-13-0550-4 
024 7 |a 10.1007/978-981-13-0550-4  |2 doi 
040 |d GrThAP 
050 4 |a QA76.9.B45 
072 7 |a UN  |2 bicssc 
072 7 |a COM021000  |2 bisacsh 
072 7 |a UN  |2 thema 
082 0 4 |a 005.7  |2 23 
245 1 0 |a Big Data Processing Using Spark in Cloud  |h [electronic resource] /  |c edited by Mamta Mittal, Valentina E. Balas, Lalit Mohan Goyal, Raghvendra Kumar. 
250 |a 1st ed. 2019. 
264 1 |a Singapore :  |b Springer Singapore :  |b Imprint: Springer,  |c 2019. 
300 |a XIII, 264 p. 89 illus., 62 illus. in color.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a Studies in Big Data,  |x 2197-6503 ;  |v 43  
505 0 |a Concepts of Big Data and Apache Spark -- Big Data Analysis in Cloud and Machine Learning -- Security Issues and Challenges related to Big Data -- Big Data Security Solutions in Cloud -- Data Science and Analytics -- Big Data Technologies -- Data Analysis with Casandra and Spark -- Spin up the Spark Cluster -- Learn Scala -- IO for Spark -- Processing with Spark -- Spark Data Frames and Spark SQL -- Machine Learning and Advanced Analytics -- Parallel Programming with Spark -- Distributed Graph Processing with Spark -- Real Time Processing with Spark -- Spark in Real World -- Case Studies. . 
520 |a The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements big data's immutable nature, and solves it with lazy evaluation, cacheable and type inference. It also addresses advanced topics in Spark, starting with the basics of Scala and the core Spark framework, and exploring Spark data frames, machine learning using Mllib, graph analytics using Graph X and real-time processing with Apache Kafka, AWS Kenisis, and Azure Event Hub. It then goes on to investigate Spark using PySpark and R. Focusing on the current big data stack, the book examines the interaction with current big data tools, with Spark being the core processing layer for all types of data. The book is intended for data engineers and scientists working on massive datasets and big data technologies in the cloud. In addition to industry professionals, it is helpful for aspiring data processing professionals and students working in big data processing and cloud computing environments. 
650 0 |a Big data. 
650 0 |a Computer security. 
650 1 4 |a Big Data.  |0 http://scigraph.springernature.com/things/product-market-codes/I29120 
650 2 4 |a Systems and Data Security.  |0 http://scigraph.springernature.com/things/product-market-codes/I28060 
650 2 4 |a Big Data/Analytics.  |0 http://scigraph.springernature.com/things/product-market-codes/522070 
700 1 |a Mittal, Mamta.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Balas, Valentina E.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Goyal, Lalit Mohan.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Kumar, Raghvendra.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
776 0 8 |i Printed edition:  |z 9789811305498 
776 0 8 |i Printed edition:  |z 9789811305511 
776 0 8 |i Printed edition:  |z 9789811344480 
830 0 |a Studies in Big Data,  |x 2197-6503 ;  |v 43  
856 4 0 |u https://doi.org/10.1007/978-981-13-0550-4  |z Full Text via HEAL-Link 
912 |a ZDB-2-ENG 
950 |a Engineering (Springer-11647)