Beginning Apache Pig Big Data Processing Made Easy /

Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to d...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Κύριος συγγραφέας: Vaddeman, Balaswamy (Συγγραφέας)
Συγγραφή απο Οργανισμό/Αρχή: SpringerLink (Online service)
Μορφή: Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:English
Έκδοση: Berkeley, CA : Apress : Imprint: Apress, 2016.
Θέματα:
Διαθέσιμο Online:Full Text via HEAL-Link
LEADER 03723nam a22004455i 4500
001 978-1-4842-2337-6
003 DE-He213
005 20161210094237.0
007 cr nn 008mamaa
008 161210s2016 xxu| s |||| 0|eng d
020 |a 9781484223376  |9 978-1-4842-2337-6 
024 7 |a 10.1007/978-1-4842-2337-6  |2 doi 
040 |d GrThAP 
100 1 |a Vaddeman, Balaswamy.  |e author. 
245 1 0 |a Beginning Apache Pig  |h [electronic resource] :  |b Big Data Processing Made Easy /  |c by Balaswamy Vaddeman. 
264 1 |a Berkeley, CA :  |b Apress :  |b Imprint: Apress,  |c 2016. 
300 |a XXIII, 274 p. 69 illus., 35 illus. in color.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
505 0 |a Chapter 1 - Introduction -- Chapter 2 - Data types -- Chapter 3 - Grunt -- Chapter 4 - Introduction to Pig Latin -- Chapter 5 - Joins and Functions -- Chapter 6 - Pig Latin using Oozie -- Chapter 7 - Introduction to HCatalog -- Chapter 8 - Submitting Pig jobs using Hue -- Chapter 9 - Role of Pig in Apache Falcon -- Chapter 10 - Macros -- Chapter 11 - User defined Functions -- Chapter 12 - Writing your own eval and Filter Functions -- Chapter 13 - Writing your own Load and Store Functions -- Chapter 14 - Know Your Pig latin scripts -- Chapter 15 - Data formats -- Chapter 16 - Optimization -- Chapter 17 - Other Hadoop tools -- Appendix A - Builtin Functions -- Appendix B - Apache Pig in Apache Ambari -- Appendix C - HBaseStorage and ORCSTorage options. 
520 |a Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications. The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools. You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance. What You Will Learn • Use all the features of Apache Pig • Integrate Apache Pig with other tools • Extend Apache Pig • Optimize Pig Latin code • Solve different use cases for Pig Latin Who This Book Is For All levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators. 
650 0 |a Computer science. 
650 0 |a Data structures (Computer science). 
650 0 |a Database management. 
650 0 |a Data mining. 
650 0 |a Information storage and retrieval. 
650 1 4 |a Computer Science. 
650 2 4 |a Open Source. 
650 2 4 |a Database Management. 
650 2 4 |a Data Storage Representation. 
650 2 4 |a Data Mining and Knowledge Discovery. 
650 2 4 |a Information Storage and Retrieval. 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
776 0 8 |i Printed edition:  |z 9781484223369 
856 4 0 |u http://dx.doi.org/10.1007/978-1-4842-2337-6  |z Full Text via HEAL-Link 
912 |a ZDB-2-CWD 
950 |a Professional and Applied Computing (Springer-12059)