Beginning Apache Pig Big Data Processing Made Easy /

Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to d...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Κύριος συγγραφέας:	Vaddeman, Balaswamy (Συγγραφέας)
Συγγραφή απο Οργανισμό/Αρχή:	SpringerLink (Online service)
Μορφή:	Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:	English
Έκδοση:	Berkeley, CA : Apress : Imprint: Apress, 2016.
Θέματα:	Computer science. Data structures (Computer science). Database management. Data mining. Information storage and retrieval. Computer Science. Open Source. Database Management. Data Storage Representation. Data Mining and Knowledge Discovery. Information Storage and Retrieval.
Διαθέσιμο Online:	Full Text via HEAL-Link


LEADER	03723nam a22004455i 4500
001	978-1-4842-2337-6
003	DE-He213
005	20161210094237.0
007	cr nn 008mamaa
008	161210s2016 xxu\| s \|\|\|\| 0\|eng d
020			\|a 9781484223376 \|9 978-1-4842-2337-6
024	7		\|a 10.1007/978-1-4842-2337-6 \|2 doi
040			\|d GrThAP
100	1		\|a Vaddeman, Balaswamy. \|e author.
245	1	0	\|a Beginning Apache Pig \|h [electronic resource] : \|b Big Data Processing Made Easy / \|c by Balaswamy Vaddeman.
264		1	\|a Berkeley, CA : \|b Apress : \|b Imprint: Apress, \|c 2016.
300			\|a XXIII, 274 p. 69 illus., 35 illus. in color. \|b online resource.
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
347			\|a text file \|b PDF \|2 rda
505	0		\|a Chapter 1 - Introduction -- Chapter 2 - Data types -- Chapter 3 - Grunt -- Chapter 4 - Introduction to Pig Latin -- Chapter 5 - Joins and Functions -- Chapter 6 - Pig Latin using Oozie -- Chapter 7 - Introduction to HCatalog -- Chapter 8 - Submitting Pig jobs using Hue -- Chapter 9 - Role of Pig in Apache Falcon -- Chapter 10 - Macros -- Chapter 11 - User defined Functions -- Chapter 12 - Writing your own eval and Filter Functions -- Chapter 13 - Writing your own Load and Store Functions -- Chapter 14 - Know Your Pig latin scripts -- Chapter 15 - Data formats -- Chapter 16 - Optimization -- Chapter 17 - Other Hadoop tools -- Appendix A - Builtin Functions -- Appendix B - Apache Pig in Apache Ambari -- Appendix C - HBaseStorage and ORCSTorage options.
520			\|a Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications. The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools. You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance. What You Will Learn • Use all the features of Apache Pig • Integrate Apache Pig with other tools • Extend Apache Pig • Optimize Pig Latin code • Solve different use cases for Pig Latin Who This Book Is For All levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators.
650		0	\|a Computer science.
650		0	\|a Data structures (Computer science).
650		0	\|a Database management.
650		0	\|a Data mining.
650		0	\|a Information storage and retrieval.
650	1	4	\|a Computer Science.
650	2	4	\|a Open Source.
650	2	4	\|a Database Management.
650	2	4	\|a Data Storage Representation.
650	2	4	\|a Data Mining and Knowledge Discovery.
650	2	4	\|a Information Storage and Retrieval.
710	2		\|a SpringerLink (Online service)
773	0		\|t Springer eBooks
776	0	8	\|i Printed edition: \|z 9781484223369
856	4	0	\|u http://dx.doi.org/10.1007/978-1-4842-2337-6 \|z Full Text via HEAL-Link
912			\|a ZDB-2-CWD
950			\|a Professional and Applied Computing (Springer-12059)

Beginning Apache Pig Big Data Processing Made Easy /

Παρόμοια τεκμήρια