Mondadori Store

Trova Mondadori Store

Benvenuto
Accedi o registrati

lista preferiti

Per utilizzare la funzione prodotti desiderati devi accedere o registrarti

Vai al carrello
 prodotti nel carrello

Totale  articoli

0,00 € IVA Inclusa

Learn Apache Pig Full

Hoang Tran
pubblicato da hoang tran

Prezzo online:
0,00

What is Apache Pig?

Apache Pig is an abstraction over MapReduce. It is a tool/platform which is used to analyze larger sets of data representing them as data flows. Pig is generally used with Hadoop; we can perform all the data manipulation operations in Hadoop using Apache Pig.

To write data analysis programs, Pig provides a high-level language known as Pig Latin. This language provides various operators using which programmers can develop their own functions for reading, writing, and processing data.

To analyze data using Apache Pig, programmers need to write scripts using Pig Latin language. All these scripts are internally converted to Map and Reduce tasks. Apache Pig has a component known as Pig Engine that accepts the Pig Latin scripts as input and converts those scripts into MapReduce jobs.

Why Do We Need Apache Pig?

Programmers who are not so good at Java normally used to struggle working with Hadoop, especially while performing any MapReduce tasks. Apache Pig is a boon for all such programmers.

Using Pig Latin, programmers can perform MapReduce tasks easily without having to type complex codes in Java.
Apache Pig uses multi-query approach, thereby reducing the length of codes. For example, an operation that would require you to type 200 lines of code (LoC) in Java can be easily done by typing as less as just 10 LoC in Apache Pig. Ultimately Apache Pig reduces the development time by almost 16 times.
Pig Latin is SQL-like language and it is easy to learn Apache Pig when you are familiar with SQL.
Apache Pig provides many built-in operators to support data operations like joins, filters, ordering, etc. In addition, it also provides nested data types like tuples, bags, and maps that are missing from MapReduce.

Features of Pig

Apache Pig comes with the following features

Rich set of operators It provides many operators to perform operations like join, sort, filer, etc.
Ease of programming Pig Latin is similar to SQL and it is easy to write a Pig script if you are good at SQL.
Optimization opportunities The tasks in Apache Pig optimize their execution automatically, so the programmers need to focus only on semantics of the language.
Extensibility Using the existing operators, users can develop their own functions to read, process, and write data.
UDF's Pig provides the facility to create User-defined Functions in other programming languages such as Java and invoke or embed them in Pig Scripts.
Handles all kinds of data Apache Pig analyzes all kinds of data, both structured as well as unstructured. It stores the results in HDFS.

Dettagli down

Generi Politica e Società » Comunicazione e Media » Internet, tecnologie e società , Informatica e Web » Linguaggi e Applicazioni » Database » Programmazione e sviluppo del software

Editore Hoang Tran

Formato Ebook (senza DRM)

Pubblicato 04/12/2019

Lingua Inglese

EAN-13 1230003596319

0 recensioni dei lettori  media voto 0  su  5

Scrivi una recensione per "Learn Apache Pig Full"

Learn Apache Pig Full
 

Accedi o Registrati  per aggiungere una recensione

usa questo box per dare una valutazione all'articolo: leggi le linee guida
torna su Torna in cima