Archivo de la categoría: Hadoop

Starting points on Pig

After diving on the web about Pig and how to go deeply as contributor, I found the following links that may be useful for future contributors in order to have a roughly idea of the architecture: Contributor Guide: here is … Sigue leyendo

Publicado en Hadoop, Pig, Varios | Etiquetado , , | Deja un comentario

Generating files to Eclipse

Based on the instructions of how to set up Pig into Eclipse environment [here] is necessary to execute through terminal: ant clean eclipse-files But, due to clean parameter, some folders are deleted and the generation is not complete. After some … Sigue leyendo

Publicado en Hadoop, Pig, Varios | Etiquetado , , | Deja un comentario

What is Pig?

Just to make more clear, I added the description of Pig available: Pig provides an engine for executing data flows in parallel on Hadoop. It includes a language, Pig Latin, for expressing these data flows. Pig Latin includes operators for many … Sigue leyendo

Publicado en Hadoop, Pig, Varios | Etiquetado , , , , | Deja un comentario

Hadoop – Pig (Intro)

  Well, after many years of producing a lot of information everywhere, it is necessary to create some sort of processing massive data for customers. But that’s not a easy task, actually it depends on feasible resources to use in … Sigue leyendo

Publicado en Hadoop, Varios | Etiquetado , , , , | Deja un comentario