Archivo de la categoría: Hadoop
Starting points on Pig
After diving on the web about Pig and how to go deeply as contributor, I found the following links that may be useful for future contributors in order to have a roughly idea of the architecture: Contributor Guide: here is … Sigue leyendo
Generating files to Eclipse
Based on the instructions of how to set up Pig into Eclipse environment [here] is necessary to execute through terminal: ant clean eclipse-files But, due to clean parameter, some folders are deleted and the generation is not complete. After some … Sigue leyendo
What is Pig?
Just to make more clear, I added the description of Pig available: Pig provides an engine for executing data flows in parallel on Hadoop. It includes a language, Pig Latin, for expressing these data flows. Pig Latin includes operators for many … Sigue leyendo
Hadoop – Pig (Intro)
Well, after many years of producing a lot of information everywhere, it is necessary to create some sort of processing massive data for customers. But that’s not a easy task, actually it depends on feasible resources to use in … Sigue leyendo