Process your data with Apache Pig

Posted by Scott_Ruecker on Feb 28, 2012 6:26 AM EDT
IBM/developerWorks; By M. Tim Jones
Mail this story
Print this story

Apache Pig is a high-level procedural language for querying large semi-structured data sets using Hadoop and the MapReduce Platform. Pig simplifies the use of Hadoop by allowing SQL-like queries to a distributed dataset. Explore the language behind Pig and discover its use in a simple Hadoop cluster.

Full Story

» Read more about: Story Type: News Story; Groups: IBM, Linux

« Return to the newswire homepage

This topic does not have any threads posted yet!

You cannot post until you login.