SquareCog's SquareBlog

Pig, HBase, Hadoop, and Twitter: HUG talk slides

Posted in programming by squarecog on May 20, 2010

I presented tonight at the Bay Area Hadoop User Group, talking briefly about Twitter’s use of Hadoop and Pig. Here are the slides:

View this document on Scribd

GROUP operator in Apache Pig

Posted in programming by squarecog on May 11, 2010

I’ve been doing a fair amount of helping people get started with Apache Pig. One common stumbling block is the GROUP operator. Although familiar, as it serves a similar function to SQL’s GROUP operator, it is just different enough in the Pig Latin language to be confusing. Hopefully this brief post will shed some light on what exactly is going on.
(more…)

Tagged with: , ,
%d bloggers like this: