If you are an RDBMS person that is used to joining data from tables, you will be happy to know that this capability exists in Pig. Below is a very simple example. grunt> fs -cat /people 1,1,steve howard 2,1,becky howard…
Category: Pig
Pig script to group URL requests in JBOSS
As we move towards an enterprise data analytics platform, I take every opportunity I can to come up with simple jobs in Hadoop, Hive, and Pig. Below is one I ran in Pig that groups the top 50 URL requests…