DataFu: Open Source Apache Pig UDFs by LinkedIn
Here's a taste of what you can do with DataFu: Run PageRank on a large number of independent graphs. Perform set operations such as intersect and union. Compute the haversine distance between two points on the globe. Create an assertion on input data which will cause the script to fail if the condition is not met. Perform various operations on bags such as append a tuple, prepend a tuple, concatenate bags, generate unordered pairs, etc.
tags:pig,linkedin,nosql releases
via NoSQL databases
Post a Comment