skip to main | skip to sidebar
  • Home
  • mindstorms
  • About
  • NoSQL
  • Think Differently Big
  • Profile
  • Wishlist
  • RSS

NoSQL: Flatten Entire HBase Column Families With Pig and Python UDFs

Alex Popescu | 2013/02/11 | bookmark | email

Flatten Entire HBase Column Families With Pig and Python UDFs

Chase Seibert:

tags:hbase,pig,udf,python,column store,bigtable,hadoop,bigdata,mapreduce

via NoSQL databases

Related Posts

    Post a Comment

    Older Post Newer Post Home

    mindstorms

    Software and web architectures, cloud computing and a flavor of tech startup entrepreneurship through the eyes of Alex Popescu.

    About me: Software architect, Web Aficionado, Cloud Computing Fanboy, Geek Entrepreneur, Speaker, Co-founder and CTO of InfoQ.com, Writing also about NoSQL on the myNoSQL blog

    Latest comments

    Loading...

    think differently big

    Loading...

    Tag Cloud Sphere ▼

    Follow Alex on Twitter ▼

      follow me on Twitter

      Daily Cloud Stream ▼

      Loading...

      Show more articles

        • NoSQL: One Database to Rule Them All?
        • NoSQL: The History of Hadoop Changed the World
        • NoSQL: Rackspace Buys MongoDB Hosting Provider Obj...
        • NoSQL: Some Interesting Facts, Sorry FUD About Had...
        • NoSQL: Spring for Apache Hadoop 1.0 Goes GA: Wrapp...
        • NoSQL: Apache Pig Goes 0.11
        • NoSQL: Cloudera Pissed Off
        • NoSQL: What Makes Amazon Redshift Faster Than Hive?
        • NoSQL: Big Data at Torbit: Custom MapReduce-like S...
        • NoSQL: MarkLogic's New (Aggressive) Voice
        • NoSQL: Redshift Performance & Cost at Airbnb
        • NoSQL: Project Rhino: Enhanced Data Protection for...
        • NoSQL: Hadoop - the Foundation for Change
        • NoSQL: Redis - Pick the Right Data Structure
        • NoSQL: An Overview of Cascading
        • NoSQL: An Overview of Scalding
        • NoSQL: Which Big Data Company Has the World's Bigg...
        • NoSQL: Creating a Simple Bloom Filter in Python
        • NoSQL: Using Apache ZooKeeper to Build Distributed...
        • NoSQL: Integrating MongoDB and Hadoop: Why & How
        • NoSQL: The State of CouchDB - Jan Lehnardt's Comment
        • NoSQL: Big Data: 3 Questions to Ask When Comparing...
        • NoSQL: The State of CouchDB With Comments
        • NoSQL: A Data Store Independent of Consistency Mod...
        • NoSQL: Are These the World's Most Innovative Big D...
        • NoSQL: Vague Goals Seed Big Data Failures
        • NoSQL: CouchDB Future Feature List
        • NoSQL: Amazon Preparing 'Disruptive' Big Data AWS ...
        • NoSQL: Why Can't RDBM Cluster the Way NoSQL Does? ...
        • NoSQL: Hortonworks: The Fastest Path to Innovation...
        • NoSQL: Counting Triangles Smarter (Or How to Beat ...
        • NoSQL: Hadoop Silos Need Integration
        • NoSQL: What Is a Data Fabric Architecture?
        • NoSQL: Inside Cloudera Impala: Runtime Code Genera...
        • NoSQL: Hortonworks and Community Driven Hadoop
        • NoSQL: MySQL 5.6 vs. MariaDB 10.0
        • NoSQL: Amazon Redshift - Now Broadly Available
        • NoSQL: CouchDB In-Browser JavaScript Debugger
        • NoSQL: VMware Sues Hortonworks
        • NoSQL: From SimpleDB to Cassandra: Data Migration ...
        • NoSQL: What's New and Upcoming in HDFS
        • NoSQL: 12 Data Best Practices
        • NoSQL: HDFS Paper: HARDFS - Hardening HDFS With Se...
        • NoSQL: The Family of MapReduce and Large Scale Dat...
        • NoSQL: Data Deduplication Tactics With HDFS and Ma...
        • NoSQL: NoSQL on MySQL: Stating the Obvious
        • NoSQL: The exhaustive list of other NoSQL vendors ...
        • NoSQL: MySQL 5.6 - What's New
        • NoSQL: Reactions to MySQL 5.6: Couchbase
        • NoSQL: DataStax's Reaction to MySQL 5.6: Oracle’s ...
        • NoSQL: Choose Your Database - CRUD
        • NoSQL: A Human-Readable Jackrabbit Persistence Man...
        • NoSQL: HDFS Puzzle: How to Remove 3 Disks From 3 D...
        • NoSQL: Handling Growth With Postgres: 5 Tips From ...
        • NoSQL: The Origins of 'Big Data': An Etymological ...
        • NoSQL: How Do I Freaking Scale Oracle?
        • NoSQL: Writing Hive UDFs With Java - a Tutorial
        • NoSQL: Playing With Hadoop Pig
        • NoSQL: Using Hadoop Pig With MongoDB
        • NoSQL: Flatten Entire HBase Column Families With P...
        • NoSQL: NoSQL Bug Fix Releases: Redis 2.6.10 and Ra...
        • NoSQL: Deploying Riak on EC2 - What to Pick?
        • NoSQL: WANdisco Hadoop 2 Distribution
        • NoSQL: MongoDB 2.4 Highlights
        • NoSQL: Can Anyone Use the Name Hadoop?
        • NoSQL: The Structure of Big Data
        • NoSQL: Sportingbet Decides Against Hadoop to Analy...
        • NoSQL: MongoDB Is Still Broken by Design 5-0
        • NoSQL: 10gen: MongoDB’s Fault Tolerance Is Not Bro...
        • NoSQL: Counterpoint: Why Some Hadoop Adapters Make...
        • NoSQL: Why Database-To-Hadoop Connectors Are Funda...
        • NoSQL: Hadoop and Vertica: Using the Right Tools f...
        • NoSQL: Setting Up a Riak Dev Cluster on OS X Mount...
        • NoSQL: NoSQL and Big Data Money News
        • NoSQL: Hadoop at Yahoo!: 2013 Update
        • NoSQL: Apache HBase Internals: Locking and Multive...
        • NoSQL: Some Hadoop Name Dropping in the News
        • NoSQL: Introducing Highly Available Transactions: ...
        • NoSQL: A Quick Guide to Hadoop Map-Reduce Frameworks
        • NoSQL: Twitter and Their Cascading Libraries for D...
        • NoSQL: A Guide to Python Frameworks for Hadoop
        • NoSQL: NoSQL Hosting: Redis and RavenDB
        • NoSQL: Neo4j Interviews: The Vision, the Business ...
        • NoSQL: Proposed Fedora 19 Feature: Replace MySQL W...
        • NoSQL: Michael Stonebraker's New Data Company Rais...
        • NoSQL: MongoDB Fault Tolerance - Broken by Design
        • NoSQL: When Data Is Worthless - Give MongoDB What ...
        • NoSQL: Why the Days Are Numbered for Hadoop as We ...
        • NoSQL: 3 MapReduce and Hadoop Links: Secondary Sor...
        • NoSQL: Joyent Solution for Hadoop Is About Speed
        • NoSQL: How to Plan for Big Data: Waterfall vs Agile
        • NoSQL: Using Treetop and Neo4j Cypher to Simulate ...
        • NoSQL: Main Features of In-Memory Data Grids

        Tags

        Archive

        myNoSQL a NoSQL blog featuring the best daily NoSQL news, articles and links covering all major NoSQL projects and following closely all things related to NoSQL ecosystem. Everything you need and want to know about NoSQL

        .
        • Alex Popescu @ LinkedIn
        • Alex Popescu @ Twitter
        • Alex Popescu @ Facebook
        • Alex Popescu @ FriendFeed
        • Alex Popescu @ Indenti.ca
        • Alex Popescu @ Disqus
        • Alex Popescu: Think Big Differenty @ Tumblr
        • Alex Popescu: A Lifestream of Differently Big Thoughts @ Soup.io