skip to main | skip to sidebar
  • Home
  • mindstorms
  • About
  • NoSQL
  • Think Differently Big
  • Profile
  • Wishlist
  • RSS
  • NoSQL: NHibernate Membase caching provider

    NHibernate Membase caching provider

    I use NHibernate for some of my projects and being able to use a Membase cache provider for NHibernates second level cache would be very nice. With help from Christian Dalager and my self, such a library is now available. Take a look here https://bitbucket.org/ovesen/membasecacheprovider

    tags:Membase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Cloudera’s Distribution for Apache Hadoop version 3 Beta 4

    Cloudera's Distribution for Apache Hadoop version 3 Beta 4

    New version of Cloudera's Hadoop distro — complete release notes available here :

    tags:Hadoop,HBase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Using CouchDB with Clojure

    Using CouchDB with Clojure

    All CouchDB basic features explained using Clojure:

    tags:CouchDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: The Market for Online Privacy Heats Up

    The Market for Online Privacy Heats Up

    WSJ:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: MapReducing Big Data with Riak and Luwak

    MapReducing Big Data with Riak and Luwak

    The recording of Basho's webinar on Riak Map/Reduce and Luwak:

    tags:Riak,MapReduce

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: CouchDB Video Tutorial: Get friendly with CouchDB

    CouchDB Video Tutorial: Get friendly with CouchDB

    One hour long video tutorial about CouchDB:

    tags:couchdb

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Fast, asynchronous analytics with MongoDB

    Fast, asynchronous analytics with MongoDB

    We needed to do simple analytics on OpenGovernment, but not of the Google Analytics variety. We needed each object in the system to have view count aggregates that we could show in real time on the page, and we needed to be able to pull top ten lists and stuff.

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: The New Business Intelligence

    The New Business Intelligence

    Excellent visualization of BI processes on InfoWorld :

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: MarkLogic Needs to Harness the NoSQL Movement

    MarkLogic Needs to Harness the NoSQL Movement

    MarkLogic can create a new class of license. Call it the "MarkLogic Server Lite" or something like that (I don't care). It should be well trimmed down from the Standard and Enterprise editions. No geospatial support, no entity enrichment, no compartment security, etc. Put a database size limit of 1 GB and only allow it to run on 1 CPU. And make it completely free to use, even on commercial projects.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Self-descriptive hypermedia in Riak

    Self-descriptive hypermedia in Riak

    Step 1: Make sure you check the original post . Scroll down to the bottom. Step 2: Make sure you return here . Step 3: Riak supports storing meta information for values and you can access it through a Link header format. Scratch that. Riak supports Web Linking .

    tags:Riak

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Recipe for a Distributed Realtime Tweet Search System

    Recipe for a Distributed Realtime Tweet Search System

    Ingredients :

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Distributed Systems: The Phi Accrual Failure Detector Paper

    Distributed Systems: The Phi Accrual Failure Detector Paper

    Abstract:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: About Watson

    About Watson

    Watson is powered by 10 racks of IBM Power 750 servers running Linux, and uses 15 terabytes of RAM, 2,880 processor cores and is capable of operating at 80 teraflops. Watson was written in mostly Java but also significant chunks of code are written C++ and Prolog, all components are deployed and integrated using UIMA. Watson contains state-of-the-art parallel processing capabilities that allow it to run multiple hypotheses – around one million calculations – at the same time.

    tags:Hadoop

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: 4 Data Deduplication Matching Algorithms in F#

    4 Data Deduplication Matching Algorithms in F#

    There are four common matching algorithms used for attribute deduplication processes: absolute match, partial match, Soundex and lookup match. These algorithms can be run against the data and, once the percentage match score is computed, you can decide whether to discard or store the data. As an exercise, I've implemented these four matching algorithms using F# pattern matching and asynchronous programming features to quickly calculate the aggregate match score.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Pig Latin and JSON on Amazon Elastic Map Reduce

    Pig Latin and JSON on Amazon Elastic Map Reduce

    In order to not have to learn everything about setting up Hadoop and still have the ability to leverage the power of Hadoop's distributed data processing framework and not have to learn how to write map reduce jobs and … (this could go on for a while so I'll just stop here). For all these reasons, I choose to use Amazon's Elastic Map infrastructure and Pig. I will talk you through how I was able to do all this [take my log data stored on S3 (which is in compressed JSON format) and run queries against it] with a little help from the Pig community and a lot of late nights. I will also provide an example Pig script detailing a little about how I deal with my logs (which are admittedly slightly abnormal).

    tags:MapReduce

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: How to Navigate the New World of Database Technologies

    How to Navigate the New World of Database Technologies

    Billy Bosworth:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Continuous Deployment for Data, Not Just Services

    Continuous Deployment for Data, Not Just Services

    Great advise:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: MongoDB: How to Find the Size of a Document

    MongoDB: How to Find the Size of a Document

    Answer for the question in the title:

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Google Could Make Data Marketplaces Actually Useful

    Google Could Make Data Marketplaces Actually Useful

    Paul Miller about the possible big data marketplaces evolution thanks to Google's Public Data Explorer and Dataset Publishing Language (DSPL) :

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Rewriting the Redis Twitter Clone

    Rewriting the Redis Twitter Clone

    The Redis Twitter clone app is showing its age:

    tags:Redis

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: MongoDB Auto-Sharding

    MongoDB Auto-Sharding

    I know this will sound like bashing MongoDB. But I've already said it a couple of times before: MongoDB scaling looks complicated.

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Amazon EBS vs SSD: Price, Performance, QoS

    Amazon EBS vs SSD: Price, Performance, QoS

    Check the numbers and run your own tests. But their results are striking:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Redis 2.2: An Optimization Release

    Redis 2.2: An Optimization Release

    Salvatore Sanfilippo summarizes the new Redis release in the Hacker News thread :

    tags:Redis

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Storing Part of Riak Object Value in Memory

    Storing Part of Riak Object Value in Memory

    The disadvantage of Riak's Key Filter approach is that you end up with highly domain-specific keys, which can be hard to reference, especially if you need to update keys to allow querying new aspects of the data: If you need to change your existing keys, references to these keys needs to be updated too. This is hard to do atomically when you have a key-value store like Riak. Even worse, if data changes you need to update the key, and – again – the pointers to the key, if you have any.

    tags:Riak

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Why startups should not choose NoSQL

    Why startups should not choose NoSQL

    So to summarize – don't sacrifice flexibility and ease of work for some fictional "trillions of petabytes". If it happens that you need to handle huge amounts of data, it will be in a way that you will be able to restructure your data model. And at a point when you will know what questions you want to ask.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Cloudata: New Open Source BigTable Implementation

    Cloudata: New Open Source BigTable Implementation

    Cloudata is the third open source implementation of Google's BigTable paper, after HBase and Hypertable. There's already an 1.0 version even if the Github project page is listing just a couple of commits.

    tags:BigTable

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: MongoDB and Site Analytics

    MongoDB and Site Analytics

    Within a few hours, using Sinatra and the MongoDB Ruby driver, I had a little prototype working. Each hit was a single MongoDB operation, an upsert based on the host, with year, month, day, and hour information stored in nested hashes. The nested hashes were updated in the operation using $inc. It did not do much, but it was pretty cool.

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: 4 Database Technologies for Large Scale Data

    4 Database Technologies for Large Scale Data

    Park Kieun (CUBRID Cluster Architect) gives an introduction to 4 large scale database technologies:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: How to Maintain a Set in Memcached

    How to Maintain a Set in Memcached

    Could you imagine a solution for storing a set into memcached satisfying these requirements:

    tags:memcached

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Amazon SimpleDB, Google Megastore & CAP

    Amazon SimpleDB, Google Megastore & CAP

    Nati Shalom (Gigaspaces) pulls out a couple of references from James Hamilton's posts [1] on Amazon SimpleDB and Google Megastore consistency model concluding:

    tags:SimpleDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: HBase Internals: Visualizing HBase Flushes And Compactions

    HBase Internals: Visualizing HBase Flushes And Compactions

    Outerthought folks [1] have put together a great visual and technical explanation of HBase flushes and compaction for usual scenarios like insert, delete, multi-column families, multi-regions:

    tags:HBase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Vector Maps: Introducing a New Data Structure

    Vector Maps: Introducing a New Data Structure

    Inspired by CouchDB's and Riak's Multi-Version Concurrency Control usage, Kresten Krab Thorup is proposing a new data structure: vector maps : an in-memory vector clock versioned map [1] :

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Three Reasons ActiveSphere Loves CouchDB

    Three Reasons ActiveSphere Loves CouchDB

    The three main reasons why ActiveSphere folks have been using CouchDB:

    tags:CouchDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Project Voldermort and Terrastore: Key-Value vs Document Stores

    Project Voldermort and Terrastore: Key-Value vs Document Stores

    It is an apples to oranges comparison, but it underlines, from a beginner perspective, the major differences between a pure key-value store (Project Voldemort) and a document database (Terrastore):

    tags:Terrastore

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Pig Latin Adds Macros as Part of Becoming Turing Complete

    Pig Latin Adds Macros as Part of Becoming Turing Complete

    Since direct integration of data flow and control flow is neither reasonable nor desirable, a heuristic is needed to productively combine the two. […] Compared to an approach that integrates control flow and data flow, such as PL/SQL, embedding in an existing scripting language is a much lower development and maintenance effort. It will also be much easier for users, who will be able to use existing development tools (IDEs, debuggers, etc.) to work with their scripts.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Improvements to Hadoop Availability

    Improvements to Hadoop Availability

    Six areas to improve Hadoop availability when dealing with common scenarios like host maintenance, configuration changes, software upgrades, host failures:

    tags:Hadoop

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Data Analysis Tools Survey Results

    Data Analysis Tools Survey Results

    I've always wondered what tools are used by data scientists to dig useful information out of the big data and create beauty out of it .

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Jeopardy Goes to Hadoop

    Jeopardy Goes to Hadoop

    Did you know that Hadoop was the knowledge base behind the Watson supercomputer ? I didn't:

    tags:Hadoop

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Scaling with Cassandra

    Scaling with Cassandra

    Peter Schuller's Scaling with Apache Cassadra recorded at Oredev:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: 15 Views of a Node Link Graph: An Information Visualization Portfolio

    15 Views of a Node Link Graph: An Information Visualization Portfolio

    An information visualization (a.k.a. infovis ) classic:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Linked Open Data Star Scheme

    Linked Open Data Star Scheme

    While writing quite a bit lately about Big Data marketplaces, I thought it would be worth mentioning Tim Berners-Lee 5-start deployment scheme for Linked Open Data:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: BackType's ElephantDB

    BackType's ElephantDB

    I didn't know BackType's ElephantDB is open source and available on GitHub , same as their Cascalog the Clojure based query language for Hadoop .

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Cloudant about Couchbase Announcement

    Cloudant about Couchbase Announcement

    Alan Hoffman of Cloudant, the CouchDB hosting providers and creators of the BigCouch scalable CouchDB solution :

    tags:CouchDB,Couchbase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Better support for short strings in Neo4j

    Better support for short strings in Neo4j

    When importing the OpenStreetMap data for Germany, with this feature in place we found that the DynamicStringStore was now 80 smaller than before! Not only that but time for reading and writing strings had improved by at least 25 ! (the benchmark I got this from creates nodes and relationships as well, so pure string operations is probably even faster)

    tags:Neo4j

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: HSearch: NoSQL Search Engine Built on HBase

    HSearch: NoSQL Search Engine Built on HBase

    Cassandra has Lucandra Solandra , Riak has Riak Search , HBase has HSearch

    tags:HBase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Current and Future Big Data Warehouse

    Current and Future Big Data Warehouse

    Custom build BigData frameworks like Teradata and VLDB implementations from Oracle that are proprietary frameworks designed to deal with large datasets. These frameworks are still very relational in orientation and are not designed to work with unstructured data sets. Data Warehouse Appliances like Oracle's Exadata. This introduces the concept of DW-in-a-box where the entire framework needed for a typical DW implementation (the Hardware, Software Framework in terms of data store and Advanced Analytical tools) are all vertically integrated and provided by the same vendor as a packaged solution. Open Source NoSQL-oriented Big Data Frameworks such as Hadoop and Cassandra. These frameworks implement advanced analytical and mining algorithms such as Map/Reduce and are designed to be installed on commodity hardware for an MPP architecture with huge Master/Slave clusters. They are very good at dealing with vast amounts of unstructured, text-oriented information. Commercial Big Data Frameworks like AsterData and GreenPlum, which follow the same paradigm of MPP infrastructures but have implemented their own add-ons such as SQL-MR and other optimizations for faster analytics.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Who Needs Big Data Marketplaces?

    Who Needs Big Data Marketplaces?

    Who could start benefiting right away from big data marketplaces:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: The Next Generation of Apache Hadoop MapReduce

    The Next Generation of Apache Hadoop MapReduce

    I'm not sure how many companies have already hit this limit, but Yahoo! is showing again its Hadoop leadership:

    tags:Hadoop,MapReduce

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Convore Usage of Redis Pub/Sub

    Convore Usage of Redis Pub/Sub

    Eric Florenzano describing the architecture of the newly launched Convore website:

    tags:Redis

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Hadoop I/O: Sequence, Map, Set, Array, BloomMap Files

    Hadoop I/O: Sequence, Map, Set, Array, BloomMap Files

    Some of Hadoop essential persistent data stuctures explained ( SequenceFile , MapFile , etc.):

    tags:Hadoop

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: What Does Big Data Mean to Infrastructure Professionals?

    What Does Big Data Mean to Infrastructure Professionals?

    Big data means the amount of data you're working with today will look trivial within five years. Huge amounts of data will be kept longer and have way more value than today's archived data. Business people will covet a new breed of alpha geeks. You will need new skills around data science, new types of programming, more math and statistics skills and data hackers…lots of data hackers. You are going to have to develop new techniques to access, secure, move, analyze, process, visualize and enhance data; in near real time. You will be minimizing data movement wherever possible by moving function to the data instead of data to function. You will be leveraging or inventing specialized capabilities to do certain types of processing- e.g. early recognition of images or content types – so you can do some processing close to the head. The cloud will become the compute and storage platform for big data which will be populated by mobile devices and social networks. Metadata management will become increasingly important. You will have opportunities to separate data from applications and create new data products. You will need orders of magnitude cheaper infrastructure that emphasizes bandwidth, not iops and data movement and efficient metadata management. You will realize sooner or later that data and your ability to exploit it is going to change your business, social and personal life; permanently.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Facebook Messages: FOSDEM NoSQL Event

    Facebook Messages: FOSDEM NoSQL Event

    From this year's FOSDEM, Facebook talking about their messaging platform:

    tags:HBase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Sones Hires New CEO to Increase Sales and Expand Partner Program

    Sones Hires New CEO to Increase Sales and Expand Partner Program

    According to TechCrunch Europe , Sones, producers of the GraphDB graph database, has hired a new CEO to focus on increasing sales and expanding their partner programs. This only weeks after Sones has announced a new round of funding .

    tags:Sones

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Redis at Digg: Story View Counts

    Redis at Digg: Story View Counts

    Digg just rolled out a new feature, cummulative page event counters (page views plus clicks), that is using Redis as its underlying solution.

    tags:Redis

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Hadoop and Membase Case Study: AOL Advertising Architecture

    Hadoop and Membase Case Study: AOL Advertising Architecture

    Combining Hadoop and Membase to solve these challenges:

    tags:Couchbase,Hadoop,Membase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: A Semantic Triple Store Built on MongoDB

    A Semantic Triple Store Built on MongoDB

    An interesting semantic triple store data modeling exercise with MongoDB:

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: HP Acquires Vertica

    HP Acquires Vertica

    The official announcement of HP's acquisition of Vertica is here . And I think this is also explaining the disappearing of the last article Michael Stonebraker published on Vertica blog about 6 criteria for real column stores .

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Cassandra Releases: Two Minor Upgrades

    Cassandra Releases: Two Minor Upgrades

    Cassandra has pushed out two new minor releases, first of them, 0.7.1, featuring a couple of performance improvements and new features, and the second, 0.7.2, fixing a critical bug in the 0.7.1 release.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Two Definitions for Big Data

    Two Definitions for Big Data

    Not sure I've got the rest of the post , but really liked these two definitions of big data :

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: The NoSQL Dogma

    The NoSQL Dogma

    Whats the point? MySQL is a proven technology, its engines are stable and well known, and scalable ( Yep modern forks for clouds are already rolling one example is drizzle); so I can't find any point in shifting to a technology thats a new born, adds more calories to my code, and then brag about it just because its a buzz word! Untill there is no genuine reason to have a noSQL its just a trendy statement!

    tags:mapreduce

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: What hardware makes a good MongoDB Server?

    What hardware makes a good MongoDB Server?

    With MongoDB what you want is RAM. And then some more RAM. Buying RAM can't hurt.

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: What hardware makes a good MongoDB Server?

    What hardware makes a good MongoDB Server?

    With MongoDB what you want is RAM. And then some more RAM. Buying RAM can't hurt.

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Big Data Marketplace: Windows Azure Marketplace DataMarket

    Big Data Marketplace: Windows Azure Marketplace DataMarket

    Just another big data marketplace , this time on Windows Azure :

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: On Sharding Graph Databases

    On Sharding Graph Databases

    We can help to maintain a balanced graph by applying domain-specific knowledge to place nodes on shards; we can use insert-time algorithms to help us select the most appropriate shard to place a node; and we can use re-balancing algorithms periodically (sort of like a graph defrag) to help maintain a good node-to-shard mapping at runtime. We can also keep heuristic information on which relationships are traversed most frequently and optimise around those.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: NoSQL Benchmark Source Code Available

    NoSQL Benchmark Source Code Available

    Code of the NoSQL benchmark I've mentioned a couple of days ago — the one comparing Cassandra 0.6.10, HBase 0.20.6, MongoDB 1.6.5, Riak 0.14.0 with some weird results — is now available on GitHub ).

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: InfiniteGraph 1.1 Released with New Indexing Options

    InfiniteGraph 1.1 Released with New Indexing Options

    A new version of InfiniteGraph, the graph database from Objectivity, has been released with a new indexing solution offering improved performance for indexing, data imports, and lookups.

    tags:InfiniteGraph

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Couchbase: Code Collaboration Already Started

    Couchbase: Code Collaboration Already Started

    From an engineering standpoint, we have just assembled an amazing group of people. Within days of setting about it, there was plenty of collaboration at the code level between the two groups. Note that code commit from Dale Harvey of CouchOne improving Membase code was on February 4. That's a full four days before we announced the merger and a mere 5 days after he'd come over to the U.S. to meet the rest of the team.

    tags:Couchbase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: NoSQL databases: Maslow's Law Applied to NoSQL Needs

    NoSQL databases: Maslow's Law Applied to NoSQL Needs

    Adrian Cockcroft [1] transposing the Maslow law to NoSQL systems:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: BigData: Digital vs Nature

    BigData: Digital vs Nature

    Compared to Mother Nature we are humble apprentices. […] Despite our gargantuan digital growth, the DNS in a single humna body still stores far more information — and a single human brain computes far more calculations — than all the technology on Earth.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: HBase Internals: HFile Explained

    HBase Internals: HFile Explained

    Matteo Bertozzi explains the HBase HFile :

    tags:HBase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Advantages of Riak over CouchBase

    Advantages of Riak over CouchBase

    On Quora:

    tags:Riak

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Basho Raises Funding for Expanding Market Reach

    Basho Raises Funding for Expanding Market Reach

    The exciting news from yesterday are not over yet. Even if the announcement is not official, it already reached press [1] : Basho, creators of the Riak key-value store and Riak Search, the first full text indexing solution integrated with a NoSQL database, have raised in an up round an additional $7.5mil from a private equity firm and Trifork AS, a Danish systems integrator.

    tags:Riak

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Document Databases: A "new" definition

    Document Databases: A "new" definition

    A new very bad definition for document databases:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: The Future of SQL-Developers and SQL-server DBAs

    The Future of SQL-Developers and SQL-server DBAs

    On the SQL Server 2008 forum :

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Reconstructing Linked Data and Graph Databases

    Reconstructing Linked Data and Graph Databases

    ReadWriteWeb has published a very interesting story of a project presented at last week's Strata conference aiming to reconstruct linked data based on public data sources like Flickr and OpenStreetMap using a somehow classical"fuzzy matching" approach.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Simple Ruby Workers with MongoDB and Resque

    Simple Ruby Workers with MongoDB and Resque

    After some research, we decided to use resque from github, but to adapt it to use mongodb instead of redis. The advantage here is that we already have a significant investment in mongodb, so we would not be introducing a new type of server to our infrastructure. Mongodb also has some features that redis does not, and we used those to build some interesting new features into resque.

    tags:MongoDB,Redis

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: NoSQL databases, Quest Software, and Toad for Cloud

    NoSQL databases, Quest Software, and Toad for Cloud

    I wrote a couple of times about Quest Software's Toad for Cloud [1] , the free Eclipse tool that allows connecting to NoSQL databases and working with data in a tabular, SQLish format. But having in mind Quest's business, there was something bugging me about Quest's initiative: why would they make these tools? Even more, why would they make them available for free?

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Unit Testing Django with a NoSQL Backend

    Unit Testing Django with a NoSQL Backend

    However, what if you are using a NoSQL database backend like MongoDB, Cassandra, CouchDB or something similar and you aren't using the Django ORM? How do you handle setting up and tearing down the database environments?

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: NuvolaBase: OrientDB in the Cloud

    NuvolaBase: OrientDB in the Cloud

    Another interesting announcement coming out today is NuvolaBase , the OrientDB in the cloud. The information about the service is very scarce on its website , so except the different account plans I couldn't find out much. I hope to hear more about it from Luca Garulli, the creator of OrientDB and the guy behind NuvolaBase.

    tags:OrientDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: NoSQL Companies Merger: Membase and CouchOne Becoming Couchbase

    NoSQL Companies Merger: Membase and CouchOne Becoming Couchbase

    Late last night I've received an email asking if I could jump in a phone call to hear some exciting news. I had to say pass as it was extremely late, but replied back: "Is it a big client acquisition or M&A?".

    tags:Membase,CouchDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: YCBS Benchmark Results for Cassandra, HBase, MongoDB, Riak

    YCBS Benchmark Results for Cassandra, HBase, MongoDB, Riak

    A recent slide deck presenting results of the YCBS benchmark run against the latest versions of Cassandra (0.6.10), HBase (0.20.6), MongoDB (1.6.5), and Riak (0.14.0):

    tags:HBase,MongoDB,Riak

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: HBase and Bloom Filters

    HBase and Bloom Filters

    Lars George and Nicolas Spiegelberg — both HBase committers, Nicolas also being the guy implementing HBase Bloom filters — explaining the pros (and cons) of using Bloom filters in HBase:

    tags:HBase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: noSQL is what you need 99% of the cases

    noSQL is what you need 99% of the cases

    Let me put a disclaimer before I carry on. There might be situations where noSQL's solution are absolutely illogical to use. But I firmly believe that if you are using a database for storing Ruby's or PHP's object into a database to eventually retrieve them later, you fit in my 99% case scenario.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Redis: One Page Command References

    Redis: One Page Command References

    When Redis has relaunched its website , I've mentioned how awesome the commands page looks.

    tags:Redis

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Rainbird: Twitter's ZooKeeper + Cassandra Based Realtime Analytics Solution

    Rainbird: Twitter's ZooKeeper + Cassandra Based Realtime Analytics Solution

    Kevin Weil [1] presented Twitter's ZooKeeper and Cassandra based solution for realtime analytics named Rainbird at Strata 2011:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: The Backstory of Yahoo and Hadoop

    The Backstory of Yahoo and Hadoop

    We currently have nearly 100 people working on Apache Hadoop and related projects, such as Pig, ZooKeeper, Hive, Howl, HBase and Oozie. Over the last 5 years, we've invested nearly 300 person-years into these projects. […] Today Yahoo runs on over 40,000 Hadoop machines (>300k cores). They are used by over a thousand regular users from our science and development teams. Hadoop is at the center of our research in search, advertising, spam detection, personalization and many other topics.

    tags:Hadoop,HBase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Leaderboards using Redis: A How-To Guide

    Leaderboards using Redis: A How-To Guide

    At the studio I had discussed with colleagues the possibility of using Redis, an advanced key-value storage engine, for leaderboards. In less than an hour, I had the set of Redis commands using their sorted set data type (a set of data that is sorted based on an associated "score") to perform operations on leaderboards such as: Retrieving general information about a leaderboard such as total members or total pages Adding or removing members from a leaderboard Retrieving information about a member in the leaderboard such as their rank or score Updating score information for a member in the leaderboard Retrieving an arbitrary page of leaders from the leaderboard Retrieving the leaders around a given member in a leaderboard, also known as an "Around Me" leaderboard Retrieving information for an arbitrary set of members in a leaderboard, e.g. How do my friends compare against me?

    tags:Redis

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: An introduction to the Hadoop Distributed File System

    An introduction to the Hadoop Distributed File System

    An excellent article covering:

    tags:Hadoop,HBase

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Sones Receives Investment

    Sones Receives Investment

    This is not news anymore, but Sones, producers of GraphDB, have raised an undisclosed amount of additional funding. I guess things will get a bit hotter in the graph database space where there are already a few quite interesting competitors.

    tags:Sones

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: No SQL and Big Data from a Business Intelligence & Data Warehousing Perspective

    No SQL and Big Data from a Business Intelligence & Data Warehousing Perspective

    "No SQL" and Big data appearing in Rick Sherman's list of overhyped trends in BI and Data warehousing:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: CouchDB as Email Store

    CouchDB as Email Store

    If you ever get bored of your GMail/Yahoo!/Hotmail email or your local client, then you could start storing email into CouchDB . Next you could develop a CouchApp for it, add tags and views to emulate folders/GMail labels, replicate to your mobile or other machine, etc. The Reddit community hasn't appreciated the idea though.

    tags:CouchDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Writes Performance: B+Tree, LSM Tree, Fractal Tree

    Writes Performance: B+Tree, LSM Tree, Fractal Tree

    Two excellent comments on Quora explaining the differences in the performance of write operation when using 3 different data structures: B+trees, Log-Structured Merge trees, and Fractal trees:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Domain-model persistence with Morphia and MongoDB

    Domain-model persistence with Morphia and MongoDB

    Morphia is an Apache-licensed Google Code project that lets you persist, retrieve, delete, and query POJOs stored as documents in MongoDB. Morphia accomplishes this by providing a set of annotations and a wrapper around the Mongo Java driver. Morphia is conceptually similar to object-relational mappers such as Java Persistence API (JPA) or Java Data Objects (JDO) implementations. In this article, I'll show how to use Morphia with a Java domain model mapped to MongoDB.

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: New Tools in the NoSQL and Big Data Market

    New Tools in the NoSQL and Big Data Market

    DataStax ( ex-Riptano ) announced yesterday their tool for managing including sophisticated visualizations of the cluster, comprehensive management and configuration, monitoring and operating enterprise Cassandra applications named OpsCenter.

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: Yahoo is Focusing on Apache Hadoop discontinues “The Yahoo Distribution of Hadoop”

    Yahoo is Focusing on Apache Hadoop discontinues "The Yahoo Distribution of Hadoop"

    This is big:

    tags:Hadoop

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: The Provenance of Data, Data Branding and "Big Data" Hype

    The Provenance of Data, Data Branding and "Big Data" Hype

    Others are definitely better at naming things than I am:

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: MongoDB vs Clustrix Performance Comparison

    MongoDB vs Clustrix Performance Comparison

    This made some rounds yesterday. And it got some long comments on both Hacker News and Reddit .

    tags:MongoDB

    via NoSQL databases

    Read post...
    bookmark | email | comments
  • NoSQL: New Redis Libraries

    New Redis Libraries

    Probably because Silver was released by a publisher , this new open source library got some press today:

    tags:Redis

    via NoSQL databases

    Read post...
    bookmark | email | comments
Home

mindstorms

Software and web architectures, cloud computing and a flavor of tech startup entrepreneurship through the eyes of Alex Popescu.

About me: Software architect, Web Aficionado, Cloud Computing Fanboy, Geek Entrepreneur, Speaker, Co-founder and CTO of InfoQ.com, Writing also about NoSQL on the myNoSQL blog

Latest comments

Loading...

think differently big

Loading...

Tag Cloud Sphere ▼

Follow Alex on Twitter ▼

    follow me on Twitter

    Daily Cloud Stream ▼

    Loading...

    Show more articles

      • NoSQL: Where Does Xeround Fit In The Cap Theorem?
      • NoSQL: Vertica and Hadoop for Big Data
      • NoSQL: NoSQL Tutorial: Setting Up a Hadoop Cluster...
      • NoSQL: 5 Key Elements for a Firehose Data System
      • NoSQL: MongoDB Tips and Tricks: More Reads Make Fo...
      • NoSQL: History of Couch Projects
      • NoSQL: Powered by Hadoop and Hive: Budgeting for s...
      • NoSQL: MongoDB Replica Sets and Sharding for GridF...
      • NoSQL: PHP and MongoDB Tutorial
      • NoSQL: Hybrid Word Aligned Bitmaps: Why are column...
      • NoSQL: Analysts' Predictions for Hadoop Market
      • NoSQL: MapReduce With Hadoop: What Happens During ...
      • NoSQL: NoSQL Books: Riak Handbook and the Little R...
      • NoSQL: MoreSQL: No More NoSQL
      • NoSQL: Measuring User Retention With Hadoop and Hi...
      • NoSQL: The History of NoSQL: This Was Not Our Tech...
      • NoSQL: Big Data Is More Than Hadoop
      • NoSQL: Mavuno: A Hadoop-Based Text Mining Toolkit
      • NoSQL: Using Amazon Elastic MapReduce With DynamoD...
      • NoSQL: 12 Hadoop Vendors to Watch in 2012
      • NoSQL: More Details About Apache HBase 0.92.0
      • NoSQL: Google Research: Let's Make TCP Faster
      • NoSQL: A Cost Analysis of DynamoDB for Tarsnap
      • NoSQL: Apache Hadoop 1.0 Doesn’t Clear Up Trunks a...
      • NoSQL: Introducing Amazon DynamoDB Slidesdeck
      • NoSQL: Latest NoSQL Releases: HBase 0.92, DataStax...
      • NoSQL: Solr Index Replication at Etsy: From HTTP t...
      • NoSQL: Jelastic Database Marketshare: MySQL, Mongo...
      • NoSQL: MapR's Map-Reduce Ready Disitributed File S...
      • NoSQL: Couchbase: Clarifying Confusions in 5 Bulle...
      • NoSQL: Couchbase Server 1.8 Released, Rebranding a...
      • NoSQL: NoSQL Tutorials: Getting Started With Cassa...
      • NoSQL: NoSQL Databases Configuration Management
      • NoSQL: Key-Value Stores, Document Databases, and C...
      • NoSQL: Countandra: Cassandra-Based Hierarchical Di...
      • NoSQL: Will Amazon DynamoDB Be a Game Changer?
      • NoSQL: The State of NoSQL in 2012
      • NoSQL: Reinforcing Couchbase's Commitment to Open ...
      • NoSQL: CouchDB: A Season Finale
      • NoSQL: Auto Scaling in the Amazon Cloud: Netflix's...
      • NoSQL: RainStor Big Data Analytics on Hadoop Promi...
      • NoSQL: Basho: Congratulations, Amazon!
      • NoSQL: Pros and Cons of Using MapReduce With Distr...
      • NoSQL: Using MongoDB Replica Sets With Node.js on ...
      • NoSQL: Amazon DynamoDB: NoSQL in the Cloud
      • NoSQL: Amazon’s DynamoDB Shows Hardware as Means t...
      • NoSQL: Introducing Amazon DynamoDB Video
      • NoSQL: Cassandra and Amazon DynamoDB Comparison
      • NoSQL: Notes About Amazon DynamoDB
      • NoSQL: The Little Known Secret of Redis
      • NoSQL: NHibernate Membase caching provider
      • NoSQL: Cloudera’s Distribution for Apache Hadoop v...
      • NoSQL: Using CouchDB with Clojure
      • NoSQL: The Market for Online Privacy Heats Up
      • NoSQL: MapReducing Big Data with Riak and Luwak
      • NoSQL: CouchDB Video Tutorial: Get friendly with C...
      • NoSQL: Fast, asynchronous analytics with MongoDB
      • NoSQL: The New Business Intelligence
      • NoSQL: MarkLogic Needs to Harness the NoSQL Moveme...
      • NoSQL: Self-descriptive hypermedia in Riak
      • NoSQL: Recipe for a Distributed Realtime Tweet Sea...
      • NoSQL: Distributed Systems: The Phi Accrual Failur...
      • NoSQL: About Watson
      • NoSQL: 4 Data Deduplication Matching Algorithms in...
      • NoSQL: Pig Latin and JSON on Amazon Elastic Map Re...
      • NoSQL: How to Navigate the New World of Database T...
      • NoSQL: Continuous Deployment for Data, Not Just Se...
      • NoSQL: MongoDB: How to Find the Size of a Document...
      • NoSQL: Google Could Make Data Marketplaces Actuall...
      • NoSQL: Rewriting the Redis Twitter Clone
      • NoSQL: MongoDB Auto-Sharding
      • NoSQL: Amazon EBS vs SSD: Price, Performance, QoS
      • NoSQL: Redis 2.2: An Optimization Release
      • NoSQL: Storing Part of Riak Object Value in Memory...
      • NoSQL: Why startups should not choose NoSQL
      • NoSQL: Cloudata: New Open Source BigTable Implemen...
      • NoSQL: MongoDB and Site Analytics
      • NoSQL: 4 Database Technologies for Large Scale Dat...
      • NoSQL: How to Maintain a Set in Memcached
      • NoSQL: Amazon SimpleDB, Google Megastore & CAP
      • NoSQL: HBase Internals: Visualizing HBase Flushes ...
      • NoSQL: Vector Maps: Introducing a New Data Structu...
      • NoSQL: Three Reasons ActiveSphere Loves CouchDB
      • NoSQL: Project Voldermort and Terrastore: Key-Valu...
      • NoSQL: Pig Latin Adds Macros as Part of Becoming T...
      • NoSQL: Improvements to Hadoop Availability
      • NoSQL: Data Analysis Tools Survey Results
      • NoSQL: Jeopardy Goes to Hadoop
      • NoSQL: Scaling with Cassandra
      • NoSQL: 15 Views of a Node Link Graph: An Informati...
      • NoSQL: Linked Open Data Star Scheme
      • NoSQL: BackType's ElephantDB
      • NoSQL: Cloudant about Couchbase Announcement
      • NoSQL: Better support for short strings in Neo4j
      • NoSQL: HSearch: NoSQL Search Engine Built on HBase...
      • NoSQL: Current and Future Big Data Warehouse
      • NoSQL: Who Needs Big Data Marketplaces?
      • NoSQL: The Next Generation of Apache Hadoop MapRed...
      • NoSQL: Convore Usage of Redis Pub/Sub
      • NoSQL: Hadoop I/O: Sequence, Map, Set, Array, Bloo...
      • NoSQL: What Does Big Data Mean to Infrastructure P...
      • NoSQL: Facebook Messages: FOSDEM NoSQL Event
      • NoSQL: Sones Hires New CEO to Increase Sales and E...
      • NoSQL: Redis at Digg: Story View Counts
      • NoSQL: Hadoop and Membase Case Study: AOL Advertis...
      • NoSQL: A Semantic Triple Store Built on MongoDB
      • NoSQL: HP Acquires Vertica
      • NoSQL: Cassandra Releases: Two Minor Upgrades
      • NoSQL: Two Definitions for Big Data
      • NoSQL: The NoSQL Dogma
      • NoSQL: What hardware makes a good MongoDB Server?
      • NoSQL: What hardware makes a good MongoDB Server?
      • NoSQL: Big Data Marketplace: Windows Azure Marketp...
      • NoSQL: On Sharding Graph Databases
      • NoSQL: NoSQL Benchmark Source Code Available
      • NoSQL: InfiniteGraph 1.1 Released with New Indexin...
      • NoSQL: Couchbase: Code Collaboration Already Start...
      • NoSQL: NoSQL databases: Maslow's Law Applied to No...
      • NoSQL: BigData: Digital vs Nature
      • NoSQL: HBase Internals: HFile Explained
      • NoSQL: Advantages of Riak over CouchBase
      • NoSQL: Basho Raises Funding for Expanding Market R...
      • NoSQL: Document Databases: A "new" definition
      • NoSQL: The Future of SQL-Developers and SQL-server...
      • NoSQL: Reconstructing Linked Data and Graph Databa...
      • NoSQL: Simple Ruby Workers with MongoDB and Resque...
      • NoSQL: NoSQL databases, Quest Software, and Toad f...
      • NoSQL: Unit Testing Django with a NoSQL Backend
      • NoSQL: NuvolaBase: OrientDB in the Cloud
      • NoSQL: NoSQL Companies Merger: Membase and CouchOn...
      • NoSQL: YCBS Benchmark Results for Cassandra, HBase...
      • NoSQL: HBase and Bloom Filters
      • NoSQL: noSQL is what you need 99% of the cases
      • NoSQL: Redis: One Page Command References
      • NoSQL: Rainbird: Twitter's ZooKeeper + Cassandra B...
      • NoSQL: The Backstory of Yahoo and Hadoop
      • NoSQL: Leaderboards using Redis: A How-To Guide
      • NoSQL: An introduction to the Hadoop Distributed F...
      • NoSQL: Sones Receives Investment
      • NoSQL: No SQL and Big Data from a Business Intelli...
      • NoSQL: CouchDB as Email Store
      • NoSQL: Writes Performance: B+Tree, LSM Tree, Fract...
      • NoSQL: Domain-model persistence with Morphia and M...
      • NoSQL: New Tools in the NoSQL and Big Data Market
      • NoSQL: Yahoo is Focusing on Apache Hadoop disconti...
      • NoSQL: The Provenance of Data, Data Branding and "...
      • NoSQL: MongoDB vs Clustrix Performance Comparison
      • NoSQL: New Redis Libraries

      Tags

      Archive

      myNoSQL a NoSQL blog featuring the best daily NoSQL news, articles and links covering all major NoSQL projects and following closely all things related to NoSQL ecosystem. Everything you need and want to know about NoSQL

      .
      • Alex Popescu @ LinkedIn
      • Alex Popescu @ Twitter
      • Alex Popescu @ Facebook
      • Alex Popescu @ FriendFeed
      • Alex Popescu @ Indenti.ca
      • Alex Popescu @ Disqus
      • Alex Popescu: Think Big Differenty @ Tumblr
      • Alex Popescu: A Lifestream of Differently Big Thoughts @ Soup.io