The Social Graph Challenge
Nati Shalom (Gigaspaces) describes a solution to solving a large scale graph problem:
via NoSQL databases
Nati Shalom (Gigaspaces) describes a solution to solving a large scale graph problem:
via NoSQL databases
I'm hearing quite often lately of Redis PUB/SUB replacing real queuing systems. Here is an example application:
tags:Redis
via NoSQL databases
The CMSWire commented list of Hadoop-related solutions:
tags:Hadoop
via NoSQL databases
What's the point in writing a new dbm style DBMS when so many exist already? Surely there is nothing new that anyone can add to this long line of projects?
via NoSQL databases
So; what are the goals? to operate as a fully-functional Redis client (obviously) to be thread-safe and non-blocking to support implicit database switching to help with multi-tenancy scenarios to be on-par with redis-sharp on like scenarios (i.e. a complete request/response cycle) to allow absolute minimum cost fire-and-forget usage (for when you don't care what the reply is, and errors will be handled separately) to allow use as a "future" – i.e request some data from Redis and start some other work while it is on the wire, and merge in the Redis reply when available to allow use with callbacks for when you need the reply, but not necessarily as part of the current request to allow C# 5 continuation usage (aka async/await) to allow fully pipelined usage – i.e. issue 200 requests before we've even got the first response to allow fully multiplexed usage – i.e. it must handle meshing the responses from different callers on different threads and on different databases but on the same connection back to the originator
tags:Redis
via NoSQL databases
For a long time CouchDB listed BBC as one of its largest users — you can read more about it Enda Farrell's talk about CouchDB at BBC and CouchDB case study . At QCon London 2011 , Matthew Wall [1] talked about Guardian.co.uk usage of MongoDB. InfoQ hasn't published the video of the presentation yet, so here's Graham Tackley's slides:
via NoSQL databases
The CouchDB with a Riak backend didn't get too far and since then we've got BigCouch . But the opposite combination seems to be interesting too.
via NoSQL databases
There are several cases in which it might make sense to tailor your app's content based on a user's physical location. But asking them directly is a bit of a pain. Luckily, it's extremely simple to find a user's location knowing only something you will always know about a visitor: their IP address. Today I'll walk you through how to use IPs to geolocate your visitors in a Rails application using Geokit and MongoDB's geospatial indexing with MongoMapper.
tags:MongoDB
via NoSQL databases
A paper by Andrew J.Brust. Abstract:
via NoSQL databases
In a post describing different ways to measure adoption of open source projects, Max Schireson (President 10gen) named Couchbase's products (CouchDB, Membase), Cassandra, HBase, and Riak as main MongoDB competitors.
tags:MongoDB
via NoSQL databases
Tim Lossen's presentation about the evolution of wooga's architecture (Facebook games) from using sharded MySQL to a polyglot persistence solution based on master/slave Redis and master/slave MySQL, with pluses and minuses:
tags:Redis
via NoSQL databases
Gil Elbaz (Factual)
tags:BigData
via NoSQL databases
Last week, Neo Technology has released the 1.3 version of their graph database Neo4j. The technical aspects of the release have been covered in this blog post . Briefly:
tags:Neo4j
via NoSQL databases
Moving Federal Gov analytics from MySQL to Hadoop and Hive:
tags:Hadoop
via NoSQL databases
VMWare's Cloud Foundry has the potential to become the preferred PaaS solution. It bundles together a set of services that it took years for other PaaS providers (Google App Engine, Microsoft Azure) to offer. And it seems that Cloud Foundry has much less (or none at all) vendor lock in [1] .
via NoSQL databases
Given all this I did some benchmarks and as expected the NOSQL community was hurt […]
via NoSQL databases
From the Wikibon blog infographic about data science and the data scientist:
via NoSQL databases
Wouter de Bie (systems developer for AdAction):
via NoSQL databases
The question I think about a lot is what changes will cloud computing bring to the database industry. Will cloud be the platform change that ushers in a new generation of database technologies, and if so which ones and how sweeping will the change be? It is both intellectually interesting and one of the largest external factors effecting the company I run (10gen, the company that builds mongoDB; yes, this is an obvious source of bias for me in this post
via NoSQL databases
Adrian Cockcroft [1] has a great explanation of the impact of multi-tenancy on cloud storage performance. The connection with NoSQL databases is not necessarily in the Amazon EBS and SSD Price, Performance, QoS comparison , but:
via NoSQL databases
This AMI does the following: installs Cassandra 0.7.4 on a Ubuntu 10.10 image configures emphemeral disks in raid0, if applicable ( EBS is a bad fit for Cassandra configures Cassandra to use the root volume for the commitlog and the ephemeral disks for data files configures Cassandra to use the local interface for intra-cluster communication configures all Cassandra nodes with the same seed for gossip discovery
tags:Cassandra
via NoSQL databases
Quora thread listing over 50 sources of open large datasets. Combined with datasciencetoolkit.org and NoSQL databases these may lead to interesting experiments. Remember BigData doesn't necessarily need big budgets .
tags:BigData
via NoSQL databases
Twitter DBA Lead at Twitter, Jeremy Cole 's talk about MySQL at Twitter from MySQL CE 2011:
tags:BigData
via NoSQL databases
From a must read Bloomberg article about BigData being used mainly for ads optimization:
via NoSQL databases
There's a bit of CouchDB in the project:
tags:CouchDB
via NoSQL databases
Last night, Mark Phillips [1] sent me the following message:
tags:Riak
via NoSQL databases
'Geolocation' seems to be the best dish being served today. Every web-portal, every mobile app wants to be sensitive to a persons location. Everyone wants to see information that is 'relative' or location sensitive. Whether its a deal portal, travel portal, social network – giving users information that is relevant to their location bring not only a personalized touch but also keeps tuned in to the portal.
tags:MongoDB
via NoSQL databases
Nedis is a (partial) redis implementation written with node. Primarily for fun, however as our team grows larger, and as we add more non-technical team members over at LearnBoost I figured it would be nice help prevent the need for compiling development dependencies.
tags:Redis
via NoSQL databases
Jeff Jonas (IBM):
tags:BigData
via NoSQL databases
The combination of Hadoop and Solr makes it easy to crunch lots of data and then quickly serve up the results via a fast, flexible search & query API. Because Solr supports query-style requests, it's suitable as a NoSQL replacement for traditional databases in many situations, especially when the size of the data exceeds what is reasonable with a typical RDBMS.
tags:Hadoop
via NoSQL databases
Today, we are proud to introduce Iris Couch, a Couchbase spin-off and your new one-stop shop for hosted CouchDB. We are transferring the Couchbase hosting business to Iris Couch – a company founded and operated by the former Couchbase hosting team – and Iris Couch will focus solely on providing the rock-solid hosting service you deserve.
via NoSQL databases
VMWare's acquisitions at work:
via NoSQL databases
Todd Hoff about NoSQL and Cloud at Netflix :
via NoSQL databases
We have been testing MongoDB 1.8.0 in two different environments. The first, a quite powerful testing server and the second, a "lab-cluster" build from tiny Proxmox -instances
tags:MongoDB
via NoSQL databases
Even if not focused on NoSQL, the videos from the Surge conference are covering very interesting aspects related to scalability. Here are a couple of examples:
via NoSQL databases
The news today is brought to you by Milind Bhandarkar :
via NoSQL databases
Predicting operational growth by monitoring the correct metrics:
tags:memcached
via NoSQL databases
Kristina Chodorow answering the question why should I use MongoDB :
tags:MongoDB
via NoSQL databases
From someone that tried CouchDB and now is evaluating other document databases:
via NoSQL databases
Two minor updates for MongoDB (1.8.1) and Redis (2.2.4). From the official announcement for MongoDB 1.8.1:
via NoSQL databases
From Loraine Lawson interview with Jeff Erhardt [1] .
via NoSQL databases
In the blue corner we have IBM with Netezza as analytic database, Cognos for BI, and SPSS for predictive analytics. In the green corner we have EMC with Greenplum and the partnership with SAS [1] . And in the open source corner we have Hadoop and R.
via NoSQL databases
Consider, for instance, the makers of Stride Rite shoes, who imagined in the book that when you went to the shoe store, a wireless device would measure your foot and the way you walk. It would select your shoe size, produce your shoe from parts the store had on-site, and share data about your foot with supply chain vendors, non-competing retailers, health-care providers and medical researchers.
tags:BigData
via NoSQL databases
If you are one of the many people out there searching for database solutions, we have found five companies that are providing answers to all your database problems in unique and innovative ways. Akiban Clustrix Hadapt NimbusDB RethinkDB
via NoSQL databases
Not sure why it took me so long to mention Erik Meijer and Gavin Bierman paper: "A Co-Relational Model of Data for Large Shared Data Banks":
via NoSQL databases
Following a tweet from Nathan Marz :
via NoSQL databases
Given all the complication, should we search over key value storage? It depends on nature of your problem domain. However if you do need to search over key value storage, it's not such a bad idea either :)
via NoSQL databases
Apixio uses Hadoop and Pig for analysing medical records and Cassandra for serving seach queries. All production machines are Amazon EC2 instances.
via NoSQL databases
Gil Hildebrand (Squidoo):
tags:Cassandra
via NoSQL databases
Remember last week's discussion about administering and scaling up a Riak cluster on Amazon EC2 ? Reid Draper created a Python script to launch a 100-node Riak cluster:
tags:Riak
via NoSQL databases
Michael Olson [1] about origins of BigData in an interview on ODBMS Industry Watch :
via NoSQL databases
For data scientists and not only: RStudio , the R IDE that runs on all major platforms or alongside R on a server and being accessible through a browser. Free and Open.
via NoSQL databases
Brendan W. McAdams has two posts — here and here — about the updates of MapReduce in MongoDB 1.8 and using Java and respectively Scala Casbah to working with MapReduce:
via NoSQL databases
James Phillips' [1] version of what lead to NoSQL:
via NoSQL databases
John Webster reporting his learnings from the Structure Big Data event:
tags:BigData
via NoSQL databases
Daniel Abadi's [1] reasons for backing his HadoopDB research with the Hadapt startup :
via NoSQL databases
Jason Monberg (MarkLogic) distills what he heard at the last BigData conferences into four trends:
tags:BigData
via NoSQL databases
Coda Hale and Ryan Kennedy [1] presented recently about Riak and Scala usage at Yammer providing details about choosing Riak and sharing some of the leassons learned while using Riak for building Streamie.
tags:Riak
via NoSQL databases
From thechangelog podcast :
tags:MongoDB
via NoSQL databases
Rob Karel:
tags:BigData
via NoSQL databases
Cloudant's BigCouch database let the team keep up with a steady flow of data so it could process and analyze it, then share it with the various stakeholders in near-real-time. The team was changing the data about 20 times per day and writing complex workflows to process it, two tasks that fall into BigCouch's wheelhouse. The database has a built-in MapReduce engine to enable writing and processing the workflows, and it allows for secondary indices, which users can populate with new data from their MapReduce jobs and query very quickly.
via NoSQL databases
Carl and Richard talk to Oren Eini, aka Ayende Rahein, about RavenDB. RavenDB is a NoSQL JSON document database. Oren explains how he came to the realization that he needed to build his own data store, and the advantages of document databases over relational databases. Is SQL dead? Not hardly, but RavenDB is an interesting addition to your data solution!
tags:RavenDB
via NoSQL databases
Even if the name of this TED talk is "The birth of a word", I would have called it anything from the future of data science, extreme data analysis, and brilliant informatio visualization. Anyway, it is a must see:
via NoSQL databases
About me: Software architect, Web Aficionado, Cloud Computing Fanboy, Geek Entrepreneur, Speaker, Co-founder and CTO of InfoQ.com, Writing also about NoSQL on the myNoSQL blog
