NOSQL

Wonder if you should switch from your RDBMS to Apache Hadoop: Don't!

August 26, 2013
Hadoop, NOSQL, rdbms

Wonder if you should switch from your RDBMS to Apache Hadoop: Don’t! # Last weekend I spend a lot of fun time at FrOSCon* in Sankt Augustin - always great to catch up with friends in the open source space. As always there were quite a few talks on NoSQL, Hadoop, but also really solid advise on tuning your system for stuff like MySQL (including a side note on PostgreSQL and Oracle) from Kristian Köhntopp. ...

Apache Sling and Jackrabbit event coming to Berlin

July 12, 2012
NOSQL, sling, Berlin, jackrabbit, Apache, Event

Apache Sling and Jackrabbit event coming to Berlin # Interested in Apache Sling and/or Apache Jackrabbit? Then you might be interested in hearing that on September 26th to 28th there will be an event in town on these two topics - mainly organised by Adobe, but labeled as community event, meaning that there will be a number of active community members attending the conference: adaptTo(). From their website: In late September 2012 Berlin will become the global heart beat for developers working on the Adobe CQ technical stack. ...

Apache Hadoop Get Together - Hand over

November 2, 2011
Scaling, NOSQL, Apache Hadoop Get Together, Hadoop, Lucene, Berlin, Get Together

Apache Hadoop Get Together - Hand over # Apache Hadoop receives lots of attention from large US corporations who are using the project to scale their data processing pipelines: “Facebook uses Hadoop and Hive extensively to process large data sets. […]” (Ashish Thusoo, Engineering Manager at Facebook), “Hadoop is a key ingredient in allowing LinkedIn to build many of our most computationally difficult features […]” (Jay Kreps, Principal Engineer, LinkedIn), “Hadoop enables [Twitter] to store, process, and derive insights from our data in ways that wouldn’t otherwise be possible. ...

Devoxx – Day 2 HBase

December 9, 2010
adobe, NOSQL, twitter, General, Mahout, facebook, Hacking, hbase, Devoxx

Devoxx – Day 2 HBase # Devoxx featured several interesting case studies of how HBase and Hadoop can be used to scale data analysis back ends as well as data serving front ends. Twitter Dmitry Ryaboy from Twitter explained how to scale high load and large data systems using Cassandra. Looking at the sheer amount of tweets generated each day it becomes obvious that with a system like MySQL alone this site cannot be run. ...

Devoxx – University – Cassandra, HBase

December 6, 2010
hbase, Cassandra, Devoxx, NOSQL, General

Devoxx – University – Cassandra, HBase # During the morning session FIXME Ellison gave an introduction to the distributed NoSQL database Cassandra. Being generally based on the Dynamo paper from Amazon the key-value store distributes key/value pairs according to a consistent hashing schema. Nodes can be added dynamically making the system well suited for elastic scaling. In contrast to Dynamo, Cassandra can be tuned for the required consistency level. The system is tuned for storing moderately sized key/value pairs. ...

Devoxx University – MongoDB, Mahout

December 5, 2010
Mahout, mongodb, NOSQL

Devoxx University – MongoDB, Mahout # The second tutorial was given by Roger Bodamer on MongoDB. It concentrates on being horizontally scalable by avoiding joins and complex, multi document transactions. It supports a new data model that allows for flexible, changeable “schemas”. The exact data layout is determined by the types of operations you expect for your application, by the access patterns (reading vs. writing data; types of updates and types of queries). ...

Devoxx Antwerp

December 3, 2010
Java, NOSQL, antwerp, General, Mahout, Software Foundation, Devoxx

Devoxx Antwerp # With 3000 attendees Devoxx is the largest Java Community conference world-wide. Each year in autumn it takes place in Antwerp/ Belgium, in recent years in the Metropolis cinema. The conference tickets were sold out long before doors were opened this year. The focus of the presentations are mainly on enterprise Java featuring talks by famous Joshua Bloch, Mark Reihnhold and others on new features of the upcoming JDK release as well as intricacies of the Java programming language itself. ...

Apache Mahout at Apache Con NA

October 15, 2010
Lucene, NOSQL, Hadoop, Mahout, ApacheConNA, Apache Con, Software Foundation

Apache Mahout at Apache Con NA # The upcoming Apache Con NA to take place in Atlanta will feature several tracks relevant to users of Apache Mahout, Lucene and Hadoop: There will be a full track on Hadoop as well as one on NoSQL on Wednesday featuring talks on the framework itself, Pig and Hive as well as presentations from users on special use cases and on their way of getting the system to production. ...

NoSQL summer Berlin - this evening

August 11, 2010
Science, Berlin, Freetime, NOSQL

NoSQL summer Berlin - this evening # This evening at Volkspark Friedrichshain, Café Schoenbrunn the next NoSQL summer Berlin (organised by Tim Lossen) is meeting to discuss the paper on Amazon’s Dynamo “Dynamo: Amazon’s Highly Available Key-value Store”. The group is planning to meet at 19:30 for some beer and discussions on the publication.

My highly subjective Berlin Buzzwords recap

June 13, 2010
Lucene, NOSQL, Hadoop, General, Mahout, Berlin Buzzwords, Software Foundation, Get Together

My highly subjective Berlin Buzzwords recap # Last November I innocently asked Grant what it would take to make him to give a talk in Berlin. The only requirement he told me was that I’d have to pay for his flight. About eight months later we had Berlin Buzzwords - a conference all around the topics scalability, data storage and search. With Simon Willnauer, Uwe Schindler, Michael Busch, Robert Muir, Grant Ingersoll, Andrzej Bialecki and many others we had quite a few Lucene people in town. ...