General

Apache Mahout in Amsterdam

January 25, 2011
JTeam, Apache Mahout, amsterdam, General

Apache Mahout in Amsterdam # On February 7th there will be an Apache Mahout meetup in Amsterdam kindly organised by JTeam. There will be two presentations - one by myself on classification with Apache Mahout as well as a second one by Frank Scholten on clustering with Apache Mahout. Time: 18:00 Location: Frederiksplein 1, 1017XK Amsterdam, The Netherlands Looking forward to a few days in Amsterdam.

FOSDEM II 2011

January 23, 2011
Apache Mahout, Fosdem, Event, General

FOSDEM II 2011 # It’s already sort of a nice little tradition for me to spend the first weekend in February in Brussels for FOSDEM. This year I am particulary happy that there will be a Data Analytics Dev Room at FOSDEM. A huge Thanks to @ogrisel and @nmaillot who have done most of the heavy lifting of getting the schedule in place. Looking forward to an interesting Cloud Track, to meeting Peter Hintjens who is going to give a talk on 0MQ, the DevOps presentation and lots of very interesting DevRooms. ...

O'Reilly Strata Conference

January 22, 2011
Apache Mahout, Strata, General, Mahout, Speaker, California

O’Reilly Strata Conference # Title: O’Reilly Strata Conference Location: Santa Clara Link out: Click here<br />Description: Early next February O’Reilly is planning to put on a very interesting conference on the topic of data analysis and the business of generating value from raw digital data. I’m really glad to have received the acceptance notification for my presentation and travel sponsorship from the DICODE project. So see you in Santa Clara. ...

Devoxx – Day one – Java, Performance and Devops

December 15, 2010
Java, devops, General, performance, optimisation, Devoxx

Devoxx – Day one – Java, Performance and Devops # In his keynote Mark Reinhold provided some information on the very interesting features to be included in the Java 7 release. Generics will be easier to declare with the diamond operator. Nested try-finally constructs that are nowadays needed to safely close resources will no longer be necessary – their will be the option of implementing a Closeable interface supporting a method close() that get’s called whenever objects of that class’s type go out of scope. ...

Apache Lunch Devoxx

December 11, 2010
apache lunch, antwerp, Software Foundation, Devoxx, General

Apache Lunch Devoxx # On Twitter I suggested to host an Apache dinner during Devoxx. Matthias Wesendorf of Apache MyFaces was so kind to take up the discussion carrying it over to the Apache community mailing-list. It quickly turned out that there was quite some interest with several members and committers attending Devoxx. We scheduled the meetup for Friday after the conference during lunch time. I pinged a few Apache related people I knew would attend the conference (being a speaker and a committer at some Apache project almost certainly resulted in getting a ping). ...

Devoxx – Day three

December 10, 2010
lombok, Java, Devoxx, General

Devoxx – Day three # The panel discussion on the future of Java was driven by visitor submitted and voted questions on the current state and future of Java. The general take-aways for me included the clear statement that the TCK will never be made available to the ASF. The promise of Oracle to continue supporting the Java community and remaining active in the JCP. There was some discussion on whether coming Java versions should be backwards-incompatible. ...

Devoxx – Day 2 HBase

December 9, 2010
adobe, NOSQL, twitter, General, Mahout, facebook, Hacking, hbase, Devoxx

Devoxx – Day 2 HBase # Devoxx featured several interesting case studies of how HBase and Hadoop can be used to scale data analysis back ends as well as data serving front ends. Twitter Dmitry Ryaboy from Twitter explained how to scale high load and large data systems using Cassandra. Looking at the sheer amount of tweets generated each day it becomes obvious that with a system like MySQL alone this site cannot be run. ...

Devoxx – Day two – Hadoop and HBase

December 8, 2010
hbase, Hadoop, Devoxx, General

Devoxx – Day two – Hadoop and HBase # In his session on the current state of Hadoop Tom went into a little more detail not only on the features released in the latest release or on the roadmap for upcoming releases (including Kerberos based security, append support, warm standby namenode and others). He also gave a very interesting view on the current Hadoop ecosystem. More and more projects are currently being created that either extend Hadoop or are built on top of Hadoop. ...

Devoxx – University – Cassandra, HBase

December 6, 2010
hbase, Cassandra, Devoxx, NOSQL, General

Devoxx – University – Cassandra, HBase # During the morning session FIXME Ellison gave an introduction to the distributed NoSQL database Cassandra. Being generally based on the Dynamo paper from Amazon the key-value store distributes key/value pairs according to a consistent hashing schema. Nodes can be added dynamically making the system well suited for elastic scaling. In contrast to Dynamo, Cassandra can be tuned for the required consistency level. The system is tuned for storing moderately sized key/value pairs. ...