January 25, 2011
Apache Mahout in Amsterdam # On February 7th there will be an Apache Mahout meetup in Amsterdam kindly organised by JTeam. There will be two presentations - one by myself on classification with Apache Mahout as well as a second one by Frank Scholten on clustering with Apache Mahout.
Time: 18:00
Location: Frederiksplein 1, 1017XK Amsterdam, The Netherlands
Looking forward to a few days in Amsterdam.
January 23, 2011
FOSDEM II 2011 # It’s already sort of a nice little tradition for me to spend the first weekend in February in Brussels for FOSDEM. This year I am particulary happy that there will be a Data Analytics Dev Room at FOSDEM. A huge Thanks to @ogrisel and @nmaillot who have done most of the heavy lifting of getting the schedule in place.
Looking forward to an interesting Cloud Track, to meeting Peter Hintjens who is going to give a talk on 0MQ, the DevOps presentation and lots of very interesting DevRooms.
...
January 22, 2011
O’Reilly Strata Conference # Title: O’Reilly Strata Conference
Location: Santa Clara
Link out: Click here<br />Description: Early next February O’Reilly is planning to put on a very interesting conference on the topic of data analysis and the business of generating value from raw digital data. I’m really glad to have received the acceptance notification for my presentation and travel sponsorship from the DICODE project. So see you in Santa Clara.
...
December 15, 2010
Devoxx – Day one – Java, Performance and Devops # In his keynote Mark Reinhold provided some information on the very interesting features to be included in the Java 7 release. Generics will be easier to declare with the diamond operator. Nested try-finally constructs that are nowadays needed to safely close resources will no longer be necessary – their will be the option of implementing a Closeable interface supporting a method close() that get’s called whenever objects of that class’s type go out of scope.
...
December 12, 2010
Teddy in Antwerp # When at Devoxx Teddy went to the city taking a few pictures of the Grote Markt, the Haven as well as the main train station.
December 11, 2010
Apache Lunch Devoxx # On Twitter I suggested to host an Apache dinner during Devoxx. Matthias Wesendorf of Apache MyFaces was so kind to take up the discussion carrying it over to the Apache community mailing-list. It quickly turned out that there was quite some interest with several members and committers attending Devoxx. We scheduled the meetup for Friday after the conference during lunch time.
I pinged a few Apache related people I knew would attend the conference (being a speaker and a committer at some Apache project almost certainly resulted in getting a ping).
...
December 10, 2010
Devoxx – Day three # The panel discussion on the future of Java was driven by visitor submitted and voted questions on the current state and future of Java. The general take-aways for me included the clear statement that the TCK will never be made available to the ASF. The promise of Oracle to continue supporting the Java community and remaining active in the JCP.
There was some discussion on whether coming Java versions should be backwards-incompatible.
...
December 9, 2010
Devoxx – Day 2 HBase # Devoxx featured several interesting case studies of how HBase and Hadoop can be used to scale data analysis back ends as well as data serving front ends. Twitter
Dmitry Ryaboy from Twitter explained how to scale high load and large data systems using Cassandra. Looking at the sheer amount of tweets generated each day it becomes obvious that with a system like MySQL alone this site cannot be run.
...
December 8, 2010
Devoxx – Day two – Hadoop and HBase # In his session on the current state of Hadoop Tom went into a little more detail not only on the features released in the latest release or on the roadmap for upcoming releases (including Kerberos based security, append support, warm standby namenode and others).
He also gave a very interesting view on the current Hadoop ecosystem. More and more projects are currently being created that either extend Hadoop or are built on top of Hadoop.
...
December 6, 2010
Devoxx – University – Cassandra, HBase # During the morning session FIXME Ellison gave an introduction to the distributed NoSQL database Cassandra. Being generally based on the Dynamo paper from Amazon the key-value store distributes key/value pairs according to a consistent hashing schema. Nodes can be added dynamically making the system well suited for elastic scaling. In contrast to Dynamo, Cassandra can be tuned for the required consistency level. The system is tuned for storing moderately sized key/value pairs.
...