November 15, 2009
December Apache Hadoop Get Together @ Berlin # As announced at ApacheCon US, the next Apache Hadoop Get Together Berlin is scheduled for December 2009.
When: Wednesday December 16, 2009 at 5:00pm Where: newthinking store, Tucholskystr. 48, Berlin
As always there will be slots of 20min each for talks on your Hadoop topic. After each talk there will be a lot time to discuss. You can order drinks directly at the bar in the newthinking store.
October 29, 2009
Apache Con US - Program up # The final program is available for download over at The schedule is packed with interesting talks on Hadoop, Lucene, Tomcat, httpd, web services, osgi. For those less tech-savvy there is a business track explaining how to best use open source software in an entreprise environment. There is also a community track explaining what makes open source projects successful.
Looking forward to seeing you in Oakland.
October 28, 2009
Lucene 2.9 White Paper # Lucid recently published a white paper that explains the changes and improvements that the new 2.9 release incorporates. Interesting for all who are thinking about upgrading to the new lucene version or generally want to know what is going on at Lucene.
October 6, 2009
Lucene 2.9 @ Heise # After last week’s Hadoop Get Together heise published an in-depth article on the changes and improvements that come with the latest Lucene 2.9 release.
Thanks to Simon Willnauer for helping me write this article and patiently explaining several new features. Thanks also to Uwe Schindler for kindly proof-reading the article before it was sent out to Heise.
September 9, 2009
Mahout@TU WS 09/10 # Title: Mahout@TU WS 09/10
There is going to be a project/seminar course at TU Berlin on Apache Mahout. The goal is to introduce students to the work on a free software project, interact with the community and build production ready software.
Students will be given several potential tasks ranging from optimizing existing implementations, implementing new algorithms and (depending on their prior knowledge) improving, scaling and parallelizing existing algorithms.
September 9, 2009
GSoC at Mahout # GSoC 2009 is about to finish: Final evaluations are through, most of the code submitted by Mahout’s students has been committed to svn, code samples are on their way to Google.
In Mahout, we had three students joining the project: Robin working on an HBase based Naive Bayes extension and on frequent itemset discovery. David contributing a distributed LDA implementation. Deneche was working on a Random Forest implementation.
September 9, 2009
First NoSQL Meetup in Germany # On October 22nd 2009 the first NoSQL Meetup Germany is going to take place in newthinking store/ Berlin:
Please submit your presentation proposals until September 22nd, accepted speakers will be notified soon after.
If you would like to sponsor the event, feel free to contact us: We would be very happy to provide videos after the event and free drinks for everyone during the event.
September 4, 2009
Apache Con drawing closer # By November I will be traveling to Oakland - for me it is the first Apache Con US ever. And the first Apache Con I will be giving a talk in one of the main tracks:
I will be presenting Apache Mahout, give an overview of the project, of our current status and explain which problems can be solved with the current implementation. The talks will conclude with an outlook to upcoming tasks and features our users can expect in the near future.
August 24, 2009
Inglourious Basterds # This evening I went to the cinema Odeon in Berlin Schöneberg. It is a pretty traditional, old-fashioned and very lovely cinema that has specialised on showing non-dubbed, original versions of movies.
Showing the great movie Inglourious Basterds, the cinema was completely sold out today. Fortunately we were able to grab some of the last tickets.
Just in case the entrance seemed familiar to those who have attended a Mahout presentation in the recent past - a picture of the Odeon usually visualises one part of my motivation on the Mahout slides ;)
August 24, 2009
Apache Hadoop Event Blog # As Apache Hadoop becomes ever more popular both in industry as well as in research, user groups, conferences and hacking days are being scheduled around the world. The goal of the event calendar blog hosted on is to provide a common space for organizers to announce their events and potential participants to look for new conferences.