November 19, 2009
Moving from Fast to Solr # Sesat has published a nice in-depth report on why to move from Fast to Solr. The article also includes a description of the steps taken to move over as well as several statistics:
On a related topic, the following article details, where Apple is using Lucene/Solr to power it’s search. Spoiler: Look at Spotlight, their desktop search, as well as on the iTunes search with about 800 QPS.
November 19, 2009
ApacheCon Oakland Roundup # Two weeks ago ApacheCon US 2009 ended in Oakland California. Shane published a set of links to articles that contain information on what happened at Apache Con. Some of them are officially published by the Apache PRC project, others are write-ups of individuals on which talks they attended and which topics they considered particularly interesting.
November 18, 2009
Mahout 0.2 released # Apache Mahout 0.2 has been released and is now available for public download at
Up to date maven artifacts can be found in the Apache repository at ent/repositories/releases/org/apache/mahout/
Apache Mahout is a subproject of Apache Lucene with the goal of delivering scalable machine learning algorithm implementations under the Apache license.
Mahout is a machine learning library meant to scale: Scale in terms of community to support anyone interested in using machine learning.
November 16, 2009
Open Source Expo 09 # I spent last Sunday and the following Monday at Open Source Expo Karlsruhe - co-located with web-tech and php-conference organized by the Software-and-Support Verlag. Together with Simon Willnauer I ran the Lucene/Mahout booth at the expo.
So far the conference is still very small (about 400 visitors) compared to free software community events. However the focus was set to be more on professional users, accordingly several projects showed that free software can be used successfully for various business use cases.
November 16, 2009
Apache Con US Wrap Up # some weeks ago I attended ApacheConUS09 in Oakland/ California. In the mean time, videos of one of the sessions have been published online:
You can find a wrap up of the most prominent topics at the conference at heise (unfortunately Germany-only).
By far the largest topics at the conference:
Lucene - there was a meetup with over 100 attendees as well as two main tracks with Lucene focussed talks.
November 15, 2009
December Apache Hadoop Get Together @ Berlin # As announced at ApacheCon US, the next Apache Hadoop Get Together Berlin is scheduled for December 2009.
When: Wednesday December 16, 2009 at 5:00pm Where: newthinking store, Tucholskystr. 48, Berlin
As always there will be slots of 20min each for talks on your Hadoop topic. After each talk there will be a lot time to discuss. You can order drinks directly at the bar in the newthinking store.
November 4, 2009
Lucene Meetup Oakland # Though pretty late in the evening the room is packed with some 100 people. Most of them solr or pure lucene java users. There are quite a few Lucene committers at the meetup from all over the world. Several even have heard about Mahout - some even used it :)
Some introductiory questions to index sizes and query volumn: 1 Mio documents seem pretty standard for Lucene deployments - several people run 10 Mio neither.
November 3, 2009
Hadoop Get Together Berlin @ Apache Con US Barcamp # This is my first real day at ApacheCon US 2009. I arrived yesterday afternoon, was kept awake by three Lucene committers until midnight: “Otherwise you will have a very bad jetlag”… Admittedly it did work out: I slept like a baby until about 08:00a.m. the next morning and am not that tired today.
Today Hackthon, Trainings and barcamp Apache happen in parallel.
October 29, 2009
Apache Hadoop Get Together Berlin # Title: Apache Hadoop Get Together Berlin
Location: newthinking store, Tucholskystr. 48, Berlin Mitte
Link out: Click here
Description: The upcoming Apache Hadoop Get Together Berlin will feature four talks by people explaining how they put Hadoop to good use in their entreprise. Table at Cafe Aufsturz is booked already. Talks will be announced late next week.
Start Time: 17:00
Date: 2009-12-16
October 29, 2009
Open Source Expo # Title: Open Source Expo
Location: Karlsruhe
Link out: Click here
Description: There will be a booth at Open source expo introducing interested visitors to the Apache projects Lucene and Mahout. Of course we are also happy to answer any questions on the ASF in general.
Start Date: 2009-11-15
End Date: 2009-11-16