Get Together

Lucene 2.9 @ Heise

October 6, 2009
Lucene, press, Hadoop, Get Together, Software Foundation

Lucene 2.9 @ Heise # After last week’s Hadoop Get Together heise published an in-depth article on the changes and improvements that come with the latest Lucene 2.9 release. Thanks to Simon Willnauer for helping me write this article and patiently explaining several new features. Thanks also to Uwe Schindler for kindly proof-reading the article before it was sent out to Heise.

Slides are up

September 30, 2009
Slides, Berlin, Hadoop, Get Together

Slides are up # The slides for yesterday’s talks just arrived. They are available online at: Isabel Drost: Brief introduction. Thorsten Schuett: Solving puzzles with map reduce. Thilo Goetz: An introduction to jaql. Uwe Schindler: Lucene 2.9 developments. Videos will be online early next week.

Apache Hadoop Get Together Berlin

September 29, 2009
Get Together

Apache Hadoop Get Together Berlin # The Get Together started just a few minutes ago. The room is packed with more than 35 people this time. This is the first Hadoop Get Together in Berlin that will be recorded on video, thanks to Martin from newthinking for doing the recording and post processing as well as to Cloudera for sponsoring the videos. The first talk was given by Thorsten Schuett on solving puzzles with map reduce. ...

Upcoming: Apache Hadoop Get Together Berlin

September 23, 2009
Lucene, Berlin, Hadoop, Get Together

Upcoming: Apache Hadoop Get Together Berlin # This is a friendly reminder that the next Apache Hadoop Get Together takes place next week on Tuesday, 29th of September* at newthinking store (Tucholskystr. 48, Berlin). Thorsten Schuett, Solving Puzzles with MapReduce. Thilo Götz, Text analytics on jaql. Uwe Schindler, Lucene 2.9 Developments. Big thanks goes to newthinking store for providing the venue for free and to Cloudera for sponsoring videos of the talks. ...

September 2009 Hadoop Get Together Berlin

August 17, 2009
JAQL, Hadoop, Software Foundation, Lucene, Event, Get Together

September 2009 Hadoop Get Together Berlin # The newthinking store Berlin is hosting the Hadoop Get Together user group meeting. It features talks on Hadoop, Lucene, Solr, UIMA, katta, Mahout and various other projects that deal with making large amounts of data accessible and processable. The event brings together leaders from the developer and user communities. The speakers present projects that build on top of Hadoop, case studies of applications being built and deployed on Hadoop. ...

Lucene slides online

June 30, 2009
Lucene, Get Together, General

Lucene slides online # The slides of the Lucene talk at the last Apache Hadoop Get Together Berlin are available online: Lucene Slides. Especially interesting to me are the last few slides which detail both index size and machine setup: The installation is running on two standard PCs with 2 dual-core processors (usual speed, bought in January 2008 for about 4000 Euro). They have 32GB RAM, 24 GB are used as ramdisk for the index. ...

Data serialization

June 26, 2009
Avro, Data Serialization, General, Protocol Buffers, Etch, Get Together, Thrift

Data serialization # XML, JSON and others are currently standard data exchange formats. Being human-readable but still structured enough to be easily parsable by programs is their main benefit. Problems are overhead in size and parsing time. In addition at least xml is not really as human-readable as it could be. An alternative are binary formats. Yet those often are not platform independent (either C++ or Java or Python bindings) or are not upgradable (what if your boss comes along and wants you to add yet another field? ...

March 2009 Hadoop Get Together Berlin

March 7, 2009
Hadoop, Get Together, Software Foundation

March 2009 Hadoop Get Together Berlin # Since last summer, newthinking store Berlin is hosting a Hadoop Meetup every quarter of the year. The scope of these user group meetings is not only on Hadoop projects but deals with technologies necessary with storing, processing and searching large amounts of data. The meeting last Thursday featured a talk by Lars George on his experiences using HBase in customer projects as early as in 2007. ...