October 22, 2009
Videos are up # As of yesterday the videos of the last Apache Hadoop Get Together Berlin are available online.
Th anks to the speakers for providing insight in their projects and thanks to Cloudera for sponsoring the videos.
The next meetup will be announced soon - three talks have already been proposed. In addition, StudiVZ offered to sponsor video taping of the next Get Together. Looking forward to seeing you in Berlin in December.
...
October 6, 2009
Lucene 2.9 @ Heise # After last week’s Hadoop Get Together heise published an in-depth article on the changes and improvements that come with the latest Lucene 2.9 release.
Thanks to Simon Willnauer for helping me write this article and patiently explaining several new features. Thanks also to Uwe Schindler for kindly proof-reading the article before it was sent out to Heise.
October 4, 2009
Getting Hadoop trunk up and running from source # Having told Thilo about the possibility to write Hadoop jobs in Python with Dumbo, we spent some time getting Dumbo 0.21 up and running over the past weekend. The first option the wiki proposes is to take a pre-0.21 release and patch that to work with the current Dumbo release. The second option described takes the not-yet-released version of Hadoop that can be used w/o any patches.
...
October 4, 2009
Dev House Berlin 2.0 # This weekend DevHouseBerlin took place in the Box119, kindly organized by Jan Lehnardt, sponsored by Upstream and StudiVZ. There were about 30 people gathered in Friedrichshain, hacking and discussing various projects: Mostly Python/ Django, Ruby/ Rails and Erlang people. The first day was reserved for hacking and exchanging ideas. Late afternoon attendees put together a list of talks that were than rated, ranked with the top three chosen for presentation on Sunday.
...
September 30, 2009
Slides are up # The slides for yesterday’s talks just arrived. They are available online at:
Isabel Drost: Brief introduction.
Thorsten Schuett: Solving puzzles with map reduce.
Thilo Goetz: An introduction to jaql.
Uwe Schindler: Lucene 2.9 developments.
Videos will be online early next week.
September 23, 2009
Upcoming: Apache Hadoop Get Together Berlin # This is a friendly reminder that the next Apache Hadoop Get Together takes place next week on Tuesday, 29th of September* at newthinking store (Tucholskystr. 48, Berlin).
Thorsten Schuett, Solving Puzzles with MapReduce.
Thilo Götz, Text analytics on jaql.
Uwe Schindler, Lucene 2.9 Developments.
Big thanks goes to newthinking store for providing the venue for free and to Cloudera for sponsoring videos of the talks.
...
August 24, 2009
Apache Hadoop Event Blog # As Apache Hadoop becomes ever more popular both in industry as well as in research, user groups, conferences and hacking days are being scheduled around the world. The goal of the event calendar blog hosted on wordpress.com is to provide a common space for organizers to announce their events and potential participants to look for new conferences.
August 23, 2009
September Apache Hadoop Get Together @ Berlin # The upcoming Apache Hadoop Get Together Berlin is to take place on September 29th in newthinking store. Details are up on the web page at upcoming and will be sent out to the mailing list soon.
August 17, 2009
September 2009 Hadoop Get Together Berlin # The newthinking store Berlin is hosting the Hadoop Get Together user group meeting. It features talks on Hadoop, Lucene, Solr, UIMA, katta, Mahout and various other projects that deal with making large amounts of data accessible and processable. The event brings together leaders from the developer and user communities. The speakers present projects that build on top of Hadoop, case studies of applications being built and deployed on Hadoop.
...
June 23, 2009
Large Scalability - Papers and implementations # In recent years the Googles and Amazons on this world have released papers on how to scale computing and processing to terrabytes of data. These publications have led to the implementation of various open source projects that benefit from that knowledge. However mapping the various open source projects to the original papers and assigning tasks that these projects solve is not always easy.
...