Hadoop

Videos are up

October 22, 2009
JAQL, Hadoop, mapreduce, Lucene, Video, Get Together

Videos are up # As of yesterday the videos of the last Apache Hadoop Get Together Berlin are available online. Th anks to the speakers for providing insight in their projects and thanks to Cloudera for sponsoring the videos. The next meetup will be announced soon - three talks have already been proposed. In addition, StudiVZ offered to sponsor video taping of the next Get Together. Looking forward to seeing you in Berlin in December. ...

Lucene 2.9 @ Heise

October 6, 2009
Lucene, press, Hadoop, Get Together, Software Foundation

Lucene 2.9 @ Heise # After last week’s Hadoop Get Together heise published an in-depth article on the changes and improvements that come with the latest Lucene 2.9 release. Thanks to Simon Willnauer for helping me write this article and patiently explaining several new features. Thanks also to Uwe Schindler for kindly proof-reading the article before it was sent out to Heise.

Getting Hadoop trunk up and running from source

October 4, 2009
0.21.0, Hadoop, dhb, Camp, Hacking, dumbo

Getting Hadoop trunk up and running from source # Having told Thilo about the possibility to write Hadoop jobs in Python with Dumbo, we spent some time getting Dumbo 0.21 up and running over the past weekend. The first option the wiki proposes is to take a pre-0.21 release and patch that to work with the current Dumbo release. The second option described takes the not-yet-released version of Hadoop that can be used w/o any patches. ...

Dev House Berlin 2.0

October 4, 2009
Camp, Hacking, CouchDB, Hadoop, dhb

Dev House Berlin 2.0 # This weekend DevHouseBerlin took place in the Box119, kindly organized by Jan Lehnardt, sponsored by Upstream and StudiVZ. There were about 30 people gathered in Friedrichshain, hacking and discussing various projects: Mostly Python/ Django, Ruby/ Rails and Erlang people. The first day was reserved for hacking and exchanging ideas. Late afternoon attendees put together a list of talks that were than rated, ranked with the top three chosen for presentation on Sunday. ...

Slides are up

September 30, 2009
Slides, Berlin, Hadoop, Get Together

Slides are up # The slides for yesterday’s talks just arrived. They are available online at: Isabel Drost: Brief introduction. Thorsten Schuett: Solving puzzles with map reduce. Thilo Goetz: An introduction to jaql. Uwe Schindler: Lucene 2.9 developments. Videos will be online early next week.

Upcoming: Apache Hadoop Get Together Berlin

September 23, 2009
Lucene, Berlin, Hadoop, Get Together

Upcoming: Apache Hadoop Get Together Berlin # This is a friendly reminder that the next Apache Hadoop Get Together takes place next week on Tuesday, 29th of September* at newthinking store (Tucholskystr. 48, Berlin). Thorsten Schuett, Solving Puzzles with MapReduce. Thilo Götz, Text analytics on jaql. Uwe Schindler, Lucene 2.9 Developments. Big thanks goes to newthinking store for providing the venue for free and to Cloudera for sponsoring videos of the talks. ...

Apache Hadoop Event Blog

August 24, 2009
Hadoop, Event, Software Foundation

Apache Hadoop Event Blog # As Apache Hadoop becomes ever more popular both in industry as well as in research, user groups, conferences and hacking days are being scheduled around the world. The goal of the event calendar blog hosted on wordpress.com is to provide a common space for organizers to announce their events and potential participants to look for new conferences.

September 2009 Hadoop Get Together Berlin

August 17, 2009
JAQL, Hadoop, Software Foundation, Lucene, Event, Get Together

September 2009 Hadoop Get Together Berlin # The newthinking store Berlin is hosting the Hadoop Get Together user group meeting. It features talks on Hadoop, Lucene, Solr, UIMA, katta, Mahout and various other projects that deal with making large amounts of data accessible and processable. The event brings together leaders from the developer and user communities. The speakers present projects that build on top of Hadoop, case studies of applications being built and deployed on Hadoop. ...

Large Scalability - Papers and implementations

June 23, 2009
search, Hacking, Free Software, Hadoop, Software Foundation

Large Scalability - Papers and implementations # In recent years the Googles and Amazons on this world have released papers on how to scale computing and processing to terrabytes of data. These publications have led to the implementation of various open source projects that benefit from that knowledge. However mapping the various open source projects to the original papers and assigning tasks that these projects solve is not always easy. ...