November 1, 2010
Apache Mahout @ Devoxx Tools in Action Track # This year’s Devoxx will feature several presentations coming from the Apache Hadoop ecosystem including Tom White on the basics of Hadoop: HDFS, MapReduce, Hive and Pig as well as Michael Stack on HBase.
< br>
In addition there will be a brief Tools in Action presentation on Monday evening featuring Apache Mahout.
Please let me know if you are going to Devoxx - would be great to meet some more Apache people there, maybe have dinner at one of the conference days.
...
July 17, 2010
Apache Hadoop in Debian Squeeze # After using Mandrake for quite a while (still blaming my boyfriend Thilo for infecting not only my computer but also myself first with that system, then with the more general idea of Free Software
but that’s another story.) after finishing my master’s thesis I started using GNU Debian Linux (back then in the version code-named Woody). Since I always had a GNU Debian on my private box as my main operating system - even installed it on my MacBook following the steps in the Debian Wiki.
...
April 10, 2010
Berlin Buzzwords - Early bird registration # I would like to invite everyone interested in data storage, analysis and search to join us for two days on June 7/8th in Berlin for Berlin Buzzwords - an in-depth, technical, developer-focused conference located in the heart of Europe. Presentations will range from beginner friendly introductions on the hot data analysis topics up to in-depth technical presentations of scalable architectures.
Our intention is to bring together users and developers of data storage, analysis and search projects.
...
March 24, 2010
Bob Schulze on Tips and patterns with HBase # At the last Hadoop Get Together in Berlin Bob Schulze from eCircle in Munich gave a presentation on “Tips and patterns with HBase”. The talk has been video recorded. The result is now available online:
HBase Bob Schulze from Isabel Drost on Vimeo.
Feel free to share and distribute the video. Thanks to Bob for an awesome talk on eCircle’s usage of HBase - and on providing some background information on how HBase was applied to solve your problems.
...
March 17, 2010
Learning To Rank,
dima,
dups,
NOSQL,
Science,
mapreduce,
Mahout,
topic tracking,
topic detection,
hbase,
pnuts,
TU Berlin Seminar on scaling learning at DIMA TU Berlin # Last Thursday the seminar on scaling learning problems took place at DIMA at TU Berlin. We had five students give talks.
The talks started with an introduction to map reduce. Oleg Mayevskiy first explained the basic concept, than gave an overview of the parallelization architecture and finally showed how jobs can be formulated as map reduce jobs.
His paper as well as his slides are available online.
...
March 11, 2010
Slides are available # Slides for the last Hadoop Get Together are available online:
Spatial Search by Chris Male
HBase Patterns by Bob Schulze
Scaling product search with Hadoop and Lucene by Dragan Milosevic
My own little introduction, just in case you are interested.
Videos will follow as soon as the are ready. Watch this space for further updates.
March 11, 2010
Apache Hadoop Get Together March 2010 # Today (or more correctly, yesterday) the March 2010 Hadoop Get Together took place in newthinking store. I arrived rather early to have some time to do some planning for Berlin Buzzwords - got there nearly one hour before the meetup. However it did not take very long until first guests came to the store. So I quickly got my introductory slides in place - Martin from newthinking already had the room setup, camera in place and audio working.
...