August 6, 2011
Machine learning problem settings # Together with Sebastian Schelter I held a Nokia sponsored (Thank you!) lecture on large scale data analysis and data mining during the past few months. After supervising a few successful university projects based on Apache Mahout the goal of this lecture was to introduce students to some of the basic concepts and problems encountered today in a world where huge datasets are generally available and are easy to process with Apache Hadoop.
...
February 19, 2011
Apache Mahout Meetup Amsterdam # Last week I was honoured to be invited as one of the two speakers on Apache Mahout at the Mahout meetup in Amsterdam at JTeams offices. After free beer, cola and pizza Frank Scholten gave an overview of Mahout's clustering capabilities. After a brief introduction to Mahout itself he went into a little more detail on how clustering works in general. After that with a selection of Seinfeld scripts he used a fun data set to guide the audience through the process of choosing the right data preparation steps, coming up with good training parameters and finally evaluating clustering quality.
...
January 25, 2011
Apache Mahout in Amsterdam # On February 7th there will be an Apache Mahout meetup in Amsterdam kindly organised by JTeam. There will be two presentations - one by myself on classification with Apache Mahout as well as a second one by Frank Scholten on clustering with Apache Mahout.
Time: 18:00
Location: Frederiksplein 1, 1017XK Amsterdam, The Netherlands
Looking forward to a few days in Amsterdam.
January 23, 2011
FOSDEM II 2011 # It’s already sort of a nice little tradition for me to spend the first weekend in February in Brussels for FOSDEM. This year I am particulary happy that there will be a Data Analytics Dev Room at FOSDEM. A huge Thanks to @ogrisel and @nmaillot who have done most of the heavy lifting of getting the schedule in place.
Looking forward to an interesting Cloud Track, to meeting Peter Hintjens who is going to give a talk on 0MQ, the DevOps presentation and lots of very interesting DevRooms.
...
January 22, 2011
O’Reilly Strata Conference # Title: O’Reilly Strata Conference
Location: Santa Clara
Link out: Click here<br />Description: Early next February O’Reilly is planning to put on a very interesting conference on the topic of data analysis and the business of generating value from raw digital data. I’m really glad to have received the acceptance notification for my presentation and travel sponsorship from the DICODE project. So see you in Santa Clara.
...
December 13, 2010
Apache Mahout Podcast # During Apache Con ATL Michael Coté interviewed Grant Ingersoll on Apache Mahout. The interview is available online as podcast. The interview covers the goals and current use cases of the project, goes into some detail on the reasons for initially starting it. If you are wondering what Mahout is all about, what you can do with it and which direction development is heading, the interview is a great option to find out more.
...
November 3, 2010
Apache Mahout 0.4 release # On last Sunday the Apache Mahout project published the 0.4 release. Nearly every piece of the code has been refactored and improved since the last 0.3 release. The release was timed to happen exactly before Apache Con NA in Atlanta. As such it was published on October 31st - the Halloween release, sort-of.
Especially mentionable are the following improvements:
Model refactoring and CLI changes to improve integration and consistency
...
October 31, 2010
Apache Mahout @ Lisbon Codebits # Second week of November I’ll spend a few days in Lisbon - never would have thought that I’d return so quickly when I visited this beautiful city this summer during vacation. I’ll be there for Codebits - thanks to Sapo for inviting me to be there.
Back in summer I learned only after I returned to Germany that there was someone form Portugal seeking to meet with other Apache people exactly when I was down there.
...