Mahout

GSoC - one day to go for your application

April 8, 2010
Mahout, GSoC, Software Foundation

GSoC - one day to go for your application # If you are a student interested in participating in Google Summer of Code: Registration closes tomorrow (as in “April 9, 19:00 UTC”). You hopefully published and discussed your proposal at your favourite project already so you have a clear plan of where to go and which milestones to achieve in summer. If you are interested in Apache Mahout: Yes, as last years, we are again looking for students willing to work on awesome student projects this summer. ...

Apache Mahout 0.3 released

March 18, 2010
Mahout, release

Apache Mahout 0.3 released # This week, Apache Mahout 0.3 was released. First of all thanks to all committers and contributors who made that possible: Thanks for all your hard work on making the code even faster and integrating even more algorithms. To the highlights: New: math and collections modules based on the high performance Colt library Faster Frequent Pattern Growth(FPGrowth) using FP-bonsai pruning Parallel Dirichlet process clustering (model-based clustering algorithm) ...

Seminar on scaling learning at DIMA TU Berlin

March 17, 2010
Learning To Rank, dima, dups, NOSQL, Science, mapreduce, Mahout, topic tracking, topic detection, hbase, pnuts, TU Berlin

Seminar on scaling learning at DIMA TU Berlin # Last Thursday the seminar on scaling learning problems took place at DIMA at TU Berlin. We had five students give talks. The talks started with an introduction to map reduce. Oleg Mayevskiy first explained the basic concept, than gave an overview of the parallelization architecture and finally showed how jobs can be formulated as map reduce jobs. His paper as well as his slides are available online. ...

Google Summer of Code starting

March 10, 2010
Mahout, GSoC, Hacking, Software Foundation

Google Summer of Code starting # As published on the Google Open Source blog the application period for mentoring organizations for GSoC starts now. The ASF is already in the process of applying. If you are a student, looking for an interesting project to work on during the coming summer - you might consider participating in GSoC. It does give you are great opportunity to get in touch with successful free software projects, learn how to work in global teams, improve your communication skills and last but not least show and publish your fantastic coding skills. ...

Learning to Rank Challenge

March 9, 2010
Mahout, Science, Learning To Rank, ICML

Learning to Rank Challenge # In one of his recent blog posts, Jeff Dalton published an article on currently running machine learning challenges. Especially interesting for those working on search engines and interested in learning new rankings from data should be the Yahoo! Learning to Rank Challenge to be held in conjunction with this year’s ICML 2010 in Haifa, Israel. The goal is to show that your algorithm does not only scale on real-world data provided by Yahoo! ...

Mahout at Berlin ignite

March 1, 2010
Mahout, Camp, berlin ignite

Mahout at Berlin ignite # This evening the first Berlin ignite event took place in the “Festsaal” in Berlin X-Berg. Organiser of the event was Matt Biddulph from Nokia Gate 5. We had eleven fantastic talks (ok, to be more precise: At least ten fantastic ones, my own can only be judged by the audience ;) ). Topics included things you can learn when starting to collect data, themes from (agile) project management, RepRap machines (see also the Rep Rap FOSDEM 2010 talk), bots and robots. ...

FOSDEM 2010 - 10 years FOSDEM

February 3, 2010
Mahout, Free Software, General

FOSDEM 2010 - 10 years FOSDEM # The final schedule of FOSDEM 2010 is up: Looks like bad news - 306 interesting talks within just one weekend. Lots of interesting talks in the main track including Greg Kroah-Hartman on “Write and Submit your first Linux kernel Patch”, David Recordon from Facebook on “Scaling Facebook with OpenSource tools”, Bernard Li on “Ganglia: 10 years of monitoring clusters and grids”, Andrew Tanenbaum with his “MINIX 3: a Modular, Self-Healing POSIX-compatible Operating System” talk, Benoît Chesneau on “CouchDB! ...

Mahout in Action

January 11, 2010
Mahout

Mahout in Action # As noted earlier by Grant Ingersoll, the first chapters of Mahout in Action are already online at Manning: Sean, Robin, keep up the great work! I would love to read more of the book in the near future.

With a little help from my friends

December 31, 2009
Lucene, Hadoop, Mahout, Berlin, Thanks, TU Berlin

With a little help from my friends # The end of the year 2009 is quickly approaching. To me it feels a little like it ran away far too quickly. So instead of taking part in the annual review of past events, I would like to use it as an opportunity to say thank you: The past twelve months were a lot of fun with lots of interesting, nice people from all over the world. ...