Skip to content

Speaking of Hadoop

We’ve recently switched the backend of Mahalo to use Hadoop for all of our text archiving needs. What’s Hadoop? Glad you asked…

Hadoop: When grownups do open source | The Register

Hadoop is a library for writing distributed data processing programs using the MapReduce framework. It’s got all the makings of a blogosphere hit: cluster computing, large datasets, parallelism, algorithms published by Google, and open source. Every four days or so, a nerd will discover Hadoop, write a “Basic MapReduce Tutorial with Hadoop” tutorial on his blog with some trivial examples, and feel satisfied with himself for educating the world about a yet-undiscovered gem. Comparatively, very few people actually use Hadoop in practice, and those who do don’t write about it. Why? Because they’re adults who don’t care about getting on the front page of Digg.

Read on. It’s great stuff, and you’ll definitely learn something useful if your site needs to…well…scale.

Post a Comment

Your email is never published nor shared. Required fields are marked *
*
*
Loading Mahalo Top 7...
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 United States