Skip navigation

Tag Archives: mapreduce

chum in the waterOne more post on MapReduce and parallel SQL, this time for the folks at O’Reilly Radar.  

Just for the record, I think MapReduce is fine, but not especially interesting technology.  The thing is, the “teachable moment” it presents is really great stuff, because it is bringing people toward data-centric parallel programming.  So it’s good for the data-centric research business in general, and especially for data-centric approaches to parallelism.  

I.e. chum in the water for our research on Lincoln…

Advertisements

One thing I plan to do here is jot down ideas I don’t have time to work on myself. Here’s the first installment in what will hopefully be a running series of “Research Gimme‘s”. Anybody who wants to run with this, I’d love to hear what you’re up to.

So…. who’s going to re-examine Online Aggregation in the Hadoop context? Goodness knows it’d be useful. It will require moving Hadoop beyond a slavish implementation of the Google MapReduce paper. That’s got to be a good thing… Here’s the start of the program:

Read More »