<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: First GigaOm post on data-parallelism is up</title>
	<atom:link href="http://databeta.wordpress.com/2008/11/08/first-gigaom-post-on-data-parallelism-is-up/feed/" rel="self" type="application/rss+xml" />
	<link>http://databeta.wordpress.com/2008/11/08/first-gigaom-post-on-data-parallelism-is-up/</link>
	<description>on computing and data .. in permanent beta</description>
	<lastBuildDate>Wed, 21 Oct 2009 05:02:21 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Siva</title>
		<link>http://databeta.wordpress.com/2008/11/08/first-gigaom-post-on-data-parallelism-is-up/#comment-16</link>
		<dc:creator>Siva</dc:creator>
		<pubDate>Wed, 25 Feb 2009 02:58:54 +0000</pubDate>
		<guid isPermaLink="false">http://databeta.wordpress.com/?p=17#comment-16</guid>
		<description>There&#039;s also the Active Data Repository project (http://www.cs.umd.edu/projects/hpsl/chaos/ResearchAreas/adr/) that proposed using a MapReduce-like paradigm to represent certain computations on multi-dimensional scientific data.</description>
		<content:encoded><![CDATA[<p>There&#8217;s also the Active Data Repository project (<a href="http://www.cs.umd.edu/projects/hpsl/chaos/ResearchAreas/adr/" rel="nofollow">http://www.cs.umd.edu/projects/hpsl/chaos/ResearchAreas/adr/</a>) that proposed using a MapReduce-like paradigm to represent certain computations on multi-dimensional scientific data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jmh</title>
		<link>http://databeta.wordpress.com/2008/11/08/first-gigaom-post-on-data-parallelism-is-up/#comment-15</link>
		<dc:creator>jmh</dc:creator>
		<pubDate>Tue, 24 Feb 2009 21:45:33 +0000</pubDate>
		<guid isPermaLink="false">http://databeta.wordpress.com/?p=17#comment-15</guid>
		<description>I&#039;m not sure how to define equivalent, but any discussion of parallel dataflow software should start with &lt;a href=&quot;http://portal.acm.org/citation.cfm?doid=129888.129894&quot; rel=&quot;nofollow&quot;&gt;Gray and DeWitt&#039;s CACM survey&lt;/a&gt; on parallel databases.  You can work backwards from there through the Gamma and Bubba projects.  More recently there&#039;s &lt;a href=&quot;http://portal.acm.org/citation.cfm?id=1272996.1273005&quot; rel=&quot;nofollow&quot;&gt;Dryad&lt;/a&gt; from Microsoft Research.  I keep meaning to read the &lt;a href=&quot;http://portal.acm.org/citation.cfm?id=1453856.1453865&quot; rel=&quot;nofollow&quot;&gt;Clustera&lt;/a&gt; work from Wisconsin.  And there&#039;s been a variety of academic work around Hadoop itself in the last year or two, including both internals issues and languages (Pig/JAQL/etc).  The Hadoop-centric stuff should be relatively easy to find with a web search.

MapReduce is not so far from data streams.  Our work on FLuX for &lt;a href=&quot;http://portal.acm.org/citation.cfm?id=1007568.1007662&quot; rel=&quot;nofollow&quot;&gt;Fault-tolerant&lt;/a&gt; and &lt;a href=&quot;http://portal.acm.org/citation.cfm?id=375663.375773&quot; rel=&quot;nofollow&quot;&gt;Load-balanced&lt;/a&gt; parallel data streams came out about the same time as Google&#039;s MapReduce, and it&#039;s interesting to compare them.  This grew out of earlier work by the &lt;a href=&quot;http://pages.cs.wisc.edu/~remzi/&quot; rel=&quot;nofollow&quot;&gt;Arpaci&lt;/a&gt;-&lt;a href=&quot;http://pages.cs.wisc.edu/~dusseau/&quot; rel=&quot;nofollow&quot;&gt;Dusseaus&lt;/a&gt; on &lt;a href=&quot;http://portal.acm.org/citation.cfm?id=592637.592639&quot; rel=&quot;nofollow&quot;&gt;River&lt;/a&gt; and &lt;a href=&quot;http://now.cs.berkeley.edu/NowSort/&quot; rel=&quot;nofollow&quot;&gt;NOW-Sort&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>I&#8217;m not sure how to define equivalent, but any discussion of parallel dataflow software should start with <a href="http://portal.acm.org/citation.cfm?doid=129888.129894" rel="nofollow">Gray and DeWitt&#8217;s CACM survey</a> on parallel databases.  You can work backwards from there through the Gamma and Bubba projects.  More recently there&#8217;s <a href="http://portal.acm.org/citation.cfm?id=1272996.1273005" rel="nofollow">Dryad</a> from Microsoft Research.  I keep meaning to read the <a href="http://portal.acm.org/citation.cfm?id=1453856.1453865" rel="nofollow">Clustera</a> work from Wisconsin.  And there&#8217;s been a variety of academic work around Hadoop itself in the last year or two, including both internals issues and languages (Pig/JAQL/etc).  The Hadoop-centric stuff should be relatively easy to find with a web search.</p>
<p>MapReduce is not so far from data streams.  Our work on FLuX for <a href="http://portal.acm.org/citation.cfm?id=1007568.1007662" rel="nofollow">Fault-tolerant</a> and <a href="http://portal.acm.org/citation.cfm?id=375663.375773" rel="nofollow">Load-balanced</a> parallel data streams came out about the same time as Google&#8217;s MapReduce, and it&#8217;s interesting to compare them.  This grew out of earlier work by the <a href="http://pages.cs.wisc.edu/~remzi/" rel="nofollow">Arpaci</a>-<a href="http://pages.cs.wisc.edu/~dusseau/" rel="nofollow">Dusseaus</a> on <a href="http://portal.acm.org/citation.cfm?id=592637.592639" rel="nofollow">River</a> and <a href="http://now.cs.berkeley.edu/NowSort/" rel="nofollow">NOW-Sort</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Siva</title>
		<link>http://databeta.wordpress.com/2008/11/08/first-gigaom-post-on-data-parallelism-is-up/#comment-14</link>
		<dc:creator>Siva</dc:creator>
		<pubDate>Tue, 24 Feb 2009 19:09:41 +0000</pubDate>
		<guid isPermaLink="false">http://databeta.wordpress.com/?p=17#comment-14</guid>
		<description>It would be great if you could document equivalent concepts to MapReduce that have been explored in academia.</description>
		<content:encoded><![CDATA[<p>It would be great if you could document equivalent concepts to MapReduce that have been explored in academia.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
