<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Hyperbio &#187; Cluster</title>
	<atom:link href="http://hyperbio.net/category/cluster/feed/" rel="self" type="application/rss+xml" />
	<link>http://hyperbio.net</link>
	<description>Startups and running</description>
	<lastBuildDate>Tue, 17 Jan 2012 12:06:31 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>eBay&#8217;s enormous data warehouses</title>
		<link>http://hyperbio.net/2009/04/30/ebays-enormous-data-warehouses/</link>
		<comments>http://hyperbio.net/2009/04/30/ebays-enormous-data-warehouses/#comments</comments>
		<pubDate>Thu, 30 Apr 2009 14:39:08 +0000</pubDate>
		<dc:creator>Leila Boujnane</dc:creator>
				<category><![CDATA[Cluster]]></category>

		<guid isPermaLink="false">http://hyperbio.net/?p=970</guid>
		<description><![CDATA[Curt Monash meets with Ebay&#8217;s Oliver Ratzesberger and gets us numbers on two of the world&#8217;s largest data warehouses in the world. Look at these Ebay stats! Metrics on eBay’s main Teradata data warehouse include: &#62;2 petabytes of user data 10s of 1000s of users Millions of queries per day 72 nodes &#62;140 GB/sec of [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.monash.com/curtbio.html" target="_blank">Curt Monash</a> meets with Ebay&#8217;s Oliver Ratzesberger and <a href="http://www.dbms2.com/2009/04/30/ebays-two-enormous-data-warehouses/" target="_blank">gets us numbers on two of the world&#8217;s largest data warehouses in the world</a>. Look at these Ebay stats!</p>
<p style="margin-bottom: 0in;">Metrics on eBay’s main Teradata data warehouse include:</p>
<ul>
<li><strong>&gt;2 petabytes of user data</strong></li>
<li><strong>10s of 1000s of users</strong></li>
<li><strong>Millions of queries per day</strong></li>
<li>72 nodes</li>
<li>&gt;140 GB/sec of I/O, or <strong>2 	GB/node/sec,</strong> or maybe that’s a peak when the workload is 	scan-heavy</li>
<li>100s of production databases being 	fed in</li>
</ul>
<p style="margin-bottom: 0in;">Metrics on eBay’s Greenplum data warehouse (or, if you like, data mart) include:</p>
<ul>
<li><strong>6  1/2 petabytes of user data</strong></li>
<li><strong>17 trillion records</strong></li>
<li><strong>150 billion new records/day, </strong><span>which seems to suggest an 	ingest rate well over </span><strong>50 terabytes/day</strong></li>
<li>96 nodes</li>
<li><strong>200 MB/node/sec</strong> of I/O 	(that’s the order of magnitude difference that triggered my post on 	disk drives)</li>
<li>4.5 petabytes of storage</li>
<li>70% compression</li>
<li>A small number of concurrent users</li>
</ul>
<p>More <a href="http://www.dbms2.com/2009/04/30/ebays-two-enormous-data-warehouses/" target="_blank">details</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://hyperbio.net/2009/04/30/ebays-enormous-data-warehouses/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

