<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DATAVERSITY &#187; 2011 &#187; September</title>
	<atom:link href="http://www.dataversity.net/2011/09/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dataversity.net</link>
	<description></description>
	<lastBuildDate>Mon, 20 May 2013 19:59:15 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Webinar: Data Quality Challenges &amp; Solution Approaches in Yahoo!’s Massive Data</title>
		<link>http://www.dataversity.net/webinar-data-quality-challenges-solution-approaches-in-yahoo%e2%80%99s-massive-data-2/</link>
		<comments>http://www.dataversity.net/webinar-data-quality-challenges-solution-approaches-in-yahoo%e2%80%99s-massive-data-2/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 22:33:24 +0000</pubDate>
		<dc:creator>Shannon Kempe</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Information Quality]]></category>
		<category><![CDATA[On Demand Webinars]]></category>
		<category><![CDATA[Webinars]]></category>
		<category><![CDATA[aparna vani]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[dan defend]]></category>
		<category><![CDATA[data quality]]></category>
		<category><![CDATA[information quality]]></category>
		<category><![CDATA[video]]></category>
		<category><![CDATA[webinar]]></category>
		<category><![CDATA[Yahoo!]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5972</guid>
		<description><![CDATA[Data Quality Challenges &#38; Solution Approaches in Yahoo!’s Massive Data View more videos from DATAVERSITY This webinar was given in collaboration with: About the Webinar Data is Yahoo!&#8217;s most strategic assets &#8211; from user engagement and insights data to revenue and billing data. Three years ago, Yahoo! invested in a Data Quality program. By applying industry principles and techniques the Data Quality program has provided proactive and reactive system solutions to Audience data issues and root causes by addressing technical challenges of data quality at scale and engaging and leveraging the rest of the organization in the solution: from product teams all through the data stack (data sourcing, ETL, aggs and analytics) to analysts and sciences teams who consume the data. This methodology is now being scaled to the all data across Yahoo! including Search and Display Advertising. This presentation covers: The solution methodology developed which builds in proactive and reactive DQ capabilities into Cloud-based products up-front and includes end-to-end data focus resulting in system improvements and fast issue resolution Solutions for technical challenges in the internet domain in Yahoo!&#8217;s massive data environment including end-to-end data monitoring and alerting, abuse and robot traffic detection, and latency vs. accuracy The DQ [...]]]></description>
				<content:encoded><![CDATA[<div style="width:425px" id="__ss_9492852"> <strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/Dataversity/data-quality-challenges-solution-approaches-in-yahoos-massive-data-9492852" title="Data Quality Challenges &amp; Solution Approaches in Yahoo!’s Massive Data" target="_blank">Data Quality Challenges &amp; Solution Approaches in Yahoo!’s Massive Data</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9492852" width="425" height="355" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe>
<div style="padding:5px 0 12px"> View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a> </div>
</p></div>
<h2>
<h2>
<h2>This webinar was given in collaboration with:</h2>
<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/Yahoo-Logo1.png"><img class="size-medium wp-image-5975 aligncenter" title="Yahoo Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/Yahoo-Logo1-300x184.png" alt="" width="300" height="184" /></a></p>
<h2><span style="color: #16416f;"><strong>About the Webinar</strong></span><strong></strong></h2>
<p><strong>Data is Yahoo!&#8217;s most strategic assets &#8211; from user engagement and insights data to revenue and billing data. Three years ago, Yahoo! invested in a Data Quality program.</strong></p>
<p><strong>By applying industry principles and techniques the Data Quality program has provided proactive and reactive system solutions to Audience data issues and root causes by addressing technical challenges of data quality at scale and engaging and leveraging the rest of the organization in the solution: from product teams all through the data stack (data sourcing, ETL, aggs and analytics) to analysts and sciences teams who consume the data. This methodology is now being scaled to the all data across Yahoo! including Search and Display Advertising.</strong></p>
<p><strong>This presentation covers:</strong></p>
<ul>
<li><strong>The solution methodology developed which builds in proactive and reactive DQ capabilities into Cloud-based products up-front and includes end-to-end data focus resulting in system improvements and fast issue resolution</strong></li>
<li><strong>Solutions for technical challenges in the internet domain in Yahoo!&#8217;s massive data environment including end-to-end data monitoring and alerting, abuse and robot traffic detection, and latency vs. accuracy</strong></li>
<li><strong>The DQ program approach to scale across all Yahoo! data that uses a central and embedded-in-the-businesses model with a strong focus on customer engagement</strong></li>
</ul>
<p>&nbsp;</p>
<h2><span style="color: #16416f;"><strong>About the Speakers</strong></span><strong></strong></h2>
<p><strong>Dan Defend &amp; Aparna Vani</strong></p>
<p><strong><a href="http://www.dataversity.net/wp-content/uploads/2011/09/Dan-Defend1.jpg"><img class="alignleft size-full wp-image-5973" title="Dan Defend" src="http://www.dataversity.net/wp-content/uploads/2011/09/Dan-Defend1.jpg" alt="" width="76" height="95" /></a>Dan Defend earned his Masters in Computer Science from University of Illinois. He has experience at Motorola as an Engineering Manager first in Unix OS and then in embedded cell phone software where he also led analysis and data-driven improvement using Digital Six Sigma methodology. He is currently leading the Data Quality program at Yahoo! where significant improvements are underway involving centralized monitoring and data cleansing across Yahoo!’s audience data pipeline and relying heavily on organizational leverage and distributed ownership and accountability.</strong></p>
<p>&nbsp;</p>
<p><strong><a href="http://www.dataversity.net/wp-content/uploads/2011/09/Aparna-Vani1.jpg"><img class="alignleft size-full wp-image-5974" title="Aparna Vani" src="http://www.dataversity.net/wp-content/uploads/2011/09/Aparna-Vani1.jpg" alt="" width="76" height="85" /></a>Aparna Vani earned her Masters in ECE from University of Houston. She worked as Hardware Design engineer at Compaq and development lead at TVGuide. She has versatile experience as test designer and architecture at Motorola. Currently, she is working as Chief DQ Architect at Yahoo, responsible for data quality strategy and design, working on global organization wide projects like robot filtering and bcookie churn.</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/webinar-data-quality-challenges-solution-approaches-in-yahoo%e2%80%99s-massive-data-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Slides: Data Quality Challenges &amp; Solution Approaches in Yahoo!’s Massive Data</title>
		<link>http://www.dataversity.net/slides-data-quality-challenges-solution-approaches-in-yahoo%e2%80%99s-massive-data/</link>
		<comments>http://www.dataversity.net/slides-data-quality-challenges-solution-approaches-in-yahoo%e2%80%99s-massive-data/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 21:32:01 +0000</pubDate>
		<dc:creator>Shannon Kempe</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Information Quality]]></category>
		<category><![CDATA[Slide Presentations]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[data quality]]></category>
		<category><![CDATA[information quality]]></category>
		<category><![CDATA[presentation]]></category>
		<category><![CDATA[Yahoo!]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5963</guid>
		<description><![CDATA[Data Quality Challenges &#38; Solution Approaches in Yahoo!’s Massive Data View more presentations from DATAVERSITY To view the recording of this webinar, click HERE. This presentation was given in collaboration with: About the Webinar Data is Yahoo!&#8217;s most strategic assets &#8211; from user engagement and insights data to revenue and billing data. Three years ago, Yahoo! invested in a Data Quality program. By applying industry principles and techniques the Data Quality program has provided proactive and reactive system solutions to Audience data issues and root causes by addressing technical challenges of data quality at scale and engaging and leveraging the rest of the organization in the solution: from product teams all through the data stack (data sourcing, ETL, aggs and analytics) to analysts and sciences teams who consume the data. This methodology is now being scaled to the all data across Yahoo! including Search and Display Advertising. This presentation covers: The solution methodology developed which builds in proactive and reactive DQ capabilities into Cloud-based products up-front and includes end-to-end data focus resulting in system improvements and fast issue resolution Solutions for technical challenges in the internet domain in Yahoo!&#8217;s massive data environment including end-to-end data monitoring and alerting, abuse and [...]]]></description>
				<content:encoded><![CDATA[<div id="__ss_9492309" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="Data Quality Challenges &amp; Solution Approaches in Yahoo!’s Massive Data" href="http://www.slideshare.net/Dataversity/data-quality-challenges-solution-approaches-in-yahoos-massive-data" target="_blank">Data Quality Challenges &amp; Solution Approaches in Yahoo!’s Massive Data</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9492309" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></p>
<div style="padding: 5px 0 12px;">View more presentations from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
</div>
<h3>To view the recording of this webinar, click <span style="text-decoration: underline; color: #0000ff;"><strong><a title="Webinar: Data Quality Challenges &amp; Solution Approaches in Yahoo's Massive Data" href="http://www.dataversity.net/archives/5972"><span style="color: #0000ff; text-decoration: underline;">HERE</span></a></strong></span>.</h3>
<h3 style="text-align: center;"></h3>
<h3 style="text-align: center;"><strong>This presentation was given in collaboration with:</strong></h3>
<h2><a href="http://www.dataversity.net/wp-content/uploads/2011/09/Yahoo-Logo.png"><img class="size-medium wp-image-5966 aligncenter" title="Yahoo Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/Yahoo-Logo-300x184.png" alt="" width="300" height="184" /></a></h2>
<h2><span style="color: #16416f;"><strong>About the Webinar</strong></span><strong></strong></h2>
<p><strong>Data is Yahoo!&#8217;s most strategic assets &#8211; from user engagement and insights data to revenue and billing data. Three years ago, Yahoo! invested in a Data Quality program.</strong></p>
<p><strong>By applying industry principles and techniques the Data Quality program has provided proactive and reactive system solutions to Audience data issues and root causes by addressing technical challenges of data quality at scale and engaging and leveraging the rest of the organization in the solution: from product teams all through the data stack (data sourcing, ETL, aggs and analytics) to analysts and sciences teams who consume the data. This methodology is now being scaled to the all data across Yahoo! including Search and Display Advertising.</strong></p>
<p><strong>This presentation covers:</strong></p>
<ul>
<li><strong>The solution methodology developed which builds in proactive and reactive DQ capabilities into Cloud-based products up-front and includes end-to-end data focus resulting in system improvements and fast issue resolution</strong></li>
<li><strong>Solutions for technical challenges in the internet domain in Yahoo!&#8217;s massive data environment including end-to-end data monitoring and alerting, abuse and robot traffic detection, and latency vs. accuracy</strong></li>
<li><strong>The DQ program approach to scale across all Yahoo! data that uses a central and embedded-in-the-businesses model with a strong focus on customer engagement</strong></li>
</ul>
<p>&nbsp;</p>
<h2><span style="color: #16416f;"><strong>About the Speakers</strong></span><strong></strong></h2>
<p><strong>Dan Defend &amp; Aparna Vani</strong></p>
<p><strong><a href="http://www.dataversity.net/wp-content/uploads/2011/09/Dan-Defend.jpg"><img class="alignleft size-full wp-image-5964" title="Dan Defend" src="http://www.dataversity.net/wp-content/uploads/2011/09/Dan-Defend.jpg" alt="" width="89" height="112" /></a>Dan Defend earned his Masters in Computer Science from University of Illinois. He has experience at Motorola as an Engineering Manager first in Unix OS and then in embedded cell phone software where he also led analysis and data-driven improvement using Digital Six Sigma methodology. He is currently leading the Data Quality program at Yahoo! where significant improvements are underway involving centralized monitoring and data cleansing across Yahoo!’s audience data pipeline and relying heavily on organizational leverage and distributed ownership and accountability.</strong></p>
<p>&nbsp;</p>
<p><strong><a href="http://www.dataversity.net/wp-content/uploads/2011/09/Aparna-Vani.jpg"><img class="alignleft size-full wp-image-5965" title="Aparna Vani" src="http://www.dataversity.net/wp-content/uploads/2011/09/Aparna-Vani.jpg" alt="" width="87" height="97" /></a>Aparna Vani earned her Masters in ECE from University of Houston. She worked as Hardware Design engineer at Compaq and development lead at TVGuide. She has versatile experience as test designer and architecture at Motorola. Currently, she is working as Chief DQ Architect at Yahoo, responsible for data quality strategy and design, working on global organization wide projects like robot filtering and bcookie churn.</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/slides-data-quality-challenges-solution-approaches-in-yahoo%e2%80%99s-massive-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bending the Rules of Data Governance</title>
		<link>http://www.dataversity.net/bending-the-rules-of-data-governance/</link>
		<comments>http://www.dataversity.net/bending-the-rules-of-data-governance/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 16:48:58 +0000</pubDate>
		<dc:creator>A.R. Guess</dc:creator>
				<category><![CDATA[Data Daily]]></category>
		<category><![CDATA[Data Governance and Quality]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Project Management]]></category>
		<category><![CDATA[Strategy]]></category>
		<category><![CDATA[Aristotle]]></category>
		<category><![CDATA[Barry Schartz]]></category>
		<category><![CDATA[bend the ruler]]></category>
		<category><![CDATA[bending the rules]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[dg strategy]]></category>
		<category><![CDATA[effective dg]]></category>
		<category><![CDATA[flexibility]]></category>
		<category><![CDATA[Kenneth Sharpe]]></category>
		<category><![CDATA[Practical Wisdon]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5961</guid>
		<description><![CDATA[by Angela Guess A new article looks at how data governance efforts might benefit from a bit of flexibility. It begins, “Data governance requires the coordination of a complex combination of a myriad of factors, including executive sponsorship, funding, decision rights, arbitration of conflicting priorities… But sometimes this emphasis on enforcing policies makes data governance sound like it’s all about rules. In their book Practical Wisdom, Barry Schwartz and Kenneth Sharpe use the Nicomachean Ethics of Aristotle as a guide to explain that although rules are important, what is more important is ‘knowing the proper thing to aim at in any practice, wanting to aim at it, having the skill to figure out how to achieve it in a particular context, and then doing it.’” The article adds, “Although there’s a tendency to ignore the existing practical wisdom of the organization, successful data governance is not about systematically applying rules or following rigid procedures, and precisely because the dynamic challenges faced, and overcome daily, by business analysts, data stewards, technical architects, and others, exemplify today’s constantly changing business world. But this doesn’t mean that effective data governance policies can’t be implemented.  It simply means that instead of focusing on who [...]]]></description>
				<content:encoded><![CDATA[<p><a title="Convento di San Girolamo in Gubbio" href="http://www.flickr.com/photos/58558794@N07/5434433491/" target="_blank"><img class="alignleft" style="border-width: 0px;border-color: currentColor;border-style: none" src="http://farm6.static.flickr.com/5099/5434433491_2a9c9c3609.jpg" alt="Convento di San Girolamo in Gubbio" width="400" height="228" border="0" /></a>by <a href="http://www.dataversity.net/contributors/angela-guess">Angela Guess</a></p>
<p><a href="http://www.information-management.com/blogs/data_governance_stewardship_BI-10021220-1.html">A new article</a> looks at how data governance efforts might benefit from a bit of flexibility. It begins, “Data governance requires the coordination of a complex combination of a myriad of factors, including executive sponsorship, funding, decision rights, arbitration of conflicting priorities… But sometimes this emphasis on enforcing policies makes data governance sound like it’s all about rules. In their book <em>Practical Wisdom</em>, Barry Schwartz and Kenneth Sharpe use the Nicomachean Ethics of Aristotle as a guide to explain that although rules are important, what is more important is ‘knowing the proper thing to aim at in any practice, wanting to aim at it, having the skill to figure out how to achieve it in a particular context, and then doing it.’”</p>
<p>The article adds, “Although there’s a tendency to ignore the existing practical wisdom of the organization, successful data governance is not about systematically applying rules or following rigid procedures, and precisely because the dynamic challenges faced, and overcome daily, by business analysts, data stewards, technical architects, and others, exemplify today’s constantly changing business world. But this doesn’t mean that effective data governance policies can’t be implemented.  It simply means that instead of focusing on who should lead the way (i.e., top-down or bottom-up), we should focus on what the rules of data governance are made of.”</p>
<p><a href="http://www.information-management.com/blogs/data_governance_stewardship_BI-10021220-1.html" target="_blank">Read more here.</a></p>
<p><a title="Attribution License" href="http://creativecommons.org/licenses/by/2.0/" target="_blank"><img src="http://www.dataversity.net/wp-content/plugins/photo-dropper/images/cc.png" alt="Creative Commons License" width="16" height="16" align="absmiddle" border="0" /></a> <a href="http://www.photodropper.com/photos/" target="_blank">photo</a> credit: <a title="kladcat" href="http://www.flickr.com/photos/58558794@N07/5434433491/" target="_blank">kladcat</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/bending-the-rules-of-data-governance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Oracle Plans to Enter the Hadoop, NoSQL Spaces</title>
		<link>http://www.dataversity.net/oracle-plans-to-enter-the-hadoop-nosql-spaces-2/</link>
		<comments>http://www.dataversity.net/oracle-plans-to-enter-the-hadoop-nosql-spaces-2/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 16:46:22 +0000</pubDate>
		<dc:creator>A.R. Guess</dc:creator>
				<category><![CDATA[Data Daily]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[big move]]></category>
		<category><![CDATA[documents]]></category>
		<category><![CDATA[evidence]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Larry Ellison]]></category>
		<category><![CDATA[Loader for Hadoop]]></category>
		<category><![CDATA[new NoSQL solution]]></category>
		<category><![CDATA[Oracle]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5958</guid>
		<description><![CDATA[by Angela Guess It appears that Oracle is planning to become a player in both the Hadoop and NoSQL spaces very soon. The article reports, “There has been speculation for a while now that Oracle might someday make its foray into the Hadoop and NoSQL spaces, and next week looks like that time. With regard to Hadoop, CEO Larry Ellison made it clear during last week’s earnings call that the company is working on a connector that will let customers load unstructured data from Hadoop into their Oracle Exadata appliances. Now we have proof — and Oracle’s big data plans don’t stop with Hadoop.” The author continues, “I’ve seen some Oracle-produced content highlighting the company’s plans for a big data platform, apparently slated for launch in the second half of 2012, that not only includes the Hadoop connector — called Oracle Loader for Hadoop — but also a NoSQL database. The goal, it seems, is to let customers acquire data from whatever sources they please and then feed it into an Oracle Exadata data warehouse system. Once there, data can be analyzed via number of means, including existing Oracle technologies such as in-database MapReduce, mining and statistical analysis with R.” [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/oracle_logo.jpg"><img class="alignleft size-medium wp-image-5959" src="http://www.dataversity.net/wp-content/uploads/2011/09/oracle_logo-300x120.jpg" alt="" width="300" height="120" /></a>by <a href="http://www.dataversity.net/contributors/angela-guess">Angela Guess</a></p>
<p>It appears that <a href="http://gigaom.com/cloud/get-ready-for-oracles-takes-on-hadoop-nosql/">Oracle is planning to become a player in both the Hadoop and NoSQL spaces</a> very soon. The article reports, “There has been speculation for a while now that Oracle might someday make its foray into the Hadoop and NoSQL spaces, and next week looks like that time. With regard to Hadoop, CEO Larry Ellison made it clear during last week’s earnings call that the company is working on a connector that will let customers load unstructured data from Hadoop into their Oracle Exadata appliances. Now we have proof — and Oracle’s big data plans don’t stop with Hadoop.”</p>
<p>The author continues, “I’ve seen some Oracle-produced content highlighting the company’s plans for a big data platform, apparently slated for launch in the second half of 2012, that not only includes the Hadoop connector — called Oracle Loader for Hadoop — but also a NoSQL database. The goal, it seems, is to let customers acquire data from whatever sources they please and then feed it into an Oracle Exadata data warehouse system. Once there, data can be analyzed via number of means, including existing Oracle technologies such as in-database MapReduce, mining and statistical analysis with R.”</p>
<p>The article adds, “Reaffirming this information Thursday, Larry Dignan at ZDNet highlighted that the Oracle Loader for Hadoop is the topic of multiple sessions at next week’s Oracle OpenWorld conference as is ‘Oracle NoSQL Database.’ What remains to be seen, though, is how heavy Oracle — which has enabled Hadoop integration for some time, actually — will actually invest in Hadoop and NoSQL now that it appears interested in productizing them.”</p>
<p><a href="http://gigaom.com/cloud/get-ready-for-oracles-takes-on-hadoop-nosql/" target="_blank">Read more here.</a></p>
<p><em>photo credit: Oracle</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/oracle-plans-to-enter-the-hadoop-nosql-spaces-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Easier Moves with Cloud Data Centers</title>
		<link>http://www.dataversity.net/easier-moves-with-cloud-data-centers/</link>
		<comments>http://www.dataversity.net/easier-moves-with-cloud-data-centers/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 16:43:49 +0000</pubDate>
		<dc:creator>A.R. Guess</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cloud-Based Data]]></category>
		<category><![CDATA[Data Daily]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[AWS]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Charles Babcock]]></category>
		<category><![CDATA[cloud data center]]></category>
		<category><![CDATA[data centers]]></category>
		<category><![CDATA[telecommunications]]></category>
		<category><![CDATA[trend]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5954</guid>
		<description><![CDATA[by Angela Guess Charles Babcock recently looked at the trends surrounding data centers and asked the question, “What do you get when you marry cloud data centers to a telecommunications company? My answer is: the beginnings of a cloud network, a chain of linked data centers that in some cases bring two or more data centers into a position of backing each other up. That&#8217;s something many enterprises would value as they move workloads into the cloud. The service freeze in Amazon Web Services&#8217; northern Virginia center over the Easter weekend served as a reminder of the value of geographic distribution when it comes to backup and recovery.” Babcock continues, “I hope Amazon Web Services is studying the trend. If you were an AWS customer in US East-1 (northern Va.), you couldn&#8217;t easily designate the AWS data center in Dublin as your preferred failover site. You couldn&#8217;t even select AWS&#8217; US West data center, unless you constructed the network links to it yourself. You were stuck using a neighboring Amazon &#8220;availability zone,&#8221; which, it turns out April 22-24, was not necessarily in a separate data center and in some cases froze up at the same time as your primary zone [...]]]></description>
				<content:encoded><![CDATA[<p><a title="From Brockhaus and Efron Encyclopedic Dictionary" href="http://www.flickr.com/photos/49879584@N00/4323862715/" target="_blank"><img class="alignleft" style="border-width: 0px;border-color: currentColor;border-style: none" src="http://farm3.static.flickr.com/2763/4323862715_cdd3920fa2.jpg" alt="From Brockhaus and Efron Encyclopedic Dictionary" width="400" height="398" border="0" /></a>by <a href="http://www.dataversity.net/contributors/angela-guess">Angela Guess</a></p>
<p>Charles Babcock recently looked at the <a href="http://www.informationweek.com/news/cloud-computing/infrastructure/231602424">trends surrounding data centers</a> and asked the question, “What do you get when you marry cloud data centers to a telecommunications company? My answer is: the beginnings of a cloud network, a chain of linked data centers that in some cases bring two or more data centers into a position of backing each other up. That&#8217;s something many enterprises would value as they move workloads into the cloud. The service freeze in Amazon Web Services&#8217; northern Virginia center over the Easter weekend served as a reminder of the value of geographic distribution when it comes to backup and recovery.”</p>
<p>Babcock continues, “I hope Amazon Web Services is studying the trend. If you were an AWS customer in US East-1 (northern Va.), you couldn&#8217;t easily designate the AWS data center in Dublin as your preferred failover site. You couldn&#8217;t even select AWS&#8217; US West data center, unless you constructed the network links to it yourself. You were stuck using a neighboring Amazon &#8220;availability zone,&#8221; which, it turns out April 22-24, was not necessarily in a separate data center and in some cases froze up at the same time as your primary zone did. Automated backup to a separate location, however, is a definite possibility, given linked chains in the cloud.”</p>
<p><a href="http://www.informationweek.com/news/cloud-computing/infrastructure/231602424" target="_blank">Read more here.</a></p>
<p><a title="Attribution License" href="http://creativecommons.org/licenses/by/2.0/" target="_blank"><img src="http://www.dataversity.net/wp-content/plugins/photo-dropper/images/cc.png" alt="Creative Commons License" width="16" height="16" align="absmiddle" border="0" /></a> <a href="http://www.photodropper.com/photos/" target="_blank">photo</a> credit: <a title="Double--M (formerly DoubleM2)" href="http://www.flickr.com/photos/49879584@N00/4323862715/" target="_blank">Double&#8211;M (formerly DoubleM2)</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/easier-moves-with-cloud-data-centers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Job of the Day: Vice President, Enterprise Data Management</title>
		<link>http://www.dataversity.net/data-job-of-the-day-vice-president-enterprise-data-management/</link>
		<comments>http://www.dataversity.net/data-job-of-the-day-vice-president-enterprise-data-management/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 16:41:32 +0000</pubDate>
		<dc:creator>A.R. Guess</dc:creator>
				<category><![CDATA[Data Job of the Day]]></category>
		<category><![CDATA[Job of the Day]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[data job]]></category>
		<category><![CDATA[Hilton Worldwide]]></category>
		<category><![CDATA[it jobs]]></category>
		<category><![CDATA[jobs in data]]></category>
		<category><![CDATA[Tysons Corner]]></category>
		<category><![CDATA[VA]]></category>
		<category><![CDATA[Vice President of Enterprise Data Management]]></category>
		<category><![CDATA[Virginia]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5951</guid>
		<description><![CDATA[by Angela Guess Hilton Worldwide is searching for a Vice President of Enterprise Data Management in Tysons Corner, VA. The post states, “Responsible for ensuring that the data assets of Hilton Worldwide are supported by a defined and communicated architecture supporting the organization in achieving its strategic goals.  The position requires senior-level expertise in all aspects of Data Management including Master Data Management (MDM), Business Intelligence (BI), Data Warehouse, Database Management and Integration.  He/she will set and govern enterprise data standards, including managing the overall data governance processes.” It continues, “Key artifacts would include Enterprise-level Logical Data modeling, Physical Data modeling, development of a data strategy and associated polices, and selection of capabilities and systems to meet business information needs.  This Vice President is technically responsible for the management, planning, design and implementation of enterprise Data Architecture and Business Intelligence domains to ensure alignment of total Enterprise Architecture with business strategy – tools, guidelines, principles, policies and governance.” Learn more and apply here. photo credit: Hilton]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/hilton_logo.jpg"><img class="alignleft size-medium wp-image-5952" src="http://www.dataversity.net/wp-content/uploads/2011/09/hilton_logo-300x206.jpg" alt="" width="300" height="206" /></a>by <a href="http://www.dataversity.net/contributors/angela-guess">Angela Guess</a></p>
<p>Hilton Worldwide is searching for a <a href="https://careers.hilton.com/psc/hrprd/EMPLOYEE/HRMS/c/HRS_HRAM.HRS_CE.GBL?JobOpeningId=73186&amp;SiteId=&amp;Page=HRS_CE_JOB_DTL&amp;">Vice President of Enterprise Data Management</a> in Tysons Corner, VA. The post states, “Responsible for ensuring that the data assets of Hilton Worldwide are supported by a defined and communicated architecture supporting the organization in achieving its strategic goals.  The position requires senior-level expertise in all aspects of Data Management including Master Data Management (MDM), Business Intelligence (BI), Data Warehouse, Database Management and Integration.  He/she will set and govern enterprise data standards, including managing the overall data governance processes.”</p>
<p>It continues, “Key artifacts would include Enterprise-level Logical Data modeling, Physical Data modeling, development of a data strategy and associated polices, and selection of capabilities and systems to meet business information needs.  This Vice President is technically responsible for the management, planning, design and implementation of enterprise Data Architecture and Business Intelligence domains to ensure alignment of total Enterprise Architecture with business strategy – tools, guidelines, principles, policies and governance.”</p>
<p><a href="https://careers.hilton.com/psc/hrprd/EMPLOYEE/HRMS/c/HRS_HRAM.HRS_CE.GBL?JobOpeningId=73186&amp;SiteId=&amp;Page=HRS_CE_JOB_DTL&amp;" target="_blank">Learn more and apply here.</a></p>
<p><em>photo credit: Hilton</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/data-job-of-the-day-vice-president-enterprise-data-management/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>NoSQL Job of the Day: Senior API Developer</title>
		<link>http://www.dataversity.net/nosql-job-of-the-day-senior-api-developer/</link>
		<comments>http://www.dataversity.net/nosql-job-of-the-day-senior-api-developer/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 16:39:24 +0000</pubDate>
		<dc:creator>A.R. Guess</dc:creator>
				<category><![CDATA[Job of the Day]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[NoSQL Job of the Day]]></category>
		<category><![CDATA[Memphis]]></category>
		<category><![CDATA[Mimeo]]></category>
		<category><![CDATA[New York]]></category>
		<category><![CDATA[NoSQL job]]></category>
		<category><![CDATA[NY]]></category>
		<category><![CDATA[Seattle]]></category>
		<category><![CDATA[Senior API Developer]]></category>
		<category><![CDATA[Tennessee]]></category>
		<category><![CDATA[TN]]></category>
		<category><![CDATA[WA]]></category>
		<category><![CDATA[Washington]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5948</guid>
		<description><![CDATA[by Angela Guess Mimeo is looking for a Senior API Developer in New York City, Memphis, or Seattle. According to the post, “The successful candidate will have strong service-oriented design and development expertise as well as experience developing highly scalable, multi-tier web systems. He or she will work closely with other internal development teams, so strong interpersonal skills are important to build relationships with peers and other teams to achieve objectives, maintain objectivity, and give and welcome feedback on design/implementation decisions.” Qualifications for the position include the following: “Deep experience in Microsoft technologies including C#, .NET BCL, LINQ, ASP.NET, WCF, SQL and MSMQ. At least 5 years of experience with web technologies such as JSON, XML, SOAP. Fluent in object oriented design and design patterns. Adept at SQL Server, including ability to understand schemas, write queries, and stored procedures. Experience with parallel computing topics and Microsoft’s PLINQ/TPL APIs. Experience with Unit Testing and Behavior Driven Design. Experience with PaaS or IaaS Platforms such as Microsoft Azure or Amazon Web Services and EC2. Familiarity with capabilities of other service platform frameworks such as PHP, Ruby, Java. Experience with NoSQL solutions.” Learn more and apply here. photo credit: Mimeo]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/mimeo-logo.jpg"><img class="alignleft size-medium wp-image-5949" src="http://www.dataversity.net/wp-content/uploads/2011/09/mimeo-logo-300x108.jpg" alt="" width="300" height="108" /></a>by <a href="http://www.dataversity.net/contributors/angela-guess">Angela Guess</a></p>
<p>Mimeo is looking for a <a href="http://tbe.taleo.net/NA8/ats/careers/requisition.jsp?org=MIMEO&amp;cws=1&amp;rid=200&amp;source=Indeed">Senior API Developer</a> in New York City, Memphis, or Seattle. According to the post, “The successful candidate will have strong service-oriented design and development expertise as well as experience developing highly scalable, multi-tier web systems. He or she will work closely with other internal development teams, so strong interpersonal skills are important to build relationships with peers and other teams to achieve objectives, maintain objectivity, and give and welcome feedback on design/implementation decisions.”</p>
<p>Qualifications for the position include the following: “Deep experience in Microsoft technologies including C#, .NET BCL, LINQ, ASP.NET, WCF, SQL and MSMQ. At least 5 years of experience with web technologies such as JSON, XML, SOAP. Fluent in object oriented design and design patterns. Adept at SQL Server, including ability to understand schemas, write queries, and stored procedures. Experience with parallel computing topics and Microsoft’s PLINQ/TPL APIs. Experience with Unit Testing and Behavior Driven Design. Experience with PaaS or IaaS Platforms such as Microsoft Azure or Amazon Web Services and EC2. Familiarity with capabilities of other service platform frameworks such as PHP, Ruby, Java. Experience with NoSQL solutions.”</p>
<p><a href="http://tbe.taleo.net/NA8/ats/careers/requisition.jsp?org=MIMEO&amp;cws=1&amp;rid=200&amp;source=Indeed" target="_blank">Learn more and apply here.</a></p>
<p><em>photo credit: Mimeo</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/nosql-job-of-the-day-senior-api-developer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Management &amp; Application Problems</title>
		<link>http://www.dataversity.net/data-management-application-problems/</link>
		<comments>http://www.dataversity.net/data-management-application-problems/#comments</comments>
		<pubDate>Thu, 29 Sep 2011 17:20:40 +0000</pubDate>
		<dc:creator>A.R. Guess</dc:creator>
				<category><![CDATA[Data Daily]]></category>
		<category><![CDATA[Data Governance and Quality]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Project Management]]></category>
		<category><![CDATA[data applications]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[data mangement]]></category>
		<category><![CDATA[Data Warehousing]]></category>
		<category><![CDATA[good data management]]></category>
		<category><![CDATA[Loraine Lawson]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5944</guid>
		<description><![CDATA[by Angela Guess Loraine Lawson recently reported on how good data management has been linked to better performance of data applications. She writes, “Primarily, when you think about data integration you think about, well, data – not the applications. In part, that&#8217;s because the way data is processed tends to separate out the two — you pull the data into a data warehouse, data mart, or whatever (hence, the ‘extract’ in ETL) — and then you do whatever it is you want to do and you load it. But it&#8217;s not really so cut and dried, is it? Applications are intimately tied to data creation and use. What&#8217;s more, a recent study by Ovum found that good data management may be critical to the performance of your applications.” Lawson continues, “Ovum surveyed IT executives at 146 large enterprises in North America, Australia and the UK on application performance and management. The survey points to a strong connection between bad management of data and problems with applications. Specifically, 85 percent of the companies complained of application performance problems and the leading culprits trace back to bad data practices such as a lack of standardization, inadequate archiving practices and too many point [...]]]></description>
				<content:encoded><![CDATA[<p><a title="iMac actually being used (1)" href="http://www.flickr.com/photos/83542829@N00/2721721645/" target="_blank"><img class="alignleft" style="border-width: 0px;border-color: currentColor;border-style: none" src="http://farm4.static.flickr.com/3265/2721721645_3467d33c96.jpg" alt="iMac actually being used (1)" width="400" height="266" border="0" /></a>by <a href="http://www.dataversity.net/contributors/angela-guess">Angela Guess</a></p>
<p>Loraine Lawson <a href="http://www.itbusinessedge.com/cm/blogs/lawson/application-problems-yet-another-reason-to-focus-on-data-management/?cs=48714">recently reported on</a> how good data management has been linked to better performance of data applications. She writes, “Primarily, when you think about data integration you think about, well, data – not the applications. In part, that&#8217;s because the way data is processed tends to separate out the two — you pull the data into a data warehouse, data mart, or whatever (hence, the ‘extract’ in ETL) — and then you do whatever it is you want to do and you load it. But it&#8217;s not really so cut and dried, is it? Applications are intimately tied to data creation and use. What&#8217;s more, a recent study by Ovum found that good data management may be critical to the performance of your applications.”</p>
<p>Lawson continues, “Ovum surveyed IT executives at 146 large enterprises in North America, Australia and the UK on application performance and management. The survey points to a strong connection between bad management of data and problems with applications. Specifically, 85 percent of the companies complained of application performance problems and the leading culprits trace back to bad data practices such as a lack of standardization, inadequate archiving practices and too many point interfaces. The study also found that 20-30 percent of data is duplicated across applications, which increases application maintenance costs and creates problems for data migration, synchronization and retention.”</p>
<p><a href="http://www.itbusinessedge.com/cm/blogs/lawson/application-problems-yet-another-reason-to-focus-on-data-management/?cs=48714" target="_blank">Read more here.</a></p>
<p><a title="Attribution-ShareAlike License" href="http://creativecommons.org/licenses/by-sa/2.0/" target="_blank"><img src="http://www.dataversity.net/wp-content/plugins/photo-dropper/images/cc.png" alt="Creative Commons License" width="16" height="16" align="absmiddle" border="0" /></a> <a href="http://www.photodropper.com/photos/" target="_blank">photo</a> credit: <a title="William Hook" href="http://www.flickr.com/photos/83542829@N00/2721721645/" target="_blank">William Hook</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/data-management-application-problems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Getting Started with Data Governance</title>
		<link>http://www.dataversity.net/getting-started-with-data-governance/</link>
		<comments>http://www.dataversity.net/getting-started-with-data-governance/#comments</comments>
		<pubDate>Thu, 29 Sep 2011 17:18:20 +0000</pubDate>
		<dc:creator>A.R. Guess</dc:creator>
				<category><![CDATA[Data Daily]]></category>
		<category><![CDATA[Data Governance and Quality]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[advice]]></category>
		<category><![CDATA[data governance]]></category>
		<category><![CDATA[DG program]]></category>
		<category><![CDATA[getting started]]></category>
		<category><![CDATA[implementing]]></category>
		<category><![CDATA[Michael Stiffler]]></category>
		<category><![CDATA[pointers]]></category>
		<category><![CDATA[tips]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5942</guid>
		<description><![CDATA[by Angela Guess Michael Stiffler recently shared some pointers to keep in mind while implementing a data governance program. First off, Stiffler notes, “Data governance is an iterative program.  You need to set the expectation that the program will evolve, especially as the company&#8217;s business requirements evolve. For instance, you may have to adjust your framework to include other areas that are now important to the business. Or perhaps you understand the corporate culture a little better now and need to adjust your approach to increase your chances of being successful.” He adds, “Don&#8217;t do data governance just for the sake of doing data governance. You need to understand the key business drivers, and tie data governance to activities that will actually increase business value. What business pains can be mitigated through data governance? How should they be prioritized? These are questions you&#8217;ll need to answer.” One question to ask is, “Are you able to measure success? How does the business define success? Dashboards and scorecards are one way of measuring and showing improvements to your data (which is one definition of success). This may also resonate better with the business folks, especially if you have gotten their input in [...]]]></description>
				<content:encoded><![CDATA[<p><a title="Start Starting Line Americorps Cinema Service Night Wilcox Park May 20, 20117" href="http://www.flickr.com/photos/10506540@N07/5749192025/" target="_blank"><img class="alignleft" style="border-width: 0px;border-color: currentColor;border-style: none" src="http://farm3.static.flickr.com/2125/5749192025_17150e9e6c.jpg" alt="Start Starting Line Americorps Cinema Service Night Wilcox Park May 20, 20117" width="400" height="267" border="0" /></a>by <a href="http://www.dataversity.net/contributors/angela-guess">Angela Guess</a></p>
<p>Michael Stiffler <a href="http://blogs.trilliumsoftware.com/trilliuminsights/2011/09/what-works-when-planning-to-operationalize-data-governance.html">recently shared some pointers</a> to keep in mind while implementing a data governance program. First off, Stiffler notes, “Data governance is an iterative program.  You need to set the expectation that the program will evolve, especially as the company&#8217;s business requirements evolve. For instance, you may have to adjust your framework to include other areas that are now important to the business. Or perhaps you understand the corporate culture a little better now and need to adjust your approach to increase your chances of being successful.”</p>
<p>He adds, “Don&#8217;t do data governance just for the sake of doing data governance. You need to understand the key business drivers, and tie data governance to activities that will actually increase business value. What business pains can be mitigated through data governance? How should they be prioritized? These are questions you&#8217;ll need to answer.”</p>
<p>One question to ask is, “Are you able to measure success? How does the business define success? Dashboards and scorecards are one way of measuring and showing improvements to your data (which is one definition of success). This may also resonate better with the business folks, especially if you have gotten their input in defining ‘success.’”</p>
<p><a href="http://blogs.trilliumsoftware.com/trilliuminsights/2011/09/what-works-when-planning-to-operationalize-data-governance.html" target="_blank">Read more here.</a></p>
<p><a title="Attribution License" href="http://creativecommons.org/licenses/by/2.0/" target="_blank"><img src="http://www.dataversity.net/wp-content/plugins/photo-dropper/images/cc.png" alt="Creative Commons License" width="16" height="16" align="absmiddle" border="0" /></a> <a href="http://www.photodropper.com/photos/" target="_blank">photo</a> credit: <a title="stevendepolo" href="http://www.flickr.com/photos/10506540@N07/5749192025/" target="_blank">stevendepolo</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/getting-started-with-data-governance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Methods for Tackling Big Data Security</title>
		<link>http://www.dataversity.net/new-methods-for-tackling-big-data-security/</link>
		<comments>http://www.dataversity.net/new-methods-for-tackling-big-data-security/#comments</comments>
		<pubDate>Thu, 29 Sep 2011 17:16:18 +0000</pubDate>
		<dc:creator>A.R. Guess</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Data Daily]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[big data security]]></category>
		<category><![CDATA[data security]]></category>
		<category><![CDATA[David Loshin]]></category>
		<category><![CDATA[interview]]></category>
		<category><![CDATA[new methods]]></category>
		<category><![CDATA[secuirty issues]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=5940</guid>
		<description><![CDATA[by Angela Guess In a recent interview David Loshin discussed emerging methods for handling Big Data security issues. Loshin said, “I had an interesting conversation a few weeks back about whether data in the cloud is better protected than data that&#8217;s sitting in your own systems. We drilled down into that in the context of big data, such as that managed by Hadoop and those types of frameworks; we discussed the fact that the people who are developing Hadoop applications or MapReduce applications are developers. They are presuming access to the data, and the data sitting out on a collection of nodes and in some massively parallel configurations &#8212; presumably, that&#8217;s data in an uncontrolled environment… and there is even greater opportunity for exposure.” Loshin continued, “Essentially, the data needs to be moved over to the framework, be it Hadoop or whatever. It&#8217;s then exposed as the analysis is being done, and then the results are integrated back in, or reconnected back to, for example, some traditional data warehouse or business intelligence framework. When you&#8217;re looking at large collections of data, there&#8217;s the potential once again for lots of data being exposed. On the other hand, if you&#8217;re creating a [...]]]></description>
				<content:encoded><![CDATA[<p><a title="Equinox." href="http://www.flickr.com/photos/40436304@N07/6193334331/" target="_blank"><img class="alignleft" style="border-width: 0px;border-color: currentColor;border-style: none" src="http://farm7.static.flickr.com/6175/6193334331_929becc04b.jpg" alt="Equinox." width="350" height="234" border="0" /></a>by <a href="http://www.dataversity.net/contributors/angela-guess">Angela Guess</a></p>
<p><a href="http://tdwi.org/articles/2011/09/20/Big-Data-Security-2.aspx">In a recent interview</a> David Loshin discussed emerging methods for handling Big Data security issues. Loshin said, “I had an interesting conversation a few weeks back about whether data in the cloud is better protected than data that&#8217;s sitting in your own systems. We drilled down into that in the context of big data, such as that managed by Hadoop and those types of frameworks; we discussed the fact that the people who are developing Hadoop applications or MapReduce applications are developers. They are presuming access to the data, and the data sitting out on a collection of nodes and in some massively parallel configurations &#8212; presumably, that&#8217;s data in an uncontrolled environment… and there is even greater opportunity for exposure.”</p>
<p>Loshin continued, “Essentially, the data needs to be moved over to the framework, be it Hadoop or whatever. It&#8217;s then exposed as the analysis is being done, and then the results are integrated back in, or reconnected back to, for example, some traditional data warehouse or business intelligence framework. When you&#8217;re looking at large collections of data, there&#8217;s the potential once again for lots of data being exposed. On the other hand, if you&#8217;re creating a controlled environment where there is no other means of getting access, one might say that there might be an opportunity for increasing the protection, if you&#8217;re instituting your development framework in the right way.”</p>
<p><a href="http://tdwi.org/articles/2011/09/20/Big-Data-Security-2.aspx" target="_blank">Read more here.</a></p>
<p><a title="Attribution-NoDerivs License" href="http://creativecommons.org/licenses/by-nd/2.0/" target="_blank"><img src="http://www.dataversity.net/wp-content/plugins/photo-dropper/images/cc.png" alt="Creative Commons License" width="16" height="16" align="absmiddle" border="0" /></a> <a href="http://www.photodropper.com/photos/" target="_blank">photo</a> credit: <a title="Rumalowa" href="http://www.flickr.com/photos/40436304@N07/6193334331/" target="_blank">Rumalowa</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/new-methods-for-tackling-big-data-security/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

 Served from: www.dataversity.net @ 2013-05-20 13:27:50 by W3 Total Cache -->