<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DATAVERSITY &#187; 2011 &#187; October</title>
	<atom:link href="http://www.dataversity.net/2011/10/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dataversity.net</link>
	<description></description>
	<lastBuildDate>Fri, 17 May 2013 18:18:00 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>From &#8220;Overnight&#8221; to &#8220;Real-time&#8221;: A Two-Year NoSQL Case Study</title>
		<link>http://www.dataversity.net/from-overnight-to-real-time-a-two-year-nosql-case-study/</link>
		<comments>http://www.dataversity.net/from-overnight-to-real-time-a-two-year-nosql-case-study/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 23:43:31 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[Benjamin Anderson]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Dataversity]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6714</guid>
		<description><![CDATA[&#160; From &#8220;Overnight&#8221; to &#8220;Real-time&#8221;: A Two-Year NoSQL Case Study, Benjamin Anderson, Meteor Solutions View more videos from DATAVERSITY About the Presentation Meteor Solutions integrates site and advertising analytics to provide major publishers and advertisers the ability to identify and reach their influential visitors with advertising, exclusive content and rewards. Eighteen months ago, Meteor was backed by a relational DB and struggling to keep up with volumes in a batch processing environment that was ill-suited to our graph oriented data model. Today, the service is backed by Cloudant, a distributed document store based on CouchDB, and provides deeper analytics in real-time. This transition enabled 10x growth and allowed us to open our technology to a much broader range of applications &#8212; though not without some bumps along the way. This talk will cover: Overview of our services and specific technical challenges Overview of Cloudant/CouchDB, how we leverage it, and its relation to other SQL, NoSQL, and web technologies in our stack Benefits we&#8217;ve seen and tradeoffs we have had to make Operational lessons learned Future plans: how NoSQL&#8217;s possibilities and limitations influence business, product and operational decisions About the Speaker Benjamin Anderson Director of Engineering Meteor Solutions &#160; Ben has [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<p>&nbsp;</p>
<div id="__ss_9286066" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="From &quot;Overnight&quot; to &quot;Real-time&quot;: A Two-Year NoSQL Case Study, Benjamin Anderson, Meteor Solutions" href="http://www.slideshare.net/Dataversity/from-overnight-to-realtime-a-twoyear-nosql-case-study-benjamin-anderson-meteor-solutions" target="_blank">From &#8220;Overnight&#8221; to &#8220;Real-time&#8221;: A Two-Year NoSQL Case Study, Benjamin Anderson, Meteor Solutions</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9286066" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<p>Meteor Solutions integrates site and advertising analytics to provide major publishers and advertisers the ability to identify and reach their influential visitors with advertising, exclusive content and rewards. Eighteen months ago, Meteor was backed by a relational DB and struggling to keep up with volumes in a batch processing environment that was ill-suited to our graph oriented data model. Today, the service is backed by Cloudant, a distributed document store based on CouchDB, and provides deeper analytics in real-time. This transition enabled 10x growth and allowed us to open our technology to a much broader range of applications &#8212; though not without some bumps along the way. This talk will cover:</p>
<ul>
<li>Overview of our services and specific technical challenges</li>
<li>Overview of Cloudant/CouchDB, how we leverage it, and its relation to other SQL, NoSQL, and web technologies in our stack</li>
<li>Benefits we&#8217;ve seen and tradeoffs we have had to make</li>
<li>Operational lessons learned</li>
<li>Future plans: how NoSQL&#8217;s possibilities and limitations influence business, product and operational decisions</li>
</ul>
<h2><strong>About the Speaker</strong></h2>
<p><a href="http://www.dataversity.net/wp-content/uploads/2011/10/B-anderson-e1320104463712.jpg"><img class="alignleft size-full wp-image-6716" title="B-anderson" src="http://www.dataversity.net/wp-content/uploads/2011/10/B-anderson-e1320104463712.jpg" alt="" width="100" height="100" /></a></p>
<p><strong>Benjamin Anderson</strong><br />
Director of Engineering<br />
<em>Meteor Solutions</em></p>
<p>&nbsp;</p>
<p><em>Ben has several years&#8217; experience building web-focused applications with an emphasis on big data, distributed architectures and non-relational data stores. He graduated from the University of Washington in 2008 with a degree in business, but spends most of his time leading a growing engineering and operations team, hacking on various open-source projects, and remodeling his new home. </em></p>
<p>Connect with Ben on <a title="@banjiewen" href="https://twitter.com/#!/banjiewen" target="_blank">Twitter </a>(@<em>banjiewen).<br />
</em></p>
<p>&nbsp;</p>
<p>For more videos and topics from the NoSQL Now! 2011 Conference, click <a title="NoSQL Now! 2011" href="http://www.dataversity.net/archives/category/education/events/nosql-now-events">HERE</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/from-overnight-to-real-time-a-two-year-nosql-case-study/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>A Tour of the Hypertable Monitoring System</title>
		<link>http://www.dataversity.net/a-tour-of-the-hypertable-monitoring-system/</link>
		<comments>http://www.dataversity.net/a-tour-of-the-hypertable-monitoring-system/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 23:37:40 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Dataversity]]></category>
		<category><![CDATA[Douglass Judd]]></category>
		<category><![CDATA[Hypertable]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6712</guid>
		<description><![CDATA[&#160; A Tour of the Hypertable Monitoring System, Douglass R Judd, Hypertable, Inc. View more videos from DATAVERSITY About the Presentation Hypertable is a high performance, open source, scalable database modeled after Google&#8217;s Bigtable. With any scalable database system that is designed to run on a large number of machines, good monitoring is essential. Hypertable&#8217;s monitoring system, first released over a year ago, has evolved into something quite useful. In this presentation, the audience will be taken on a tour of the Hypertable monitoring system. About the Speaker &#160; Douglass R Judd CEO Hypertable, Inc. &#160; Doug is co-founder and CEO of Hypertable, Inc, a company that provides commercial support for Hypertable, a massively scalable, open source database. Doug started the Hypertable open source project in 2007, while working as an Architect at Zvents, and has been actively building the technology ever since. Doug has over a decade of software engineering experience in the area of distributed computing and information retrieval. He joined Inktomi’s Web Search division in 1997 where he held both engineering and management positions. During his five year tenure, he designed and developed large-scale distributed systems, including significant pieces of the crawling and indexing software. Doug earned [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<p>&nbsp;</p>
<div id="__ss_9183229" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="A Tour of the Hypertable Monitoring System, Douglass R Judd, Hypertable, Inc." href="http://www.slideshare.net/Dataversity/a-tour-of-the-hypertable-monitoring-system-douglass-r-judd-hypertable-inc" target="_blank">A Tour of the Hypertable Monitoring System, Douglass R Judd, Hypertable, Inc.</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9183229" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<p>Hypertable is a high performance, open source, scalable database modeled after Google&#8217;s Bigtable. With any scalable database system that is designed to run on a large number of machines, good monitoring is essential. Hypertable&#8217;s monitoring system, first released over a year ago, has evolved into something quite useful. In this presentation, the audience will be taken on a tour of the Hypertable monitoring system.</p>
<h2><strong>About the Speaker</strong></h2>
<p><em><a href="http://www.dataversity.net/wp-content/uploads/2011/10/judd.jpg"><img class="alignleft size-full wp-image-6696" title="judd" src="http://www.dataversity.net/wp-content/uploads/2011/10/judd.jpg" alt="" width="90" height="120" /></a></em></p>
<p>&nbsp;</p>
<p><strong>Douglass R Judd</strong><br />
CEO<br />
<em>Hypertable, Inc.</em></p>
<p>&nbsp;</p>
<p><em>Doug is co-founder and CEO of Hypertable, Inc, a company that provides commercial support for Hypertable, a massively scalable, open source database. Doug started the Hypertable open source project in 2007, while working as an Architect at Zvents, and has been actively building the technology ever since. Doug has over a decade of software engineering experience in the area of distributed computing and information retrieval. He joined Inktomi’s Web Search division in 1997 where he held both engineering and management positions. During his five year tenure, he designed and developed large-scale distributed systems, including significant pieces of the crawling and indexing software. Doug earned a B.S. in Computer Science from U.C. Santa Barbara in 1992 and holds four patents in search technology.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/a-tour-of-the-hypertable-monitoring-system/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MongoDB at Sailthru: Scaling and Schema Design</title>
		<link>http://www.dataversity.net/mongodb-at-sailthru-scaling-and-schema-design/</link>
		<comments>http://www.dataversity.net/mongodb-at-sailthru-scaling-and-schema-design/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 23:31:41 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Ian White]]></category>
		<category><![CDATA[MongoDB]]></category>
		<category><![CDATA[Sailthru Dataversity]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6709</guid>
		<description><![CDATA[&#160; MongoDB at Sailthru: Scaling and Schema Design, Ian White, Sailthru View more videos from DATAVERSITY About the Presentation Sailthru provides all your website email delivery needs, ensuring Inbox delivery for transactional and mass mail. Sailthru started out as a MySQL-powered transactional-mail service. Starting in 2009, we migrated to the document-oriented &#8220;nosql&#8221; database MongoDB. Moving entirely to MongoDB has allowed us to build complex user profiles to power behavioral-targeted mass emails and onsite recommendations. How and why we made the move, and how we use MongoDB today. About the Speaker &#160; Ian White CTO and CO-Founder Sailthru &#160; Ian White is CTO and co-founder of Sailthru, a next-generation email service provider focusing on behavioral analytics. He was formerly the lead of development at Business Insider, where he used the alpha version of MongoDB in production. Ian White studied computer science at Brown University.]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<p>&nbsp;</p>
<div id="__ss_9261223" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="MongoDB at Sailthru: Scaling and Schema Design, Ian White, Sailthru" href="http://www.slideshare.net/Dataversity/mongodb-at-sailthru-scaling-and-schema-design-ian-white-sailthru" target="_blank">MongoDB at Sailthru: Scaling and Schema Design, Ian White, Sailthru</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9261223" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<p>Sailthru provides all your website email delivery needs, ensuring Inbox delivery for transactional and mass mail. Sailthru started out as a MySQL-powered transactional-mail service. Starting in 2009, we migrated to the document-oriented &#8220;nosql&#8221; database MongoDB. Moving entirely to MongoDB has allowed us to build complex user profiles to power behavioral-targeted mass emails and onsite recommendations. How and why we made the move, and how we use MongoDB today.</p>
<h2><strong>About the Speaker</strong></h2>
<p><a href="http://www.dataversity.net/wp-content/uploads/2011/10/I-White-e1320103699867.jpg"><img class="alignleft size-full wp-image-6710" title="I-White" src="http://www.dataversity.net/wp-content/uploads/2011/10/I-White-e1320103699867.jpg" alt="" width="100" height="150" /></a></p>
<p>&nbsp;</p>
<p><strong>Ian White</strong><br />
CTO and CO-Founder<br />
<em>Sailthru</em></p>
<p>&nbsp;</p>
<p><em>Ian White is CTO and co-founder of Sailthru, a next-generation email service provider focusing on behavioral analytics. He was formerly the lead of development at Business Insider, where he used the alpha version of MongoDB in production. Ian White studied computer science at Brown University.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/mongodb-at-sailthru-scaling-and-schema-design/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Introduction to Hadoop for Enterprise Applications (Part 1)</title>
		<link>http://www.dataversity.net/introduction-to-hadoop-for-enterprise-applications-part-1/</link>
		<comments>http://www.dataversity.net/introduction-to-hadoop-for-enterprise-applications-part-1/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 23:25:12 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Dataversity]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Serge Blazhievsky]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6707</guid>
		<description><![CDATA[&#160; Introduction to Hadoop for Enterprise Applications, Serge Blazhievsky, LiveOps, Inc. View more videos from DATAVERSITY About the Presentation History of the Map-Reduce technology Typical use-cases: when it should be used and then it is not appropriate Map-Reduce abstraction High-level components HDFS &#8211; hadoop distributed file system Main features of namenode and datanodes Performing calculation on top of the hadoop distributed file system. Main features of jobtracker and tasktrackers Yahoo case study general setup data processing reliability statistics Liveops case study ( near-real time data processing) data collection rapid calculation on hadoop cluster Sharing cluster between different uses using pool scheduling Key issues in starting a pilot project: data preparation Initial setup for hadoop cluster ongoing maintenance of the hadoop cluster Monitoring and maintenance Summary and take-away points About the Speaker Serge Blazhievsky Developer and Architect LiveOps, Inc. Serge Blazhievsky is an experienced developer and architect with a rich background in C++/Java and distributed systems. His latest venture, LiveOps, Inc. uses Hadoop infrastructure for all reporting needs. LiveOps Hadoop framework was completely designed by him and satisfies very strict performance and availability requirements. Serge&#8217;s prior ventures include Attributor, Inc. where he designed Hadoop infrastructure used for Internet crawling and web-page [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<p>&nbsp;</p>
<div id="__ss_9434364" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="Introduction to Hadoop for Enterprise Applications, Serge Blazhievsky, LiveOps, Inc." href="http://www.slideshare.net/Dataversity/live-ops-part-1-sml" target="_blank">Introduction to Hadoop for Enterprise Applications, Serge Blazhievsky, LiveOps, Inc.</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9434364" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<ol>
<li>History of the Map-Reduce technology</li>
<li>Typical use-cases: when it should be used and then it is not appropriate</li>
<li>Map-Reduce abstraction</li>
<li>High-level components</li>
<ol>
<li>HDFS &#8211; hadoop distributed file system</li>
<li>Main features of namenode and datanodes</li>
<li>Performing calculation on top of the hadoop distributed file system.</li>
<li>Main features of jobtracker and tasktrackers</li>
</ol>
<li>Yahoo case study</li>
<ul>
<li>general setup</li>
<li>data processing</li>
<li>reliability statistics</li>
</ul>
<li>Liveops case study ( near-real time data processing)
<ul>
<li>data collection</li>
<li>rapid calculation on hadoop cluster</li>
</ul>
</li>
<li>Sharing cluster between different uses using pool scheduling</li>
<li>Key issues in starting a pilot project:</li>
<ul>
<li>data preparation</li>
<li>Initial setup for hadoop cluster</li>
<li>ongoing maintenance of the hadoop cluster</li>
</ul>
<li>Monitoring and maintenance</li>
<li>Summary and take-away points</li>
</ol>
<h2><strong>About the Speaker</strong></h2>
<p><strong>Serge Blazhievsky</strong><br />
Developer and Architect<br />
<em>LiveOps, Inc.</em></p>
<p><em>Serge Blazhievsky is an experienced developer and architect with a rich background in C++/Java and distributed systems. His latest venture, LiveOps, Inc. uses Hadoop infrastructure for all reporting needs. LiveOps Hadoop framework was completely designed by him and satisfies very strict performance and availability requirements. Serge&#8217;s prior ventures include Attributor, Inc. where he designed Hadoop infrastructure used for Internet crawling and web-page analysis. He holds a Masters Degree in Computer Engineering from Santa Clara University, CA, located in the heart of Silicon Valley. Serge is a regular attendee and contributor to various Hadoop conferences including Hadoop User Group at Yahoo, the creator of Hadoop.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/introduction-to-hadoop-for-enterprise-applications-part-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Object-based Semantic Vocabulary Data Models for Agile Business Intelligence</title>
		<link>http://www.dataversity.net/object-based-semantic-vocabulary-data-models-for-agile-business-intelligence/</link>
		<comments>http://www.dataversity.net/object-based-semantic-vocabulary-data-models-for-agile-business-intelligence/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 23:19:55 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[NoSQL Now! Presentations]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[agile]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data models]]></category>
		<category><![CDATA[Dataversity]]></category>
		<category><![CDATA[Geoffrey Malafsky]]></category>
		<category><![CDATA[semantic vocabulary]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6704</guid>
		<description><![CDATA[&#160; Object-based Semantic Vocabulary Data Models for Agile Business Intelligence, Geoffrey P. Malafsky, Phasic Systems Inc View more videos from DATAVERSITY About the Presentation Object-based key-value data model using semantic vocabulary enables Agile data governance, BI, and system requirements by eliminating the time-intensive, rigid, siloed activities of serial requirements analysis, relational/dimensional data modeling, and data integration engineering. Agility is required to match technical design to the speed of business decisions with accurate, common, consistent data. About the Speaker &#160; Geoffrey P. Malafsky CEO Phasic Systems Inc &#160; Dr. Geoffrey Malafsky earned a PhD in Nanotechnology from The Pennsylvania State University. He was a research scientist at the Naval Research Laboratory before becoming a technology consultant in advanced system capabilities for numerous Government agencies and corporate clients. He has over thirty years of experience and is an expert in multiple fields including Nanotechnology, Knowledge Discovery and Dissemination, and information engineering. He founded and operated the technology consulting company TECHi2 prior to founding Phasic Systems Inc where he is the CEO.]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<p>&nbsp;</p>
<div id="__ss_9184507" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="Object-based Semantic Vocabulary Data Models for Agile Business Intelligence, Geoffrey P. Malafsky, Phasic Systems Inc" href="http://www.slideshare.net/Dataversity/objectbased-semantic-vocabulary-data-models-for-agile-business-intelligence-geoffrey-p-malafsky-phasic-systems-inc" target="_blank">Object-based Semantic Vocabulary Data Models for Agile Business Intelligence, Geoffrey P. Malafsky, Phasic Systems Inc</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9184507" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<p>Object-based key-value data model using semantic vocabulary enables Agile data governance, BI, and system requirements by eliminating the time-intensive, rigid, siloed activities of serial requirements analysis, relational/dimensional data modeling, and data integration engineering. Agility is required to match technical design to the speed of business decisions with accurate, common, consistent data.</p>
<h2><strong>About the Speaker</strong></h2>
<p><a href="http://www.dataversity.net/wp-content/uploads/2011/10/G-malafsky.jpg"><img class="alignleft size-full wp-image-6705" title="G-malafsky" src="http://www.dataversity.net/wp-content/uploads/2011/10/G-malafsky.jpg" alt="" width="115" height="150" /></a></p>
<p>&nbsp;</p>
<p><strong>Geoffrey P. Malafsky</strong><br />
CEO<br />
<em>Phasic Systems Inc</em></p>
<p>&nbsp;</p>
<p><em>Dr. Geoffrey Malafsky earned a PhD in Nanotechnology from The Pennsylvania State University. He was a research scientist at the Naval Research Laboratory before becoming a technology consultant in advanced system capabilities for numerous Government agencies and corporate clients. He has over thirty years of experience and is an expert in multiple fields including Nanotechnology, Knowledge Discovery and Dissemination, and information engineering. He founded and operated the technology consulting company TECHi2 prior to founding Phasic Systems Inc where he is the CEO.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/object-based-semantic-vocabulary-data-models-for-agile-business-intelligence/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>NoSQL for the Real-Time User Data Problem</title>
		<link>http://www.dataversity.net/nosql-for-the-real-time-user-data-problem/</link>
		<comments>http://www.dataversity.net/nosql-for-the-real-time-user-data-problem/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 23:08:46 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[NoSQL Now! Presentations]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Brian Bulkowski]]></category>
		<category><![CDATA[Dataversity]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6698</guid>
		<description><![CDATA[NoSQL for the Real-Time User Data Problem, Brian Bulkowski, Citrusleaf, Inc. View more videos from DATAVERSITY About the Presentation Brian Bulkowski, founder and CEO of Citrusleaf, will focus on the role of innovative NoSQL databases in advertising and the problem of real-time user data. The talk will feature case studies from online and mobile advertising companies who are facing data challenges of 100s of terabytes and requirements for millisecond response times. He will discuss the data problems in real-time bidding applications and present some ideas for moving beyond the last click attribution model. He will illustrate why scalability and speed are so critical and how new technology approaches are driving innovation in digital advertising. About the Speaker &#160; Brian Bulkowski CEO &#38; Founder Citrusleaf, Inc. &#160; Brian Bulkowski is founder and CEO of Citrusleaf. Brian is an expert at building high-scale platforms and middleware that provide ease of use through technical excellence. As Director of Performance, he led a team to build a clustered distributed recommendation engine for Aggregate Knowledge. At Liberate, he was Chief Architect of Cable Solutions, as well as managing cross-functional groups and founding a new product line for the company. At Novell, he learned the game-changing [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<div style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="NoSQL for the Real-Time User Data Problem, Brian Bulkowski, Citrusleaf, Inc." href="http://www.slideshare.net/Dataversity/nosql-for-the-realtime-user-data-problem-brian-bulkowski-citrusleaf-inc" target="_blank">NoSQL for the Real-Time User Data Problem, Brian Bulkowski, Citrusleaf, Inc.</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9229262" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<p>Brian Bulkowski, founder and CEO of Citrusleaf, will focus on the role of innovative NoSQL databases in advertising and the problem of real-time user data. The talk will feature case studies from online and mobile advertising companies who are facing data challenges of 100s of terabytes and requirements for millisecond response times. He will discuss the data problems in real-time bidding applications and present some ideas for moving beyond the last click attribution model. He will illustrate why scalability and speed are so critical and how new technology approaches are driving innovation in digital advertising.</p>
<h2><strong>About the Speaker</strong></h2>
<p><a href="http://www.dataversity.net/wp-content/uploads/2011/10/B-Bulkowski-e1319838757503.jpg"><img class="alignleft size-medium wp-image-6580" title="B-Bulkowski" src="http://www.dataversity.net/wp-content/uploads/2011/10/B-Bulkowski-e1319838364279-196x300.jpg" alt="" width="118" height="180" /></a></p>
<p>&nbsp;</p>
<p><strong>Brian Bulkowski</strong><br />
CEO &amp; Founder<br />
<em>Citrusleaf, Inc.</em></p>
<p>&nbsp;</p>
<p><em>Brian Bulkowski is founder and CEO of Citrusleaf. Brian is an expert at building high-scale platforms and middleware that provide ease of use through technical excellence. As Director of Performance, he led a team to build a clustered distributed recommendation engine for Aggregate Knowledge. At Liberate, he was Chief Architect of Cable Solutions, as well as managing cross-functional groups and founding a new product line for the company. At Novell, he learned the game-changing power of commodity hardware as Lead of AppleTalk routing for Netware 3 and 4. Brian has also contributed to start-up efforts in the areas of mobile internet, wifi mesh technology, high-scale IP-based television, and the contextual web. Brian holds a B.S. from Brown University in Mathematics/Computer Science.</em></p>
<p><a title="Join DATAVERSITY" href="http://www.dataversity.net/membership-overview/join-dataversity" target="_blank"><img class="size-full wp-image-7556 aligncenter" title="Click Here to Win" src="http://www.dataversity.net/wp-content/uploads/2011/10/Click-Here-to-Win.jpg" alt="" width="125" height="125" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/nosql-for-the-real-time-user-data-problem/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A Genome Sequence Analysis System Built with Hypertable</title>
		<link>http://www.dataversity.net/a-genome-sequence-analysis-system-built-with-hypertable/</link>
		<comments>http://www.dataversity.net/a-genome-sequence-analysis-system-built-with-hypertable/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 23:02:30 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[NoSQL Now! Presentations]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Dataversity]]></category>
		<category><![CDATA[Douglass Judd]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6695</guid>
		<description><![CDATA[&#160; A Genome Sequence Analysis System Built with Hypertable, Douglass R Judd, Hypertable, Inc. View more videos from DATAVERSITY About the Presentation Deep genome sequencing has revolutionized the fields of biology and medicine. Since January 2008, the capacity to generate sequence data has increased exponentially, far outpacing Moore&#8217;s Law. The emergence of scalable NoSQL database technologies has made the analysis of this vast amount of sequence data not only feasible, but cost effective.The University of California at San Francisco UCSF-Abbott Viral Detection and Discovery Center, led by director Charles Chiu, MD, PhD, Taylor Sittler, MD and the Hypertable development team have embarked upon a project to build a scalable software platform to facilitate deep sequencing analysis in diagnostic microbiology, transcriptomic analysis, and clinical / environmental metagenomics, areas for which existing commercial and academic solutions are sorely lacking. Doug Judd, the original creator of Hypertable, will present an overview of this genome sequencing analysis system. The presentation will cover the following topics: Rationale for choosing NoSQL Schema design Sources and description of input data Algorithms for generating and querying lookup tables Table sizes and compression ratios Lessons learned during system deployment About the Speaker &#160; Douglass R Judd CEO Hypertable, Inc [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<p>&nbsp;</p>
<div id="__ss_9286408" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="A Genome Sequence Analysis System Built with Hypertable, Douglass R Judd, Hypertable, Inc." href="http://www.slideshare.net/Dataversity/a-genome-sequence-analysis-system-built-with-hypertable-douglass-r-judd-hypertable-inc" target="_blank">A Genome Sequence Analysis System Built with Hypertable, Douglass R Judd, Hypertable, Inc.</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9286408" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<div style="padding: 5px 0pt 12px;">
<p>Deep genome sequencing has revolutionized the fields of biology and medicine. Since January 2008, the capacity to generate sequence data has increased exponentially, far outpacing Moore&#8217;s Law. The emergence of scalable NoSQL database technologies has made the analysis of this vast amount of sequence data not only feasible, but cost effective.The University of California at San Francisco UCSF-Abbott Viral Detection and Discovery Center, led by director Charles Chiu, MD, PhD, Taylor Sittler, MD and the Hypertable development team have embarked upon a project to build a scalable software platform to facilitate deep sequencing analysis in diagnostic microbiology, transcriptomic analysis, and clinical / environmental metagenomics, areas for which existing commercial and academic solutions are sorely lacking. Doug Judd, the original creator of Hypertable, will present an overview of this genome sequencing analysis system. The presentation will cover the following topics:</p>
<ul>
<li>Rationale for choosing NoSQL</li>
<li>Schema design</li>
<li>Sources and description of input data</li>
<li>Algorithms for generating and querying lookup tables</li>
<li>Table sizes and compression ratios</li>
<li>Lessons learned during system deployment</li>
</ul>
<h2 style="padding: 5px 0pt 12px;"><strong>About the Speaker</strong></h2>
<p><a href="http://www.dataversity.net/wp-content/uploads/2011/10/judd.jpg"><img class="alignleft size-full wp-image-6696" title="judd" src="http://www.dataversity.net/wp-content/uploads/2011/10/judd.jpg" alt="" width="90" height="120" /></a></p>
<p>&nbsp;</p>
<p><strong>Douglass R Judd</strong><br />
CEO<br />
<em>Hypertable, Inc</em></p>
<p><em></em><em>Doug is co-founder and CEO of Hypertable, Inc, a company that provides commercial support for Hypertable, a massively scalable, open source database. Doug started the Hypertable open source project in 2007, while working as an Architect at Zvents, and has been actively building the technology ever since. Doug has over a decade of software engineering experience in the area of distributed computing and information retrieval. He joined Inktomi’s Web Search division in 1997 where he held both engineering and management positions. During his five year tenure, he designed and developed large-scale distributed systems, including significant pieces of the crawling and indexing software. Doug earned a B.S. in Computer Science from U.C. Santa Barbara in 1992 and holds four patents in search technology.</em></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/a-genome-sequence-analysis-system-built-with-hypertable/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Putting Business Intelligence to Work on Hadoop Data Stores</title>
		<link>http://www.dataversity.net/putting-business-intelligence-to-work-on-hadoop-data-stores/</link>
		<comments>http://www.dataversity.net/putting-business-intelligence-to-work-on-hadoop-data-stores/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 22:55:16 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[NoSQL Now! Presentations]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Dataversity]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Ian Fyfe]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6691</guid>
		<description><![CDATA[&#160; Putting Business Intelligence to Work on Hadoop Data Stores, Ian Fyfe, Pentaho View more videos from DATAVERSITY About the Presentation An inexpensive way of storing large volumes of data, Hadoop is also scalable and redundant. But getting data out of Hadoop is tough due to a lack of a built-in query language. Also, because users experience high latency (up to several minutes per query), Hadoop is not appropriate for ad hoc query, reporting, and business analysis with traditional tools. The first step in overcoming Hadoop&#8217;s constraints is connecting to HIVE, a data warehouse infrastructure built on top of Hadoop, which provides the relational structure necessary for schedule reporting of large datasets data stored in Hadoop files. HIVE also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. But to really unlock the power of Hadoop, you must be able to efficiently extract data stored across multiple (often tens or hundreds) of nodes with a user-friendly ETL (extract, transform and load) tool that will then allow you to move your Hadoop data into a relational data mart or warehouse where you can use BI tools [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<p>&nbsp;</p>
<div id="__ss_9155471" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="Putting Business Intelligence to Work on Hadoop Data Stores, Ian Fyfe, Pentaho" href="http://www.slideshare.net/Dataversity/putting-business-intelligence-to-work-on-hadoop-data-stores-ian-fyfe-pentaho" target="_blank">Putting Business Intelligence to Work on Hadoop Data Stores, Ian Fyfe, Pentaho</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9155471" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<p>An inexpensive way of storing large volumes of data, Hadoop is also scalable and redundant. But getting data out of Hadoop is tough due to a lack of a built-in query language. Also, because users experience high latency (up to several minutes per query), Hadoop is not appropriate for ad hoc query, reporting, and business analysis with traditional tools.</p>
<p>The first step in overcoming Hadoop&#8217;s constraints is connecting to HIVE, a data warehouse infrastructure built on top of Hadoop, which provides the relational structure necessary for schedule reporting of large datasets data stored in Hadoop files. HIVE also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data.</p>
<p>But to really unlock the power of Hadoop, you must be able to efficiently extract data stored across multiple (often tens or hundreds) of nodes with a user-friendly ETL (extract, transform and load) tool that will then allow you to move your Hadoop data into a relational data mart or warehouse where you can use BI tools for analysis.</p>
<p>Attendees will learn how an IT person without java programming skills can:</p>
<ul>
<li>Integrate with Hadoop and Hive to bring ETL, data warehousing and BI applications to the tasks of analyzing Big Data</li>
<li>Provide key data integration and transformation functionality to Hadoop data</li>
<li>Manage and control Hadoop jobs using a graphical interface</li>
<li>Integrating Hadoop data with data from other sources to drive compelling reporting and analytics for today&#8217;s massive volumes of data</li>
</ul>
<h2><strong>About the Speaker</strong></h2>
<p><a href="http://www.dataversity.net/wp-content/uploads/2011/10/I-Fyfe.png"><img class="alignleft size-full wp-image-6693" title="I-Fyfe" src="http://www.dataversity.net/wp-content/uploads/2011/10/I-Fyfe.png" alt="" width="106" height="146" /></a></p>
<p>&nbsp;</p>
<p><strong>Ian Fyfe</strong><br />
Chief Technology Evangelist<br />
<em>Pentaho</em></p>
<p>&nbsp;</p>
<p><em>Ian Fyfe is responsible for driving adoption of Pentaho&#8217;s BI technologies, focusing on Pentaho&#8217;s customer base and community to ensure their needs are being met and exceeded, and providing input on high-level product strategy and roadmap development. Ian brings extensive experience in the Business Intelligence and Data Warehouse industry including Jaspersoft, PeopleSoft, Epiphany, Informix, and Business Objects.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/putting-business-intelligence-to-work-on-hadoop-data-stores/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Two Years of Production HBase: Lessons Learned</title>
		<link>http://www.dataversity.net/two-years-of-production-hbase-lessons-learned/</link>
		<comments>http://www.dataversity.net/two-years-of-production-hbase-lessons-learned/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 22:47:04 +0000</pubDate>
		<dc:creator>Nerrisa Waite</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Conference and Webinar Communities]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[NoSQL Now!]]></category>
		<category><![CDATA[NoSQL Now! Presentations]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Dataversity]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[Rod Cope]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6689</guid>
		<description><![CDATA[Two Years of Production HBase: Lessons Learned, Rod Cope, OpenLogic, Inc. View more videos from DATAVERSITY About the Presentation HBase and friends are built from the ground up to support Big Data, but that doesn&#8217;t make them easy. Just like with any other relatively new and complex technologies, there are some rough edges and growing pains to manage. I&#8217;ve learned some hard lessons while deploying HBase tables containing billions of rows and dozens of terabytes on OpenLogic&#8217;s Hadoop infrastructure. Come to this session to learn about some of the &#8220;gotchas&#8221; you might run into when deploying Hadoop and HBase in your own production environment and how to avoid them. Here are some general areas we&#8217;ll explore: Hard-to-find configuration problems and debugging techniques Under-documented yet critical features Deployment recommendations for particular use cases Advice on how to import Big Data Using JRuby/Ruby to make life with Hadoop and HBase easier About the Speaker Rod Cope CTO &#38; Founder OpenLogic, Inc. Rod Cope is the CTO and Founder of OpenLogic, a provider of Open Source support and governance solutions for the enterprise.  He has over 25 years of software development experience in a wide range of industries and technologies. Prior to founding [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo.png"><img class="alignnone size-medium wp-image-5642" title="NoSQLNow Logo" src="http://www.dataversity.net/wp-content/uploads/2011/09/NoSQLNow-Logo-300x55.png" alt="NoSQL Now! Conference" width="300" height="55" /></a></p>
<div id="__ss_9249320" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="Two Years of Production HBase: Lessons Learned, Rod Cope, OpenLogic, Inc." href="http://www.slideshare.net/Dataversity/two-years-of-production-hbase-lessons-learned-rod-cope-openlogic-inc" target="_blank">Two Years of Production HBase: Lessons Learned, Rod Cope, OpenLogic, Inc.</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9249320" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="425" height="355"></iframe></div>
<div style="padding: 5px 0 12px;">View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a></div>
<h2 style="padding: 5px 0 12px;"><strong>About the Presentation</strong></h2>
<p>HBase and friends are built from the ground up to support Big Data, but that doesn&#8217;t make them easy. Just like with any other relatively new and complex technologies, there are some rough edges and growing pains to manage. I&#8217;ve learned some hard lessons while deploying HBase tables containing billions of rows and dozens of terabytes on OpenLogic&#8217;s Hadoop infrastructure. Come to this session to learn about some of the &#8220;gotchas&#8221; you might run into when deploying Hadoop and HBase in your own production environment and how to avoid them.</p>
<p>Here are some general areas we&#8217;ll explore:</p>
<ul>
<li>Hard-to-find configuration problems and debugging techniques</li>
<li>Under-documented yet critical features</li>
<li>Deployment recommendations for particular use cases</li>
<li>Advice on how to import Big Data</li>
<li>Using JRuby/Ruby to make life with Hadoop and HBase easier</li>
</ul>
<h2><strong>About the Speaker</strong></h2>
<p><strong>Rod Cope</strong><br />
CTO &amp; Founder<br />
<em>OpenLogic, Inc.</em></p>
<p><em></em> <em>Rod Cope is the CTO and Founder of OpenLogic, a provider of Open Source support and governance solutions for the enterprise.  He has over 25 years of software development experience in a wide range of industries and technologies. Prior to founding OpenLogic, Rod worked for General Electric, IBM, IBM Global Services, and Anthem before starting his own consulting company. As a consultant, he has architected solutions for Ericsson, Ford, Manugistics, Integral, Goodyear, and many other companies of all sizes. Rod has spoken on various technical and business topics at JavaOne, OSCON, the Open Source Business Conference, the Next Generation Data Center conference, and other venues around the world. He is currently writing the book &#8220;Cloud Computing in Action&#8221;. He holds both Bachelor&#8217;s and Master&#8217;s degrees in Software Engineering from the University of Louisville.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/two-years-of-production-hbase-lessons-learned/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Webinar: Award Winning Data Governance</title>
		<link>http://www.dataversity.net/webinar-award-winning-data-governance/</link>
		<comments>http://www.dataversity.net/webinar-award-winning-data-governance/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 22:20:28 +0000</pubDate>
		<dc:creator>Shannon Kempe</dc:creator>
				<category><![CDATA[Data Governance and Quality]]></category>
		<category><![CDATA[Data Topics]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[On Demand Webinars]]></category>
		<category><![CDATA[Webinars]]></category>

		<guid isPermaLink="false">http://www.dataversity.net/?p=6676</guid>
		<description><![CDATA[This presentation was given in collaboration with: &#160; Award Winning Data Governance View more videos from DATAVERSITY To view just the slides of the live presentation, click HERE. &#160; About the Webinar View this recording and learn why Sallie Mae was the recipient of the 2011 Data Governance Best Practice Award. This presentation is designed for participants to learn: The &#8220;non-traditional&#8221; Data Governance framework Sallie Mae used and the benefits of that approach How the DG Program contributed to the success of Sallie Mae during tumultuous industry changes How Sallie Mae utilized business initiatives to gain momentum and resources The key critical success factors and lessons learned &#160; About the Speaker Michele Koch is the Director of Enterprise Data Management and the Data Governance Office at Sallie Mae, the leading &#8220;saving, planning, and paying for education&#8221; company in the US. Michele was responsible for the successful design and implementation of the enterprise Data Governance and Data Quality Programs at Sallie Mae. Their Data Governance Program won the 2011 Data Governance Conference Best Practice Award and TDWI’s  2010 Best Practices Award for the Data Governance category. She is also responsible for the data modeling team who provides support to development teams [...]]]></description>
				<content:encoded><![CDATA[<h3 style="text-align: center;"><strong>This presentation was given in collaboration with:</strong></h3>
<p><a title="DGPO" href="http://www.dgpo.org" target="_blank"><img class="aligncenter" title="DGPO" src="http://www.dataversity.net/wp-content/uploads/2011/08/DGPO.png" alt="" width="314" height="108" /></a></p>
<p>&nbsp;</p>
<div style="width:425px" id="__ss_9966509"> <strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/Dataversity/award-winning-data-governance-9966509" title="Award Winning Data Governance" target="_blank">Award Winning Data Governance</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/9966509" width="425" height="355" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe>
<div style="padding:5px 0 12px"> View more videos from <a href="http://www.slideshare.net/Dataversity" target="_blank">DATAVERSITY</a> </div>
</p></div>
<h3>To view just the slides of the live presentation, click <a title="Slides: Award Winning Data Governance" href="http://www.dataversity.net/archives/6672"><strong>HERE</strong></a>.</h3>
<p>&nbsp;</p>
<h2><span style="color: #16416f;"><strong>About the Webinar</strong></span></h2>
<h3>View this recording and learn why Sallie Mae was the recipient of the 2011 Data Governance Best Practice Award.</h3>
<h3><strong>This presentation is designed for participants to learn:</strong></h3>
<ul>
<li>
<h3>The &#8220;non-traditional&#8221; Data Governance framework Sallie Mae used and the benefits of that approach</h3>
</li>
<li>
<h3>How the DG Program contributed to the success of Sallie Mae during tumultuous industry changes</h3>
</li>
<li>
<h3>How Sallie Mae utilized business initiatives to gain momentum and resources</h3>
</li>
<li>
<h3>The key critical success factors and lessons learned</h3>
</li>
</ul>
<p>&nbsp;</p>
<h2><span style="color: #16416f;"><strong>About the Speaker</strong></span></h2>
<h3><a href="http://www.dataversity.net/wp-content/uploads/2011/10/Koch_Michele.jpg"><img class="alignleft" title="Koch_Michele" src="http://www.dataversity.net/wp-content/uploads/2011/10/Koch_Michele.jpg" alt="" width="72" height="97" /></a>Michele Koch is the Director of Enterprise Data Management and the Data Governance Office at Sallie Mae, the leading &#8220;saving, planning, and paying for education&#8221; company in the US. Michele was responsible for the successful design and implementation of the enterprise Data Governance and Data Quality Programs at Sallie Mae. Their Data Governance Program won the 2011 Data Governance Conference Best Practice Award and TDWI’s  2010 Best Practices Award for the Data Governance category. She is also responsible for the data modeling team who provides support to development teams and the business user community. Her 28 years of experience include applying structured analysis and design methods for process and data modeling, managing client/server and mainframe DBAs, and consulting at Fortune 500 companies. Michele received dual masters&#8217; degrees in MIS and Computer Systems Applications from The American University and a bachelor&#8217;s degree from Cornell University.</h3>
<h3>Michele is a co-founder of the newly formed, Data Governance Professionals Organization (DGPO).</h3>
]]></content:encoded>
			<wfw:commentRss>http://www.dataversity.net/webinar-award-winning-data-governance/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced

 Served from: www.dataversity.net @ 2013-05-18 02:27:50 by W3 Total Cache -->