<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Irfan&#039;s View</title>
	<atom:link href="http://blogs.sybase.com/irfankhan/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.sybase.com/irfankhan</link>
	<description>Just another WordPress site</description>
	<lastBuildDate>Tue, 15 May 2012 15:36:47 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
		<item>
		<title>Big Data Will Change the (Very) Old IT Org Chart</title>
		<link>http://blogs.sybase.com/irfankhan/2012/05/big-data-will-change-the-very-old-it-org-chart/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/05/big-data-will-change-the-very-old-it-org-chart/#comments</comments>
		<pubDate>Tue, 15 May 2012 15:11:36 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Careers]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=208</guid>
		<description><![CDATA[It’s not surprising that the first known information storage system—the clay tablet—was used for structured data. Yes, the Sumerians used their cuneiform writing to compile data on the wealth of temples around 3200 BCE. The temple’s leader held a title of sangu, which roughly translates to accountant, (in effect, the CFO). The sangu reported directly [...]]]></description>
			<content:encoded><![CDATA[<p>It’s not surprising that the first known information storage system—the clay tablet—was used for structured data. Yes, the Sumerians used their cuneiform writing to <a href="http://www.historyworld.net/wrldhis/PlainTextHistories.asp?historyid=ab33">compile data</a> on the wealth of temples around 3200 BCE. The temple’s leader held a title of sangu, which roughly translates to accountant, (in effect, the CFO). The sangu reported directly to the king, (the CEO of Sumer, if you will). </p>
<p><a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/05/shutterstock_17789854.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/05/shutterstock_17789854-150x150.jpg" alt="" title="shutterstock_17789854" width="150" height="150" class="alignleft size-thumbnail wp-image-209" /></a>The sangu’s scribes (let’s liken them to the IT department) entered and stored data about such valuables as livestock and beer, the fermenting of which the Sumerians are credited with discovering. This penchant for data led the Sumerians to another breakthrough. Around 2600 BCE they invented a mathematical table that had rows and columns for calculating. We recognize it today as the <a href="http://ukcatalogue.oup.com/product/9780198508410.do">spreadsheet</a>. And guess what they used it for? Inventory management.</p>
<p>I mention this only because so little seems to have changed since those days in Mesopotamia. That is, IT is still mostly beholden to our modern equivalent to the sangu. I came across a <a href="http://www.cio.com/article/593262/CFOs_to_CIOs_You_Work_for_Me_Now">recent study</a> by Gartner and the Financial Executives Research Foundation that claims that 42% of IT departments report to the CFO with only one-third answering directly to the king, er, CEO. </p>
<p>This hoary tradition of putting the CIO under the finance leader makes sense when most of what the IT department did was count things and store information about them in structured formats, especially when most of the data related to the worth of the enterprise or kingdom. Having your top finance person being responsible for all aspects of the information, including the systems used to manage the data, whether that be clay tablets or data warehouses, makes sense in that context.</p>
<p>But I predict that Big Data will upend that ancient organizational history because it’s not just numbers about the value of inventory or weekly cash flow. Getting insight from Big Data requires a broader understanding of the business, one that should take more CIOs out from under the CFO’s shadow, if they take the lead on applying analytics to these new data sources. </p>
<p>Gartner also says that in 2012 CEOs will be pushing for more analytics projects, in part because they want to exploit Big Data for growth. This could be a great year, then, for CIOs to make the case that they need to report directly to the CEO. And taking the helm on Big Data projects might be the best way to end the ancient history of IT’s slot on the corporate org chart.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/05/big-data-will-change-the-very-old-it-org-chart/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Worth of Big Data</title>
		<link>http://blogs.sybase.com/irfankhan/2012/05/the-worth-of-big-data/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/05/the-worth-of-big-data/#comments</comments>
		<pubDate>Tue, 08 May 2012 16:01:07 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Predictive analytics]]></category>
		<category><![CDATA[Unstructured data]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=203</guid>
		<description><![CDATA[You may have heard that some in the industry believe that 99.9% of Big Data is “worthless.” Indeed, I’ve written about the scientists at the Large Hadron Collider who discard the vast majority of the petabytes of data their experiments produce. And we all know about the huge swathes of our SANs wasted on unused [...]]]></description>
			<content:encoded><![CDATA[<p>You may have <a href="http://www.v3.co.uk/v3-uk/news/2171134/majority-worthless-oracle-s-mark-hurd">heard</a> that some in the industry believe that 99.9% of Big Data is “worthless.” Indeed, I’ve written about the scientists at the Large Hadron Collider who discard the vast majority of the petabytes of data their experiments produce. And we all know about the huge swathes of our SANs wasted on unused data. Organizations confront  mountains of extraneous and redundant data. That’s a well-understood problem. </p>
<p>But is it really “worthless”?</p>
<p>I vigorously disagree. Even if data is not usable to the task at hand, it may become so later to another analyst. And if it is redundant, understanding what processes are causing the repetition might lead to improved business performance. Understanding the whole of a Big Data opportunity means being able to discern what part of your data set is worthwhile and which is not. That does not mean that what you discard today will be worthless tomorrow. It depends on the questions you ask the data and how you ask them and when.</p>
<p>More to the point, as Hal Varian, Google’s Chief Economist <a href="http://www.aspeninstitute.org/sites/default/files/content/docs/pubs/The_Promise_and_Peril_of_Big_Data.pdf">observes</a>, to truly conduct valid predictive analytics, perhaps the most important potential of Big Data, you need to start with a random data set, even if it’s a <a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/05/shutterstock_79161421.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/05/shutterstock_79161421-150x150.jpg" alt="" title="shutterstock_79161421" width="150" height="150" class="alignright size-thumbnail wp-image-204" /></a>small one. However, as engineers at Google know, to get that truly random data set, that sliver of data needs to come from a massive amount of information. Without a large enough pool of data to draw from, the validity of your data set and subsequent analytics loses precision. In other words, even unused, Big Data generates the most valid data sets for modeling.</p>
<p>Philip Russom, TDWI’s director of data management, adds another critical aspect of Big Data. He argues that Big Data is discovery oriented, where you look for facts you never knew before. He warns if you over massage Big Data you risk eliminating outliers in the data, which might be exactly what you need to find, such as in credit card fraud. </p>
<p>Blithely dismissing 99.9% of Big Data as worthless shows a lack of nuance and insight when it comes to understanding the Big Data problems modern enterprises face. Seeing the world strictly through a traditional database mentality, where only the purest, cleanest, most massaged, and manageably small data set is trusted, leads to bald and false proclamations about the worth of Big Data. </p>
<p>Such a view does not lead to effective predictive analytics. It does not lead forward. It strikes me as purely a defensive position from those who do not have the tools to exploit the enormous worth of Big Data.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/05/the-worth-of-big-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Analytics in Our iPad Era</title>
		<link>http://blogs.sybase.com/irfankhan/2012/05/analytics-in-our-ipad-era/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/05/analytics-in-our-ipad-era/#comments</comments>
		<pubDate>Tue, 01 May 2012 15:31:10 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Mobile Commerce]]></category>
		<category><![CDATA[Mobility]]></category>
		<category><![CDATA[Predictive analytics]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=197</guid>
		<description><![CDATA[Tablets are increasingly the tech tool of choice for CEOs. According to the 2012 CEO &#038; Senior Business Executive survey by Gartner, 41% of CEOs polled said they use an iPad. Mark Raskino, an analyst with the research firm, says he expects that percentage to be at least 50% by now. And it’s growing. Given [...]]]></description>
			<content:encoded><![CDATA[<p>Tablets are increasingly the tech tool of choice for CEOs. According to the 2012 CEO &#038; Senior Business Executive survey by Gartner, 41% of CEOs polled said they use an iPad. Mark Raskino, an analyst with the research firm, <a href="http://www.gartner.com/id=1957516">says he expects</a> that percentage to be at least 50% by now. And it’s growing.</p>
<p>Given this rapid adoption by executives, Raskino suggested during a webinar last month that CIOs should “put some cool app on the CEO’s iPad” to show off the innovation and capabilities of his or her IT department. Few apps, in my mind, would be more effective than one built around analytics, particularly one that showed how applying data to business issues can foster growth because, as the Gartner survey showed, it’s the top concern for CEOs across the globe.</p>
<p>Although there are some very interesting analytics apps on the iTunes App Store for iPad, as you can imagine, most are for entry-level analytics businesses. I don’t think CEOs at large organizations would be that impressed with most of them. (Plus, the App Store’s predilection toward consumers means that a search for “Analytics” in the Business section of Apple’s online marketplace yields numerous curious results, such as  “PiPuzzle: Valentine’s Day Edition,” “Circus Challenge,” and “Palmistry—destiny in your hand,” among others. And, while palm reading could help some CIOs in reading their CEO’s views on IT, it’s probably not the best app with which to start.)</p>
<p>However, the Gartner survey does suggest  what kind of app CEOs might appreciate for their tablet: a customer-centric one. According to the Gartner research, nearly 60% of CEOs see increasing their investment in their CRM infrastructure, more than any other IT area. The second item on a CEO’s IT wish list for 2012: data-driven management.</p>
<p>Could you ask for a better combination for making a CEO happy? Customers are the foundation for any business growth. CRM systems are rich with customer data about your business. Creating an app that delivers relevant, decision-focused data about your customers, then, should be a high priority for any IT team that has a CEO carrying an iPad.</p>
<p><a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/05/shutterstock_92708947.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/05/shutterstock_92708947-150x150.jpg" alt="" title="shutterstock_92708947" width="150" height="150" class="aligncenter size-thumbnail wp-image-198" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/05/analytics-in-our-ipad-era/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Scientists Push Big Data Limits</title>
		<link>http://blogs.sybase.com/irfankhan/2012/04/scientists-push-big-data-limits/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/04/scientists-push-big-data-limits/#comments</comments>
		<pubDate>Tue, 24 Apr 2012 16:59:57 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Data Management]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=192</guid>
		<description><![CDATA[Physicists studying the results of tests at the Large Hadron Collider (LHC) at CERN, the Swiss nuclear research laboratory outside Geneva, have a lot of data to ponder. In fact, they have nearly 50% more than they had originally estimated. Initially, the LHC was expected to generate 15 petabytes of usable data each year. Recent [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_95287372.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_95287372-150x150.jpg" alt="" title="shutterstock_95287372" width="150" height="150" class="alignleft size-thumbnail wp-image-193" /></a>Physicists studying the results of tests at the Large Hadron Collider (LHC) at CERN, the Swiss nuclear research laboratory outside Geneva, have a lot of data to ponder. In fact, they have nearly 50% more than they had originally estimated. Initially, the LHC was <a href="http://public.web.cern.ch/public/en/LHC/Computing-en.html">expected to generate </a>15 petabytes of usable data each year. Recent <a href="http://www.zdnet.co.uk/news/cloud/2012/03/14/cern-drafts-in-cloud-to-handle-data-loads-40095265/">reports</a> have raised that number to more than 22 petabytes annually. </p>
<p>However, CERN says that its tests <a href="http://www.lhc-closer.es/php/index.php?i=1&#038;s=3&#038;p=12&#038;e=0">produce vastly greater amounts of data</a> than gets studied. Some experiments can create up to one petabyte of data per second. Lucky for the data storage managers, not to mention the tired-eye physicists, on average all but 200 Mbytes per second of that petabyte are deemed “uninteresting data” and discarded by the system.</p>
<p>It’s Big Data quantities like we see at the LHC that helped prompt the U.S. government late last month to <a href="http://www.whitehouse.gov/sites/default/files/microsites/ostp/big_data_press_release_final_2.pdf">announce</a> $200 million dollars in research and development funds specifically for scientists confronting the data deluge. According to a statement from the White House, the funds are necessary to develop the technologies capable of “managing, analyzing, visualizing, and extracting useful information from large and diverse data sets.” And it’s hoped that this investment in Big Data management “will accelerate scientific discovery and lead to new fields of inquiry that would otherwise not be possible.”</p>
<p>I am optimistic that those of us on the technology side will be up to the task of handling the data needs of science. But even I was daunted by recent news of the Square Kilometer Array (SKA). Headquartered in Manchester, UK, with a target completion date in 2024, the proposed 20-nation radio telescope research project dwarfs all other Big Data initiatives I’ve seen so far. This single project is <a href="http://arstechnica.com/science/news/2012/04/future-telescope-array-drives-development-of-exabyte-processing.ars?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=rss">currently estimated</a> to produce one exabyte of data every day; or the equivalent of <a href="http://www.skatelescope.org/about/facts-figures/">six weeks worth of the total volume of data traversing the Internet in 2011.</a></p>
<p>Large-scale, multi-nation collaborative science, such as the LHC and SKA, are relatively small in number and confront Big Data problems on a scale few of us can imagine. But they bear watching because almost all of us can learn from their Big Data solutions.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/04/scientists-push-big-data-limits/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A Windstorm of Big Data</title>
		<link>http://blogs.sybase.com/irfankhan/2012/04/a-windstorm-of-big-data/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/04/a-windstorm-of-big-data/#comments</comments>
		<pubDate>Tue, 17 Apr 2012 14:06:41 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Predictive analytics]]></category>
		<category><![CDATA[R&D]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=184</guid>
		<description><![CDATA[In his fascinating and comprehensive 1984 study Heaven’s Breath: A Natural History of the Wind, Lyall Watson observed, “Wind is invisible.” And, as a result, “There are no photographs of the wind.” But he wrote that before Big Data blew into town. Now we have a moving picture of the wind. Data visualization experts Fernanda [...]]]></description>
			<content:encoded><![CDATA[<p>In his fascinating and comprehensive <a href="http://www.goodreads.com/book/show/80654.Heaven_s_Breath">1984 study</a> Heaven’s Breath: A Natural History of the Wind, Lyall Watson observed, “Wind is invisible.” And, as a result, “There are no photographs of the wind.”</p>
<p>But he wrote that before Big Data blew into town. Now we have a moving picture of the wind.</p>
<p>Data visualization experts Fernanda Viégas and Martin Wattenberg have given us <a href="http://hint.fm/wind/">this</a> beautiful near real-time view of wind in motion throughout the continental United States. The striking live image uses the massive National Digital Forecast Database maintained by the National Weather Service. By clicking on the map you can drill down and see the wind blowing in your area of interest; or simply become mesmerized by flowing patterns the wind makes.<a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_71733247.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_71733247-150x150.jpg" alt="" title="shutterstock_71733247" width="150" height="150" class="alignright size-thumbnail wp-image-186" /></a></p>
<p>In addition to being a thing of beauty itself, the visualization also offers practical insight for energy companies in deploying wind farms to produce electricity. It can also be a tool for firefighters in the American West in their annual battle with forest fires. And even farmers can use the visualization to help them locate the best places to establish effective wind breaks to fight wind erosion, a major <a href="http://www.weru.ksu.edu/new_weru/problem/problem.shtml">hindrance</a> to agricultural productivity.</p>
<p>Viégas and Wattenberg are hunting for similar wind databases from other regions on the globe so they can build a model for those geographies as well. If you know of any such databases, you should let them know. </p>
<p>The promise of Big Data is to help us see things we never could before. No where is that truer than with this visualization of wind.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/04/a-windstorm-of-big-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>HANA’s Database Strategy Stands Tall</title>
		<link>http://blogs.sybase.com/irfankhan/2012/04/hana%e2%80%99s-database-strategy-stands-tall/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/04/hana%e2%80%99s-database-strategy-stands-tall/#comments</comments>
		<pubDate>Fri, 13 Apr 2012 13:26:05 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Predictive analytics]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=175</guid>
		<description><![CDATA[Admittedly, I’m a technology-focused person, which is why in my initial post about SAP’s news this week about its in-memory database, HANA, I dwelled on its superior performance. After all, when real-world customers are seeing 400,000 times improvement in database responsiveness, well, it gets your attention. But there was much more SAP brought to the [...]]]></description>
			<content:encoded><![CDATA[<p>Admittedly, I’m a technology-focused person, which is why in my <a href="http://www.itworld.com/data-centerservers/266880/saps-hana-database-big-performance-big-data">initial post</a> about SAP’s news this week about its in-memory database, HANA, I dwelled on its superior performance. After all, when real-world customers are seeing 400,000 times improvement in database responsiveness, well, it gets your attention.</p>
<p>But there was much more SAP brought to the competitive landscape than the fastest database available. It brought a well-conceived strategy to win customers and increase market share. Oh, and it brought money to implement the plan. Lots of money. About a half billion dollars.</p>
<p><em>Businessweek</em>’s <a href="http://www.businessweek.com/news/2012-04-10/sap-plans-337-million-fund-to-encourage-customers-to-adopt-hana">headline</a> sums up our new database world nicely:</p>
<p><strong>SAP Challenges Oracle With $500 Million Hana Incentive<br />
</strong><br />
Of course, how SAP spends the money is what will change the game. In that regard the company has devised two-pronged strategy that, first, will help CIOs decide it’s time to move off slow, legacy databases ($337 million) and, second, encourage startups to build applications on top of the world’s fastest database ($155 million). </p>
<p>That seems like the right balance of investments. First, there a lot of CIOs with tired, overworked databases undermining application performance. But they are always fighting for IT resources and need to pick their budget battles carefully. Merely showing CFOs how fast a HANA database infrastructure will be for a company is not always going to be persuasive. But revealing the partnership mentality SAP is bringing to the table in terms of its investment with customers will definitely help win over the keepers of the purse strings. As one observer <a href="http://www.zdnet.com/blog/btl/sap-goes-database-happy-can-it-dent-oracle/73837">summed up nicely</a>: “The game plan: Push long-time database partner Oracle out of the way and take a bigger chunk of the enterprise IT spending pie. It’s unclear how this adventure will turn out, but SAP has some cojones.”</p>
<p>Step two is to increase the application portfolio on HANA by funding startups. With far fewer software startups by comparison to IT customers, fewer dollars are needed. But there’s another less money is needed: software developers naturally gravitate toward the best, fastest, most advanced technology. They have a built-in desire to partner with state-of-the-art vendors. A little extra venture capital money, well, that’s icing on the cake for many of them.</p>
<p>Another area SAP thought through carefully is pricing. The company made it simple. As <em>Information Week</em> <a href="http://www.informationweek.com/news/software/enterprise_apps/232900098">points out</a>, in a similar ERP analytics environment with Oracle you would need three licenses. “With Hana there&#8217;s just one license,” the news source says.</p>
<p>Finally, training will be a top priority. As <em>Computerworld</em> <a href="http://www.computerworlduk.com/news/it-business/3350301/sap-building-on-hana-become-big-player-in-databases-mobile/">reports</a>: “SAP is also focused on making sure developers and database administrators have access to the best training materials for HANA, as well as development and test environments at no cost….”</p>
<p>This is a complete strategy that over the long term threatens Oracle. That’s why Larry Ellison recently said that SAP “must be on drugs” to consider competing against his company. But as <em>Forbes</em> magazine <a href="http://">countered:</a><a href="http://www.forbes.com/sites/siliconangle/2012/04/10/sap-makes-mobile-acquisition-takes-on-oracle-launched-real-time-venture-fund/"> “Well SAP is doing some really good performance enhancing drugs. SAP is standing tall shouting that they are indeed in the database market.”<br />
&#8211;<br />
<a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_31617739.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_31617739-300x300.jpg" alt="" title="shutterstock_31617739" width="300" height="300" class="aligncenter size-medium wp-image-178" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/04/hana%e2%80%99s-database-strategy-stands-tall/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>SAP’s HANA Database: Big Performance for Big Data</title>
		<link>http://blogs.sybase.com/irfankhan/2012/04/sap%e2%80%99s-hana-database-big-performance-for-big-data/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/04/sap%e2%80%99s-hana-database-big-performance-for-big-data/#comments</comments>
		<pubDate>Wed, 11 Apr 2012 14:23:04 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Databases]]></category>
		<category><![CDATA[Healthcare]]></category>
		<category><![CDATA[Predictive analytics]]></category>
		<category><![CDATA[R&D]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=170</guid>
		<description><![CDATA[Yesterday, Sybase’s parent company, SAP, made a shocking claim that demolishes the conventional wisdom in the enterprise software industry. The claim: SAP is a database company. Not just any database company, but the database company for Big Data. I know, I know. SAP is an enterprise resource planning (ERP) and business applications company. Its software [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday, Sybase’s parent company, SAP, made a shocking claim that demolishes the conventional wisdom in the enterprise software industry. The claim: SAP is a database company. Not just any database company, but the database company for Big Data.</p>
<p>I know, I know. SAP is an enterprise resource planning (ERP) and business applications company. Its software runs on databases like those from Sybase and other companies. That’s true. But the fact is, SAP is also database company.</p>
<p>Don’t believe me? Well, in addition to having the products in its Sybase arsenal, SAP makes the fastest, most robust database available today: <a href="http://www.sap.com/solutions/technology/in-memory-computing-platform/hana/overview/index.epx">HANA</a>. HANA’s in-memory, massively parallel processing architecture makes it ideally suited for the era of Big Data.  And I mean really Big Data.</p>
<p>In a recent independent <a href="https://www.experiencesaphana.com/community/blogs/blog/2012/04/09/sap-hana--scale-out-performance-test-results--early-findings">benchmark</a> HANA raced through a 100TB test database with 100 billion records. First, HANA achieved a 20x data compression level, which was remarkable. More impressive, though, was that with no caching, indexing, or materializing of the query results, the query responses were a mere 300 to 500 milliseconds. Compare this to some Oracle documentation that has claimed it was “lightning fast” at processing 100 million records in one second. HANA, then, can run 1,000 times more data in less than one-half the time than Oracle. <a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_92964202.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_92964202-150x150.jpg" alt="" title="shutterstock_92964202" width="150" height="150" class="alignright size-thumbnail wp-image-171" /></a></p>
<p>Beyond benchmarks, in the real world of Wall Street, one HANA application is using Sybase CEP (Complex Event Processing) to feed more than 2.1 million updates per second into the database. In a retail environment in Japan, one customer achieved 400,000 times performance improvement over its previous database environment. Adobe uses HANA to analyze customer data in real time and T-Mobile runs three HANA databases to analyze and reduce customer churn. It’s stories like these that make HANA the fastest growing product in SAP history.</p>
<p>HANA portends to change not just perceptions about SAP as a database company, but real lives. Imagine, if you will, a cancer patient sitting in a doctor’s office having a sample taken to analyze her genome. With those test results, medical professionals can prepare a targeted, personally designed treatment program for the patient. The problem is that today those test results take at least one month to complete. With HANA, the potential is that the doctor will get those results during the patient’s initial office visit and can begin the treatment immediately.</p>
<p>HANA dramatically changes the database industry landscape. Now, according to <a href="http://www.gartner.com/technology/reprints.do?id=1-17OKEN0&#038;ct=111014&#038;st=sb#h-d2e248">Gartner</a>, SAP has “put extraordinary competitive pressure on the megavendors, including IBM, Microsoft and Oracle” to follow its lead. However, they are a long, long way behind the new database technology leader.</p>
<p>Big Data is fast becoming SAP’s best friend because it cries out for a database like HANA. And with Big Data on everyone’s lips these days they’ll soon be talking about SAP as the database company for our Big Data era. And it won’t be shocking news to anyone. </p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/04/sap%e2%80%99s-hana-database-big-performance-for-big-data/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>In-Database Compression</title>
		<link>http://blogs.sybase.com/irfankhan/2012/04/in-database-compression/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/04/in-database-compression/#comments</comments>
		<pubDate>Tue, 03 Apr 2012 14:23:03 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Data Management]]></category>
		<category><![CDATA[Databases]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=166</guid>
		<description><![CDATA[Recently, we polled some of our customers about their estimated costs to manage a single terabyte of data in a year. Although there was no consensus—the numbers ranged from $25,000 to $100,000 per annum—there was a grudging acceptance that these costs are inevitable. After all, you still need the storage hardware, network connectivity, power, and [...]]]></description>
			<content:encoded><![CDATA[<p>Recently, we polled some of our customers about their estimated costs to manage a single terabyte of data in a year. Although there was no consensus—the numbers ranged from $25,000 to $100,000 per annum—there was a grudging acceptance that these costs are inevitable. After all, you still need the storage hardware, network connectivity, power, and more as well as the nominal labor to keep the data reliably available.</p>
<p>Plus, more people inside companies have more requirements to use more data more often for their jobs. This not only adds to the massive amounts of data being retained, it puts pressure on IT departments to have more of that information fingertip-ready for users. In addition, virtually all enterprises are hanging onto their data for longer periods. Heightened governance and regulatory concerns make storing data the default policy in many organizations. </p>
<p>Gartner <a href="http://www.cioinsight.com/c/a/Latest-News/Data-Growth-Now-a-FirstTier-Challenge-for-Enterprises-Gartner-Reports-273073/">estimates</a> large companies are seeing growth rates between 40%-60% annually. So, it doesn’t take long for those data store costs to mount quickly.</p>
<p>How can you manage these costs? Well, first of all, you need to have a data retirement policy. When can you safely archive and ultimately delete information not covered by regulations? Naturally, each organization will have their own requirements. And some data, even seldom accessed information, might never be removed such as CRM data on long-term customers and HR information on employees. </p>
<p>But other information—and there’s a lot of it—can be retired after negotiating with the information owners. And you can add leverage to the discussions by making certain that the data owners also own the costs of the data store over time. Nothing like a line item on their budget to send department heads to the delete key.</p>
<p>Yet, with hot data inside a database, storage costs can often be overlooked. That’s because the key metric for a production database is generally performance. Putting any constraints on performance might jeopardize a service-level agreement, so it’s a step seldom taken lightly.<br />
<a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_80259724.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/04/shutterstock_80259724-150x150.jpg" alt="" title="shutterstock_80259724" width="150" height="150" class="aligncenter size-thumbnail wp-image-167" /></a><br />
With the release of Sybase Adaptive Server Enterprise 15.7, in-database compression is now an option. Needless to say, rates will vary, but we’ve seen between 40%-80% compression ratios, certainly enough to make a notable impact on storage costs.  </p>
<p>What’s more important, I think, is that performance remains the top priority for ASE users, so the new compression capabilities does not put your SLA at risk. ASE can compress a single row so as to eliminate empty spaces in fixed-length columns; page directory and page index compression can be applied at the page or block level; and large objects (LOB) are compressed in-database. The overall effect is to actually reduce I/O time. Backup performance is also greatly improved as a result.</p>
<p>There’s more information here in <a href="http://www.sybase.com/files/White_Papers/Sybase-ASE-Compression-WP.pdf">this white paper</a>. If you have data storage cost problems—and you do—it’s worth a moment of your time.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/04/in-database-compression/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Democratization of Analytics</title>
		<link>http://blogs.sybase.com/irfankhan/2012/03/the-democratization-of-analytics/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/03/the-democratization-of-analytics/#comments</comments>
		<pubDate>Tue, 27 Mar 2012 14:37:54 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Predictive analytics]]></category>
		<category><![CDATA[R&D]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=160</guid>
		<description><![CDATA[It’s an election year in the United States and it’s also time for the nation’s annual college basketball “March Madness” event. Both contests bring out the data geeks who lay claim to having the best predictions about which side will take first place; whether it’s hardball politics on the podium or roundball on hardwood courts, [...]]]></description>
			<content:encoded><![CDATA[<p>It’s an election year in the United States and it’s also time for the nation’s annual college basketball “March Madness” event. Both contests bring out the data geeks who lay claim to having the best predictions about which side will take first place; whether it’s hardball politics on the podium or roundball on hardwood courts, understanding the troves of Big Data involved gives the smart money the best insights into who will win and lose.</p>
<p>As a citizen of Great Britain, I do not take sides in either contest. However, I am rather partial to the data geeks who, increasingly, are helping democratize the use of analytics in daily life. By exposing the average citizen to the value of analytics, these data geeks are not just improving the public’s understanding of their world, they are fostering an interest in the field at a time when we need more analytics experts.<br />
<a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/03/shutterstock_25448113.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/03/shutterstock_25448113-150x150.jpg" alt="" title="shutterstock_25448113" width="150" height="150" class="alignleft size-thumbnail wp-image-161" /></a><br />
For example, computer scientists at the University of Illinois have created a <a href="http://bracketodds.cs.illinois.edu/">website</a> that uses analytics to help college basketball fans pick the most likely winners in the NCAA Men’s Basketball Tournament. At the same time, the <em>New York Times</em> has hired a well-known <a href="http://en.wikipedia.org/wiki/Psephology">psephologist</a> to write a regular <a href="http://fivethirtyeight.blogs.nytimes.com/">blog about Big Data and politics</a>. Both efforts not only raise awareness of the use of data in our lives, they underscore the accuracy analytics brings to an event before any hoops are shot or votes are cast. They show, with reliable track records, that using analytics can improve your understanding of not just the past, but of the future as well. That’s very appealing to people.</p>
<p>Hollywood also gave a boost to understanding analytics when it touted the sports movie <em>Moneyball</em> as an Oscar contender for Best Picture at this year’s Academy Awards. The movie raked in more than $100 million in ticket sales where a few million filmgoers witnessed Brad Pitt’s character discuss the importance of analytics in picking the best ballplayers for the least cash, a classic business intelligence decision process. As a Tinseltown flack might say about analytics, “You can’t buy that kind of publicity.”</p>
<p>More important than high-profile exposure of analytics to wider audiences is encouraging the broader use of the technology. Arguably, no one has done a better job at putting BI tools into the hands of everyday people than Google. In addition to publishing blogs and articles aimed at small business users to help them exploit their own business data, it has posted ten, <a href="http://www.youtube.com/playlist?list=PL85CE2D27BC6FD84B">simple how-to videos</a> targeting analytics tyros. Like the tools themselves, Google offers the instructive videos for free.</p>
<p>I suspect that the current popular interest in analytics will continue because data increasingly has become intertwined with our modern lives. More than ever, we see ourselves and our actions in the context of data. And I hope that the hype that Hollywood movies, election cycles, and sporting events bring to the technology translates into genuine interest among younger people to study the underlying math in analytics. As we all know from the now-famous <a href="http://www.mckinsey.com/Features/Big_Data">McKinsey report</a> on Big Data, we face a shortfall of hundreds of thousands of analysts, data-savvy managers, and analytics experts in the coming years; a shortfall that could put many enterprises at a competitive disadvantage.</p>
<p>Without attracting more young people to the field, the competition for the few skilled practitioners available will become heavily weighted in favor of only the largest and wealthiest organizations. But if all of this recent publicity does, indeed, spur more students to major in mathematics and statistics, the building blocks of analytics, then the democratization of analytics will have been a great success at more than just the box office.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/03/the-democratization-of-analytics/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Overrated Analytics?</title>
		<link>http://blogs.sybase.com/irfankhan/2012/02/overrated-analytics/</link>
		<comments>http://blogs.sybase.com/irfankhan/2012/02/overrated-analytics/#comments</comments>
		<pubDate>Tue, 28 Feb 2012 08:19:52 +0000</pubDate>
		<dc:creator>Irfan Khan, Sybase CTO</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Operational BI]]></category>
		<category><![CDATA[Predictive analytics]]></category>

		<guid isPermaLink="false">http://blogs.sybase.com/irfankhan/?p=153</guid>
		<description><![CDATA[It would never occur to me to seriously ask whether business analytics are overrated. And if I posed the question in this blog, you would already know my answer before I reached a conclusion. Not surprisingly, my answer is unequivocally no. If anything, business analytics are somewhat underrated. Still, it is legitimate question and one [...]]]></description>
			<content:encoded><![CDATA[<p>It would never occur to me to seriously ask whether business analytics are overrated. And if I posed the question in this blog, you would already know my answer before I reached a conclusion. Not surprisingly, my answer is unequivocally no. If anything, business analytics are somewhat underrated.</p>
<p>Still, it is legitimate question and one posed by Bloomberg Businessweek Research Services in a recently published report called “The Current State of Business Analytics: Where Do We Go From here?” (You can download the survey <a href="http://tdwi.org/whitepapers/2011/12/sas_the-current-state-of-business-analytics/asset.aspx?tc=assetpg">here</a> if you have an account at The Data Warehouse Institute.) While the answer derived from their research is less emphatic than mine, it is ultimately supportive of my position.</p>
<p>I’m not going to review all of the information in the report. But there are a few key areas of interest relating to the question at hand. First, there is the expansion of business analytics within companies. Compared to a similar survey conducted in 2009 when 90% of $100+ million companies around the globe said they used business analytics, 97% say they do today. Further, 58% of respondents say their use of analytics increased significantly or moderately in the past 12 months. </p>
<p>Another point in the survey underscoring the acceleration of analytics in business is the hunt for talent. Forty-three percent of those polled have current plans to add to their analytics personnel, while another 22% acknowledge they need more people in the area, but for various reasons are not recruiting today.</p>
<p>That growth tells me that virtually every mid-to-large enterprise accepts the value of analytics. Perhaps the only data-driven business tool more prevalent in an organization today is the spreadsheet.</p>
<p>That’s because, as the report notes, “money talks.” The top three reasons companies are using analytics are to reduce costs, improve profits, and manage risk. In short, analytics are successful because they can directly help an enterprise succeed in their business.</p>
<p>However, the report’s authors seem to equivocate on the value of analytics in one area because the survey reveals that companies continue to rely on intuition more than they do data during the decision-making process. In its words: “While organizations recognize that business analytics provide additional insight for decision-making, survey results seem to show that analytics cannot fully replace experience and knowledge.” They put the ratio at 60/40 in favor of intuition over analytics.</p>
<p>That’s not a bad thing, from my perspective. And, in fact, a more nuanced survey might have discovered how tightly integrated intuition and analytics are in the most successful enterprises. By that, I mean, the best queries and models executed inside an analytics engine are those derived from business experience. Good intuition leads to savvy queries which bring the best results. In fact, the report supports my interpretation by noting that in companies “using analytics effectively,” that 60/40 ratio of intuition to analytics shifts to 53/47.<br />
<a href="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/02/shutterstock_10619575.jpg"><img src="http://blogs.sybase.com/irfankhan/wp-content/uploads/2012/02/shutterstock_10619575-150x150.jpg" alt="" title="shutterstock_10619575" width="150" height="150" class="alignleft size-thumbnail wp-image-157" /></a><br />
To me, the ideal balance for effective organizations is a 50/50 balance of business know-how and analytics. Maybe the next Bloomberg Businessweek poll will show that. Until then, I’ll argue that business analytics is slightly underrated.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.sybase.com/irfankhan/2012/02/overrated-analytics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

