Wednesday, October 15, 2008

Ellison hypes Oracle's data warehouse appliance

This story appeared on Network World at
http://www.networkworld.com/columnists/2008/100708-kobielus.html

Ellison hypes Oracle's data warehouse appliance

By James Kobielus , Network World , 10/07/2008

The high-end data warehousing wars are fast upon us. Vendors are launching ever more scalable DW solutions. And they're delivering them with more aggressive -- and slippery -- performance claims.

The DW industry's new battlefront is petabyte scalability. This refers to a DW platform's ability to ingest, store, process and deliver an order-of-magnitude more data than today's typical terabyte-size warehouses. In this regard, the competitive high ground is still held by pioneering DW-appliance provider Teradata. That vendor recently released a high-end, shared-nothing, massively parallel processing (MPP) DW appliance that can scale to an astounding 10 petabytes across as many as 1,024 compute/storage nodes.

Oracle and HP recently joined the petabyte battle with all guns blazing. At Oracle's annual OpenWorld conference, they jointly announced general availability of a new petabyte-scalable DW appliance: the HP Oracle Database Machine, which includes the HP Exadata Storage Server. They touted its "extreme" performance and scaling features, bolstering those claims through public demos and beta-tester testimonials.

Most significant, they enlisted none other than Oracle CEO Larry Ellison and HP honchos Mark Hurd and Ann Livermore to unveil the new offering from the conference's main stage.

Clearly, the HP Oracle Database Machine is highly strategic for both companies. It provides a platform for Oracle to sell more database licenses and for HP to sell more server and storage hardware into DW deployments. It will almost certainly get the partners onto vendor short lists, alongside Teradata, for petabyte-scale DW solutions, which are increasingly being deployed in such vertical markets as telecommunications, government and financial services.

Also, it helps them blunt the momentum of DW appliance up-and-comer Netezza, whose platform, like the new Oracle/HP offering, performs SQL processing in an intelligent storage layer, thereby accelerating queries and table scans against very large data sets.

For sure, the recent Oracle/HP announcement was substantial and has shifted the competitive dynamics in the high-end DW market. But it was also an exercise in pure, albeit well-engineered, marketing hype. Predictably, it triggered an immediate firestorm of heated retorts from aggrieved competitors, which will almost certainly escalate in coming months.

In the fog of war, the first casualty is perspective, and that's certainly the case in this competitive fracas. Buyers of DW solutions should exercise extreme caution when evaluating the new Oracle/HP solution vis-à-vis comparably scalable offerings from Teradata, Sybase, Greenplum, IBM and others. You'll definitely need to apply the standard caveats to Larry Ellison's bold price/performance claims for his new monster DW appliance. And considering that Ellison was employing the native marketing speak of the DW arena, you'll need to apply the same grains of salt to his competitors' tails. Everybody in the DW market presents their self-serving performance story in much the same way as Oracle's big kahuna.

For starters, Ellison studded his talk with what might be regarded as the "virtuous coefficients" of DW performance enhancement: 10x, 20x, 30x, 40x, 50x, as high as 72x speedups have been documented by beta testers of the HP Oracle Database Machine. Of course, every DW professional knows that these performance boosts are extremely sensitive to myriad implementation factors, such as what you put in a SQL "where" clause, how many table joins you perform, whether and how you compress the data and so forth.

The performance enhancements are also relative to whatever DW configuration -- well-engineered or otherwise -- the beta testers had implemented prior to getting their hands on this shiny new uber-appliance. Note the tag line near the end of Ellison's presentation (emphasis added): "10-50x faster than current Oracle data warehouses."

Also, Oracle's big boss hammered Teradata and Netezza with benchmarks that were ostensibly apples-to-apples. However, Ellison's presentation seriously lacked the detailed footnoting that would be necessary to ascertain that he was indeed comparing his product against comparably configured instances of rival offerings that were processing comparable workloads. Where are those fast-talking, TV-commercial pharmaceutical disclaimer readers when we need them?

But even without aid of a magnifying glass, it was clear that Ellison was comparing his appliance directly to the Teradata 2550 and Netezza 10100 on the basis of a single common-denominator, configuration-wise: They all have a one-rack footprint. That's an odd basis for comparison. Those competitors do in fact have higher-end DW-appliance models, with more capacity, that might serve as a better basis for performance and price comparisons. Somehow, though, Oracle chose to overlook that fact. Why did it size up a 168-terabyte Oracle/HP machine against 43-terabyte offerings from Teradata and Netezza respectively?

Furthermore, Oracle somehow failed to benchmark these same solutions on the full range of performance criteria that actually matter in DW and business intelligence (BI) deployments, such as query response times, concurrent usage, mixed workload support, load speed and transaction throughput. Of course, even if Oracle had provided reliable, unbiased, third-party benchmarks in all of these areas, it would have been useless if the company didn't apply to comparably configured Teradata and Netezza offerings.

And the price-comparison chart -- including those same rival solutions -- was also seriously deficient. Most notably, the HP Oracle Database Machine's overall price, as presented by Ellison, lacked the requisite Oracle Database Real Application Cluster license fees. However, the stated prices for the Teradata and Netezza solutions definitely included the database management systems that come configured into those offerings (though, of course, Netezza has a free open source database, PostgreSQL, at the heart of its offering). So when you factor in all relevant costs, the new HP Oracle Database Machine doesn't look quite as attractive on the common-denominator of acquisition price per usable terabyte of production data.

Finally, Ellison, like most DW vendors, implicitly presented his solution's architectural approach as the gold standard against which all others must be disparaged. That, of course, is a highly debatable proposition.

For one thing, Oracle Database 11g -- the software heart of the appliance -- is still a general-purpose relational DBMS that has one foot in DW but another solidly planted in online transaction processing (OLTP). By constast, Teradata, Sybase, Netezza, Greenplum and other competitors have optimized their DBMSs for DW from the get-go, and do not support OLTP.

Also, Oracle's new appliance implements a shared-disk storage-area network architecture. By most accounts, shared-disk approaches are inherently less scalable than the shared-nothing MPP approach at the heart of DW solutions from, among others, Teradata and Greenplum.

And the Exadata storage layer can only parallelize SQL queries, and only against structured relational data. In its present incarnation, the Exadata storage grid cannot be used to execute a wider range of analytic functions or handle unstructured and semi-structured data types. Consequently, it is not applicable to the new generation of "content DWs" or for any of the in-database analytics that might be applied to the myriad nonrelational data types that reside in those warehouses.

Of course, Larry Ellison didn't go into anywhere near this degree of industry context. His job was and is to sell the world on an important new Oracle product and partnership, and he did so quite well. We shouldn't expect his direct competitors to be any more frank about their respective DW solutions' limitations. No commercial DW platform can optimally address every business-analytics requirement, now and future.

Sorting through the field of high-end DW solutions is getting more difficult, due to the diversity of vendor approaches. IT professionals need to read between the lines of DW vendors' increasingly breathtaking product announcements -- and talk to a consultant or analyst in the know -- before deciding if Oracle, HP or any other solution provider is truly breaking new ground.

If you find all of these complexities and caveats extremely confusing, and you're having trouble deciding which high-end appliance-based solutions can support the most extreme petabyte-scale workloads, welcome to the new DW market.

All contents copyright 1995-2008 Network World, Inc. http://www.networkworld.com