SQL Server 2012 PDW has a feature called PolyBase, that enables you to integrate Hadoop data with PDW data. By using PDW with PolyBase capabilities, a user can:
- Use an external table to define a table structure for Hadoop data.
- Query Hadoop data by running SQL statements
- Integrate Hadoop data with PDW data by running a PDW query that joins Hadoop data to a relational PDW table.
- Persist Hadoop data in PDW by querying Hadoop and saving the results to a PDW table.
- Use Hadoop as an online data archive by exporting PDW data to Hadoop. Since the data is stored online in Hadoop, user will be able to retrieve the data by querying it from PDW.
In the video below, which highlights a solution to a problem that involves sending help to evacuate potential victims of a hurricane, Microsoft SQLCAT Senior Program Manager Murshed Zaman demonstrates how to solve a customer question using relational data from SQL Server Parallel Data Warehouse 2012 (PDW 2012) and non-relational data stored inside Hadoop. The demo will show how you can analyze data by combining the capabilities of Power View and Power Pivot for Excel, Hadoop, and PDW. This video focuses on the PolyBase feature of SQL Server Parallel Data Warehouse 2012. PowerPivot and PowerView were added to the demonstration to help visualize the data results.
For step-by-step instructions on creating the PowerView report please visit Cindy Gross’ blog “Hurricane Sandy Mash-Up: Hive, SQL Server, PowerPivot & Power View.”
For more information on SQL Server Parallel Data Warehouse Appliance visit http://www.microsoft.com/en-us/sqlserver/solutions-technologies/data-warehousing/pdw.aspx