Microsoft Azure Stream Analytics (ASA) is a fully managed cloud service for real-time processing of streaming data. ASA makes it easy to set up real-time analytic computations on data flowing in from devices, sensors, web sites, applications and infrastructure systems. It supports a powerful high-level SQL-like language that dramatically simplifies… Read more
Massive parallel processing (MPP) is the future for data warehousing.
So what is MPP? SQL Server is a Symmetric Multiprocessing (SMP) solution, which essentially means it uses one server. MPP provides scalability and query performance by running independent servers in parallel. That is the quick definition. For more… Read more
SQL Server 2016 was recently announced. Top new features include:
- Always Encrypted protects data at rest and in motion. With Always Encrypted, SQL Server can perform operations on encrypted data and best of all, the encryption key resides with the application in the customers trusted environment. Encryption and decryption of…
At the recent Microsoft Build Developer Conference, Executive Vice President Scott Guthrie announced the Azure Data Lake. It is a new flavor of Azure Storage which can handle streaming data (low latency, high volume, short updates), is geo-distributed, data-locality aware and allows individual files to be sized at… Read more
Analytics Platform System (APS) is Microsoft’s massively parallel processing (MPP) data warehouse technology. This has only been available as an on-prem solution (see video Overview of Microsoft Analytics Platform System). Until now. At the recent Microsoft Build Developer Conference, Executive Vice President Scott Guthrie announced the… Read more
In case you were wondering what happened to the TechEd conferences, Microsoft is now bringing together the best of previously individual events – the Management Summit, the Exchange, SharePoint, Lync, Project, and TechEd conferences – and then taking it to the next level, based on what customers and partners have… Read more
A NoSQL database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases. NoSQL is often interpreted as Not-only-SQL to emphasize that they may also support SQL-like query languages. Most NoSQL databases are designed to store… Read more
A “data lake” is a storage repository, usually in Hadoop, that holds a vast amount of raw data in its native format until it is needed. It’s a great place for investigating, exploring, experimenting, and refining data, in addition to archiving data. Some characteristics of a data lake include:
Microsoft Azure provides you two options when hosting your SQL Server-based data warehouse: Microsoft Azure SQL Database and SQL Server in Azure Virtual Machine. Which one is appropriate based on the size of the data warehouse? What are some hardware features to choose from for an Azure VM for… Read more
I see a lot of confusion on what exactly is an Operational Data Store (ODS). While it can mean different things to different people, I’ll explain what I see as the most common definition. First let me mention that an ODS is not a data warehouse or data mart. A… Read more
Thanks to everyone who attended my session “Building a Big Data Solution” (Building an Effective Data Warehouse Architecture with Hadoop, Cloud and MPP) for Pragmatic Works today. The abstract for my session is below and the recording will be available here tomorrow. I hope you enjoyed it!
Here is the… Read more
In an effort to understand Power BI and all the products it encompasses, I have made this slide deck to hopefully make things easy for you: Power BI Made Simple.
It is a presentation that covers all the products under the Power BI umbrella. I give an overview of… Read more
Advanced Analytics, or Business Analytics, refers to future-oriented analysis that can be used to help drive changes and improvements in business practices. It is made up of four phases:
Descriptive Analytics: What is generally referred to as “business intelligence”, this phase is where a lot of digital… Read more
Ever wonder about Microsoft’s product roadmap? With Microsoft rapidly releases products and services, they realized the need to provide better transparency. Well wonder no more! They have released the Cloud Platform Roadmap Site (visit here).
Roadmaps are given for: Microsoft Azure, Intune, Power BI, and Visual Studio Online; server… Read more
Long has the question been asked “Which Microsoft tool do I use for dashboards?”. SSRS, Excel, PowerView, Report Builder and PerformancePoint are all candidates. But that has all changed, and the future of dashboarding is here: Power BI Designer.
The Power BI Designer is a new companion application for Power… Read more
The following blog describes how to use certain cloud-based Power BI for Office 365 products (Power View and Q&A) on a Power BI Site that will reference data on-premises. To do this you will use certain on-premise Power BI products (Power Pivot and Power Query) to gather the data. Note… Read more
The Analytics Platform System (APS), which is a renaming of the Parallel Data Warehouse (PDW), has just released an appliance update (AU3), which is sort of like a service pack, except that it includes many new features. These appliance updates are made available frequently, about every 3-4 months.… Read more
The traditional data warehouse has served us well for many years, but new trends are causing it to break in four different ways: data growth, fast query expectations from users, non-relational/unstructured data, and cloud-born data. How can you prevent this from happening? Enter the modern data warehouse, which is… Read more
The Azure Data Factory is a service designed to allow developers to integrate disparate data sources. It is a platform somewhat like SSIS in the cloud to manage the data you have both on-prem and in the cloud.
It provides access to on-premises data in SQL Server and cloud data… Read more
I have previously blogged about the Fast Track Data Warehouse, a reference configuration optimized for data warehousing (see Microsoft SQL Server Reference Architecture and Appliances). That was version 3.0 of the reference configuration that was for SQL Server 2008 R2. Released a couple of years ago was version 4.0… Read more