Microsoft made available the first technical preview of its new Microsoft Azure Stack offering today. It was announced last week. Azure Stack brings the cloud model of computing to every datacenter – your own private cloud. Azure Stack is a new hybrid cloud platform product that enables organizations to… Read more
This blog describes the various approaches you can use to migrate an on-premises SQL Server database to Azure SQL Database.
In this migration process you migrate both your schema and your data from the SQL Server database in your current environment into SQL Database, provided the existing database passes… Read more
Azure SQL Database is a relational database-as-a-service in the cloud. It uses a special version of Microsoft SQL Server as its backend that is nearly identical to SQL Server (see Azure SQL Database Transact-SQL differences). While there are many benefits to using SQL Database over SQL Server, in this… Read more
Previously I covered what a data lake is (including the Azure Data Lake and enhancements), and now I wanted to touch on the main reason why you might want to incorporate a data lake into your overall data warehouse solution.
To refresh, a data lake is a landing zone,… Read more
Microsoft Azure is a cloud computing platform and infrastructure, created by Microsoft, for building, deploying and managing applications and services through a global network of Microsoft-managed and Microsoft partner-hosted datacenters. Included in this platform are multiple ways of storing data. Below I will give a brief overview of each so… Read more
So you have data in Azure Blob Storage and are concerned about reliability. Have no fear! There are four replication options for redundancy:
1. Locally Redundant Storage (LRS): All data in the storage account is made highly durable and available within a facility/datacenter by replicating transactions synchronously to three different… Read more
In a previous blog I talked about copying on-prem data to Azure Blob Storage (Getting data into Azure Blob Storage). Let’s say you have copied the data and it is sitting in Azure Blob Storage (or an Azure Data Lake) and you now want to copy it… Read more
- The Azure Data Lake has been renamed to the Azure Data Lake Store. The…
If you have on-prem data and want to copy it to Azure Blob Storage in the cloud, what are all the possible ways to do it? There are many, and here is a quick review of them:
AzCopy: A popular command-line utility designed for high-performance uploading, downloading, and copying… Read more
I see a lot of confusion about the place and purpose of the many new database solutions (“NoSQL databases”) compared to the relational databases solutions that have been around for many years. So let me try to explain the differences and best use cases for each.
First lets clarify these… Read more
In my Introduction to Hadoop I talked about the basics of Hadoop. In this post, I wanted to cover some of the more common Hadoop technologies and tools and show how they work together, in addition to showing how they work well with Microsoft technologies and tools. So you don’t… Read more
The Analytics Platform System (APS), which is a renaming of the Parallel Data Warehouse (PDW), has just released an appliance update (AU4), which is sort of like a service pack, except that it includes many new features. Below is what is new in this release:
AU4 continues to… Read more
Yesterday at the Microsoft World Wide Partner Conference in Orlando Microsoft announced the Cortana Analytics Suite, which is a new package of data storage, information management, machine learning, and business intelligence software in a single convenient monthly subscription. Microsoft’s Cortana personal digital assistant, until now available to consumers on mobile… Read more
Just announced is the Microsoft Azure Data Catalog, which is an enterprise metadata catalog / portal for the self-service discovery of data sources. It becomes available on Monday next week, July 13, 2015. Check out this short video on it. My response to this is – woo hoo! I have… Read more
Polyglot Persistence is a fancy term to mean that when storing data, it is best to use multiple data storage technologies, chosen based upon the way data is being used by individual applications or components of a single application. Different kinds of data are best dealt with different data stores.… Read more
Microsoft Azure Stream Analytics (ASA) is a fully managed cloud service for real-time processing of streaming data. ASA makes it easy to set up real-time analytic computations on data flowing in from devices, sensors, web sites, applications and infrastructure systems. It supports a powerful high-level SQL-like language that dramatically simplifies… Read more
Massive parallel processing (MPP) is the future for data warehousing.
So what is MPP? SQL Server is a Symmetric Multiprocessing (SMP) solution, which essentially means it uses one server. MPP provides scalability and query performance by running independent servers in parallel. That is the quick definition. For more… Read more
SQL Server 2016 was recently announced. Top new features include:
- Always Encrypted protects data at rest and in motion. With Always Encrypted, SQL Server can perform operations on encrypted data and best of all, the encryption key resides with the application in the customers trusted environment. Encryption and decryption of…
At the recent Microsoft Build Developer Conference, Executive Vice President Scott Guthrie announced the Azure Data Lake. It is a new flavor of Azure Storage which can handle streaming data (low latency, high volume, short updates), is geo-distributed, data-locality aware and allows individual files to be sized at… Read more
Analytics Platform System (APS) is Microsoft’s massively parallel processing (MPP) data warehouse technology. This has only been available as an on-prem solution (see video Overview of Microsoft Analytics Platform System). Until now. At the recent Microsoft Build Developer Conference, Executive Vice President Scott Guthrie announced the… Read more