Delta Lake over Spark Connect
All Spark Connect Posts
I have just finished an update for the spark connect dotnet lib that contains the DeltaTable implementation so that we can now use .NET to maintain...
2025-06-20
8 reads
All Spark Connect Posts
I have just finished an update for the spark connect dotnet lib that contains the DeltaTable implementation so that we can now use .NET to maintain...
2025-06-20
8 reads
I recently encountered an interesting issue with ADF where the publish feature suddenly attempted to republish every single object, claiming they were new, despite having incrementally published changed objects...
2025-07-02 (first published: 2025-06-15)
170 reads
All Spark Connect Posts
Introduction There have been quite a few changes in the last couple of months and I just wanted to give a quick update on the current...
2024-11-16
14 reads
All Spark Connect Posts
I recently published the latest version of the Spark Connect Dotnet library which includes support for the new Variant data type in Apache Spark 4.0 here....
2024-10-12
15 reads
All Spark Connect Posts
Code
What to do? At some point we will want to do something with delta lake and so I wanted to explore the options. Before we do...
2024-02-17
3 reads
All Spark Connect Posts
Code
Goal of this post The goal of this post is to look at creating a SparkSession and a DataFrame that will wrap the Range relation and...
2024-02-10
34 reads
All Spark Connect Posts
Code
Goal of this post I wanted to explore what the Spark Connect API looked like from other languages, I am not a php developer - I...
2024-02-10
83 reads
All Spark Connect Posts
Code
Goal of this post So there are two goals of this post, the first is to take a look at Apache Arrow and how we can...
2024-02-05
71 reads
All Spark Connect Posts
Code
Goal of this post In this post we will continue looking at the gRPC API and the AnalyzePlan method which takes a plan and analyzes it....
2024-02-01
103 reads
All Spark Connect Posts
Code
Goal of this post In the first two posts, we looked at how to run some Spark code, firstly against a local Spark Connect server and...
2024-01-26
277 reads
By Steve Jones
The PASS Summit goes on tour this year, with the final date the first...
By Arun Sirpal
The Business Critical tier of Azure SQL Managed Instance offers the read-scale out feature...
By Rohit Garg
Cloud computing is essential for modern development, data storage, and scalable applications. Setting up...
Comments posted to this topic are about the item How a Legacy Logic Choked...
Comments posted to this topic are about the item Sometimes, Troubleshooting Is Hard
Comments posted to this topic are about the item Dimensional Modeling Case Study, Part...
I have this table in SQL Server 2022:
CREATE TABLE CustomerLarge (CustomerID INT NOT NULL IDENTITY(1, 1) CONSTRAINT CustomerLargePK PRIMARY KEY CLUSTERED , CustomerName VARCHAR(20) , CustomerContactFirstName VARCHAR(40) , CustomerContactLastName VARCHAR(40) , Address VARCHAR(20) , Address2 VARCHAR(20) , City VARCHAR(20) , CountryCode CHAR(3) , Postal VARCHAR(20) ) GOIf I check the columns_updated() function return in a trigger, what is the data returned? See possible answers