Microsoft Fabric and PySpark: Coding tricks to improve your solutions
PySpark has some unconventional syntaxes which provide power to the development process, making it easier.
2024-10-02
PySpark has some unconventional syntaxes which provide power to the development process, making it easier.
2024-10-02
This article looks at the old style of calculating a running total and how to convert that to code that uses a window function.
2024-09-30
2,010 reads
This tip covers the differences and similarities between SQL Server index rebuild and index reorganize operations.
2024-09-30
Azure Data Studio (ADS) is a lightweight IDE built...
2024-09-27
746 reads
Many years ago, my wife and I wrote an article for Datamation, a major trade publication at the time, under the title, “Don’t Warehouse Dirty Data!” It’s been referenced quite a few times over the decades but is nowhere to be found using Google these days. The point is, if you have written a report using data, you have no doubt felt the pain of dirty data and it is nothing new.
2024-09-27
Join over 170 global data experts offering more than 150 sessions designed for all skill levels. Explore the schedule to plan your days on-site, secure your ticket, and book your travel before prices increase.
2024-09-27
This article explores how raising the Cost Threshold for Parallelism (CTFP) affects missing index recommendations in SQL Server along with examples.
2024-09-25
Learn a step-by-step method to save your ADF pipelines in Git and automatically release new changes to test and production.
2024-09-23 (first published: 2023-10-16)
2,217 reads
Have you ever surfed the internet and felt overwhelmed by personalized advertisements that appear to know your every desire? Have you ever been amazed at how accurate some weather forecasts can be? These actions are based on data processing. EsProc SPL and SQL (note, SPL stands for Structured Processing Language and SQL stands for Structured Query Language). SQL is a major language widely encountered and used for data manipulation within relational databases. While EsProc SPL is another effective tool for data processing, especially suited for complex computations and in-memory operations, it is less commonly encountered.
2024-09-23
Calculating elapsed time between time values can be an interesting problem that is easily solved even when data anomalies are present.
2024-09-20 (first published: 2013-09-26)
18,245 reads
By Brian Kelley
It's like disaster recovery (and business continuity) planning is the end-of-term research paper that...
Look, we all know data is king these days. But having mountains of data...
This topic keeps coming up with my customers so the purpose of this blog...
Hi, how to send emails on Azure Hyperscale server and which sp we...
I was trying to access SSRS report in host server browser , throwing an...
Sorry, more bakery stuff. Say I buy ingredients over time and I want to...
If I have durable memory-optimized tables, what is the starting point for estimating storage space?
See possible answers