Using the FP-Growth Algorithm to Mine Useful Patterns in Data
This article looks at using the FP-Growth algorithm from Python to mine data in SQL Server.
This article looks at using the FP-Growth algorithm from Python to mine data in SQL Server.
Introduced in SQL Server 2025 CTP 1.3, the PRODUCT() function acts similarly to SUM(), but multiplies values rather than adds them. It is an aggregate function in SQL Server and therefore operates on a data set, rather than on scalar values.
We have some requirements that we meet a particular setting or value, but often we have guidelines and recommendations. Do you know the difference?
This article shows how you can easily create connections in your Power BI workspace that use Identity Authentication to connect to your data.
The advantage of using triggers is that the same processing can occur regardless of where or how the data has been inserted, updated or deleted. In this article, we look at several examples of where and why triggers could be useful along with an example use case.
Many of the GenAI services are using the free model of the past, where they use your data in ways you might not expect. Now, a court is ensuring OpenAI keeps your chat data around.
In Level 1 of the Stairway to Azure SQL Hyperscale, we learn about the architecture and create a hyperscale instance.
In this article, we’ll revisit the dimension models we created. We wrote the entire SQL statement for the dimension by hand, and the dimensions themselves were very rudimentary; they lacked a surrogate key and there were no audit columns (such as insert date and update date). We’ll show you how we can expand the dimensions using Jinja, but also how we can minimize development effort by baking reusable patterns into the Jinja code.
Page splits are an often-overlooked performance killer in SQL Server. In this article, we take a forensic look at how serial inserts differ from mid-table inserts, revealing why inserting rows out of order causes hidden page splits, increased IO, and fragmentation. Using a wide-column table, we demonstrate both scenarios and decode their impact with page-level analysis.
Efficient query performance in Amazon Redshift often comes down to how well you manage...
By gbargsley
Welcome back to PowerShell Strikes Back. We’re three weeks in, and the training is...
By Steve Jones
Last week I was honored with a trip to Rome for the Redgate President’s...
Comments posted to this topic are about the item Why Your Index Isn't Being...
Comments posted to this topic are about the item Distance Metric Algorithms
Comments posted to this topic are about the item What Can AI Really Do?
What are the distance metric algorithms that can be used in VECTOR_DISTANCE()?
See possible answers