Think LSNs Are Unique? Think Again - Preventing Data Loss in CDC ETL
When using only an LSN to track which changes you have processed, you can lose data. This article shows how to avoid this problem.
2025-12-15
2,337 reads
When using only an LSN to track which changes you have processed, you can lose data. This article shows how to avoid this problem.
2025-12-15
2,337 reads
Over the past years, the tool dbt – short for data build tool – has become quite popular in the data engineering world for handling such an ELT process. dbt takes on the role of the “T”, meaning it’s responsible for transforming the data in a certain data store.
2024-12-23
Learn the basics of Apache Druid and how it can be used to ingest data into a data lake.
2023-11-10
14,573 reads
Learn about the evolutionary journey of ETL (Extract, Transform, Load) from traditional processes to modern cloud solutions.
2023-10-18
This article looks at manipulating data using Python to unpivot data received in a cross tab format.
2023-06-30
3,370 reads
A comparison of the lambda and kappa architectures for real time ingestion and analysis of big data.
2023-06-19
5,338 reads
This Python 3 script is designed to take CSV file data pasted into the csv_data variable and generate SQL insert statements that can be used to insert the data into a MySQL database. The script is easy to use and can save you a lot of time when working with large amounts of data.
2023-05-08 (first published: 2023-05-05)
5,682 reads
Data movement is a fundamental piece of a data engineer’s duties, and recently I’ve been thinking about the art of data movement. What are some of the most important pieces that a data engineer needs to think about when confronted with data ingestion? There is of course data exporting as well, and in that case, […]
2022-11-11
6,047 reads
2021-04-23
633 reads
An ETL issue with Excel causes issues, because the users didn't realize a limitation.
2020-10-19
305 reads
2025 exposed a growing gap between AI ambition and operational reality. As budgets tightened...
By John
When organizations migrate workloads to Azure, the focus is usually on architecture, performance, and...
By Brian Kelley
There's a great article from MIT Technology Review about resetting on the hype of...
Comments posted to this topic are about the item The Max PK Length
Comments posted to this topic are about the item My experience using the GitHub...
Comments posted to this topic are about the item The Microsoft SQL Year in...
If I create a multiple column Primary Key constraint, what is the most number of bytes I can include in the constraint?
See possible answers