Think LSNs Are Unique? Think Again - Preventing Data Loss in CDC ETL
When using only an LSN to track which changes you have processed, you can lose data. This article shows how to avoid this problem.
2025-12-15
2,358 reads
When using only an LSN to track which changes you have processed, you can lose data. This article shows how to avoid this problem.
2025-12-15
2,358 reads
Over the past years, the tool dbt – short for data build tool – has become quite popular in the data engineering world for handling such an ELT process. dbt takes on the role of the “T”, meaning it’s responsible for transforming the data in a certain data store.
2024-12-23
Learn the basics of Apache Druid and how it can be used to ingest data into a data lake.
2023-11-10
14,573 reads
Learn about the evolutionary journey of ETL (Extract, Transform, Load) from traditional processes to modern cloud solutions.
2023-10-18
This article looks at manipulating data using Python to unpivot data received in a cross tab format.
2023-06-30
3,370 reads
A comparison of the lambda and kappa architectures for real time ingestion and analysis of big data.
2023-06-19
5,343 reads
This Python 3 script is designed to take CSV file data pasted into the csv_data variable and generate SQL insert statements that can be used to insert the data into a MySQL database. The script is easy to use and can save you a lot of time when working with large amounts of data.
2023-05-08 (first published: 2023-05-05)
5,685 reads
Data movement is a fundamental piece of a data engineer’s duties, and recently I’ve been thinking about the art of data movement. What are some of the most important pieces that a data engineer needs to think about when confronted with data ingestion? There is of course data exporting as well, and in that case, […]
2022-11-11
6,049 reads
2021-04-23
633 reads
An ETL issue with Excel causes issues, because the users didn't realize a limitation.
2020-10-19
306 reads
By Bert Wagner
Until recently, my family's 90,000+ photos have been hidden away in the depths of...
By Kamil
Managing Microsoft Fabric at scale quickly becomes painful if you rely only on the...
2025 exposed a growing gap between AI ambition and operational reality. As budgets tightened...
Comments posted to this topic are about the item Celebrating Tomorrow
Comments posted to this topic are about the item SQL Art: I Made a...
I am using ssrs in visual studio 2022, I work on a report, there...
Which of these is more appropriate for tomorrow?
See possible answers