Handling Row-level Errors in ADF Data Flows

diponkar.paul, 2021-06-16 (first published: 2021-05-02)

If you are working with ADF (Azure Data Factory) data flows, then you may have noticed there was a new feature released in November 2020, which is useful to capture any error while inserting/updating the records in a SQL database. This article will describe how to setup the error row handling feature and why it's important to set up this feature. This feature is shown below in the sink database.

Fig 1: Error row handling at sink database

For error handling there are two options to choose from:

1. Stop on first error (default)
2. Continue on error

These options are shown below in the drop down.

Fig 2: Error row handling options

By default, ADF pipeline will stop at the first error. However, the main purpose of this feature is to use the Continue on error option to catch and log the error so that we can look at later and take action accordingly.

Let's change the settings to catch errors. The below figures show the settings and will also describe each item. Please follow the numbering of each item in Fig 3.

Fig 3: Settings Continue on error1) Error row handling: Since we wanted to catch errors, we have chosen "Continue on error".

2) Transaction Commit: Choose whether the data flow will be written in a single transaction or in batches. I have chosen single, which means whenever there is failure, it will store the record that failed. Batch will store the error records when the full batch is completed.

3) Output rejected data: You need to check this box to store the error rows. The whole point of error row handling is you want to know the error records. If so, please tick check mark. Though you can avoid this, if there are any errors, you will not know which record(s) caused the error.

4) Linked Service: Put the linked service and test the connection

5) Storage folder path: This is the path where you would like to store the error records from a file.

6) Report success on error: I don't select this checkbox, since I want to know if there is a failure.

After changing the settings, when you run the pipeline and there is any error in the dataset, it will be stored in the storage folder you have provided at no 5 in the settings.

In general, when there is a failure at the time of inserting records to the database, it takes some time to find the reason for failure. You may have to go through large chunk of your dataset to find the root cause. Through this feature, the error records will be captured and stored in the storage so you will be able to identify the reason for any error very quickly. And if you would like to ingest those error rows, then you can fix those records and re-run the pipeline.

Working with NULL Values in an ADF Data Flow?

by diponkar.paul

SQLServerCentral

Azure Data Factory

Learn how to replace NULL values in an ADF data flow.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(1)

You rated this post out of 5. Change rating

2021-01-14

46,121 reads

Discuss

How to Implement CI/CD in Azure Data Factory (ADF)

by diponkar.paul

SQLServerCentral

Learn how you can use CI/CD with your ADF Pipelines and Azure DevOps using ARM templates.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(3)

You rated this post out of 5. Change rating

2023-03-10

13,703 reads

Discuss

Understanding the Mapping Data Flow Activity in Azure Data Factory

by Randheer Parmar

SQLServerCentral

ETL/SSIS/Azure Data Factory

Learn about using Mapping Data Flows in Azure Data Factory.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(2)

You rated this post out of 5. Change rating

2022-04-25

9,702 reads

Discuss

How to Recover a Corrupted Azure Data Factory Integration Runtime

by diponkar.paul

SQLServerCentral

I would like to share my recent experience with Azure Data Factory (ADF) where AutoResolveIntegrationRuntime become corrupted and how did I recover it. I still don't know how the Integration Runtime (IR) was corrupted. However, if it happens, then this article will help you to solve the issue. Problem In general, the ADF AutoResolveIntegrationRuntime should […]

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(2)

You rated this post out of 5. Change rating

2024-09-02 (first published: 2021-11-05)

4,074 reads

Discuss

Dynamically Add a Timestamp To Files in Azure Data Factory

by diponkar.paul

SQLServerCentral

This article will describe how to add your local timestamp at the end of the each file in Azure Data Factory (ADF). In general, ADF gets a UTC timestamp, so we need to convert the timestamp from UTC to EST, since our local time zone is EST. For example, if the input Source file name […]

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

(2)

You rated this post out of 5. Change rating

2021-04-22

30,286 reads

Discuss

Handling Row-level Errors in ADF Data Flows

Rate

Share

Tags

Share

Rate

Handling Row-level Errors in ADF Data Flows

Rate

Share

Tags

Share

Rate

Related content

Working with NULL Values in an ADF Data Flow?

How to Implement CI/CD in Azure Data Factory (ADF)

Understanding the Mapping Data Flow Activity in Azure Data Factory

How to Recover a Corrupted Azure Data Factory Integration Runtime

Dynamically Add a Timestamp To Files in Azure Data Factory