Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
All questions, symptoms of errors are described with possible resolution below:
Why I'm seeing invalid source error from scanning job.
- There can be two reasons why you see this error:
- The delta table doesn't exist in the location
- The data in the file aren't in a valid delta format.
 
I'm setting up data quality scans for my Fabric delta tables. I see all data assets in the data quality view, I selected one asset and applied rules for data quality scanning, but the scan is failing.
- There can be number of reasons why your data quality scanning is failing:
- Data in tables aren't correct delta format. Make sure that your data are in delta format.
- Make sure the data map scan ran successfully, if not then rerun data map scan.
- Delete any previous data quality runs for the data asset.
 
Why I'm seeing this error message: No connection can be used. Try to create connection first?
- To profile data or to run data quality scanning, you first need to configure data source connection. This alert disappears after you have created a data source connection.
Why is the total count of profiled data showing less than the total count in my Azure Data Lake Storage Gen2 delta table?
- Microsoft Purview Data Quality is using 1 Million sample size for profiling. This sample is taken randomly. If your delta table has more than million records, then total count won't match.
Why do I see an action about data quality score is missing for a data product, I see the score in the data product when I browsed the data product view.
- When the action was created, there wasn't any data quality score for that data product. Data quality scanning ran after the action was created and the score published for the data product. Recommend to close the action once the remediation is done to avoid confusion.
Data quality rule creation from "Suggest rules" throws an error about a "date" column when trying to add all 30 suggested rules
- This is because the schema data type is unsupported state in the data quality schema view. You could change the data type to date by selecting the schema management toggle and save it. After you changed the data type you should be able to add the rule.
When trying to add all suggested rules it throws error about "ObserverId already exists"
- Most likely, the same/identical rule has already been added to a column. When you try to add same/identical rule to a column the application throws this error message.
Why my scheduled job is skipping instead of running? I see the Skipped for data quality scanning jobs
- The DQ Job has a functionality to check and run DQ only if there has been changes since the last run, which is performed to check the delta history. Skipped merely means there have been no changes in the data since last run and the spark run for DQ isn't performed. Skipped!= Failed
When I select profile data tab, I see number of columns preselected. Can I change the selected columns?
- Microsoft Purview Data Quality is using an AI assisted profiling solution. Preselected columns are selected using the Microsoft Purview Data Profiling AI. You can deselect preselected columns and reselect based on criticality of the columns and select save and run to run profiling.
Why I can't select some of the data assets from data quality asset list page to profile and scan?
- There can be few reasons:
- Those data assets are published from unsupported data sources
- The file format of those data assets isn't supported
 
Why my profiling job is failing for the supported data sources?
- Check the schema to make sure that there's no column name with spaces. The current version doesn't support column names with spaces.
Why I can't run data quality scanning and data profiling for CSV, tsv, and text files?
- Microsoft Purview Data Quality currently supports the Delta format of Parquet, Delta, Iceberg ORC, and Iceberg AVRO. Microsoft Purview Data Quality doesn't support CSV, tsv, and text files.
Why don't I see the data quality freshness rule in the rule list?
- Data quality freshness isn't supported for Azure SQL tables. If your data asset is an Azure SQL table, then the freshness rule won't be listed to select and apply to the data asset.
Why I see datatype Undefined for some columns of a data asset schema in DQ schema page?
- It seems data type for all columns hasn't been identified correctly. You can import the schema to resolve the issue (to update the datatype). Select the schema menu item from the data duality overview page, select the Schema management toggle, and select Import schema. After importing the schema, select the schema management toggle again to save the updated schema. 
- Data quality freshness isn't supported for Azure SQL tables. If your data asset is an Azure SQL table, then the freshness rule won't be listed to select and apply to the data asset. 
My DQ scan job failed. I see an error message 'Internal service error occurred, please retry, or contact Microsoft support.' What should I do to troubleshoot?
- There can be many reasons the scan is failing with this error message:
- User isn't authorized to perform the current operation for the workspace that user is trying to access for the data quality scan.
- Error code 403, meaning access to data sources is forbidden temporarily.
- Granted access to the data source for your managed identity (MSI) has expired.
- Microsoft Purview managed identity (MSI) needs contributor access to the Microsoft Fabric workspace. If the contributor access for the Microsoft Purview MSI hasn't been provided to the Microsoft Fabric workspace, then the data quality scan fails.
 
Why am I getting delta format error even though I'm using delta format?
- We support Spark 3.4 Delta 2.4. Make sure that you're using delta lake version 2.4.
Why am I seeing the error when I selected a reference data asset to configure Table lookup rule?
- The reason is you have selected a data asset that isn't part linked or referred to a data product under the same governance domain. To select the right data asset:
- Select select reference table, as indicated in this image:   
- Cancel current selection, as indicated in this image:   
- After canceling the current selection, select another asset. 
 
How can I configure access to data source for Microsoft Purview MSI?
- Here's MSI configuration guide. You find the details in this document.
All our data sources are behind the private end point (in virtual network), Can Microsoft Purview access data in virtual network for data quality scanning?
- Yes, Microsoft Purview supports Managed Virtual Network for data quality scanning. See the Microsoft Purview managed virtual network configuration article.
Where can I find good documentation about expression function to create custom rules?
- You find the documentation references and examples in the Data Quality rule page.
Why is my data quality scan for Fabric Lakehouse table failing?
- There can be many reasons:
- Make sure that your lakehouse tables are discoverable in Data Map with schema. 
- Make sure that you're using SPN for Data Map scan and MSI for DQ scan 
- Make sure that you have configured DQ connection with MSI 
- Make sure that Microsoft Purview MSI has contributor access to your fabric workspace 
- Enable OneLake setting: Users can access data stored in OneLake with apps external to Fabric   
- Learn how to configure data quality for Fabric Lakehouse. 
- Learn how to configure a Data Map scan setup for Fabric. 
 
