Share via

End to End Lineage not visible between Azure Databricks and Power BI in MS Purview

SudhakarReddy Marepalli 0 Reputation points
2026-02-12T19:13:57.9666667+00:00

Issue Description

We are implementing Microsoft Purview for enterprise data governance for a client.

The client’s data platform consists of:

  • Azure Data Factory for orchestration
  • Azure Databricks with Unity Catalog for data storage and transformations
  • Power BI for analytics and reporting

We have completed the following successfully:

  • Scanned Azure Databricks Unity Catalog and captured table-level and column-level lineage across Databricks layers
  • Scanned Power BI and can view lineage for datasets, reports, and dashboards

However, the lineage is currently disconnected:

  • Databricks lineage appears independently
  • Power BI lineage appears independently
  • There is no end-to-end lineage connection from Databricks source tables through transformations and into Power BI datasets, reports, and dashboards
  • No Transformation logic

Requirement

The client requires a single, end-to-end lineage view in Microsoft Purview that:

  • Starts from Databricks source tables
  • Flows through all Databricks transformation layers
  • Show what has been there in transformation logic or details inside the transformation
  • Extends to downstream Power BI datasets, reports, and dashboards consuming those tables

Questions

Could you please confirm:

  1. What is the supported method in Microsoft Purview to enable cross-platform lineage from Azure Databricks Unity Catalog tables to Power BI datasets, reports, and dashboards?
  2. Are there any specific scanner configurations, permissions, or prerequisites required on either the Databricks or Power BI side to allow Purview to associate upstream Databricks assets with downstream Power BI artifacts?
  3. Does Microsoft Purview rely on Azure Data Factory, direct dataset connections, or metadata correlation to establish lineage between Databricks and Power BI?
  4. Are there any known constraints that prevent Databricks-to-Power BI lineage from appearing as a single end-to-end flow?
  5. Are transformation-level or cross-service lineage enhancements planned or on the roadmap?
  6. If not supported, is this considered a product limitation or a configuration gap?

Business Impact

Without connected lineage across Databricks and Power BI, the client cannot achieve full transparency for data governance, impact analysis, and audit requirements.

Azure Databricks
Azure Databricks

An Apache Spark-based analytics platform optimized for Azure.

{count} votes

3 answers

Sort by: Most helpful
  1. Manoj Kumar Boyini 9,180 Reputation points Microsoft External Staff Moderator
    2026-02-12T20:51:59.7333333+00:00

    Hi SudhakarReddy Marepalli,

    Based on the current capabilities of Microsoft Purview and the architecture you are using (Azure Databricks with Unity Catalog feeding Power BI), the behavior you are observing is expected. Purview is able to extract lineage inside Azure Databricks—both table-level and column-level from Unity Catalog, and it can also extract dataset, model, report, and dashboard lineage inside Power BI. However, the platform does not automatically connect these two lineage graphs. Even if Power BI is sourcing data directly from Databricks, Purview does not receive the metadata needed to infer that upstream relationship, so Databricks lineage and Power BI lineage appear as two disconnected graphs.

    Regarding configuration, your scanning setup on both sides is correct. Databricks requires the workspace and catalogs to be registered and scanned with a scanning identity that has Metadata Reader or equivalent permissions in Unity Catalog. Power BI requires tenant registration, admin consent for metadata scanning, and full dataset/report scanning. Even with all prerequisites met, Purview still cannot stitch the lineage, because it connects services only when it detects a metadata-visible data movement or transformation activity. These activities are usually produced by orchestrators like Azure Data Factory, Synapse pipelines, or supported Spark lineage connectors. A direct Power BI → Databricks connection does not expose an activity or data-movement event that Purview can correlate.

    This leads to the core limitation: Purview currently cannot produce an automatic end-to-end lineage from a Databricks Unity Catalog table through transformations and into Power BI datasets, even though it successfully captures lineage inside each platform. Additionally, transformation‑level logic (SQL/Python from Databricks notebooks) is not extracted today, so Purview cannot display the internal transformation steps between layers.

    If end-to-end lineage is required, there are supported ways to approximate it. Some customers introduce an orchestrator such as Azure Data Factory between Databricks and the downstream consumption layer, because ADF activities generate lineage events that Purview can use to link upstream and downstream assets. Another option is to use the Databricks → Purview lineage connector (OpenLineage), which improves dataflow visibility inside Databricks, although it still does not create automatic stitching to Power BI. For cases where a complete governance view is mandatory, Purview allows manual lineage creation through the UI or through the Apache Atlas API, enabling you to draw the missing link between Databricks assets and Power BI datasets.

    The scenario you described is a current product limitation rather than a configuration issue. Purview supports Databricks lineage and Power BI lineage individually, but the platform does not yet support native, automatic stitching between these systems. ADF-based orchestration or manual lineage creation are presently the only ways to form a complete end-to-end picture.

    Hope this helps, Please let us know if you have any questions and concerns.

    1 person found this answer helpful.

  2. SudhakarReddy Marepalli 0 Reputation points
    2026-02-16T16:32:09.97+00:00

    Hi Manoj,

    Thank you for your detailed reply. wanted to check.

    Will I be able to see the transformation logic executed within Azure Databricks notebooks? If not fully supported, are there any recommended workarounds or alternative approaches to enable visibility of these transformation details?

    Thanks,

    Sudhakar

    0 comments No comments

  3. Q&A Assist
    2026-02-12T19:14:13.76+00:00
    AI generated content. This question contains an answer created with AI from Q&A Assist. Learn more

    To enable cross-platform lineage from Azure Databricks Unity Catalog to Power BI in Microsoft Purview, consider the following points based on the current capabilities:

    1. Supported Method for Cross-Platform Lineage: Microsoft Purview captures lineage for assets scanned through it. For end-to-end lineage to be visible, ensure that both Azure Databricks and Power BI assets are included in the scans. This means that lineage is only shown for objects that have been scanned and are connected through the data flow.
    2. Scanner Configurations and Permissions: Ensure that all relevant Databricks workspaces and Power BI datasets are scanned by Microsoft Purview. You may need to check that the necessary permissions are granted for Microsoft Purview to access and scan the assets in both Databricks and Power BI. Specific privileges may be required to link external metadata and lineage relationships.
    3. Lineage Establishment: Microsoft Purview does not inherently rely on Azure Data Factory for lineage between Databricks and Power BI. Instead, it captures lineage based on the assets scanned and their relationships. If Azure Data Factory is used for orchestration, it can facilitate lineage if it is also scanned and integrated into the lineage view.
    4. Known Constraints: There are limitations that may prevent Databricks-to-Power BI lineage from appearing as a single flow. For instance, if not all objects involved in the data flow are scanned, the lineage will be incomplete. Additionally, lineage may not be preserved for renamed objects or if certain transformation logic is not captured in the lineage system tables.
    5. Transformation-Level Enhancements: As of now, there is no specific information available regarding planned enhancements for transformation-level lineage or cross-service lineage in Microsoft Purview. It is advisable to keep an eye on updates from Microsoft regarding future capabilities.
    6. Product Limitation vs. Configuration Gap: If the desired lineage is not supported, it may be considered a product limitation rather than a configuration gap, as the current capabilities of Microsoft Purview dictate how lineage is captured and displayed.

    In summary, achieving a connected lineage view requires careful configuration of scans and permissions across both Azure Databricks and Power BI, along with an understanding of the limitations in the current lineage capabilities of Microsoft Purview.


    References:

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.