An Apache Spark-based analytics platform optimized for Azure.
Based on the current capabilities of Microsoft Purview and the architecture you are using (Azure Databricks with Unity Catalog feeding Power BI), the behavior you are observing is expected. Purview is able to extract lineage inside Azure Databricks—both table-level and column-level from Unity Catalog, and it can also extract dataset, model, report, and dashboard lineage inside Power BI. However, the platform does not automatically connect these two lineage graphs. Even if Power BI is sourcing data directly from Databricks, Purview does not receive the metadata needed to infer that upstream relationship, so Databricks lineage and Power BI lineage appear as two disconnected graphs.
Regarding configuration, your scanning setup on both sides is correct. Databricks requires the workspace and catalogs to be registered and scanned with a scanning identity that has Metadata Reader or equivalent permissions in Unity Catalog. Power BI requires tenant registration, admin consent for metadata scanning, and full dataset/report scanning. Even with all prerequisites met, Purview still cannot stitch the lineage, because it connects services only when it detects a metadata-visible data movement or transformation activity. These activities are usually produced by orchestrators like Azure Data Factory, Synapse pipelines, or supported Spark lineage connectors. A direct Power BI → Databricks connection does not expose an activity or data-movement event that Purview can correlate.
This leads to the core limitation: Purview currently cannot produce an automatic end-to-end lineage from a Databricks Unity Catalog table through transformations and into Power BI datasets, even though it successfully captures lineage inside each platform. Additionally, transformation‑level logic (SQL/Python from Databricks notebooks) is not extracted today, so Purview cannot display the internal transformation steps between layers.
If end-to-end lineage is required, there are supported ways to approximate it. Some customers introduce an orchestrator such as Azure Data Factory between Databricks and the downstream consumption layer, because ADF activities generate lineage events that Purview can use to link upstream and downstream assets. Another option is to use the Databricks → Purview lineage connector (OpenLineage), which improves dataflow visibility inside Databricks, although it still does not create automatic stitching to Power BI. For cases where a complete governance view is mandatory, Purview allows manual lineage creation through the UI or through the Apache Atlas API, enabling you to draw the missing link between Databricks assets and Power BI datasets.
The scenario you described is a current product limitation rather than a configuration issue. Purview supports Databricks lineage and Power BI lineage individually, but the platform does not yet support native, automatic stitching between these systems. ADF-based orchestration or manual lineage creation are presently the only ways to form a complete end-to-end picture.
Hope this helps, Please let us know if you have any questions and concerns.