Why do you need to use a complex data lineage graph to work out where specific data goes in your data platform? When you can track n trace that data just like you track and trace your courier parcel
How does it respond to the situation where the exact same data value exists in different tables?
It traverses the Data Map to identify the Tiles that are upstream (lets call that a Bounded Data Flow)
This reduces the number of tables that it search for the data value.
It then searches every table in the Bounded Data Flow and records the number of times it finds that value in each Tile.
It searches every column in the Tile.
I have a backlogged item to either allow me to pick the column, or show the count per column in the Tile.
Not a priority right now, as I tend to open the Tile and use Data Search to see the columns that hold it in a Tile.
As always a few clicks I can remove when it makes the priority.
But im not sure if count by column or Menu Anywhere to display the value in the Tile Details is the biggest value.
So will keep using it until one of those (or another pattern) becomes the obvious next iteration.
Now im overthinking it.
Should it be a Data Flow Ensemble rather than a Bounded Data Flow ……..
How does it respond to the situation where the exact same data value exists in different tables?
It traverses the Data Map to identify the Tiles that are upstream (lets call that a Bounded Data Flow)
This reduces the number of tables that it search for the data value.
It then searches every table in the Bounded Data Flow and records the number of times it finds that value in each Tile.
It searches every column in the Tile.
I have a backlogged item to either allow me to pick the column, or show the count per column in the Tile.
Not a priority right now, as I tend to open the Tile and use Data Search to see the columns that hold it in a Tile.
As always a few clicks I can remove when it makes the priority.
But im not sure if count by column or Menu Anywhere to display the value in the Tile Details is the biggest value.
So will keep using it until one of those (or another pattern) becomes the obvious next iteration.
Now im overthinking it.
Should it be a Data Flow Ensemble rather than a Bounded Data Flow ……..