Settings
https://github.com/sqlparser/sqlflow_public/blob/master/sqlflow_guide_cn.md#setting
Last updated
https://github.com/sqlparser/sqlflow_public/blob/master/sqlflow_guide_cn.md#setting
Last updated
Input different request parameters to the graph API to get different results:
showRelationType
If direct dataflow = true then it is fdd and we will show only direct dataflow. If direct dataflow = true and indirect dataflow=true, then it is fdd,fddi,fdr,frd. We will show both direct dataflow and indrect dataflow.
dataflowOfAggregateFunction
Whether take the parameters in the COUNT function as direct or indirect dataflow
ignoreRecordSet
true or false
ignoreFunction
true or false
showConstantTable
true or false
showTransform
true or false
We can observe the changes on parameters when switching different modes:
You will be able to set the config when creating data lineage with the SQL Edior as well as giving the configurable parameters under the setting
section during the job creation.
Customizing parameters when visualizing your SQL:
Giving the configurable parameters under the setting
section during the job creation:
direct dataflow
On/Off
Whether show direct dataflow or not.
indirect dataflow
On/Off
Whether show indirect dataflow or not.
dataflow of count function
direct/indirect
Whether take the parameters in the COUNT function as direct or indirect dataflow
show intermediate recordset
On/Off
Show intermediate recordset or not.
show function
On/Off
Show function
or not
show transform
On/Off
Show transform
or not
show constant
On/Off
show constant
or not
Take the following sql as example:
When we set all values as On
and direct
as the value in dataflow of count function when creating the job/visualizing the SQL, we will get following data lineage:
All direct dataflows and indirect dataflows are present in above data lineage. We show the functions in the sql and intermediate recordSets are shown.
The indirect dataflows will not be present if we turn Off
indirect dataflow:
You can read more theory about what is direct dataflow/indirect dataflow and why aggrect function COUNT is different.