Analyze data linege from a database
The dlineage tool can connect to the database instance and analyze the metadata to generate the data lineage automatically.
Connect and analyze data lineage
Use /fromdb
parameter to export metadta from the database.
/fromdb
parameter:
-dbVendor: Database type, Use colon to split dbVendor and version if specific version is required. (:, such as dbvmysql:5.7)
-host: Database host name (ip address or domain name)
-port: Port number
-db: Database name
-user: User name
-pwd: User password
-extractedDbsSchemas: Export metadata under the specific schema. Use comma to split if multiple schema required. We can use this flag to improve the export performance.
-excludedDbsSchemas: Exclude metadata under the specific schema during the export. Use comma to split if multiple schema required (such as ,). We can use this flag to improve the export performance.
-extractedViews: Export metadata under the specific view. Use comma to split if multiple views required (such as ,). We can use this flag to improve the export performance.
/exportonly
just export metadata.json, no further data analysis.
/metadataoutput
specifies the metadata output directory and file name.
for example, connect to an Oracle database and analzye the data lineage.
java -jar data_flow_analyzer.jar /fromdb "-dbVendor dbvoracle -host 127.0.0.1 -port 1521 -db orcl -user scott -pwd tiger"
Export the meatadata only
You can also export the meatadata from database and analzye the metadata in two steps:
Only export the metadta
java -jar data_flow_analyzer.jar /fromdb "-dbVendor dbvoracle -host 127.0.0.1 -port 1521 -db orcl -user scott -pwd tiger" /exportonly /metadataoutput metadata.json
the metadata.json exported in this step can also be used with /env
paramter to resolve the ambiguous columns problem in SQL query.
analyze the metadta that generated in the previous step
java -jar gudusoft.dlineage.jar /t oracle /f metadata.json
Last updated