Integration
3rd party apps ..
Last updated
Was this helpful?
3rd party apps ..
Last updated
Was this helpful?
x
Pentaho Data Quality (PDQ) integrates with Pentaho Data Catalog (PDC) enabling the user to pull metadata information from PDC to semantic layer in PDQ.
The following information is pulled from PDC:
Connections
Domains
Tags
Before you connect to PDC and pull the metadata, consider the mappings between PDC & PDQ.
● A Term in the attribute level will pulled and mapped to the same attribute in PDQ for the asset.
● The Tags will also be mapped for the same assets.
● The PDC Glossary will be added as a domain to the asset in PDQ - with prefix HitachiPDC_
● The creation and assignment of metadata will occur in a PDC job.
● The data source in PDQ needs to be the same name in PDQ.
The following is the mapping between PDC & PDQ:
Connections
Data Source
Domains
Glossary
Tags
Tags
Terms
Terms
Click on the Hitachi | Pentaho Data Catalog Tile.
Enter in the following connection details:
Host
https://pdc.pdc.lab
User
system_admin@hv.com
Password
Welcome123!
Limitation
The integration is only supported for the following connectors :
● MSSQL
● Oracle
● Snowflake
If a term is created under a category, then only the parent glossary will be created in PDQ
The username and password has to be provided by the user and the asset has been selected for a connection in PDQ. The integration will not pull the username and password of a data source connection from PDC.
Select all the Field options
Validate & save the connection.
The user can use the refresh button to refresh the integration data in PDQ.
The refresh will only refresh the semantic section of PDQ and not update the mappings to the asset until the Catalog update task is triggered as part of the asset run.
x
x