-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(ingest/looker): Added delta-lake
to platform naming convention involving 2 parts
#12061
base: master
Are you sure you want to change the base?
feat(ingest/looker): Added delta-lake
to platform naming convention involving 2 parts
#12061
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Based on the logic here
datahub/metadata-ingestion/src/datahub/ingestion/source/delta_lake/source.py
Lines 204 to 212 in b74ba11
logger.debug(f"Ingesting table {table_name} from location {path}") | |
if self.source_config.relative_path is None: | |
browse_path: str = ( | |
strip_s3_prefix(path) if self.source_config.is_s3 else path.strip("/") | |
) | |
else: | |
browse_path = path.split(self.source_config.base_path)[1].strip("/") | |
data_platform_urn = make_data_platform_urn(self.source_config.platform) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I thought they would also have database.tablename
identifier in Looker while using the sql_table_name. And will not require the DEFAULT_DATABASE
that is mentioned in the Looker connection
I saw some docs here as well https://docs.delta.io/latest/delta-batch.html
But let me know what would be the best way forward. We have database.tablename
in sql_table_name and would like that to be the identifier as well for upstream lineage
@k7ragav looking at our golden files, delta-lake seems to have datahub/metadata-ingestion/tests/integration/delta_lake/delta_lake_minio_mces_golden.json Line 6 in 9fb2df1
What do your delta-lake urns look like? Are you using our delta-lake ingestion source, or something else? Overall - I don't think this change does what we want. That said, my understanding of how delta-lake works is somewhat weak. |
@hsheth2 We do not use the |
@hsheth2 Do you have some suggestions on how to move forward based on the explanation below?
|
I briefly checked how our different sources work right now.
@k7ragav, what do you think about this? |
|
Here is an example of how this looks like in the Hive tests ->
|
Checklist
Added
delta-lake
to platforms that follow the naming convention of 'db.table_name`.