Glue Table Already Exists for Temporary Tables #689
-
For context, I am running two Airflow tasks at the same time, for the same table. I have implemented idempotency, so each task is running for a different timespan. My understanding is that this should be fine for an incremental model. However, what happens is that an error gets thrown that the Glue Table already exists: My guess is that this is happening to the temporary tables being generated for each dbt run. Please correct me if my understanding is wrong. Assuming this is correct, is there a way to concatenate a UUID to the temporary table names so that tasks can be run in parallel. Thanks so much! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 7 replies
-
@GabrielEisenbergOlympus that's the case for hive tables. To configure what you want you can use in your dbt model this config then:
In addition to that, you can consider to use |
Beta Was this translation helpful? Give feedback.
@GabrielEisenbergOlympus that's the case for hive tables.
This PR introduce the possibility of using unique_tmp_table_suffix to achieve what you want, but it works only for hive tables and stategy='insert_overwrite' (I believe that we can add also
append
as a stategy).To configure what you want you can use in your dbt model this config then:
In addition to that, you can consider to use
temp_schema
to store those unique tables in another glue database to isolate more access on underlying tables.