Fix run operation return codes (#1377) #1406

beckjake · 2019-04-17T17:30:00Z

When we run an operation, set the return code on error.

Also, since wilt-chamberlain totally broke the run-operation command in multiple ways, I fixed all that. And added tests so I don't do that again.

I also added a commit call that happens after successfully executing the macro. Maybe that would be better as a flag? I just wanted it for testing, but I can't really see not wanting to commit.

Make dbt run-operation actually function at all with the RPC changes On exceptions that occur outside of actual macro execution, catch them and return failure appropriately

cmcarthur

looks good. @drewbanin should we document run-operation in wilt chamberlain?

cmcarthur · 2019-04-18T15:01:14Z

core/dbt/task/run_operation.py

@@ -22,19 +21,44 @@ def _get_macro_parts(self):
    def _get_kwargs(self):
        return dbt.utils.parse_cli_vars(self.args.args)

-    def run(self):
+    def run_unsafe(self):


can you make this _run_unsafe

drewbanin · 2019-04-19T20:28:14Z

@cmcarthur yeah, I think Wilt is the right time to release this for real.

@beckjake I think it's actually pretty important that we don't add a commit at the end of operations. Some queries (like vacuum / analyze on Redshift) are not allowed to run inside of transactions. I think that if people want to run transactional code through operations, they should add the begin and commit explicitly.

If we do this, then we need to be really sure that we don't begin a transaction implicitly, otherwise the query won't be committed. Do we have a good way of ensuring that connections never have open connections when the operations SQL runs?

drewbanin · 2019-04-19T20:30:03Z

core/dbt/task/run_operation.py

+            result = self._run_unsafe()
+        except dbt.exceptions.Exception as exc:
+            logger.error(
+                'Encountered a dbt exception while running a macro: {}'


This is an exception that dbt is aware of, but it's not a dbt exception per se. I just don't want folks to think that dbt did something wrong if their operations fail. Can we make this say:

Encountered an error while running an operation:

beckjake · 2019-04-21T22:37:33Z

Some queries (like vacuum / analyze on Redshift) are not allowed to run inside of transactions. I think that if people want to run transactional code through operations, they should add the begin and commit explicitly.

Ok, I can do that instead then.

If we do this, then we need to be really sure that we don't begin a transaction implicitly, otherwise the query won't be committed.

I think that's already true. If your macro is implemented via call statement currently like I kind of assume it is, you must make sure not set auto_begin=False. Right?

Do we have a good way of ensuring that connections never have open connections when the operations SQL runs?

I assume you mean "... connections never have open transactions ...". Yeah. adapter.clear_transaction() is reliable, in my experience.

drewbanin

This looks great - good call on adding tests.

One thing: I think we'll want to clear the transaction before calling the operation macro. This is helpful for queries (like vacuum) that must be run outside of a transaction:

{% macro vacuum() %}
  {% call statement(auto_begin=false) %}
    vacuum public.events
  {% endcall %}
{% endmacro %}

If you invoke this operation, you'll see:

VACUUM cannot run inside a transaction block

I don't actually see an explicit begin; in the logs, so I think this is that psycopg2 thing where new connections are created with an open transaction. I assumed that setting auto_begin=false would prevent this from happening, but maybe we can be heavy-handed and just add

adapter.clear_transaction()

right before the call to adapter.execute_macro()

This seems to work ok, and I believe it no-ops on databases that don't have transactions.

drewbanin · 2019-04-24T00:50:08Z

core/dbt/task/run_operation.py

+                .format(exc)
+            )
+            logger.debug('', exc_info=True)
+            return False, None


Could we conceivably return a dict that encodes this error message? That would be useful for an eventual API that wants to do something with the results when an operation fails.

I think we can leave this as-is for now, but want to keep it in the back of our minds when we revisit the RunResults node contract.

Sure, though in the RPC case I'd probably reach for overriding the run method entirely (the message would end up in the response logs anyway)

drewbanin

LGTM

Jacob Beck added 3 commits April 17, 2019 11:26

run-operation fixes

13dd720

Make dbt run-operation actually function at all with the RPC changes On exceptions that occur outside of actual macro execution, catch them and return failure appropriately

Acquire a connection before executing the macro, and commit after

a72a4e1

add tests

5b74c58

cmcarthur approved these changes Apr 18, 2019

View reviewed changes

PR feedback: run_unsafe -> _run_unsafe

4730789

drewbanin reviewed Apr 19, 2019

View reviewed changes

no automatic transactions

08c5f9a

drewbanin reviewed Apr 24, 2019

View reviewed changes

PR feedback: Add a clear_transaction call

ca02a58

drewbanin approved these changes Apr 25, 2019

View reviewed changes

beckjake merged commit 5762e5f into dev/wilt-chamberlain Apr 25, 2019

beckjake deleted the fix/run-operation-return-codes branch April 25, 2019 14:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix run operation return codes (#1377) #1406

Fix run operation return codes (#1377) #1406

beckjake commented Apr 17, 2019

cmcarthur left a comment

cmcarthur Apr 18, 2019

drewbanin commented Apr 19, 2019

drewbanin Apr 19, 2019

beckjake commented Apr 21, 2019

drewbanin left a comment

drewbanin Apr 24, 2019

beckjake Apr 24, 2019

drewbanin left a comment

Fix run operation return codes (#1377) #1406

Fix run operation return codes (#1377) #1406

Conversation

beckjake commented Apr 17, 2019

cmcarthur left a comment

Choose a reason for hiding this comment

cmcarthur Apr 18, 2019

Choose a reason for hiding this comment

drewbanin commented Apr 19, 2019

drewbanin Apr 19, 2019

Choose a reason for hiding this comment

beckjake commented Apr 21, 2019

drewbanin left a comment

Choose a reason for hiding this comment

drewbanin Apr 24, 2019

Choose a reason for hiding this comment

beckjake Apr 24, 2019

Choose a reason for hiding this comment

drewbanin left a comment

Choose a reason for hiding this comment