dbt run for new or modified models only

Created on 13 Aug 2019 · 7Comments · Source: fishtown-analytics/dbt

Describe the feature

Be able to run only new or modified queries within a terminal command. For example, dbt run --models path --type new, modified

Describe alternatives you've considered

Apply tags to any new or updated models, and remove these "in-dev" tags when pushing to master.

Who will this benefit?

This will especially benefit analysts that are building entirely new data models from scratch and trying to deploy them into dev for the first time. This could involve creating new base models (or simply adding a new measure) and a variety of dependent mart models depending on the project. With complex projects, models can live in a variety of folders. Rather than having to tag every model touched, or rely on color coding in their text editor of choice (i.e. modified models are highlighted "green" in Atom), or simply run everything, this "new + modified only" function would save some admin time.

enhancement

Source

sagarvelagala

Most helpful comment

I really like that idea! I hadn't considered "smart tags" like this before.

We have some new code shipping in 0.15.0 that will only re-compile "changed" models. Maybe we can leverage that to do something like what you're describing? Check out the PR here if you're interested: https://github.com/fishtown-analytics/dbt/pull/1646

drewbanin on 15 Aug 2019

❤4 👍1

All 7 comments

Hey @sagarvelagala - cool idea! Can you think of a good mechanism for dbt to determine if a model is new or changed? One good option is to leverage git. Check out this post which shows you how to do something similar from outside of dbt: https://discourse.getdbt.com/t/tips-and-tricks-about-working-with-dbt/287/2

drewbanin on 13 Aug 2019

@drewbanin I don't have a very technical answer if that's what you're looking for =). In my mind I was thinking there would be a way to compare the compiled query files from the most recent run by a user to the new ones? From there, I imagine there is a way to leverage the "tag" feature in dbt and basically create a layer of "smart" tags that are generated by dbt every time a run command happens. There's a lot of cool stuff that could be done with a smart tagging feature that is probably inaccessible to an analyst that just knows SQL (aka me). Examples could be "new," "modified," or even "previously failed" tags if a model was involved in a run failure (or test failure!) in the previous run.

sagarvelagala on 15 Aug 2019

I really like that idea! I hadn't considered "smart tags" like this before.

drewbanin on 15 Aug 2019

❤4 👍1

I think an easy way to implement this would be to store the last modified date for each model. Then when this command is run compare the last modified date of the current files with the ones in the previous one and only run those.

KimchaC on 30 May 2020

We'd be very interested in something like this in DBT Cloud - we have a test-on-pull set up from github, but our preprod test runs are getting a bit long and expensive - would rather just dbt run changes and then test all.
The bash command linked above doesn't work in cloud (it doesn't error in develop, just runs forever) - interested if any other thoughts have come up to achieve it since last comments on this job?

ciejer on 5 Jun 2020

We've been thinking a lot about this! Check out https://github.com/fishtown-analytics/dbt/issues/2465

jtcohen6 on 5 Jun 2020

I'm going to call this resolved! (https://github.com/fishtown-analytics/dbt/issues/2641)

Please re-open, of course, if there's a nuance here that's missing in our implementation, which shipped as a beta feature in dbt v0.18.0 (docs).

jtcohen6 on 9 Sep 2020

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Optionally skip `expand_target_column_types` during incremental materialization

smomen · 3Comments

"year is out of range error" when loading results with the words "last month"

clrcrl · 3Comments

Relation doesn't exist for dbt test after `dbt run -m state:modified` if the test was modified but the model wasn't

joellabes · 3Comments

support persist_docs on all plugins

drewbanin · 3Comments

Add `partitions` to bq AdapterSpecificConfigs

jtcohen6 · 3Comments