Rasa: Changing duckling url shouldn't require a model retrain

Created on 7 May 2019 · 17Comments · Source: RasaHQ/rasa

Rasa version:
Rasa core 0.14.0
Rasa nlu 0.14.4
Python version:
3.6.8
Operating system (windows, osx, ...):
osx
Issue:
rasa nlu model creation takes the duckling URL from the config.yml file and puts it into the metadata.json file of the trained model.
we use docker-compose for local testing and k8s for cloud test/prod.
docker and k8s use different way to network between containers; docker uses named containers eg duckling and k8s uses localhost. So we need different duckling url in local vs cloud testing.
we've separated the URL's in environment files but the Rasa training puts the URL into the metadata.json file of the model. This means that the model has to be retrained between local (docker-compose) and cloud (k8s-docker) testing. It makes more sense to have the URL outside of the model in a config file that can be controlled with environment and build processes so that the trained model can be copied rather than retrained (for no reason other than URL change due to environment).
eg.
for docker-compose "url": "http://duckling:8000",
for k8s "url": "http://localhost:8000",

Content of configuration file (config.yml):

for docker-compose:

pipeline:
# other stuff
  - name: ner_duckling_http
    url: http://duckling:8000

for cloud k8s:
pipeline:
# other stuff
  - name: ner_duckling_http
    url: http://localhost:8000

Content of domain file (domain.yml) (if used & relevant):

not relevant

area wontfix type

Source

cmcc13

All 17 comments

Thanks for raising this issue, @MetcalfeTom will get back to you about it soon.

akelad on 8 May 2019

Hey @cmcc13, does this help?

In addition to setting the default ``url`` of your duckling server in the
configuration, you can also change the url of your duckling server (without
needing to re-train your model) by setting the ``RASA_DUCKLING_HTTP_URL``
environment variable.

See relevant issue here

erohmensing on 10 May 2019

This might be a work around. But I think the URL should not be put into the model in the first place. The URL should be read from a YML file (config, .env or endpoints). We'll give the environment variable a go. Thanks.

cmcc13 on 13 May 2019

👍1

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] on 11 Aug 2019

This issue has been automatically closed due to inactivity. Please create a new issue if you need more help.

stale[bot] on 18 Aug 2019

I found the same problem - if i change duckling http url in the cnfig file it requires a complete retrain. Please consider fixing this as its very un-intuitive - spent a lot of time trying to figure out why the url change is not getting picked up before stumbling on this.

sankaran45 on 14 Nov 2019

I agree. Not sure how best to handle this, either the URL should be part of the endpoints.yml (but would still need to be able to define in config for NLU only models? 🤔 ) or its value shouldn't influence the fingerprinting.

erohmensing on 14 Nov 2019

@wochinge in progress but no assignee?

erohmensing on 18 Nov 2019

Thanks, fixed it :-)

wochinge on 18 Nov 2019

👍1

@wochinge why are we not fixing that?

akelad on 27 Jan 2020

Because

it's messy in the code
it's only a tiny tiny advancement if we don't retrain in case the duckling url is changed (how often are you changing your duckling url?)

So basically the relation between benefit and effort is very bad.

wochinge on 27 Jan 2020

👍1

We have to change the duckling url regularly because dev and prod environments are different. So frequency of needing to change this is daily.
for docker-compose "url": "http://duckling:8000",
for k8s "url": "http://localhost:8000"

cmcc13 on 27 Jan 2020

Actually @cmcc13 the prod url is now "duckling.default.svc.cluster.local.:8000" and that could change later if we do more advanced GKE service stuff. But in dev we just want to docker-compose up and let docker sort out all the networking. So @wochinge it's a bit of a headache for our team to not have this all in a config file.

Yoomtah on 28 Jan 2020

@Yoomtah

As far as I understand it, you have two setups, correct? One docker-compose and one K8s? And are they completely separate or are you sharing the trained models between them? Because if you are not sharing the models between these two deployments, then you have to retrain either way.

wochinge on 28 Jan 2020

For each chatbot that we have (I believe its 5), we have two model files: model-dev and model-prod. These models are identical except that they were trained with different duckling URLs. Depending on our environment we then build a Rasa docker container with one of these files.

The duckling URL is the only thing necessitating two training runs and managing two model files for each bot. We have a different action URL for dev and prod as well but this is easily changed in the endpoints.yml file.

Yoomtah on 29 Jan 2020

Ah, I think I'm getting it now.
1) You train a model in the dev environment and decide it's worth to promote it to the prod environment
2) You can't promote it to the prod environment, because the duckling url is diferent and then you have to retrain it, right?

Would an easy workaround to add an alias for duckling to your hosts file? (https://en.wikipedia.org/wiki/Hosts_(file))

wochinge on 30 Jan 2020

This problem still exists and is very uninituitv. Every endpoint can be configured in the endpoints.yml except the duckling part. Makes the automate deployment e.g. via helm very messy.

nmelche on 14 Nov 2020

Was this page helpful?

0 / 5 - 0 ratings

Related issues

can not pass arguments in docker-compose up

yondu22 · 3Comments

Exception: Not all required packages are installed. To use this pipeline, you need to install the missing dependencies. Please install sklearn

irfan-zoefit · 3Comments

No matching distribution found for tensorflow==1.15.0

Poojan66 · 3Comments

DIET classifier _predict_entities function clean_up_entities for Chinese language issue

johnson7788 · 3Comments

rasa_core.policies.ensemble.InvalidPolicyConfig: You didn't define any policies. Please define them under 'policies:' in your policy configuration file.

Arghya999 · 3Comments