Pipelines: [Project Health] Dependency upgrade process

Created on 28 Oct 2020 · 61Comments · Source: kubeflow/pipelines

KFP has many dependencies like

library versions (e.g. tfx, mlmd)
images (e.g. dockerhub/python, alpine, ...)

Benefits of updating these on a timely manner:

Upstream image updates likely remove many vulnerabilities that we will need to investigate manually each time later
We need to keep in sync with tfx/mlmd etc to know problems early.

We could stop pinning image versions to try getting above benefits, but this approach makes KFP repo non-deterministic. Tests may start to fail suddenly with no clue when some images released a new tag.

The best long term solution I can imagine is:

build a script that programmatically update a dependency's version in KFP repo, we can use this to ease manual update efforts.
build a bot that periodically run above script and send a PR to KFP repo, admins can approve these PRs if they pass presubmit tests.

We'll likely need to set this up for each type of dependency.
A script that programmatically update a dependency is of higher priority, because so far update efforts have been highly manual and time-costing.

In this approach,

there's minimal ongoing efforts needed to keep images up-to-date
tests/builds won't suddenly fail without any clue
if there are incompatibility issues with a dependency's new version, we can discover it early

prioritp2 sizL

Source

Bobgy

Most helpful comment

@rarkins A side question: can I confirm, do @renovate-bot already sign Google's CLA?

Yes. The app is installed and active on 400+ repos within google, googleapis, GoogleCloudPlatform and ampproject, plus potentially some other Google orgs that I'm not aware of.

rarkins on 27 Jan 2021

🚀1 🎉1

All 61 comments

I think docker image update is a good first candidate, because it is of low possibility to break us on upgrade, also we need the timely vulnerability fixes coming from it.

Bobgy on 28 Oct 2020

There are several main areas that might need updating:

Python SDK deps
Backend Go deps
Kubernetes deps (e.g. Argo, Istio)
Deps of micro services (Metadata Writer, Visualization Server, etc.)
frontend deps
MLMD client/server version

For python (SDK and Visualizations) we already have update scripts.

capri-xiyue on 19 Jan 2021

@Bobgy Regarding the docker tags, it seems dependabot can already do this.
https://dependabot.com/blog/dependabot-now-supports-docker/

Edit:
~~It seems it is also already setup to do so, see https://github.com/kubeflow/pipelines/pull/4962, but~~ Maybe some settings can be optimized as dependabot limits the amount of PRs it creates which I think should be higher than the default (5 I believe).

I just saw that the dependency was in a *.py file. In that case it is a question of setting up dependabot to scan docker files as well.

DavidSpek on 20 Jan 2021

@Bobgy I've created a script that will scan the repository for files named *ockerfile* but skipping /components/deprecated*, package*.json but skipping /*node_modules* and *requirements.txt. It then goes on to create the yaml file for dependabot so it scans the correct directories. It is setup for dockerfiles, npm, gomod and python at the moment, and I believe this should cover almost all the code in the repo. It is trivial to further customize what folders are selected if further customization is needed.

As it stands now, there are about 130 PRs that will be created with this configuration, so it might be advisable to have some form of plan to implement it in stages or be ready to quickly go through lots of the PRs. Another option is to create a target branch for all these PRs so they can be merged into that first rather than master.
https://github.com/kubeflow/pipelines/pull/5015

DavidSpek on 20 Jan 2021

👍2

For reference, the PRs that will be created can be seen here: https://github.com/DavidSpek/pipelines/pulls

DavidSpek on 20 Jan 2021

To add to the above PRs, by default the dependabot security alerts can only be seen by repo admins. While I understand it might not be desirable to have this information be public (although all it takes is forming the repo to find out), I do think it is important that WG members are able to see the security alerts so they can quickly see what dependabot PRs have a high priority for being merged. I'm not sure how this would work wrt https://github.com/kubeflow/internal-acls, but here are the instructions to do it through the UI:

https://docs.github.com/en/github/administering-a-repository/managing-security-and-analysis-settings-for-your-repository#granting-access-to-security-alerts

DavidSpek on 20 Jan 2021

@Bobgy, I could be wrong but I believe my PR covers the entire scope of what was described in the original post. If you agree I will change the PR so that it links to this issue and can close it when it gets merged.

DavidSpek on 25 Jan 2021

Based on the example https://github.com/DavidSpek/pipelines/pulls,
it looks like it will create one PR for each dependency.

I'm worried that too much PRs will get created for one major update and it causes burden for devs to review it. In addition, I'm afraid it will cause large workloads of test infra if we create one PR for one specific dependency upgrade.

I'd prefer we will have one single large PR for dependency update of a specific area if people need to review such dependency update PRs.
For example, one large pr for all dependency update of go code, one large pr for all dependency update of python deps and one large pr for all dependency update of front end deps. We may need to write our own scripts for dependency update of each specific area.