Azure-pipelines-tasks: Searching a way to allow specific matrix entries to fail

Created on 10 Jan 2019 · 20Comments · Source: microsoft/azure-pipelines-tasks

Environment

Issue Description

We have setup azure pipelines as automatic CI service for diesel.rs. That works mostly fine.
Now it is specific to the rust ecosystem that there are several compiler version, notably a stable version, a beta version and a nightly version. The later is updated every night a may not work. To detect such broken version projects are encouraged to run their CI also with the nightly version of the rust compiler, but setup the CI in such a way that failing builds with a nightly compiler are ignored. Basically I've found no way to do this with azure pipelines.

Build question stale

Source

weiznich

👍12

Most helpful comment

This is going to end up being a serious pain point for effective adoption in both the Rust and Ember.js communities, where it's common to run against the current stable release, the current beta release, and often also the current unstable/nightly/master channel. (We're doing it against beta on ember-cli-typescript.) In that case, it's perfectly acceptable for the beta channel to fail, and it shouldn't report that the whole build failed as a result. Unfortunately, because it's at a job or task level, it's not possible to handle with matrixes as far as I can see (and as your comment suggests).

chriskrycho on 13 Apr 2019

👍3

All 20 comments

This may be possible with our yaml builds. @ericsciple do you know if this can be done?

moswald on 11 Feb 2019

@vtbassmatt fyi

@weiznich Unfortunately it is not allowed today. The current solution would be to model it as a separate job.

ericsciple on 11 Feb 2019

@ericsciple how do you prevent a task failure from bubbling up to the job (or job failure from bubbling up to the pipeline)? I could update the docs with a pattern for implementing allow-failure, but I don't actually know how to do it 😀

vtbassmatt on 22 Feb 2019

@vtbassmatt continueOnError is a task-level control and job-level control. However, when set on the job I believe it applies to all configurations.

My initial thoughts are: that behavior makes sense, especially as we think about deployment phases coming soon. If we need additional control, we might want an additional option somewhere else.

This is a similar problem to condition. The property actually exists at the phase-level, not on each job configuration.

A workaround would be to use templates instead of matrix.

Rather than adding additional controls, it makes me wonder whether the correct long term solution here is something like templates without actually requiring you to declare a separate file.

ericsciple on 22 Feb 2019

chriskrycho on 13 Apr 2019

👍3

Thanks @chriskrycho - good to hear reinforcement of how important this is. If you want to point me at an example pipeline, I might be able to suggest a workaround. (And to be clear, it'll be a workaround, not the solution we want to deliver.)

vtbassmatt on 15 Apr 2019

@vtbassmatt thanks! Here is our config for the ember-cli-typescript project:

chriskrycho on 15 Apr 2019

Gotcha. Here's how I'd work around the limitation. In the template, add a parameter allowedToFail defaulted to false. In the template, update the lines that run tests:

  - ${{ if not(eq(parameters.allowedToFail, 'true')) }}:
    - script: |
        ${{ parameters.emberTestCommand }}
      displayName: ${{ parameters.emberTestDisplayName }}
  - ${{ if eq(parameters.allowedToFail, 'true') }}:
    - script: |
        ${{ parameters.emberTestCommand }}
        exit 0
      displayName: ${{ parameters.emberTestDisplayName }}

Then in your top-level pipeline, you'll have to have separate matrixes for the allowed-to-fail legs:

  - job: ember_cli_versions
    displayName: 'ember-cli'
    dependsOn: linux_fixed
    pool:
      vmImage: 'ubuntu-16.04'
    strategy:
      matrix:
        release:
          eCliVersion: latest
    steps:
      - template: .azure/ci-template.yml
        parameters:
          emberCliVersion: $(eCliVersion)
  - job: ember_cli_versions_allowed_failures
    displayName: 'ember-cli [allowed failures]'
    dependsOn: linux_fixed
    pool:
      vmImage: 'ubuntu-16.04'
    strategy:
      matrix:
        beta:
          eCliVersion: beta
    steps:
      - template: .azure/ci-template.yml
        parameters:
          emberCliVersion: $(eCliVersion)
          allowedToFail: 'true'

vtbassmatt on 15 Apr 2019

👍1

    - script: |
        ${{ parameters.emberTestCommand }}
        exit 0
      displayName: ${{ parameters.emberTestDisplayName }}

With this use of exit 0, it seems like a reviewer would have to actually dig into the test logs of each passing pull request, in order to ascertain whether a given code change introduced a build failure on these allowed-to-fail jobs.

It's still very important to surface feedback about these failures in a first class way.

An example implementation might involve using GitHub's CheckRun feature to report failures with a "neutral" or "warning" state in the pull request "checks" page, while still marking the Status as green.

Example of Neutral Status

github_apps_checks_annotations

Currently, Azure Pipelines does not make full use of this feature

Azure Pipeline Warnings

Screen Shot 2019-04-17 at 10 09 47 PM

By adding this capability, developers could do something like

script: <test-command> || echo -e "\043#vso[task.logissue type=warning;] Yikes!!"

to effectively turn failures into warnings, while having them surface in the GitHub UI just like they do in the Pipelines UI

Screen Shot 2019-04-17 at 10 19 49 PM

Screen Shot 2019-04-17 at 10 19 56 PM

Screen Shot 2019-04-17 at 10 20 05 PM

mike-north on 18 Apr 2019

👍2

@mike-north fair point, I didn't think about that. You can actually inject an error and still not fail the leg, which might be what's needed.

vtbassmatt on 18 Apr 2019

@vtbassmatt I assigned you to this issue while the correct functionality is being ironed out.

moswald on 20 May 2019

I am in the Python world and looking into transitioning a similar functionality from Travis CI. I think this issue covers it, but if not, please let me know whether to open a new one or look at a different one. Thanks!

xref spacetelescope/synphot_refactor#194 and astropy/astropy#8445

pllim on 6 Jun 2019

@vtbassmatt Any updates? We would also need that for some projects.

letmaik on 25 Jul 2019

Nothing to report, sorry.

vtbassmatt on 25 Jul 2019

@vtbassmatt - any updates to share?

I'm adding Python 3.9 to our build in https://github.com/HypothesisWorks/hypothesis/pull/2445, and I'd love to have an allowed-to-fail matrix entry for 3.10 / nightly too.

Zac-HD on 21 May 2020

@Zac-HD sorry, still nothing. It hasn't been forgotten, but we've had a number of other priorities. Does one of the above workarounds help at all?

vtbassmatt on 21 May 2020

👍1

No worries, I'll look into setting up a separate job for it, or might just get around to moving to GitHub Actions as I have for smaller projects :smile:

Zac-HD on 21 May 2020

👍1

@Zac-HD , if you have "allowed to fail" working for Actions, I would like the recipe. Thank you!

pllim on 21 May 2020

@pllim I'll be experimenting with https://help.github.com/en/actions/reference/workflow-syntax-for-github-actions#jobsjob_idcontinue-on-error and will let you know how it goes.

Zac-HD on 21 May 2020

👀1 👍1

This issue is stale because it has been open for 180 days with no activity. Remove the stale label or comment on the issue otherwise this will be closed in 5 days

github-actions[bot] on 17 Nov 2020

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Cannot set unique testRunTitle when using dotnet test for multiple projects

richard-ob · 3Comments

Nuget restore fails on net461 project referencing netstandard2.0 project

Mardoxx · 3Comments

Azure Pipeline public build throws TF400813: The user is not authorized to access this resource error when the Publish Build Artifacts task is executed

MarkIannucci · 3Comments

[Feature Request] - KubernetesV1 - Don't force commands to use '-o' json/yaml

HenrikStanley · 3Comments

NodeTool does not work on ARM

timfish · 3Comments