Airflow: Create guide for Dataflow operators

Created on 8 Apr 2020  路  3Comments  路  Source: apache/airflow

Description

Hello,

A guide that describes how to use Dataflow service operators would be useful.

We have an example DAG for this service, so the guide should not be a big challenge.

If anyone is interested in this task, I am willing to provide all the necessary tips and information.

Other guides are available:
https://airflow.readthedocs.io/en/latest/howto/operator/index.html

All example DAGs for GCP are available in the example_dags directory.

Best regards,

Kamil

Use case / motivation

N/A

Related Issues

Missing guide for GCP: https://github.com/apache/airflow/issues/8209

docs good first issue feature Google

All 3 comments

I can take this on - this is a service I've worked heavily in.

@tanjinP Fantastic. I am working on this integration now with the Dataflow team, so this guide would be very helpful. I saw that you applied for 3 services, but then this service is the most important. I will be happy to share my thoughts on this integration and together with you will develop this guide. I am sure that the Dataflow team will be willing to review this guide as well. Dataflow is a hot topic right now, so I'm very happy.

@tanjinP I would be happy if you added information that asynchronous execution is recommended.
https://cloud.google.com/dataflow/docs/guides/specifying-exec-params#python_8
I would like to create sensors that will allow more efficient use of resources.

Was this page helpful?
0 / 5 - 0 ratings