
Sign up to save your podcasts
Or


Modern data orchestration at scale demands reliability, speed and thoughtful adoption of new tooling. As organizations grow, keeping pipelines efficient while supporting more teams becomes a critical challenge.
In this episode, we’re joined by Ethan Shalev, Data Engineer at Wix, to discuss how Wix operates Airflow at massive scale, migrates to Airflow 3 and uses AI to accelerate development.
Key Takeaways:
00:00 Introduction.
02:13 Wix structures data engineering across multiple product-focused organizations.
03:40 Migrating nearly 8,000 DAGs to Airflow 3 requires careful planning.
04:31 Migration creates an opportunity to remove long-standing legacy Airflow code.
05:32 Internal playbooks and Cursor rules standardize and speed up DAG migrations.
07:39 Airflow 3 introduces backfills, DAG versioning and asset-aware scheduling.
09:16 Deferrable operators reduce scheduler congestion in large Airflow environments.
12:54 AI-generated code still requires review and strong testing practices.
14:52 Moving to managed Airflow reduces operational burden on internal platform teams.
15:57 Improving multi-tenancy and UI personalization remains a key Airflow need.
Resources Mentioned:
Ethan Shalev
https://www.linkedin.com/in/eshalev/
Wix | LinkedIn
https://www.linkedin.com/company/wix-com/
Wix | Website
https://www.wix.com/
Apache Airflow
https://airflow.apache.org/
Astronomer
https://www.astronomer.io/
Trino
https://trino.io/
Apache Iceberg
https://iceberg.apache.org/
Cursor
https://cursor.sh/
Airflow Summit
https://airflowsummit.org/
Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.
#AI #Automation #Airflow
By Astronomer5
2020 ratings
Modern data orchestration at scale demands reliability, speed and thoughtful adoption of new tooling. As organizations grow, keeping pipelines efficient while supporting more teams becomes a critical challenge.
In this episode, we’re joined by Ethan Shalev, Data Engineer at Wix, to discuss how Wix operates Airflow at massive scale, migrates to Airflow 3 and uses AI to accelerate development.
Key Takeaways:
00:00 Introduction.
02:13 Wix structures data engineering across multiple product-focused organizations.
03:40 Migrating nearly 8,000 DAGs to Airflow 3 requires careful planning.
04:31 Migration creates an opportunity to remove long-standing legacy Airflow code.
05:32 Internal playbooks and Cursor rules standardize and speed up DAG migrations.
07:39 Airflow 3 introduces backfills, DAG versioning and asset-aware scheduling.
09:16 Deferrable operators reduce scheduler congestion in large Airflow environments.
12:54 AI-generated code still requires review and strong testing practices.
14:52 Moving to managed Airflow reduces operational burden on internal platform teams.
15:57 Improving multi-tenancy and UI personalization remains a key Airflow need.
Resources Mentioned:
Ethan Shalev
https://www.linkedin.com/in/eshalev/
Wix | LinkedIn
https://www.linkedin.com/company/wix-com/
Wix | Website
https://www.wix.com/
Apache Airflow
https://airflow.apache.org/
Astronomer
https://www.astronomer.io/
Trino
https://trino.io/
Apache Iceberg
https://iceberg.apache.org/
Cursor
https://cursor.sh/
Airflow Summit
https://airflowsummit.org/
Thanks for listening to “The Data Flowcast: Mastering Apache Airflow® for Data Engineering and AI.” If you enjoyed this episode, please leave a 5-star review to help get the word out about the show. And be sure to subscribe so you never miss any of the insightful conversations.
#AI #Automation #Airflow

32,100 Listeners

228,383 Listeners

537 Listeners

626 Listeners

144 Listeners

3,976 Listeners

25 Listeners

139 Listeners

10,182 Listeners

58,066 Listeners

5,530 Listeners

13 Listeners

8 Listeners

25 Listeners

139 Listeners