Having used a variety of modern ETL frameworks in the past years, I consider writing a hands-on book about what I have learned on the way.
If I may ask, what questions do you find most difficult to solve in the context of real-world ETL setups?
There are too many different tools in the space. I've been heavily researching workflow / ETL frameworks this week, and even after culling the ones that seemed like poor fits, I'm still left with:
- https://github.com/getpopper/popper
- https://github.com/lyft/flyte
- https://aws.amazon.com/step-functions/
- https://github.com/spotify/luigi
- https://github.com/dagster-io/dagster