David Suarez
Dec 19, 2021

Nice post, it triggered some of my thoughts. However I see a few disadvantages of taking this approach.

First, good luck with convincing source maintainers to push their files into a streaming service…

Second, if you can convince them to do so, why not pushing the files directly to your data lake so you skip the complexity of streaming?

Once you start bothering source maintainers for this kind of things, you are likely to get an extra bottleneck for the data ingestion in case the process fails. I would only go this direction in case your team have full permissions and responsibility to maintain the whole process.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

David Suarez
David Suarez

Written by David Suarez

Passionate about modern Cloud Data Engineering.

No responses yet

Write a response