David Suarez
Feb 14, 2022

--

Ehm… RDDs in 2022?
Dataframes API was made for a reason. You’ll see then that you get basically the same performance in both languages since Catalyst Optimizer resolves to same RDD DAG. Only reason to go for Scala nowadays is needing lots of UDFs or using Datasets API for extra data type control…

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

David Suarez
David Suarez

Written by David Suarez

Passionate about modern Cloud Data Engineering.

Responses (2)

Write a response