Data Engineering

Which Apache Spark tool is the best?

Sarah Floris
Towards AI


Providing a background to different tools that use Apache Spark

Photo by Warren Wong on Unsplash

You are staring at your code run one last time before you will push it to stage to test it in the cloud. You see an error pop up that says “Connection Lost.” “Again?” you think to yourself. This is not even a tenth of the data that I will need to process in these Python jobs.

