December 19, 2014
Solving MapReduce Performance Problems With Sharded Joins
Sometimes the answer to a sluggish data pipeline isn’t more power in the Hadoop cluster, but a shift in technique. [...]
Sometimes the answer to a sluggish data pipeline isn’t more power in the Hadoop cluster, but a shift in technique. [...]
All of our lovely Spotify users generate many terabytes of data every day. All the songs that are listened to, [...]
As we all know, Hadoop is great and here at Spotify we are big fans of it. We use it [...]