Why JSON is Breaking Your CPU — And How Nvidia GPUs Are Fighting Back

Download MP3
Ever felt the pain of processing massive, deeply nested JSON files? In this episode, we dive into the bottlenecks of handling terabytes of complex JSON data in modern systems — and why standard CPUs just can’t keep up. From the rise of JSON as the web’s data language to how Apache Spark tackles scale with distributed processing, we unpack the real problem: CPU inefficiency at scale.
Then we take you inside the story of a team that tried to accelerate JSON queries using GPUs — and failed... at first. What went wrong? Cache thrashing, warp divergence, and sparse data. But with clever engineering — query grouping, alphabetic ordering, and parallel tokenization — they unlocked up to 3.2x performance boosts.
Whether you're a data engineer, developer, or just fascinated by what happens under the hood of big data systems, this is an episode you don’t want to miss.
Why JSON is Breaking Your CPU — And How Nvidia GPUs Are Fighting Back
Broadcast by