这个talk依次从整个memory between execution and storage,memory across task的角度和memory across operators running within the same task的角度分析了spark如何使用内存。同时简单介绍了project tungsten
how to arbitrate memory between execution and storage?首先普及一下基本概念。spark有两种memory
Execution memory: used for shuffles joins sorts and ag
...