I just don't understand it Is it because the AMD driver team is lazy? NVIDIA still offers such function. I bet it could be handy when optimizing CUDA programs. However there is no such equivalent in ATI Stream SDK / OpenCL etc. What is wrong? OpenCL claims to be able to handle memory hierarchy, but what good can be done by hiding the video memory usage?