Originally Posted by Thefumigator
He's not upset about it, he's just telling a fact. Its not a matter of pissing anyone off.
If you use an intel compiler x86, most likely it will work fast on intel cpus and not so fast on non-intel cpus, but its not intentionally, its by nature. The same goes for this c++ AMD OpenCL compiler. It might not generate code optimized for nvidia or intel GPUs. They could even have a hard time making it work fast on their own AMD gpus...
So the fact is, ok, AMD released this c++ thingy, and its fine. Now nvidia should sum up his efforts and make something similar unless the AMD compiler is good enough on nvidia gpus too.
Never said he was... just seemed funny to me in a sort of sad way, day to day complaining about how some company does something but its not enough even if it adds no cost to the consumer.
Compile optimizations are intentionally... and you've argued my point, just as intel is not responsible for how intel compiled code works on AMD in the same way AMD is not responsible for how their OpenCL compiled coded works on Nvidia GPUs. As for how hard it's for them, it could be but they are doing it.
Golubev: Things got better since last time I’ve took a look at OpenCL, after an year (of very “hard” work I guess) AMD made possible to use BFI_INT, BIT_ALIGN_INT directly from OpenCL kernels (via bitselect() and amd_bitalign()). I was amazed how easy to write GPU kernels for AMD cards now while their performance is nearly the same as hand-written IL kernels
It's still not perfect, but hey, small steps.
Nvidia, yeah... they're more likely to steal the code, adapt and rebrand it.