- Joined
- Jul 31, 2014
- Messages
- 533 (0.14/day)
System Name | Diablo | Baal | Mephisto |
---|---|
Processor | Ryzen 9800X3D | 2x Xeon E5-2697v4 | i7-13900H |
Motherboard | ASRockRack B650D4U-2L2T/BCM | Supermicro X10DRH-iT | Lenovo Thinkpad P1 Gen 6 |
Cooling | Custom loop | SC846 Chassis cooled| dual-fanned heatpipes with LM |
Memory | 64GiB DDR5-5600 ECC | 256GiB DDR4-3200 ECC RDIMM | 64GiB DDR5-5600 |
Video Card(s) | RTX 3090 Ti Founder's Edition | Embedded ASPEED2400 | RTX 5000 Ada Mobile (80W) |
Storage | many, many SSDs and HDDs.... |
Display(s) | Dell U3014 + Dell U3011 | SMCI IPMI KVMoIP | 3840×2400 Samsung OLED |
Case | Caselabs TH10A | Supermicro SC846 | Lenovo Thinkpad P1 Gen 6 |
Audio Device(s) | Creative SoundBlaster X4 | None | On-board + Moondriver2 Ti + Bluetooth |
Power Supply | Corsair AX1600 | 1200W PSU (Delta) | Lenovo 230W or 300W |
Mouse | Logitech G604 |
Keyboard | 1985 IBM Model F 122-key, Lenovo integrated |
VR HMD | The wait for 4K per eye is long and winding.... |
Software | FAAAR too much to list |
I'm more worried about the mode switching from 32 to 64 this will require. To be honest, I don't have much info about the cost of these ops, but considering mixed 32/64bit programs aren't that common (heck, I know of none!), I'm not very optimistic.
I doubt that. This would entail writing a CUDA-like infrastructure that does provide what CUDA does. Most feasible solution would probably be leveraging existing APIs; AMD's ROCm and Intel's equivalent.
Mode-switching isn't too much of a problem. Iirc you can access AVX2 registers and instructions from a 32bit IA-32 contexts just fine (exactly like MMX and SSE; AVX2 is just a faster and wider successor to SSE, which was a successor to MMX).
As for the CUDA part, between ZLUDA (as mentioned by Denver) and just reimplementing the necessary bits internally, it's not that much of a problem - just changes the timescale really.
But isnt it like aged beyond believe? like dont we have far superior modern solutions?
Physics have not changed in a few billion years, brother

I suspect that even recent CPUs such as Ryzen 9700X might not be fast enough to outperform older GPUs when running PhysX simulations.
It's more than just speed - you also have to factor data transfer between main memory and VRAM, setting up and tearing down GPU kernels. This is why something like Path of Exile which can generate frankly *ridiculous* amounts of physics particles does their graphical (meaning no interaction back into the gameloop) physics entirely on CPU - the latency of back and forth CPU-GPU processing is just too high.