You're mistaken. Layers are not swapped between GPUs by default. Each card will do it's own thing.IIRC, you cannot 'spread' LLM workloads across seperate VRAM segments.
It's so fucking dumb. A top slot, 3-4 slots of fucking nothing, then a 5.0 "AI" slot at the very bottom of the board.I know. I've given up on looking towards AM5 until a new PROMOTORY is released. The standardized fanout for PCIe on AM5 is unironically, literally, and un-amusingly "retarded".
Everybody who worked on X870 should get fired.