Follow along with the video below to see how to install our site as a web app on your home screen.
Note: This feature may not be available in some browsers.
Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.
The forums have been upgraded with support for dark mode. By default it will follow the setting on your system/browser. You may override it by scrolling to the end of the page and clicking the gears icon.
It should be fit into the 20GB VRAM,
Will check a different Gemma 27B model tomorrow.
-----------
latter ----
Somewhat better performance with this
Q4_K_M 16.5 GB
https://huggingface.co/lmstudio-community/gemma-3-27b-it-GGUF
We need more cases supporting vertical GPU mounting with at least 4 slot thickness, while the rest of the slots are available on the mainboard!
Please Be Quiet, Cooler Master, Fractal Design!
New, supposedly super efficient QWEN 3 is here
https://huggingface.co/Qwen/Qwen3-8B — 63 token/s with Q8
https://huggingface.co/Qwen/Qwen3-30B-A3B — 18 tokens/s with Q6_K
https://huggingface.co/Qwen/Qwen3-235B-A22B — n/a :D
What monster rig you have?
And also, what speeds you can get?
I could get one more of the kit I have to run it, but it would be still like 0.7 token/s or maybe even less :D
I am not sure about the actual devs, they do what the management demanding from them.
So they need to rush and crunch - I am pretty sure they could finish and polish all TES games nicely, Consider the wide scope they always aiming for.
More time (and money) needed for that. So I will not damn...