Budget solutions

40 views
Nick Antonaccio
Nick AntonaccioAdmin
May 16, 2026 at 22:41
#1

There are lot of people building systems that use old GPUs, retired repurposed datacenter hardware, and other options that are less expensive than mainstream consumer GPU hardware.

This guy got Qwen 3. 27b at 8 bit quant (that's the very slow dense version with a verse high precision quantization) running at 20 tokens per second on a GMKtec Evo T1 mini PC with 2 AMD Mi50-32gb GPUs!

https://www.youtube.com/watch?v=ll-N6IFlZvk

Here's what Gemini had to say about that system:

Building this custom hardware setup will cost approximately $1,000 to $1,250 in total. The bulk of this will be the mini PC, while the used server GPUs are highly affordable. A cost breakdown of the components:

  1. GMKtec Evo T1 Mini PCPrice: $820 - $1,000 (depending on retailer deals and configuration) Details: The Evo T1 features an Intel Core Ultra 9 285H processor and a native OCuLink port.Where to buy: You can purchase it directly from the GMKtec Store or check listings on Amazon.

  2. AMD Radeon Instinct MI50 (32GB) GPUsPrice: $150 - $250 per card (used/refurbished) Details: These are retired data center server GPUs. While they offer massive VRAM for the price, they were not designed for consumer desktop use. Where to find them: You can typically find these on eBay or secondary hardware markets.

  3. OCuLink eGPU Docks & Cables Price: $90 - 200 Details: To run these GPUs via OCuLink, you will need an eGPU PCIe docking station (like the AD-GP1 eGPU dock) that supports external PCIe 4.0. You will also need a separate desktop power supply (PSU) to provide adequate power (around 300W} per card) to the MI50 GPUs. Where to buy: eGPU docks are available on Newegg or directly from the GMKtec Advance Series eGPU Page.

Important Technical Considerations

Dual GPU Support: Most OCuLink eGPU docks only support a single graphics card. To run two Mi50s, you would need to modify your setup using multi-port PCIe switch boards, which can add $150 - $300 in hardware complexity.

Lack of Display Outputs: The Mi50 is designed for headless compute tasks. It has no display output enabled on the stock VBIOS. You will need to flash a custom vBIOS (such as a Radeon VII vBIOS) to get video outputs working.

Cooling: Server GPUs generally lack built-in fans. You will need to build or buy custom fans/shrouds to keep them from overheating

Nick Antonaccio
Nick AntonaccioAdmin
May 16, 2026 at 22:49 (edited, 2 revisions)
#2

This guy got Qwen 3.6 27b running on 2 ancient 1080ti GPUs, using turboquant:

https://www.youtube.com/watch?v=7V8G7cQBrus

I certainly wouldn't recommend going that low - it's more just an interesting experiment to push the limit of what's possible - but the fact is, you can actually run a useful LLM, with enough context to get real work accomplished, for somewhere around $500.

Please login to post a reply.

© 2026 AI By Nick.