Hello, I want to standardize my home servers and reduce them to 3 Proxmox computers. 2x a Tiny server and a slightly more powerful one for AI (ollama/open webui and deepseek-r1-70b, CPU based only, no GPU).
For the more powerful server, I am wavering between 2 processors: i9-10940X vs. i9-14900KS.
i9-10940X
- 14 Cores (3,30-4,8 GHz == 67,2 GHz)
- 28 Threads
- Quadro-Channel DDR4-2933 (PC4-23466, 93.9GB/s)
i9-14900KS
- 24 Cores (8Power+16Economy - 2,40-6,2 GHz == 117,6GHz)
- 32 Threads
- Dual-Channel DDR5-5600 (PC5-44800, 89.6GB/s)
I don’t like the Idea of the Power/Economy-Cores… And the newer i9 has only dual-channel for RAM instead of quad. But it has double of GHz.
Which is better for my solution? I also want a relative low idle power consumption.
thank you all!
Need max AVX instructions. Anything with P/E cores is junk. Only enterprise P cores have the max AVX instructions. When P/E are mixed the advanced AVX is disabled in microcode because the CPU scheduler is unable to determine if a process thread contains an AVX instruction and there is no asymmetrical scheduler that handles this. Prior to early 12k series Intel, the microcode for P enterprise could allegedly run if swapped manually. This was “fused off” to prevent it, probably because Linux could easily be adapted to asymmetrical scheduling but Windows would probably not. The whole reason W11 had to be made was because of the E-cores and the way the scheduler and spin up of idol cores works, at least according to someone on Linux Plumbers for the CPU scheduler ~2020. There are already asymmetric schedulers in Android ARM.
Anyways I think it was on Gamer’s Nexus in the last week or two that Intel was doing some all P core consumer stuff. I’d look at that. According to chips and cheese, the primary CPU bottleneck for tensors is the bus width and clock management of the L2 to L1 cache.
I do alright with my laptop, but haven’t tried R1 stuff yet. The 70B llama2 stuff that I ran was untenable for CPU only with a 12700 with just CPU. It is a little slower than my reading pace when split with a 16 GB GPU, and that was running a 4 bit quantization version.
At which altar should we pray to the idol cores?
Plan 9
Thank you! I will look into Gamer’s Nexus.