I've been running various models on a Mac Pro 2013 (8 cores, 32 GB RAM) at about...

neverartful · 2026-06-02T01:04:36 1780362276

I have and use a Mac Pro 2013 too. Mine is 8 cores with 64 GB RAM. I haven't used mine for any LLM workloads, but it does just fine for most stuff. My biggest concern with it is the OS. I'm still running macOS (the latest supported version) but it's getting continually further out-of-date security wise all the time.

fooker · 2026-06-01T10:07:46 1780308466

What are the tasks that do well with 8-10 t/s ?

wazoox · 2026-06-01T12:19:55 1780316395

The sort of task you don't expect to end immediately. If extracting data from a bunch of PDFs takes 1 hour or the whole night, that doesn't make much difference to me. It's not fast enough for auto completion and slightly too slow for chat (but bearable IMO).

fooker · 2026-06-01T15:08:25 1780326505

Running a local llm at 10 t/s overnight to extract data from a few PDFs will burn more in electricity than paying cents for the hosted kimi models.

You can (sometimes) break even if you have a workstation GPU.

wazoox · 2026-06-01T20:24:08 1780345448

Sometimes data privacy is paramount.