Would matmul acceleration in m5 do much to improve token Generation? I’m planning to upgrade when the new studio updates to m5 but if it’s not much of a shift I’ll probably look at a second hand 256+ ram m3 studio.
Edit: of course the software would need to leverage the neural accelerators which is another variable if the software supports it in the first place
Edit: of course the software would need to leverage the neural accelerators which is another variable if the software supports it in the first place