Could we use a small e-GPU for prefill and the large unified VRAM on Macs for the best of both worlds?
Could we use a small e-GPU for prefill and the large unified VRAM on Macs for the best of both worlds?