RESEARCH28
Systematic Optimization of Real-Time Diffusion Model Inference on Apple M3 Ultra
arXiv CS.LGΒ·May 19, 2026
This research systematically optimizes real-time diffusion model inference on Apple M3 Ultra, exploring various techniques like CoreML conversion, quantization, and model distillation. The study achieved 22.7 FPS for 512x512 img2img transformation by combining CoreML conversion of SDXS-512 with a 3-thread camera pipeline.
Read original β