【推理引擎】模型压缩04:训练后量化QAT深度解读!与量化部署核心原理! 训练后量化分为动态离线量化(Post Training Quantization Dynamic, PTQ Dynamic)和静态离线量化(Post Training Quantization Static, PTQ Static),不管是哪种量化方式,同样需要在端侧真正部署起来。