how can I go about applying o3 quant config (static per-tensor a8w8) to llama? I don't see this in example notebooks
how can I go about applying o3 quant config (static per-tensor a8w8) to llama? I don't see this in example notebooks