Z.ai releases open-source GLM-4.6
Z.ai, formerly Zhipu AI, has released GLM-4.6, an open-source large model with enhanced agentic coding and expanded deployment options. The release marks the first FP8 and Int4 quantization integration on Cambricon chips and also runs with native FP8 precision on Moore Threads GPUs via the vLLM inference framework.