tompark
9 hours ago
More context at: "Moonshot's Kimi K2 uses a 1T-parameter MoE architecture with 32B active parameters and outperforms models like GPT-4.1 and DeepSeek-V3 on key benchmarks" <https://www.techmeme.com/250712/p11#a250712p11>
9 hours ago
More context at: "Moonshot's Kimi K2 uses a 1T-parameter MoE architecture with 32B active parameters and outperforms models like GPT-4.1 and DeepSeek-V3 on key benchmarks" <https://www.techmeme.com/250712/p11#a250712p11>