Table of Contents

Kimi from Moonshot AI is a series of open-source large language models from Beijing that has achieved parity with—and in several benchmarks surpassed—OpenAI’s GPT-5, Anthropic’s Claude, and Google’s Gemini. With a 1-trillion-parameter mixture-of-experts architecture, 32 billion active parameters, and pricing 100x lower than competitors, Kimi represents a significant shift in the global AI landscape1.

What Is Kimi?

Kimi is the flagship AI chatbot and model series developed by Moonshot AI (月之暗面, meaning “Dark Side of the Moon”), a Beijing-based artificial intelligence company founded in March 2023 by Yang Zhilin, Zhou Xinyu, and Wu Yuxin—three Tsinghua University graduates2. The company name pays homage to Pink Floyd’s The Dark Side of the Moon, released exactly 50 years before the startup’s founding3.

The Kimi series has evolved rapidly since its October 2023 debut:

  • Kimi (October 2023): Initial chatbot capable of processing 200,000 Chinese characters per conversation4
  • Kimi K1.5 (January 2025): First model to match OpenAI o1’s reasoning capabilities5
  • Kimi K2 (July 2025): Open-source trillion-parameter model with industry-leading coding performance6
  • Kimi K2 Thinking (November 2025): Reasoning-focused variant outperforming GPT-5 and Claude Sonnet 4.57
  • Kimi K2.5 (January 2026): Native multimodal model with vision and video capabilities8

💡 Key Insight: Moonshot AI achieved a $3.8 billion valuation as of October 2025, backed by Alibaba, Tencent, and IDG Capital—making it the highest-valued unicorn among China’s “Four AI Tigers”9.

How Does Kimi Work?

Architecture and Technical Specifications

Kimi K2 and its successors employ a Mixture-of-Experts (MoE) architecture that represents a significant engineering achievement:

SpecificationKimi K2/K2.5
Total Parameters1 Trillion
Active Parameters32 Billion
Context Window256K tokens
Training Data15.5T tokens
Architecture61 layers, 384 experts
Vision Encoder400M parameters (K2.5)
LicenseModified MIT

The model activates only 32 billion parameters during inference, achieving efficiency comparable to much smaller models while retaining the knowledge capacity of a trillion-parameter system10.

The Muon Optimizer

Moonshot AI developed and scaled the Muon optimizer, which the company claims improves computational efficiency by a factor of 2 compared to the standard AdamW optimizer. This breakthrough enabled training a trillion-parameter model with “zero training instability”11. The research earned the Erik Riedel Best Paper Award at the USENIX FAST conference for the paper detailing the Mooncake serving architecture12.

Reinforcement Learning Approach

The Kimi K1.5 technical report reveals Moonshot’s reinforcement learning methodology achieves state-of-the-art reasoning through:

  • Long context scaling: Processing up to 2 million Chinese characters in a single prompt13
  • Improved policy optimization: Eliminating complex techniques like Monte Carlo tree search
  • No process reward models: Simplifying the training pipeline while maintaining performance14

The model achieved 77.5 on AIME mathematics benchmarks, 96.2 on MATH-500, and 94th percentile on Codeforces—matching OpenAI’s o115.

Kimi vs. Claude and ChatGPT: Feature Comparison

FeatureKimi K2.5GPT-5.2Claude 4.5 OpusGemini 3 Pro
Parameters1T total / 32B activeUndisclosedUndisclosedUndisclosed
Context Window256K200K200K2M
Open SourceYesNoNoNo
Vision CapabilitiesNativeYesYesYes
Video ProcessingYesLimitedLimitedYes
Input Price$0.15/1M tokens$1.25/1M$15/1MVaries
Output Price$2.50/1M tokens$10/1M$75/1MVaries
HLE Benchmark50.2% (w/ tools)45.5%43.2%45.8%
SWE-Bench Verified76.8%80.0%80.9%76.2%

⚠️ Price Advantage: Kimi’s input token pricing is 100x cheaper than Claude Opus 4 and its output pricing is 30x cheaper—a dramatic cost differential for enterprises16.

What Makes Kimi Unique?

1. Open-Source Strategy

Unlike OpenAI, Anthropic, and Google, Moonshot AI releases full model weights under a Modified MIT License. The only restriction: products exceeding 100 million monthly users or $20 million monthly revenue must display “Kimi K2” in the interface17.

This approach follows a trend among Chinese AI companies to counter U.S. technology restrictions through open-source proliferation18. As one analyst noted: “The hope is countries apart from China will use these models to ensure large amounts of applications are built on these Chinese models”19.

2. Agentic Intelligence

Kimi K2 and K2.5 are explicitly designed for agentic tasks—autonomous multi-step operations requiring tool use, reasoning, and problem-solving:

  • Tool calling: Native support for 200-300 sequential tool calls without human intervention20
  • Agent Swarm (K2.5): Self-directed coordination of multiple domain-specific agents working in parallel21
  • BrowseComp performance: 78.4% in Agent Swarm mode, compared to GPT-5’s 57.8%22

🔧 Developer Perspective: “K2 is the first model I feel comfortable using in production since Claude 3.5 Sonnet,” said Pietro Schirano, founder of AI startup MagicPath23.

3. Long-Context Leadership

Moonshot AI pioneered ultra-long context processing:

  • March 2024: Kimi claimed 2 million Chinese characters per prompt24
  • Current: 256K tokens standard across K2 series
  • Practical application: Legal documents, fiction writing, deep financial analysis25

The demand surge caused a two-day outage in March 2024, prompting a public apology from the company26.

4. Native Multimodality (K2.5)

Kimi K2.5 introduces MoonViT, a 400-million-parameter vision encoder enabling:

  • Image and video understanding
  • Code generation from visual specifications (UI designs, video workflows)
  • Visual data processing through autonomous tool orchestration27

The model can replicate website user journeys from video demonstrations alone—a capability previously unavailable in open-source models28.

The Chinese AI Landscape

Market Position

Kimi’s journey reflects the volatile Chinese AI market:

  • August 2024: Ranked #3 in monthly active users among Chinese AI chatbots29
  • June 2025: Dropped to #7 following DeepSeek’s disruptive R1 release30
  • Post-K2: Reclaimed prominence with open-source releases

The “Six AI Tigers”

Moonshot AI competes alongside five other Chinese AI startups dubbed the “Six Tigers”31:

  1. Moonshot AI (Kimi)
  2. Zhipu AI (GLM)
  3. MiniMax (MiniMax-M2)
  4. 01.AI (Yi)
  5. Baichuan
  6. Various others including DeepSeek

Funding Trajectory

DateRoundAmountValuationLead Investors
2023Seed$60M$300MHongShan, Zhen Fund
Feb 2024Series B$1B$2.5BAlibaba, HongShan
Aug 2024Series C$300M$3.3BTencent, Gaorong Capital
Oct 2025Series D$600M$3.8BIDG Capital, Tencent32

Challenges and Limitations

Despite impressive benchmarks, Kimi faces several challenges:

  • Hallucinations: Initial reviews noted instances of fabricated information—a prevalent issue across all LLMs33
  • Tool integration: Counterpoint analysts noted K2 still develops tools for effective integration with existing tech systems34
  • Geopolitical barriers: U.S. restrictions on Chinese technology limit Western enterprise adoption
  • Market competition: DeepSeek’s ultra-low-cost models continue pressuring all Chinese AI players

FAQ

Is Kimi free to use?

Yes. Kimi is available free through its web interface and mobile app. API access starts at $0.15 per million input tokens and $2.50 per million output tokens—significantly cheaper than GPT-5 ($1.25/$10) or Claude Opus 4 ($15/$75)35.

Can I run Kimi locally?

Yes. Kimi K2, K2 Thinking, and K2.5 weights are available on Hugging Face. The models run on inference engines including vLLM, SGLang, KTransformers, and TensorRT-LLM. Native INT4 quantization enables efficient deployment36.

How does Kimi compare to DeepSeek?

Both are Chinese open-source models, but Kimi targets agentic and coding tasks while DeepSeek focuses on general reasoning. DeepSeek disrupted markets in January 2025 with ultra-low pricing; Kimi responded with superior benchmark performance in coding and tool use.

Who founded Moonshot AI?

Yang Zhilin, a 31-year-old Tsinghua University graduate with a computer science PhD from Carnegie Mellon University, founded Moonshot AI with Zhou Xinyu and Wu Yuxin. Yang previously worked at Google Brain and Meta AI, and co-authored Transformer-XL37.

What does “Moonshot AI” mean?

The Chinese name (月之暗面) translates to “Dark Side of the Moon,” inspired by founder Yang Zhilin’s favorite Pink Floyd album, released exactly 50 years before the company’s founding38.


Conclusion

Kimi represents a pivotal development in the global AI race: a Chinese open-source model achieving parity with—and occasionally exceeding—the most advanced Western proprietary systems. Its combination of trillion-parameter scale, aggressive pricing, open licensing, and agentic capabilities positions it as a serious alternative for enterprises and developers worldwide.

The rapid evolution from Kimi’s 2023 debut to K2.5’s multimodal agent swarm demonstrates the pace of Chinese AI development. As Google DeepMind CEO Demis Hassabis acknowledged, Chinese AI models may be only “months” behind U.S. counterparts39. For enterprises evaluating AI solutions, Kimi offers a compelling case study in how open-source, efficiency-optimized architectures can compete with billion-dollar proprietary systems.


Footnotes

Footnotes

  1. VentureBeat, “Moonshot’s open source Kimi K2 Thinking outperforms GPT-5, Claude Sonnet 4.5” (November 2025)

  2. Wikipedia, “Moonshot AI” (accessed February 2026)

  3. TechCrunch, “China’s Moonshot AI zooms to $2.5B valuation” (February 2024)

  4. Ibid.

  5. arXiv, “Kimi k1.5: Scaling Reinforcement Learning with LLMs” (January 2025)

  6. Reuters, “China’s Moonshot AI releases open-source model to reclaim market position” (July 2025)

  7. VentureBeat, op. cit.

  8. Hugging Face, “Kimi K2.5 Model Card” (January 2026)

  9. Bloomberg, “Alibaba Leads Record Deal to Mint $2.5 Billion China AI Firm” (February 2024); TechNode, “Moonshot AI raising new funding” (October 2025)

  10. Hugging Face, “Kimi K2-Instruct Model Card” (July 2025)

  11. arXiv, “Muon is Scalable for LLM Training” (February 2025)

  12. SCMP, “Chinese team wins award for AI booster” (March 2025)

  13. SCMP, “Moonshot AI’s Kimi Chatbot offers paid service” (May 2024)

  14. arXiv, op. cit.

  15. Ibid.

  16. CNBC, “Alibaba-backed Moonshot releases new Kimi AI model” (July 2025)

  17. Hugging Face, “Kimi K2 License” (July 2025)

  18. Reuters, op. cit.

  19. CNBC, “Chinese tech companies accelerate AI model rollouts” (January 2026)

  20. VentureBeat, op. cit.

  21. Hugging Face, “Kimi K2.5 Model Card” (January 2026)

  22. Ibid.

  23. CNBC, op. cit. (July 2025)

  24. SCMP, op. cit. (May 2024)

  25. TechCrunch, op. cit.

  26. SCMP, op. cit. (May 2024)

  27. Hugging Face, “Kimi K2.5 Model Card” (January 2026)

  28. Wikipedia, op. cit.

  29. aicpb.com, cited in Reuters (July 2025)

  30. Ibid.

  31. Quartz, “Meet the ‘Six Tigers’ that dominate China’s AI industry” (March 2025)

  32. Pandaily, “Kimi Nears $600 Million Funding Round” (October 2025); Bloomberg, op. cit.

  33. CNBC, op. cit. (July 2025)

  34. Ibid.

  35. CNBC, op. cit. (July 2025); Hugging Face, op. cit.

  36. Hugging Face, “Kimi K2-Instruct Model Card” (July 2025)

  37. TechCrunch, op. cit.

  38. Ibid.

  39. CNBC, “Google DeepMind: China AI models ‘months’ behind” (January 2026)

Enjoyed this article?

Stay updated with our latest insights on AI and technology.