An artificial intelligence (AI) laboratory based in Hangzhou, east China's Zhejiang Province, has set Silicon Valley abuzz with the release of its state-of-the-art model. Trained at a fraction of the cost of mainstream models like OpenAI's ChatGPT, this breakthrough has sparked discussions about China's accelerating advancements in AI technology.
DeepSeek, founded by hedge fund manager Liang Wenfeng, unveiled its R1 model last Monday. The release was accompanied by a detailed paper outlining a novel approach to training large-scale reinforcement learning (RL) models without relying on supervised fine-tuning (SFT) as a preliminary step. Within days, DeepSeek's app soared to the top of the iPhone free app charts in both China and the United States, surpassing the previously dominant ChatGPT.
Igniting Debates in the Tech World
The emergence of DeepSeek's R1 model has ignited a heated debate in Silicon Valley about whether better-resourced U.S. AI companies, including Meta and OpenAI, can maintain their technological advantage. Some AI experts have described DeepSeek's advancements as countering attempts to curb China's high-tech ambitions.
Meanwhile, Liang has become a focal point of discussion in China. Last week, he was invited to a symposium in Beijing, where Chinese Premier Li Qiang sought opinions and suggestions from experts, entrepreneurs, and representatives across various sectors—including education, science, culture, health, and sports—on a draft government work report.
Who Is Liang Wenfeng?
Liang Wenfeng graduated from Zhejiang University with a degree in artificial intelligence. In 2016, he co-founded the quantitative hedge fund High-Flyer, which quickly gained recognition for its innovative use of AI-driven trading strategies. By 2021, High-Flyer had fully integrated AI into its operations, using machine learning models to predict market trends and make data-driven investment decisions.
In May 2023, Liang founded DeepSeek with the aim of advancing the field of artificial general intelligence (AGI). Unlike traditional for-profit ventures, DeepSeek was envisioned as a platform for long-term, fundamental research where curiosity-driven exploration could drive meaningful advancements in AI.
Embracing 'Long-Termism'
For Liang, DeepSeek represents a commitment to foundational research driven by deep curiosity. He acknowledges that basic research often yields low immediate returns on investment, yet he is captivated by the challenges of exploring complex fields like finance and the potential of AGI. Liang focuses on understanding the essence of human intelligence and the processes that underlie it, believing that such exploration is crucial despite the lack of immediate commercial incentives.
Liang has maintained a low profile, granting interviews only to Anyong, a sub-brand of China's commercial tech media 36Kr, in 2023 and 2024. In these interviews, he shared insights into his philosophy and vision, emphasizing the importance of long-term commitment to research and innovation.
A New Chapter in AI Innovation
DeepSeek's rapid ascent reflects a broader trend of innovation and ambition within China's tech landscape. As the company continues to challenge established norms and push the boundaries of AI research, industry observers are keenly watching how this 'mysterious force from the East' will shape the future of artificial intelligence globally.
Reference(s):
Behind China's rising AI startup DeepSeek: Who is Liang Wenfeng?
cgtn.com