In the rapidly evolving world of artificial intelligence (AI), a new player has emerged from the Chinese mainland, capturing global attention. DeepSeek, an AI lab based in Hangzhou, Zhejiang Province, has set Silicon Valley abuzz with its groundbreaking AI model, challenging industry giants and reshaping the competitive landscape.
Founded by the enigmatic entrepreneur Liang Wenfeng, DeepSeek recently unveiled its R1 model, a state-of-the-art reinforcement learning (RL) system trained without relying on supervised fine-tuning (SFT) as a preliminary step. This innovation allows for significant cost reductions in training large-scale AI models, positioning DeepSeek at the forefront of AI research.
Within days of its release, DeepSeek's app soared to the top of the iPhone free app charts in both China and the United States, surpassing the once-dominant ChatGPT. The swift rise of DeepSeek has ignited a heated debate in Silicon Valley about whether better-resourced U.S. AI companies can maintain their technological advantage.
An Entrepreneur's Journey
Liang Wenfeng, a Zhejiang University graduate in Artificial Intelligence, is no stranger to leveraging cutting-edge technology. In 2016, he co-founded the quantitative hedge fund High-Flyer, which quickly gained recognition for its innovative use of AI-driven trading strategies. By 2021, High-Flyer had fully integrated AI into its operations, utilizing machine learning models to predict market trends and make data-driven investment decisions.
In May 2023, driven by a deep curiosity and commitment to foundational research, Liang founded DeepSeek with the aim of advancing the field of artificial general intelligence (AGI). Unlike traditional for-profit ventures, DeepSeek was envisioned as a platform for long-term, fundamental research where curiosity-driven exploration could drive meaningful advancements in AI.
Liang has remained low-profile, granting interviews exclusively to Anyong, a sub-brand of China's commercial tech media 36Kr, in 2023 and 2024. He describes DeepSeek as more of a side project or hobby, emphasizing that understanding the essence of human intelligence is crucial despite the lack of immediate commercial incentives.
The breakthrough achieved by DeepSeek has drawn criticism from some AI experts in the United States, who describe it as counterproductive to attempts to curb China's high-tech ambitions. Nevertheless, the success of DeepSeek highlights the dynamic and rapidly advancing AI landscape in the Chinese mainland.
Last week, Liang was invited to a symposium in Beijing, where Chinese Premier Li Qiang sought opinions and suggestions from experts, entrepreneurs, and representatives across various sectors on a draft government work report. Liang's participation underscores the growing importance of AI research and development in national strategic planning.
As DeepSeek continues to innovate, its impact on the global AI community is undeniable. The story of Liang Wenfeng and DeepSeek exemplifies the spirit of innovation and the pursuit of knowledge that is driving the next generation of technological advancements.
Reference(s):
Behind China's rising AI startup DeepSeek: Who is Liang Wenfeng?
cgtn.com