Editor's note: In the realm of artificial intelligence (AI), Liang Wenfeng and his creation, DeepSeek, are emerging as a \"mysterious force from the East.\" CGTN is producing a series on AI to delve into the power of innovation and its global impact. In this article, we take you behind the scenes to explore the man behind DeepSeek, his ideology, and his journey.
An artificial intelligence lab based in Hangzhou, east China's Zhejiang Province, has set Silicon Valley abuzz with the release of its state-of-the-art model, trained at a fraction of the cost of mainstream models such as OpenAI's ChatGPT. The breakthrough has drawn criticism from many AI experts online, who describe it as \"counterproductive\" to the U.S.'s attempt to curb China's high-tech ambitions.
DeepSeek, founded by hedge fund manager Liang Wenfeng, unveiled its R1 model last Monday, accompanied by a detailed paper outlining how to train a large-scale reinforcement learning (RL) model without relying on supervised fine-tuning (SFT) as a preliminary step.
Within days, DeepSeek's app soared to the top of the iPhone free app charts in both China and the U.S., surpassing the once-dominant ChatGPT. The release of DeepSeek's R1 model has ignited a heated debate in Silicon Valley about whether better-resourced U.S. AI companies, including Meta and OpenAI, can maintain their technological advantage.
Meanwhile, Liang has become a focal point of discussion in China. Last week, he was invited to a symposium in Beijing, where Chinese Premier Li Qiang sought opinions and suggestions from experts, entrepreneurs, and representatives across various sectors—including education, science, culture, health, and sports—on a draft government work report.
About Liang Wenfeng
Liang Wenfeng graduated from Zhejiang University with a degree in Artificial Intelligence. He co-founded the quantitative hedge fund High-Flyer in 2016, which quickly gained recognition for its innovative use of AI-driven trading strategies. By 2021, High-Flyer had fully integrated AI into its operations, using machine learning models to predict market trends and make data-driven investment decisions.
In May 2023, Liang took a bold step by founding DeepSeek, aiming at AI-focused research in advancing the field of general artificial intelligence (AGI). Unlike traditional for-profit ventures, DeepSeek was envisioned as a platform for long-term, fundamental research, where curiosity-driven exploration could drive meaningful advancements in AI.
DeepSeek's 'Long-Termism'
For Liang, DeepSeek is more than a business venture; it's a quest fueled by deep curiosity and a commitment to foundational research. He acknowledges that basic research often yields low immediate returns on investment, yet he is captivated by the challenge of exploring complex fields like finance and the potential of AGI. Liang's focus is on understanding the essence of human intelligence and the processes that underlie it, believing that such exploration is crucial despite the lack of immediate commercial incentives.
\"We are not just building an AI model; we are exploring the boundaries of human cognition,\" Liang said in a rare interview with Anyong, a sub-brand of China's commercial tech media 36Kr. \"Our goal is to push the envelope of what's possible in AI research.\"
DeepSeek's philosophy of 'long-termism' emphasizes patience and persistence in research, valuing long-term potential over short-term gains. This approach has enabled the company to innovate in ways that larger, more commercially focused entities may overlook.
As DeepSeek continues to make waves in the AI community, the world watches to see how Liang Wenfeng and his team will shape the future of artificial intelligence.
Reference(s):
Behind China's rising AI startup DeepSeek: Who is Liang Wenfeng?
cgtn.com