Editor's note: In the ever-evolving world of artificial intelligence (AI), Liang Wenfeng and his creation, DeepSeek, are emerging as a significant force from the East. This article delves into the mind behind DeepSeek, exploring Liang's ideology and journey.
An AI lab based in Hangzhou, east China's Zhejiang Province, has captured global attention with the release of its cutting-edge model. Trained at a fraction of the cost of mainstream models like OpenAI's ChatGPT, DeepSeek's breakthrough has sparked conversations worldwide.
Founded by hedge fund manager Liang Wenfeng, DeepSeek unveiled its R1 model last Monday. Accompanied by a detailed paper, the release outlines how to train a large-scale reinforcement learning (RL) model without relying on supervised fine-tuning (SFT) as a preliminary step.
Within days, DeepSeek's app soared to the top of the iPhone free app charts in both China and the U.S., surpassing the previously dominant ChatGPT. The rapid ascent has ignited debates in tech hubs like Silicon Valley about the shifting dynamics in AI innovation.
Meanwhile, Liang has become a focal point of discussion in China. Last week, he was invited to a symposium in Beijing, where Chinese Premier Li Qiang sought insights from experts, entrepreneurs, and representatives across various sectors—including education, science, culture, health, and sports—on a draft government work report.
About Liang Wenfeng
Liang Wenfeng graduated from Zhejiang University with a degree in artificial intelligence. In 2016, he co-founded the quantitative hedge fund High-Flyer, which quickly gained recognition for its innovative use of AI-driven trading strategies. By 2021, High-Flyer had fully integrated AI into its operations, utilizing machine learning models to predict market trends and make data-driven investment decisions.
In May 2023, Liang took a bold step by founding DeepSeek, focusing on advancing the field of artificial general intelligence (AGI). Unlike traditional for-profit ventures, DeepSeek was envisioned as a platform for long-term, fundamental research, where curiosity-driven exploration could drive meaningful advancements in AI.
Liang has remained low-profile, granting interviews only to Anyong, a sub-brand of China's commercial tech media 36Kr, in 2023 and 2024. Below are translated excerpts from these interviews, offering a glimpse into his philosophy and vision.
DeepSeek's Commitment to Long-Term Innovation
For Liang, DeepSeek is more than just a business venture; it's a passion project driven by deep curiosity and a commitment to foundational research. He acknowledges that basic research often yields low immediate returns on investment, yet he is captivated by the challenge of exploring complex fields like finance and the potential of AGI.
Liang's focus is on understanding the essence of human intelligence and the underlying processes that drive it. He believes that exploring these complex questions is crucial for the advancement of AI, despite the lack of immediate commercial incentives. This long-term approach positions DeepSeek at the forefront of innovation, prioritizing meaningful advancements over short-term gains.
Reference(s):
Behind China's rising AI startup DeepSeek: Who is Liang Wenfeng?
cgtn.com