Editor's note: In the realm of artificial intelligence, Liang Wenfeng and his creation, DeepSeek, are emerging as a mysterious force from the East. This article delves into the power of innovation and its global impact by exploring the man behind DeepSeek, his ideology, and his journey.
An artificial intelligence lab based in Hangzhou, east China's Zhejiang Province, has set the tech world abuzz with the release of its state-of-the-art model. Trained at a fraction of the cost of mainstream models such as OpenAI's ChatGPT, this breakthrough has drawn attention and sparked discussions about China's advancements in AI technology.
DeepSeek, founded by hedge fund manager Liang Wenfeng, unveiled its R1 model last Monday. The release was accompanied by a detailed paper outlining how to train a large-scale reinforcement learning model without relying on supervised fine-tuning as a preliminary step. Within days, DeepSeek's app soared to the top of the iPhone free app charts in both China and the United States, surpassing the once-dominant ChatGPT.
The unveiling of DeepSeek's R1 model has ignited a heated debate in Silicon Valley about whether better-resourced U.S. AI companies, including Meta and OpenAI, can maintain their technological advantage. The rapid rise of DeepSeek underscores the dynamic nature of the global AI landscape and highlights the innovative spirit driving China's tech sector.
About Liang Wenfeng
Liang Wenfeng graduated from Zhejiang University with a degree in Artificial Intelligence. In 2016, he co-founded the quantitative hedge fund High-Flyer, which quickly gained recognition for its innovative use of AI-driven trading strategies. By 2021, High-Flyer had fully integrated AI into its operations, using machine learning models to predict market trends and make data-driven investment decisions.
In May 2023, Liang founded DeepSeek, aiming to advance the field of artificial general intelligence (AGI). Unlike traditional for-profit ventures, DeepSeek was envisioned as a platform for long-term, fundamental research, where curiosity-driven exploration could drive meaningful advancements in AI.
Liang has remained low-profile, granting interviews only to Anyong, a sub-brand of China's commercial tech media 36Kr, in 2023 and 2024. These rare glimpses into his philosophy reveal a deep commitment to foundational research and a belief in the importance of understanding the essence of human intelligence.
DeepSeek's 'Long-Termism'
For Liang, DeepSeek represents more than just a business venture; it's a pursuit driven by deep curiosity and a commitment to exploring complex fields. He acknowledges that basic research often yields low immediate returns on investment, yet he is captivated by the challenge of delving into the intricacies of AI and AGI. Liang's focus is on understanding the underlying processes of human intelligence, believing that such exploration is crucial for meaningful advancements in the field.
His approach reflects a philosophy of 'long-termism,' prioritizing sustained research over immediate commercial gains. This mindset has positioned DeepSeek as a significant player in the AI landscape, fostering innovation and pushing the boundaries of what is possible in artificial intelligence.
The story of Liang Wenfeng and DeepSeek exemplifies the dynamic and rapidly evolving nature of the global AI industry. As China's tech sector continues to grow, figures like Liang are shaping the future of artificial intelligence, contributing to a more diverse and competitive global landscape.
Reference(s):
Behind China's rising AI startup DeepSeek: Who is Liang Wenfeng?
cgtn.com