In the rapidly evolving world of artificial intelligence (AI), a new name has begun to resonate: DeepSeek, an AI lab based in Hangzhou, in east China's Zhejiang Province. Founded by the enigmatic hedge fund manager Liang Wenfeng, DeepSeek has recently disrupted the global AI landscape with the release of its groundbreaking R1 model.
The R1 model, unveiled just last week, has been making headlines for its advanced capabilities and the unconventional approach taken in its development. Unlike mainstream models such as OpenAI's ChatGPT, DeepSeek's R1 was trained without relying on supervised fine-tuning (SFT) as a preliminary step. This innovative methodology allowed the team to train a large-scale reinforcement learning (RL) model at a fraction of the cost typically associated with such endeavors.
A Rising Star in the AI Arena
The launch of the R1 model sent ripples through Silicon Valley, sparking debates among AI experts about the future of technological leadership. Within days, DeepSeek's app skyrocketed to the top of the iPhone free app charts in both China and the United States, surpassing established competitors and capturing the attention of tech enthusiasts worldwide.
Critics have described DeepSeek's rapid ascent as a challenge to the technological advantage previously held by better-resourced U.S. AI companies. The breakthrough has also been viewed as a counterpoint to attempts at curbing the Chinese mainland's high-tech ambitions, highlighting the nation's growing prowess in the field of AI.
The Man Behind the Vision: Liang Wenfeng
Despite the sudden spotlight, founder Liang Wenfeng remains a figure shrouded in mystery. A graduate of Zhejiang University with a degree in Artificial Intelligence, Liang co-founded the quantitative hedge fund High-Flyer in 2016. The fund quickly gained recognition for its innovative use of AI-driven trading strategies, fully integrating machine learning models to predict market trends and make data-driven investment decisions by 2021.
In May 2023, Liang embarked on a new journey by founding DeepSeek, aiming to push the boundaries of AI research and advance the field of artificial general intelligence (AGI). Unlike traditional for-profit ventures, DeepSeek is envisioned as a platform for long-term, fundamental research where curiosity and exploration drive meaningful advancements without the immediate pressures of commercial success.
Embracing Long-Termism
Liang's philosophy centers around what he calls \"long-termism.\" For him, DeepSeek is more than a business venture; it is a passion project fueled by a deep commitment to foundational research. He acknowledges that basic research often yields low immediate returns on investment, yet he is captivated by the challenge of exploring complex fields and uncovering the essence of human intelligence.
\"Understanding the underlying processes of intelligence is crucial,\" Liang shared in one of his rare interviews with Anyong, a sub-brand of China's tech media 36Kr. \"It's not just about creating profitable applications but about pushing the boundaries of what we know and can achieve with AI.\"
A Quiet Influence
Liang's low-profile demeanor contrasts sharply with the impact his work is having on the global stage. Last week, he was invited to a symposium in Beijing, where Chinese Premier Li Qiang sought input from experts across various sectors on a draft government work report. Liang's participation underscores the significance of DeepSeek's contributions to the Chinese mainland's technological ambitions.
As DeepSeek continues to make strides, the global tech community watches with keen interest. Whether Liang's long-term vision will reshape the AI landscape remains to be seen, but one thing is certain: DeepSeek represents a new and intriguing chapter in the story of artificial intelligence.
Reference(s):
Behind China's rising AI startup DeepSeek: Who is Liang Wenfeng?
cgtn.com