An AI startup in Hangzhou, east China's Zhejiang Province, is making waves in the global tech community with its groundbreaking advancements in artificial intelligence. DeepSeek, founded by hedge fund manager and AI enthusiast Liang Wenfeng, has released its R1 model, a state-of-the-art AI system trained at a fraction of the cost of mainstream models like OpenAI's ChatGPT.
DeepSeek's R1 model has quickly gained attention for its innovative approach to training large-scale reinforcement learning (RL) models without the need for supervised fine-tuning (SFT) as a preliminary step. Within days of its release, DeepSeek's app soared to the top of the iPhone free app charts in both China and the United States, surpassing established competitors.
This rapid rise has sparked discussions within the global AI community about the shifting landscape of technological innovation. Industry experts are debating whether this signifies a new era where innovative startups like DeepSeek can challenge the dominance of well-resourced tech giants.
In China, the spotlight has turned to DeepSeek's founder, Liang Wenfeng. Last week, he was invited to a symposium in Beijing, where Chinese Premier Li Qiang gathered insights from experts, entrepreneurs, and representatives across various sectors on a draft government work report.
Who is Liang Wenfeng?
A graduate of Zhejiang University with a degree in Artificial Intelligence, Liang co-founded the quantitative hedge fund High-Flyer in 2016. The firm quickly gained recognition for its innovative use of AI-driven trading strategies, fully integrating AI into its operations by 2021 to predict market trends and make data-driven investment decisions.
In May 2023, Liang founded DeepSeek with a vision to advance the field of artificial general intelligence (AGI). Unlike traditional for-profit ventures, DeepSeek is envisioned as a platform for long-term, foundational research, driven by curiosity and a commitment to exploring the essence of human intelligence.
Liang has maintained a low profile, granting interviews only to Anyong, a sub-brand of China's commercial tech media 36Kr, in 2023 and 2024. Through these interviews, Liang shared insights into his philosophy and vision for AI.
Embracing 'Long-Termism'
For Liang, DeepSeek represents more than a business venture; it is a passion project rooted in a deep curiosity about the nature of intelligence. He acknowledges that foundational research often yields low immediate returns but believes that exploring complex fields like finance and AGI is crucial for meaningful advancements.
\"My focus is on understanding the essence of human intelligence and the processes that underlie it,\" Liang said in an interview with Anyong. \"Such exploration is essential, even without immediate commercial incentives.\"
A New Era for Global AI Innovation
DeepSeek's rapid ascent highlights the increasingly dynamic landscape of global AI research and development. Startups like DeepSeek are demonstrating that innovation can emerge from diverse regions, contributing to a more interconnected and collaborative global tech community.
Reference(s):
Behind China's rising AI startup DeepSeek: Who is Liang Wenfeng?
cgtn.com