An artificial intelligence lab based in Hangzhou, east China's Zhejiang Province, has set Silicon Valley abuzz with its groundbreaking state-of-the-art model. Trained at a fraction of the cost of mainstream models like OpenAI's ChatGPT, DeepSeek's innovative approach is challenging the global AI landscape.
DeepSeek, founded by hedge fund manager Liang Wenfeng, unveiled its R1 model last Monday. Accompanied by a detailed paper, the model outlines how to train a large-scale reinforcement learning (RL) model without relying on supervised fine-tuning (SFT) as a preliminary step. This breakthrough has drawn both admiration and criticism, with some AI experts describing it as countering the U.S.'s efforts to curb China's high-tech ambitions.
Within days of its release, DeepSeek's app soared to the top of the iPhone free app charts in both China and the U.S., surpassing the once-dominant ChatGPT. The rapid ascent has ignited a heated debate in Silicon Valley about whether well-resourced U.S. AI companies, including Meta and OpenAI, can maintain their technological advantage.
Meanwhile, Liang has become a focal point of discussion in China. Last week, he was invited to a symposium in Beijing, where Chinese Premier Li Qiang sought opinions and suggestions from experts, entrepreneurs, and representatives across various sectors—including education, science, culture, health, and sports—on a draft government work report.
Who Is Liang Wenfeng?
Graduating from Zhejiang University with a degree in Artificial Intelligence, Liang co-founded the quantitative hedge fund High-Flyer in 2016. The fund quickly gained recognition for its innovative use of AI-driven trading strategies. By 2021, High-Flyer had fully integrated AI into its operations, utilizing machine learning models to predict market trends and make data-driven investment decisions.
In May 2023, Liang took a bold step by founding DeepSeek, aiming to advance the field of artificial general intelligence (AGI). Unlike traditional for-profit ventures, DeepSeek was envisioned as a platform for long-term, fundamental research. Liang's vision was to create a space where curiosity-driven exploration could drive meaningful advancements in AI.
DeepSeek's 'Long-Termism'
For Liang, DeepSeek is more than a business venture; it's a manifestation of his deep curiosity and commitment to foundational research. He acknowledges that basic research often yields low immediate returns on investment, yet he is captivated by the challenge of exploring complex fields like finance and the potential of AGI. Liang's focus is on understanding the essence of human intelligence and the processes that underlie it, believing that such exploration is crucial despite the lack of immediate commercial incentives.
\"The pursuit of AGI is not just about technological advancement; it's about unraveling the mysteries of human cognition,\" Liang said in a rare interview with Anyong, a sub-brand of China's tech media 36Kr. \"DeepSeek represents a commitment to long-termism, where the journey of discovery is just as important as the destination.\"
Liang's low-profile approach has intrigued many in the tech industry. By granting interviews only to select media outlets, he maintains a sense of mystery around both himself and DeepSeek. This strategy aligns with his philosophy of letting the work speak for itself, focusing on innovation rather than publicity.
The Future of AI with DeepSeek
DeepSeek's rapid ascent signals a shift in the global AI landscape. By offering a model that rivals established platforms at a fraction of the cost, Liang and his team are democratizing access to advanced AI technologies. This move could potentially disrupt the market and redefine how AI development is approached worldwide.
The debate sparked by DeepSeek's R1 model also highlights the growing competitiveness between China and the U.S. in the field of AI. As DeepSeek continues to innovate, it may serve as a catalyst for further advancements and collaborations in AI research.
For now, Liang remains focused on his vision of long-term, curiosity-driven research. His journey with DeepSeek is just beginning, and the tech world is watching closely to see how this \"mysterious force from the East\" will continue to shape the future of artificial intelligence.
Reference(s):
Behind China's rising AI startup DeepSeek: Who is Liang Wenfeng?
cgtn.com