Breaking Barriers in Domestic AI Model Development

Advertisements

In January of this year, DeepSeek, a pioneering company in artificial intelligence, made waves worldwide with the launch of its general-purpose large model, DeepSeek-R1. This model has garnered significant attention thanks to its standout features of low cost and high performance, marking a momentous milestone in China's AI development and offering a treasure trove of insights for the industry at large.

The magic behind DeepSeek’s success lies in its innovative technologies: Parallel Thread Execution (PTX), Mixture of Experts (MoE), Multi-head Latent Attention (MLA), and Multi-Token Prediction (MTP). These advances enable DeepSeek to significantly enhance model performance even in the context of limited computing power compared to its international counterparts. Remarkably, they have managed to trim training costs down to a mere 10% of the industry standard. This achievement not only lowers the barriers to deploying large models but also illustrates the feasibility of compensating for power shortages through algorithmic optimization. It presents a refreshing alternative to the Western model of AI development, which is often characterized by the mantra of “throwing resources at problems” in hopes of miraculous breakthroughs.

Moreover, DeepSeek has embraced an entirely open-source strategy, making its algorithms, model weights, and training details publicly accessible. This principle of openness empowers global developers to draw upon, refine, and deploy the model, thus cultivating an ecosystem conducive to innovation. This approach is poised to disrupt the typical winner-takes-all competition landscape, allowing for a more collaborative and dynamic development environment.

Despite these significant advancements, it is crucial to recognize that China still faces challenges in its path towards original innovation in the AI sphere. As of 2023, only one Chinese institution made it to the list of the top ten most cited research institutes in generative AI. When examining areas like AI patents, deep learning models, and machine learning hardware, there remains a noticeable gap between China and the United States, indicating that the journey is far from complete.

Currently, the foundational frameworks necessary for data management in China are nascent. Mechanisms for data acquisition and exchange are often inadequate, making it difficult for industries to access both industry-specific and public data. Consequently, the available data for training large models is limited. Quality data labeling, which is crucial for supplying high-quality datasets, is hampered by a shortage of specialized personnel. This deficiency is especially pronounced in sectors like healthcare and autonomous driving, where precise and expert-level data annotation is urgently needed and challenging to meet.

On a global scale, the influence of domestically developed large models like DeepSeek is still in its infancy within the global technology ecosystem. Domestically, the journey from fundamental AI research through to technical innovation and practical application has not been fully realized. There are several bottlenecks in the flow of essential elements such as technology, funding, data, and talent, preventing the creation of an efficient ecological loop that could enable further iterations of large models.

To address these challenges, it is crucial to reinforce AI foundational research and technological innovation. There should be a concerted effort to develop national strategic scientific capabilities in the AI sector, pushing for interdisciplinary collaboration that fuses AI with foundational disciplines such as mathematics, physics, and brain science. This would elevate the level of foundational research in AI significantly. Additionally, promoting open-source initiatives in AI technology is vital. By centering efforts around open-source projects, there can be a collective push for technological innovation that involves contributors, providers, users, and operators alike.

Furthermore, the construction of extensive datasets must be strategically coordinated. This involves accelerating the establishment of data management frameworks, leveraging government data openings to integrate enterprise and industry data, and spurring the construction of public datasets and specialized application datasets. For diverse application scenarios, detailed standards for data labeling should be devised, complemented by specialized training in sectors such as healthcare and autonomous driving to enhance data labeling quality.

There is also a pressing need to nurture and scale AI startups. Uncovering valuation models and platform designs specifically tailored to China's unique context can significantly strengthen the early-stage valuation process for AI startups. Such measures would provide government and financial institutions with precise methods to identify high-potential, high-value AI ventures, injecting vitality into the development and expansion of domestic AI technologies.

Lastly, fostering an independent AI industrial ecosystem is paramount. By tapping into the vast reservoirs of data and rich application scenarios present in China, it is essential to mobilize superior forces such as research institutions and tech-leading enterprises. A focused effort on domains like intelligent manufacturing and autonomous driving should be prioritized, leading to the establishment of innovation centers for large model applications. By leveraging domestic technologies, it will be possible to build a comprehensive industry platform integrating data, algorithms, and computing power. This platform could facilitate the development of standardized and modular models, middleware, and application software, advancing deep collaboration along the industrial chain and continuously enriching and iterating on the independent industrial ecosystem.

Write A Review

Etiam tristique venenatis metus,eget maximus elit mattis et. Suspendisse felis odio,

Please Enter Your 5 star Reviews*

Breaking Barriers in Domestic AI Model Development

Write A Review

Keyword

Recent Post

Category