DeepSeek "is getting bigger": raised 50 billion yuan in financing, purchasing domestic computing power, recruiting talents, and sprinting towards AGI
On the evening of June 29th, DeepSeek announced that the official version of V4 will be officially launched in mid-July.
The announcement states that to more reasonably allocate resources and improve service stability, the API pricing strategy will be adjusted synchronously after the official version is released, and a peak-valley pricing mechanism will be introduced.
In the past two weeks, there have been two other major events related to DeepSeek:
First, on June 16th, DeepSeek completed its first round of external financing since its establishment, raising a total of 51 billion yuan and achieving a valuation of nearly 400 billion yuan, breaking the principle set by founder Liang Wenfeng of "no financing, no listing, and no commercialization."
Second, just over ten days after the financing, on June 27th, the DeepSeek team jointly published a paper titled "DSpark: Confidence-Scheduled Speculative Decoding with Semi-Autoregressive Generation" with Peking University. Founder Liang Wenfeng is listed among the authors of the paper.
If the task of DeepSeek in the first half was to prove that it could develop a world-class large model, in the second half, when "capital" and "technology" converge, DeepSeek will use the financing funds to prove to the world that it is a truly commercial company.
01 After 50 billion yuan in financing, DeepSeek starts to recruit more people
Since the financing, Cui Tianyi, the new leader of the DeepSeek Harness team, has been extremely busy.
On the evening of June 25th, DeepSeek posted large-scale recruitment information on social platforms, covering 7 major categories including algorithms, R & D, operation and maintenance, products, data engineers, and functional departments, with a total of 33 positions. The work locations include Beijing and Hangzhou, and all positions accept interns.
On various social platforms such as DeepSeek's official website and official account, Boss Zhipin, X, and Xiaohongshu, you can see this alumnus of Liang Wenfeng from Zhejiang University, who joined the team in March this year, actively recruiting and taking the time to "refute rumors."
This is a picture of Cui Tianyi responding on Xiaohongshu to the comment that "DeepSeek only recruits people with Tsinghua-level education and experience with Doubao."
Previously, DeepSeek was more like a silent, low-key, and hardworking technician, backed by the quantitative private equity fund "Magic Square Quant" founded by Liang Wenfeng. Magic Square Quant had an annualized return rate of 56.55% in 2025 and managed assets of over 70 billion yuan, allowing DeepSeek not to rely on external capital.
In the eyes of the outside world, DeepSeek seems to "not be short of money."
The potential reason for Liang Wenfeng to change his attitude towards financing may be the loss of core talents and the intensifying competition for talents in the external market.
As a reference, Zhipu, the "first large model stock" that has been listed earlier, had a total market value of nearly HK$1 trillion as of June 30th, and the market value of MiniMax also exceeded HK$130 billion.
In contrast, the options in the hands of DeepSeek employees are still worthless. The company has no financing, no listing, and no external valuation reference.
"Without financing, its valuation won't increase. Even if employees have options, their value won't rise. Compared with Zhipu, MiniMax, and some other large model teams, whose valuations have either skyrocketed or soared after listing, it's certain that DeepSeek's employees won't stay," said a senior industry insider close to DeepSeek when talking about the reason for financing at this juncture.
The cost of retaining talents is also a significant expense.
In 2025, DeepSeek's greatest competitive advantage came from a team of over a hundred highly educated "geniuses." Dozens of top researchers, along with founder Liang Wenfeng, who has a strong sense of technological idealism, created DeepSeek - R1.
This year, on one hand, major Internet companies are continuously poaching top AI researchers with high salaries. On the other hand, the market demand for AI talents has surged. Public data shows that the median monthly salary for algorithm positions generally exceeds 24,000 yuan, and the monthly salary of top talents exceeds 50,000 yuan. The premium for AI talents continues to expand.
Qiancheng Wuyou's "Report on the Demand for AI Talents in the 2026 Campus Recruitment Market"
Among the positions in this large-scale recruitment, in addition to full-stack development/algorithms, AI core system R & D, operation and maintenance, and products, it is worth noting that functional departments such as HR, legal, finance, procurement, and administration are also expanding.
The signal released by the all-round "expansion" is that DeepSeek is still improving its organizational capabilities as a technology company.
Transitioning from "product-driven" to "organization-driven" is also an inevitable path for many technology companies. After the organization matures, the platform improves, and the talent incentive mechanism is established, large-scale product dividends will begin to be released.
Completing 50 billion yuan in financing is, on one hand, necessary for retaining talents, and on the other hand, provides the basic conditions for completing the organizational structure. However, in this process, can DeepSeek "turn around lightly" and still maintain the efficiency of a "small team operation" and the flexibility of the decision-making chain after the expansion? This is the first question that DeepSeek needs to answer in the "second half."
02 DeepSeek moves towards heavy assets
Among the 33 positions in the recruitment information posted on June 25th, some positions are worthy of attention, namely the IDC (Internet Data Center) data center team, which is involved in infrastructure construction.
As early as mid-April, DeepSeek posted its first batch of data center positions in Ulanqab, Inner Mongolia, including senior data center operation and maintenance engineers and senior data center delivery managers. In June, DeepSeek added the position of "IDC design and planning engineer."
From data center operation and maintenance, delivery to design and planning, since the beginning of this year, DeepSeek's talent layout has extended from models to computing power infrastructure construction.
This is the recruitment information for the DeepSeek IDC data center team positions.
As large models enter the stage of large-scale training and inference, the competition among AI model companies will inevitably enter a hardware competition in "infrastructure," which forces DeepSeek to join the heavy-asset and capital-intensive game of building its own computing power clusters, just like the leading large model companies at the forefront of Silicon Valley.
According to public data, US technology giants Alphabet, Amazon, Meta, and Microsoft are expected to invest a total of about $650 billion this year to expand their artificial intelligence-related infrastructure. Anthropic and OpenAI have also repeatedly emphasized in their publicly disclosed financial documents that they will continue to increase their investment in computing power infrastructure.
For example, Anthropic is expected to pay about $1.25 billion per month to SpaceX just for data center capacity leasing, which amounts to $15 billion a year, not including GPU procurement, network, and operation and maintenance costs.
Pan Helin, a member of the Expert Committee on Information and Communication Economics of the Ministry of Industry and Information Technology, pointed out in a previous interview that in the current wave of AI investment, it is an inevitable trend for large model companies to raise funds. "Not only DeepSeek, but Google has also raised $80 billion. The industry has entered a stage of heavy capital investment."
In order not to fall behind in the construction of computing power heavy assets, DeepSeek must "open up new sources of funds," obtain financing, and then invest in the construction of infrastructure such as computing power and data centers.
It is worth noting that DeepSeek is building its computing power infrastructure under the restriction of overseas advanced computing power exports, which means that the above computing power will be driven by domestic chips. When DeepSeek V4 was released, DeepSeek mentioned the exploration of domestic computing power on its official page and in its technical report.
At the end of May, Huawei proposed the "Tao (τ) Law," attempting to break through the bottleneck caused by the slowdown of Moore's Law through full-stack collaborative optimization of devices, chips, and systems. Domestic large models are also accelerating their adaptation to domestic computing power, and the goal of domestic computing power exploration is to build "autonomous and controllable AI infrastructure."
03 On the eve of AGI
For a basic model company like DeepSeek, having only capital and organization is not enough to support long-term leadership. The key is to continuously produce original technologies.
A senior computing power industry insider also emphasized that DeepSeek's large-scale financing in this round is to incentivize the team and retain core talents. "Only with sufficient financing can a good model be trained, and a leading position can be established."
Academic papers are an important microcosm of DeepSeek's exploration of new technologies.
According to incomplete statistics, in the past two years, DeepSeek has publicly published about 27 core technology papers, covering research directions such as MoE (Mixture of Experts), reinforcement learning, large code models, mathematical reasoning, and multimodality, which almost correspond to each generation of its core models and key technological breakthroughs.
On June 27th, DeepSeek quietly updated a paper related to the new technology "DSpark" on GitHub.
In the paper, DeepSeek proposed a new inference acceleration framework "DSpark," which significantly improves the inference speed and system throughput of large models without changing the model's capabilities.
Different from previous papers, this one does not iterate on a new model. Instead, it adds a speculative decoding module based on the original DeepSeek - V4 - Pro and DeepSeek - V4 - Flash, with the focus on optimization at the engineering implementation level.
DeepSeek intends to deploy DSpark in the online service system of DeepSeek - V4 to reduce the waste of computing power caused by invalid verifications when handling real user traffic.
The paper mentions that "without changing the underlying model architecture, the generation speed has been increased by 60 - 85%." For AI companies that process a large number of API requests every day, every reduction in computing power cost will increase the profit margin.
According to a report from VentureBeat in February this year, the cost of model training will only increase. Dario Amodei, the CEO of Anthropic, predicts that the training cost of the next generation of cutting-edge models will reach $5 - 10 billion.
The above-mentioned senior computing power industry insider said that DeepSeek will also enter the same capital-intensive stage of model training.
To support the iterative training of DeepSeek's models, financing is inevitable.
Many media have interpreted that DeepSeek's publication of a new paper just over ten days after the financing seems to prove that the company's core rhythm remains unchanged, and the R & D team still maintains a high-frequency output.
Whether it is launching new models, building data centers, or transforming from a "genius laboratory" into a commercially operated AI company, the ultimate goal of DeepSeek's evolution is AGI.
In an interview with "Dark Waves" in 2024, Liang Wenfeng clearly stated: "What we are doing is AGI (Artificial General Intelligence). Large language models may be the inevitable path to AGI."
During the free period, users can tolerate problems such as response failures, conversation interruptions, and API rate limits. Once the official version of DeepSeek V4 is launched in July, users' requirements for stability will increase. If these problems persist, it will directly affect whether users are willing to integrate DeepSeek into their real work processes and business systems.
In the paid segment, developer ecosystem, and enterprise scenarios, DeepSeek's engineering capabilities will face more stringent tests. Whether the model can truly be applied on a large scale is also a threshold that must be crossed to enter the AGI era.
While users hope that DeepSeek can promote cost reduction and efficiency improvement of Chinese large models, the company has its own calculations: in the critical period of AGI, the primary goal is to retain talents, followed by continuous model training and technological iteration. It also cannot lose the "battle" for data center infrastructure.
Relying solely on Liang Wenfeng's quantitative private equity fund, Magic Square Quant, is far from enough.
In 2026, it was an inevitable choice for DeepSeek to start financing. As stated in the recruitment announcement posted by DeepSeek on the evening of June 25th:
"Humanity is currently on the eve of AGI."
Next, DeepSeek will face a more long-term and capital-intensive AGI competition. Can it continue to "lead the way" in the AGI process as it did in the "large model" competition? This is the second question that DeepSeek needs to answer in the "second half."
This article is from "Tencent Technology", author: Xie Ruirui, editor: Xu Qingyang, published by 36Kr with authorization.