Exclusive by 36Kr: Hong Kong University professor LI Hongyang launches a startup developing a general full-body embodied brain, securing hundreds of millions in seed funding from ZhenFund, Gaorong Capital, IDG, 5Yuan Capital and other investors
Author | Huang Nan
Editor | Yuan Silai
Yingke has exclusively learned that Archon Robotics, a company focusing on general whole - body embodied brains, recently completed a seed - round financing of hundreds of millions of yuan. The investors in this round include leading US - dollar funds such as ZhenFund, Banyan Capital, IDG Capital, and Matrix Partners, as well as the joint fund of Gobi Partners and the University of Hong Kong, MiraclePlus, and Shanghai Institute of Innovation and Intelligence.
Light Source Capital served as the exclusive financial advisor.
The funds from this round will be mainly used for the research and development of the whole - body humanoid basic model, the collection of multi - modal whole - body motion data, the expansion of the talent team, and the establishment of R & D centers in multiple locations and the industrial cooperation ecosystem, to accelerate the implementation of the open - source humanoid base model within this year.
Archon Robotics was founded in April 2026, and its R & D headquarters is located in Caohejing Development Zone, Xuhui District, Shanghai. The company focuses on the research and development of the general whole - body humanoid base model, constructs whole - body intelligence, and provides humanoid robots with the ability of human - like whole - body mobile operation to accelerate the realization of embodied intelligence entering thousands of households.
Dr. Li Hongyang, the founder, is currently an assistant professor at the University of Hong Kong, the assistant dean of the School of Computing and Data Science, and a tutor at Shanghai Institute of Innovation and Intelligence. The end - to - end autonomous driving project UniAD led by him won the Best Paper Award at CVPR 2023, which is the only work by a mainland academic institution to receive this award in the past decade. In 2026, he won the RSS Early Career Award at the top international conference in the robotics field, becoming the first Chinese scholar to receive this award since its establishment 20 years ago.
Dr. Li Tianyu, the co - founder and CEO, is one of the first graduates of Shanghai Institute of Innovation and Intelligence and a doctor from Fudan University. As a core developer, he was deeply involved in the research and development of Huawei's mass - produced autonomous driving ADS 4.0 World Engine solution. Dr. Chen Li, the co - founder and Head of AI, is the first author of the Best Paper of UniAD. He graduated from the Zhiyuan Honors Program of Shanghai Jiao Tong University and once won the President's PhD Scholarship of the University of Hong Kong.
The core team of Archon Robotics all come from top research teams in autonomous driving, robotics, and large - model fields at universities such as the University of Hong Kong, Tsinghua University, Shanghai Jiao Tong University, Fudan University, and Zhejiang University, with both breakthroughs in original algorithms and experience in the implementation of ultra - large - scale industrial systems.
The Archon team raises their glasses at the Everest Base Camp. Taken in February 2024 at Mount Everest (Photo source/Enterprise)
The embodied intelligence industry is entering a critical differentiation moment. Public data from Omdia and other sources show that in the first half of 2026, there were a total of 288 financing events in the domestic embodied intelligence and robotics fields, with the disclosed total financing amount exceeding 46 billion yuan, approaching the scale of 55.4 billion yuan in the whole year of 2025. However, the huge amount of financing has not brought about a convergence of technical consensus.
Most current embodied solutions have inherent limitations: the existing training data mainly consist of desktop first - person perspective videos, single - arm or gripper actions, lacking the original human interaction logic such as whole - body center - of - gravity adjustment, torso leverage, and multi - limb coordination. This means that most robots can only perform fixed - point grasping. When faced with daily tasks that require whole - body coordination, such as opening a door, making a bed, or opening and closing a door while holding an object with both hands, they have difficulty adapting to variables independently.
The root cause of this limitation lies in the structural gap in data infrastructure. Li Tianyu, the CEO of Archon Robotics, told Yingke, "The available embodied data sets in the market seem large, but in fact, the information that is truly effective for whole - body humanoid training is extremely limited."
The first - person perspective video data set only supports recording the pictures seen by human eyes, lacking key action pose information other than the hand appearance, such as squatting, bending over, and leaning sideways. The labeled data of robotic arms and grippers are mostly concentrated in the plane category, only recording the end - point trajectory. The model can learn how the manipulator moves, but it is difficult to understand how to interact with the environment. The total amount of real - humanoid robot data is extremely low, and the collection cost is hundreds to nearly a thousand yuan per hour. The samples of compound tasks related to whole - body multi - joint coupling are almost blank in the past data pool.
The deficiencies in these three types of data point to the same problem: the most core information for humans to complete daily actions, such as how the whole body coordinates, how the center of gravity shifts, and how force is transmitted from the lower limbs to the upper limbs, has hardly been recorded in the existing data.
Take a simple daily scenario as an example. When a person pulls a light door and a heavy door, there is almost no difference in the hand trajectory; they first grasp the handle and pull it back, and regardless of the amount of force used, the pose is always synchronized with the movement of the door. The real difference lies in the whole - body dimension. When facing a light door, a person can pull it open while standing upright; when facing a heavy door, the body needs to tilt forward to shift the center of gravity and use body weight to resist the resistance.
This information about the movement of the center of gravity is only recorded in the whole - body data and contains the essential differences in the physical properties of objects. Simply put, if the model only learns single - dimension information for a long time, it may be able to perform the action of "pulling the door," but it cannot understand what "how heavy the door is" means at the human - body level.
"The long - term absence of information has locked the current capabilities of robots at the level of fixed - desktop grasping. There is a data gap between this and the diverse tasks in the real home environment," Li Tianyu said. "To break through this ceiling, we must go back to the source and redefine the logic of data collection."
The Archon team believes that migrating from the solution of a wheeled chassis with two arms to a humanoid robot has essential differences in structure, motion control, and perception dimensions, and it is not a simple morphological upgrade. When overseas companies have just realized the complexity of humanoid tasks, they have already set their goals in this area.
Archon Robotics aims at an almost untouched field, developing a general whole - body humanoid base model. Its core concept, Human Body Learning, is to learn the whole - body pose and coordination mode of humans, rather than simply tracking the trajectory of the end - effector. By learning from human whole - body actions, robots can acquire the "wisdom of limb coordination" and have complete whole - body interaction capabilities.
By depositing the "intelligence" of the humanoid robot's actions as much as possible in the mid - brain level that is independent of the body, during this period, the capabilities learned by the mid - brain are not bound to a specific robot; it outputs the whole - body motion trajectory, rather than joint - angle instructions for a specific model, enabling the model to have the potential to migrate across different bodies. As the data collection becomes more sufficient and the covered scenarios become more diverse, the representational ability of the mid - brain becomes stronger, and the types of bodies to which Archon Robotics' embodied whole - body brain can migrate become more extensive.
Based on this judgment, Archon Robotics will build a new data collection system. Li Hongyang, the founder, believes that the evolution path of embodied data is evolving from real - machine teleoperation to hand - held devices and first - person perspectives, and the ultimate goal is human - centric full - humanoid data with complete human perception elements and whole - body action labels.
At the same time, Archon Robotics will also introduce multi - dimensional perception modalities such as touch, and match them with higher - precision whole - body and hand capture devices. Li Tianyu told Yingke that data diversity and data quality are more crucial than simply the data scale. "One piece of whole - body data covering the movement of the center of gravity and the change of the torso angle has a much higher information density than a hundred pieces of desktop data with only hand trajectories."
The data collection method determines what the model can learn, and the ability shortcomings of the model in turn define the goal of the next collection. Once this "collection - training - feedback" closed - loop starts to operate, it will form a continuously self - strengthening data barrier: every time a round of collection and training is completed, the model's ability is improved, and the system's understanding of "what data is truly useful" becomes more accurate, and the efficiency and quality of the next round of collection will be further improved.
This tests not only the engineering ability at the algorithm level but also the systematic understanding of the fundamental question of "what the model really needs to learn from the physical world." And this understanding is precisely the core judgment ability of Archon Robotics.
Yingke has learned that Archon Robotics plans to release its first humanoid native base model in the second half of 2026.
In the view of the Archon Robotics team, for humanoid robots to truly enter families from the laboratory, what is needed is not just a perfect single - point demonstration, but the ability for robots to work continuously and reliably in a complex, dynamic, and unstructured home environment. The upper limit of this ability fundamentally depends on the depth of the model's understanding of the physical world.
Archon Robotics chooses to go back to the starting point of embodied intelligence and answer this question again: what kind of body and what kind of data to learn from determine how far the robot can ultimately go.
The following is an excerpt from an interview between Yingke and Dr. Li Hongyang, the founder of Archon Robotics, and Dr. Li Tianyu, the CEO (slightly edited):
Yingke: The current embodied industry has not yet entered the convergence stage. What is the underlying judgment for Archon Robotics to develop a general whole - body humanoid base model?
Li Tianyu: There are two essential differences between our approach and the mainstream solutions in the market.
The first is the hardware form. Since its establishment, Archon Robotics has targeted a complete humanoid robot. The core problem to be solved is the linkage of multiple joints of the whole body and the dynamic adjustment of the center of gravity. Most teams choose a wheeled chassis with two arms, which is understandable from an engineering perspective, as it has good stability, relatively low technical thresholds, and is easy to produce demos in the short term. However, in the home scenario, a wheeled chassis cannot even cross the threshold, let alone perform daily operations such as squatting, climbing, and squeezing through gaps. The humanoid form is not an option but an inevitable end - state.
The second is the underlying model architecture. We are developing a dedicated humanoid native embodied large model, rather than fine - tuning an existing VLA or world model. The difference between the two is that the former learns the underlying logic of human interaction with the physical world from whole - body data, while the latter teaches the model how to move an object on a fixed desktop. The model architecture of Archon Robotics has been designed for whole - body movement from the very beginning. The brain is responsible for task understanding and long - term planning, the mid - brain learns the whole - body movement representation across different bodies, and the cerebellum is responsible for real - time pose tracking and balance maintenance. The three layers work together to output the whole - body motion trajectory, rather than simply end - point instructions.
Based on this approach, we have built an independent data collection link. On the one hand, we collect a large amount of whole - body motion data of ordinary people in real - world scenarios. On the other hand, we combine it with the actual operation data of humanoid robots for joint training, enabling the model to replicate the innate whole - body coordination and interaction ability of humans.
Archon Robotics aims to build a humanoid brain that can truly face an unpre - set home environment. Robots need to independently sense surrounding changes and randomly handle various differentiated tasks: the weight of objects, their placement positions, and the room layout are constantly changing, without a fixed script. Its core requirement is strong generalization and migration ability, so that it can learn a set of logic and handle thousands of different scenarios.
Yingke: You once compared the current development level of the embodied intelligence industry to the L1.5 stage of autonomous driving in a public interview. What are the bases for this comparison? Which specific dimensions does it correspond to?
Li Hongyang: We can borrow the grading logic from L1 to L5 in autonomous driving to understand the evolution of embodied intelligence. L1 is basic teleoperation with limited generalization; only when it truly reaches L2 can the system complete a complete operation closed - loop independently in a specific scenario without human intervention.
In the current embodied industry, most public demonstrations are essentially remote teleoperation or highly pre - set script execution. Most actions, object positions, and environmental layouts are fixed in advance, and the system will fail if any variable is changed. No product can handle continuous home tasks with multiple steps and changes independently without human intervention.
Therefore, the judgment of the L1.5 stage means that the industry can produce demos for single - point tasks, but it is still far from "complete autonomy in a specific scenario." The key to crossing this threshold lies in data. However, the existing data sets do not contain the signals required for humanoid robots to make independent decisions and coordinate the whole body in a real home environment. Simply increasing the model scale with the current data will not reach the L2 level.
To cross this gap, we must redefine the logic of data collection, shifting from "collecting the movement of robots from a single perspective" to "collecting how the whole body interacts with the environment," and from "fixed tasks in pre - set scenarios" to "diverse exploration in an open world." This is also what Archon Robotics has been doing from the very beginning.
Comments from investors:
Qin Tianyi, Managing Director of ZhenFund said that he has known Li Hongyang for many years and witnessed his team tackling the most difficult and important issues, from UniAD to BEVFormer, and then to the vision of Archon Robotics, "the general whole - body humanoid base model." They never chase benchmarks but firmly push the boundaries. Even more commendable is that the OpenDriveLab founded by Li Hongyang has become a talent hub in the fields of autonomous driving and embodied intelligence, nurturing a group of high - spirited top young scholars, among whom Li Tianyu and Chen Li, the co - founders of Archon Robotics, are outstanding representatives.
Xin Wang, Partner of Banyan Capital said that they really like the Archon Robotics team led by Professor Li Hongyang. They have pure technical ideals, profound insights, and the courage to lead in innovation, and will never be followers. In the field of intelligent driving, projects such as UniAD, BEVFormer, and WorldEngine, and in the field of embodied intelligence, projects such as Agibot, WholeBodyVLA, and Ego - humanoid, have continuously proven that the Archon team has the ability to continuously innovate and produce world - influential results. Today, Archon Robotics has chosen a difficult but correct path, challenging the problem of the whole - body dexterous operation brain of humanoid robots. They are very glad to participate in Archon's cause and look forward to Archon becoming a world - class embodied brain company.
Meng Xing, Partner of Matrix Partners said that he met Hongyang in 2023 when he was working on autonomous driving. At that time, UniAD had just won the Best Paper Award at CVPR, which was the most sensational event in the industry. What convinced him was not the trophy but his way of asking questions: when everyone was competing in their respective modules, he started from the ultimate goal of planning and deduced what the system should look like, integrating the backbones of all modules and making everything serve the final planning. Three years later, it's still the same Hongyang at Archon. While others are asking "what can be trained with the existing data," he is asking "what should robots learn from humans." Human Body Learning in the field of embodied intelligence is just like the planning - oriented approach in autonomous driving back then - both define the starting point with the end - state. Today, the embodied intelligence field does not lack money, engineers, or computing power, but only lacks this kind of taste: the ability to distinguish what is worth doing in the noise. He believes that the most precious thing is a person's judgment ability to ask the right questions across two technological waves.
Li Guanle, Managing Director of Gobi Partners and General Manager of University Innovation and Technology said that Archon Robotics is not a traditional robotics company but a silicon - based embodied intelligence company driven by algorithms and data engines. The team has the rare ability to define the industry's technical paradigm. Relying on self - developed original algorithms and mature engineering implementation capabilities, in the current situation where the industry standards have not been unified, it is expected to