ICCA 2026 Program | Thursday June 18, 2026


ThAT1	Assembly Hall
Robotics 3	Regular Session
Chair: Huang, Hailong	Hong Kong Polytechnic University
Co-Chair: Gao, Zhi	Wuhan University

08:30-08:45, Paper ThAT1.1
Development of Flapping Robots Using Piezoelectric Fiber Composites - Introduction of Click Mechanism -

Suzuki, Keisuke	The University of Electro-Communications
Sato, Ryuki	The University of Electro-Communications
Ming, Aiguo	The University of Electro-Communications
Keywords: Robotics, Motion Control, Modeling and Control of Complex Systems Abstract: Developing agile flapping robots like insects is still an important task to achieve successful potential applications of the flapping robots. Authors have been working on developing flapping robots using a new type of piezoelectric material, that is, piezoelectric fiber composites, while mimicking the structure and the control of insects. To enhance the performance of the flapping robots, this paper describes an introduction of click mechanism to a flapping robot using piezoelectric fiber composites inspired from insects. The simulation-based design is performed to achieve flapping motions with large flapping amplitude and high speed by utilizing the click mechanism. A prototype of flapping robot using the designed click mechanism has been fabricated and large flapping amplitude and high speed due to the click mechanism have been confirmed by experimental results.

08:45-09:00, Paper ThAT1.2
Hybrid Control Architecture for Mobile Robot Fleets Integrating Centralized Coordination and Decentralized Navigation

Abutalipov, Kaisar	Satbayev University
Tuleshov, Yerkebulan	Satbayev University
Issabekov, Zhanibek	Satbayev University
Rakhimzhanov, Rustem	Satbayev University
Rakhmetova, Perizat	Satbayev University
Keywords: Robotics, Motion Control, Modeling and Control of Complex Systems Abstract: Mobile systems with multiple robots are widely used in warehouse and production logistics, where work efficiency is determined not only by the quality of local navigation of an individual robot, but also by the consistency of movement of the entire fleet. With the increasing number of robots in a limited environment, characteristic problems arise: conflicts at intersections, mutual delays, queues in narrow corridors and mutual blockages. Fully decentralized approaches provide good local adaptation, but do not guarantee a global order of movement. Fully centralized approaches allow you to create conflict-free plans, but they are sensitive to computational complexity and the mismatch between a discrete model and continuous execution. The paper proposes a hybrid fleet management framework for mobile robots that combines a centralized level of coordination based on multi-agent pathfinding and decentralized execution using the Nav2 navigation stack in the ROS2/Gazebo environment. Conflict-based search is used as a centralized planning method that forms a globally consistent discrete plan. For practical implementation, the Fleet Manager intermediate coordination module has been developed, which converts a discrete plan into a sequence of goals and waypoint commands for Nav2, tracks the progress of robots and initiates replanning in case of deviations. Experimental validation was conducted at two levels: a discrete Python model and continuous simulation in Gazebo. Three modes are compared: centralized coordination in a discrete model as a reference, decentralized execution using Nav2 alone, and hybrid management of MAPF + Nav2. The results show that in simple scenarios, Nav2-only mode can provide acceptable behavior, but in conditions of corridors and conflicts, the likelihood of blockages increases and waiting times worsen. The hybrid approach enhances stability and predictability of fleet motion by integrating global coordination with local adaptability.

09:00-09:15, Paper ThAT1.3
Convexity-Exploiting Successive Convexification for Safe Drone Racing

Shen, Zhipeng	The Hong Kong Polytechnic University
Zhou, Shiyu	City University of Hong Kong
Huang, Hailong	Hong Kong Polytechnic University
Keywords: Robotics, Motion Control, Optimal Control Abstract: Autonomous drone racing requires time-optimal trajectories that are both dynamically feasible under the full nonlinear quadrotor model and safe with respect to reliable gate traversal. This paper presents a tractable trajectory-optimization framework based on successive convexification augmented with systematic convexity exploitation. We incorporate geometric gate decomposition by representing gate-passage requirements as convex constraints (intersections of half-spaces and second-order cones), thereby certifying complete traversal without integer variables or intricate nonconvex constraint mechanisms. Beyond accelerating the main optimization, we show that the exploited convex structure also enables strong initialization: a single convex program can provide a high-quality initial guess and, in many cases, a competitive solution. Extensive comparative evaluations demonstrate significant computational improvements from successive convexification and convexity exploitation, while real-world flight experiments validate the physical feasibility and repeatability of the resulting trajectories.

09:15-09:30, Paper ThAT1.4
S2Loc: A Temporal-Geometric Consistent Framework for Long-Term LiDAR Localization

Wu, Yezhen	Wuhan University of Technology
Wang, Qiang	Wuhan University of Technology
Keywords: Robotics, Real-time Systems Abstract: Reliable long-term localization is essential for autonomous navigation, yet significant drift often arises when transient environmental changes are incorrectly matched to the pre-built static map. We propose S2Loc, a hierarchical framework that replaces passive error tolerance with active constraint verification. By explicitly decoupling feature reliability from feature existence, the system ensures that only spatiotemporally consistent geometric features are utilized for localization. Specifically, the Stable Structure Perception (SSP) module functions as a strict validation layer to extract dominant structural planes while actively intercepting structural inconsistencies. This allows the Structure-Guided Matching (SGM) registration strategy to leverage exclusively these validated constraints to optimize the pose estimation. Extensive experiments on the NCLT dataset and in industrial environments subject to frequent layout changes demonstrate that S2Loc reduces translational errors by an average of 46.6%, maintaining robustness even under environmental variations (with a Global Change Rate reaching up to 41.8%).

09:30-09:45, Paper ThAT1.5
HSURE: Hierarchical Safety-Aware Exploration Guided by Unknown Regions Using Dynamic Sparse Graphs

Yang, Wenbin	Wuhan University
Xu, Jingzhong	Wuhan University
Gao, Zhi	Wuhan University
Zhou, Zhiyu	Wuhan University
Lin, Feng	National University of Singapore
Keywords: Robotics, Sensor/Data Fusion Abstract: Autonomous exploration with Unmanned Ground Vehicles (UGVs) demands a balance between exploration efficiency and navigation safety. However, existing methods typically focus on known regions or frontiers, neglecting the informative potential of unknown regions,resulting in redundant backtracking or suboptimal efficiency. To address this issue, we introduce HSURE, a hierarchical safety-aware autonomous exploration framework guided by unknown regions. We explicitly decompose unknown regions by topological connectivity and construct an observability-driven safety assessment using dynamic sparse graphs in known space. Leveraging these hybrid representations, we devise a coarse-to-fine safety-aware exploration planner. The global planner computes an optimal coverage path and the local planner refines this into a safe and executable trajectory. Extensive simulation and real-world experiments demonstrate the proposed method outperforms state-of-the-art baselines, significantly reducing exploration time and trajectory redundancy while maintaining navigational safety.

09:45-10:00, Paper ThAT1.6
Toggle-Assisted Electric Control Brake Knee Joint for Maintaining Step-Adaptive Swing Phase Angle

Rei, Ito	Mie University
Masaki, Senzaki	Mie University
Yano, Ken'ichi	Mie University
Manabu, Goto	MASEN Engineering Corporation
Katsuhiko, Tori	IMASEN Engineering Corporation
Keywords: Robotics, Sensor/Data Fusion Abstract: Compact passive single-axis prosthetic knees are widely prescribed because of their low mass and short build height, yet their swing motion is largely unregulated. At low cadences, this can cause (i) insufﬁcient minimum toe clearance (MTC) due to premature knee extension after peak ﬂexion, and (ii) knee-ﬂexed initial contact caused by terminal impact and prolonged late swing, both of which increase fall risk. This paper presents a compact uniaxial prosthetic knee that electronically actuates a conventional load-brake through a toggle mechanism, enabling swing-phase knee-angle holding at arbitrary timings with reduced actuator torque. The knee applies braking at two key instants: it holds the knee at peak ﬂexion until the predicted MTC instant and locks the knee at full extension in late swing until initial contact. A prototype (build length 205 mm, mass 918 g) was tested in level walking with one unilateral transfemoral amputee at cadences of 60–90 rpm, compared with two conventional passive knees. At 75 rpm, the proposed knee increased MTC from 0.5 mm to 23.7 mm and eliminated toe scufﬁng, while at 60 rpm, fall prevention was achieved by maintaining the fully extended position.

10:00-10:15, Paper ThAT1.7
HandCept: A Visual-Inertial Fusion Framework for Accurate Proprioception in Dexterous Hands

Huang, Junda	Chinese University of Hong Kong
Guo, Honghao	The Chinese University of Hong Kong
Wu, Hao	National University of Singapore
Li, Yitong	National University of Singapore
Liu, Zhengyang	Meta
Ang Jr, Marcelo H.	NUS
Zhou, Jianshu	National University of Singapore
Keywords: Robotics, Sensor/Data Fusion, Modeling and Control of Complex Systems Abstract: In the pursuit of general robotic manipulation, dexterous hands require reliable and scalable proprioception, which remains a key bottleneck due to limitations in sensing integration and generality. This paper presents HandCept, a visual-inertial proprioception framework for accurate joint angle estimation without relying on embedded joint sensors. HandCept integrates a wrist-mounted RGB-D camera and distributed 9-axis IMUs to estimate link poses through complementary sensing modalities. A zero-shot visual pipeline trained on synthetic data provides global 6D pose observations, while inertial measurements deliver high-frequency orientation updates. These asynchronous signals are fused via a latency-compensated Extended Kalman Filter, enabling real-time, drift-free estimation under dynamic conditions. Experimental results show that HandCept achieves joint angle errors within 2^{circ} to 4^{circ} without observable drift, outperforming visual-only and inertial-only baselines. The proposed framework further demonstrates strong IMU stability and cross-device uniformity, allowing a shared reference frame and simplified calibration. HandCept provides a generalizable and hardware-efficient solution for dexterous hand proprioception, supporting robust closed-loop manipulation in real-world environments.


ThAT2	Room 256
Advanced Control, Planning and Learning of Unmanned Systems	Regular Session
Chair: Liang, Hongtao	Shaanxi Normal University
Co-Chair: Bai, Jianjun	Hangzhou Dianzi Univ
Organizer: Li, Huiping	Northwestern Polytechnical University
Organizer: Zong, Guangdeng	Qufu Normal University
Organizer: Liang, Hongtao	Shaanxi Normal University
Organizer: Liu, Xiaotao	Xidian University

08:30-08:45, Paper ThAT2.1
Adaptive Trajectory Tracking Control for Underactuated AUVs with Prescribed Performance and RBFNN-Based Current Compensation (I)

Zhu, Shuyi	Hangzhou Dianzi Unversity
Liu, Zhiyao	Hangzhou Dianzi University
Bai, Jianjun	Hangzhou Dianzi Univ
Chen, Yun	Hangzhou Dianzi University
Keywords: Adaptive Control, Nonlinear Systems and Control, Networked Control Abstract: This paper proposes an adaptive trajectory tracking scheme for underactuated AUVs subject to time-varying ocean currents and uncertainties. A fast-convergent RBFNN observer with fractional-order terms is designed to accurately estimate disturbances at the kinematic level. By integrating Prescribed Performance Control (PPC) into the backstepping framework, tracking errors are strictly confined within a predefined performance funnel. Simulation results verify that the proposed method ensures superior transient stability and robust disturbance rejection compared to standard backstepping controllers.

08:45-09:00, Paper ThAT2.2
Globally Asymptotic Formation Control of Networked USVs with Output and Input Constraints (I)

Liang, Hongtao	Shaanxi Normal University
Yu, Junzhi	College of Engineering, Peking University
Li, Huiping	Northwestern Polytechnical University
Keywords: Adaptive Control, Fuzzy and Neural Systems, Multi-agent Systems Abstract: This article addresses the asymptotic formation control issue for networked unmanned surface vehicles (USVs) with output and input constraints. Specifically, a leader-follower scheme is developed to make each USV follow its reference trajectory without collisions among vehicles. Moreover, a global prescribed performance control is proposed to ensure tracking errors converge to optimal constrained boundaries regardless of initial conditions. Additionally, an adaptive formation controller is designed with the auxiliary variable to guarantee asymptotic convergence of the tracking system towards the origin with a small residual set, where the first-order filtering is introduced to avoid repeated differentiation and the fuzzy logic system is employed to attenuate effects of uncertainties and disturbances. Based on Lyapunov stability theorem, the rigorous closed-loop stability is achieved in terms of convergence and boundedness. Finally, numerical simulations show the effectiveness of the proposed method.

09:00-09:15, Paper ThAT2.3
Risk-Averse Tracking Control for Autonomous Heavy-Duty Trucks in High-Speed Obstacle Avoidance Scenarios

Wang, Yuanxin	Beijing Institute of Technology
Meng, Guoli	Beijing Institute of Technology
Li, Erhang	Beijing Institute of Technology
Wei, Hongqian	Beijing Institute of Technology
Yu, Huilong	Beijing Institute of Technology
Xi, Junqiang	School of Mechanical Engineering, Beijing Institute of Technology
Keywords: Learning-based Control, Nonlinear Systems and Control, Optimal Control Abstract: 重型卡车的高速机动因非线性动力学而带来重大控制挑战，而高重心使其容易发生灾难性翻车。为此，我们提出了一个风险厌恶型非线性模型预测控制（NMPC）框架。首先，构建结构化残差物理知情神经网络（rPINN），以补偿模型与实际非线性车辆动力学之间的不匹配，从而提升多步预测的准确性，同时确保物理一致性。其次，通过基于网格的可达性分析离线构建安全操作包络（SOE）。在不同速度下识别出稳定边界，并用凸多胞体近似实现实时优化。此外，稳定包络映射到可微风险势场，并集成到NMPC中作为软约束。该机制主动降低风险，防止车辆接近稳定边界。模拟表

09:15-09:30, Paper ThAT2.4
Crossing the Sim-To-Real Barrier in RL for Quadrotor Control

Zhao, Zeyuan	Shanghai Jiao Tong University
Zhou, Junyu	Shanghai Jiao Tong University
Li, Xianwei	Shanghai Jiao Tong University
Keywords: Learning-based Control, Nonlinear Systems and Control, Robotics Abstract: Reinforcement learning (RL) has shown promise for quadrotor control, but sim-to-real transfer remains highly challenging: policies that work well in simulation often fail or even crash in reality. The core difficulty lies in unmodeled dynamics and disturbances, including actuator-level thrust dynamics affected by nonlinear aerodynamic drag, body-relative airflow, and multimodal environmental noise, that are usually omitted or over-simplified in simulation. To address these issues, we propose an RL-based controller with three key design choices: (i) careful selection of temporal parameters such as motor inertia and control delay to stabilize reward-action mapping, (ii) an input space design that uses body-rates history and rotation-matrix attitude representation to balance efficiency, robustness, and accuracy, and (iii) diffusion-based noise modeling that captures complex real-world disturbances beyond simple parametric approximations. We validate our approach on Crazyflie quadrotors across diverse and challenging trajectories unseen during training, where our policy significantly outperforms classical controllers in tracking accuracy. These results show that principled design in modeling and training can enable reliable zero-shot sim-to-real transfer of RL policies for quadrotor flight.

09:30-09:45, Paper ThAT2.5
Predefined-Time Disturbance-Rejection Control for UAVs without Flow Angle Measurements

Li, Jinbai	Beihang University
Wang, Honglun	Beihang University
Wang, Yanxiang	Beihang University
Liu, Yiheng	Beihang University
Keywords: Nonlinear Systems and Control, Learning-based Control Abstract: Aiming at the challenge of achieving precise control performance when the flow angle of unmanned aerial vehicles (UAVs) cannot be measured directly, a predefined-time disturbance-rejection control (PTDRC) method without flow angle measurements is proposed. Based on the six-degree-of-freedom (6-DOF) nonlinear model of the UAV, an affine nonlinear form containing position, path angle, flow angle, angular rate, and velocity loop is derived; an flow angle estimation network (FAEN) is developed using a deep learning–based approach, in which the network structure contains three stacked gated recurrent unit (GRU) layers followed by a fully-connected layer; and a PTDRC method for the UAV without flow angle measurements is designed based on the estimated flow angle. Simulation results demonstrate that the FAEN exhibits stronger robustness against coefficient perturbations compared with the extended Kalman filter (EKF). Under different aerodynamic coefficient perturbations and turbulence intensities, the proposed PTDRC method achieves higher control accuracy and stronger disturbance rejection performance than the linear active disturbance rejection control (LADRC).

09:45-10:00, Paper ThAT2.6
Model-Based Accelerated Safe Reinforcement Learning for Constrained Trajectory Planning of Autonomous Vehicles

Guo, Jiawei	School of Mathematics, Southeast University
Fu, Junjie	Southeast University
Keywords: Nonlinear Systems and Control, Learning-based Control, Motion Control Abstract: Deep Reinforcement Learning (DRL) shows great potential in achieving high-performance autonomous navigation but suffers from high sample complexity and exploration risks. This paper proposes a safety-aware Model-Based Reinforcement Learning (MBRL) framework to address these issues. First, we integrate a High-Order Control Barrier Functions (HOCBFs) module to ensure safety. This module strictly constrains policy actions within safe regions, significantly minimizing training violations. Second, we apply a modular Stochastic Value Gradient (SVG) scheme that aligns policy updates with safety-filtered execution. Finally, to enhance sample efficiency, we employ a Dyna-style architecture augmented by a Sparse Gaussian Process (SGP) dynamics model. We initialize the model via offline pre-training and fine-tune it online. This design reduces model mismatch and keeps the computational cost manageable. Experimental results show improved learning efficiency and safety performance compared to the baselines.


ThAT3	Room 267
Advancements in Intelligent Perception and Autonomous Decision-Making	Regular Session
Chair: Zhang, Chunmei	Taiyuan University of Science and Technology
Co-Chair: Zhao, Jiayi	Taiyuan University of Technology
Organizer: Cheng, Lan	Taiyuan University of Technology
Organizer: Zhang, Chunmei	Taiyuan University of Science and Technology
Organizer: Zhang, Jia	Beijing Institute of Technology

08:30-08:45, Paper ThAT3.1
Confidence-Aware Point Cloud Optimization for Sparse-View 3D Gaussian Splatting (I)

Zhao, Jiayi	Taiyuan University of Technology
Cheng, Lan	Taiyuan University of Technology
Zhang, Jia	Beijing Institute of Technology
Xu, XinYing	Taiyuan University of Technology
Keywords: Learning Systems Abstract: To address the artifacts arising from low-quality geometric initialization in 3D Gaussian Splatting under sparse views, this paper proposes a general, plug-and-play self-supervised point cloud optimization framework. Seamlessly integrated into the InstantSplat pipeline, our method leverages monocular depth estimation as a geometric constraint. Specifically, it constructs a gradient-based spatial confidence mask to suppress edge noise in the depth prior, dynamically calibrates depth scales via confidence-weighted statistical moment matching, and optimizes the point cloud using a combination of Charbonnier loss and an adaptive anchor regularization mechanism. Experimental results demonstrate that our approach effectively eliminates geometric artifacts, significantly enhancing the visual quality and robustness of novel view synthesis compared to the baseline.

08:45-09:00, Paper ThAT3.2
Q-Learning Model Predictive Control with Adaptive Learning Period for Systems with Unknown Parameters (I)

Peng, He	North University of China
Xiaoli, Luan	Jiangnan University
Wen, Jiwei	Jiangnan University
Zhao, Zhiliang	North University of China
Keywords: Learning-based Control, Robust and H infinity Control, Linear Systems Abstract: In the present study, a Q-learning model predictive control with adaptive adjustment of learning period is proposed for unknown linear discrete-time systems to optimize the mixed H_2/H_infty performance index. To improve the learning efficiency of the control gain, the influence function is used to construct the quantitative relationship between the measured data and learning performance. Meanwhile, to achieve better control performance in the case of system disturbance mutation, the receding horizon optimization of model predictive control combined with Q-learning is presented. Finally, effectiveness of the proposed algorithm is numerically evaluated.

09:00-09:15, Paper ThAT3.3
Adaptive Point Set Aggregation for Large Scale Maximal Covering Location Problems (I)

Zhu, Ruiyi	Beijing Institute of Technology
Liu, Yaxuan	Beijing Institute of Technology
Chen, Chen	Beijing Institute of Technology
Keywords: Modeling and Control of Complex Systems, Sensor Networks Abstract: Maximal covering location problems (MCLP) are fundamental in spatial optimization with widespread applications in sensor networks, smart cities, and robotics. The growing need for fine-grained decision-making in modern systems results in both large-scale candidate facility location and demand point sets, rendering this NP-hard combinatorial optimization beyond the practical computational limits of existing solvers. Simple aggregation methods improve scalability, but often incur a significant loss in solution quality, motivating the need for intelligent aggregation mechanisms. To address this, the paper proposes an adaptive aggregation framework that effectively reduces problem dimensionality while maintaining high-quality coverage. The proposed method establishes an iterative framework of aggregation-optimization-refinement, driven by novel criteria: a coverage inconsistency evaluation that detects discrepancies between approximate and actual coverage, and a local gain potential analysis that identifies regions with high potential for solution improvement. Using these indicators to selectively refine spatial granularity, the framework adaptively preserves high-quality coverage while maintaining computational efficiency. Simulations with up to 100,000 points demonstrate that the proposed algorithm achieves high-quality coverage comparable to leading solvers with substantial runtime reductions, confirming its scalability for solving large-scale MCLP.

09:15-09:30, Paper ThAT3.4
Pulse Charging Strategy for Lithium Batteries Based on Deep Reinforcement Learning (I)

Chen, Boyang	Taiyuan University of Technology
Ren, Mifeng	Taiyuan University of Technology
Zhang, WenJie	Taiyuan University of Technology
Keywords: Energy Efficiency, Control Applications, Learning-based Control Abstract: ，同时减少极化 4182; 延长电池寿命ᦁ 2; 精准的控制至关 7325; 要。本文将深度结合带脉冲充电的强 1270; 学习提出一个新型充电控制方 7861;。关注贫困群体传统方法的泛化 3021; 力，这方法动态调整初 2987; 条件训练，使模型能 2815; 适应多样化的ࠠ 5; 电环境并迅速响应 6032; 的作情况场景。它有效地增强了 7169; 型的泛化能力，同时 6174; 著抑制充电时的偏振效 4212;。模拟结果证明通过该方法 5292; 极化电压保持ߎ 1; 变约0.2V，温度在充电过程。

09:30-09:45, Paper ThAT3.5
An Automated Data Synthesis Framework for Visual-Language Navigation Training Based on 3D Gaussian Splatting (I)

Ma, Runze	Taiyuan University of Technology
Zhang, Weiqiang	Taiyuan University of Technology
Hao, Lingguang	Taiyuan University of Technology
Cheng, Lan	Taiyuan University of Technology
Keywords: Intelligent and AI Based Control, Learning Systems, Learning-based Control Abstract: 视觉语言导航的泛化能力（VLN）模型受到以下的稀缺性极大限制。高质量、多样化的训练数据。传统数据采集方法依赖于昂贵的机器人平台，人工注解劳动密集型，导致有限的注释数据规模和显著的场景偏差。为了克服这一点瓶颈，本文提出了一种新颖的端到端解决方案自动化VLN训练数据综合框架。其核心创新在于高效地大规模生产训练数据——具有精确几何结构和丰富的语义，以及自然语言指令——仅使用多视图RGB 目标场景的图像序列作为输入。具体来说，我们首先，采用三维高斯喷溅技术重建高保真、可渲染的神经场景从图像中表示。随后，我们设计了一个数据驱

09:45-10:00, Paper ThAT3.6
SD-MOMPA-Based Approach for Multi-UAV Cooperative Task Allocation (I)

Yang, Xiaokang	Taiyuan University of Science and Technology
Zhang, Chunmei	Taiyuan University of Science and Technology
Guo, Hong ge	Taiyuan University of Science and Technology
Keywords: Optimal Control Abstract: Multi-UAV cooperative task allocation is a typical multi-objective constrained optimization problem with discrete combinatorial structure, strong constraint coupling, and conflicting objectives. To address these issues, this paper proposes a State-aware(S) Decomposition-guided(D) Multi-Objective Marine Predators Algorithm (MOMPA), abbreviated as SD-MOMPA. A multi-objective model is first established by considering total system energy consumption, maximum completion time, and load variance under task uniqueness, energy, capacity, and time-window constraints. Then, the decomposition idea from MOEA/D is introduced into MOMPA to enhance search directionality along different Pareto preferences; state-aware phase scheduling is designed to adaptively coordinate exploration and exploitation using population diversity, improvement rate, and feasibility ratio; and local replanning is applied to infeasible solutions to preserve favorable task-allocation structures. Comparative experiments show that SD-MOMPA achieves competitive convergence performance and obtains a well-distributed Pareto solution set, demonstrating its effectiveness for constrained multi-UAV task allocation.


ThAT4	Room 269
Collective Behavior Regulation Inspired by Swarm Intelligence and Its Applications	Regular Session
Chair: Yang, Qingkai	Beijing Institute of Technology
Co-Chair: Zeng, Xianlin	Beijing Institute of Technology
Organizer: Yang, Qingkai	Beijing Institute of Technology

08:30-08:45, Paper ThAT4.1
Consensus for Multi-Agent Systems with Stochastic Network and Noises by LLM (I)

Shang, Jinxin	Fuzhou University
Qi, Yiwen	Fuzhou University
Keywords: Multi-agent Systems Abstract: This paper studies the consensus problem for stochastic multiagent systems (SMASs) under compound noises (additive and multiplicative noises) over stochastic networks via a Large Language Model (LLM).Existing challenges primarily stem from the difficulty in obtaining appropriate control gains to rapidly mitigate noises under stochastic network conditions.Furthermore, the simultaneous presence of these networks and compound noises complicates the transformation into unified error dynamics.To address these issues, this paper applies LLM inference to the adaptive selection of control gains alpha(k).By utilizing the LLM's powerful reasoning capabilities and ability to process complex contextual information, this approach dynamically selects appropriate control gains to cope within complex environments.Specifically, a stochastic approximation (SA) protocol is employed where the LLM-designed gains, subject to specific convergence constraints, effectively reduce the influence of noises for SMASs.In addition to ensuring state boundedness through Lyapunov theory, a semi-decomposition technique to establish consensus among agents is employed.The effectiveness of our LLM-based control scheme is ultimately demonstrated by a representative numerical example.

08:45-09:00, Paper ThAT4.2
NE Seeking for Linear Multi-Agent Systems with Time-Varying Costs (I)

Chongyuan, Hu	Bupt
Xu, Chengzhi	Beijing University of Posts and Telecommunications
Sun, Qiming	Beijing University of Posts and Telecommunications
Tang, Yutao	Beijing University of Posts and Telecommunications
Keywords: Nonlinear Systems and Control, Multi-agent Systems Abstract: This paper addresses the problem of distributed Nash equilibrium seeking for multi-agent systems composed of heterogeneous linear agents. Unlike most existing works that assume static cost functions or single-integrator dynamics, we consider a dynamic noncooperative game where the cost functions evolve over time and the agents are subject to high-order physical constraints. We conduct a hierarchical design consisting of a distributed algorithm for the Nash equilibrium seeking at the upper level and a reference-tracking controller at the lower level. Rigorous theoretical analysis of the system performance reveals that the algorithm performance is fundamentally limited by the variation rate of the realtime NE. Numerical simulations are presented to validate the effectiveness of the proposed algorithm.

09:00-09:15, Paper ThAT4.3
Synergizing Geometric Fabrics with Population-Based Reinforcement Learning for Dexterous Manipulation of Articulated Objects (I)

Zhu, Yiming	Zhejiang University
Li, Zihao	Zhejiang University
Lang, Yilin	Zhejiang University
Ren, Qinyuan	Zhejiang University
Keywords: Learning-based Control, Robotics Abstract: Dexterous manipulation of articulated objects necessitates navigating a high-dimensional configuration space riddled with discontinuous contact dynamics. Standard end-to-end Reinforcement Learning often struggles in this regime, frequently converging to jittery, unstable policies. To bridge this gap, we propose a hierarchical control framework that synergizes the extensive exploration capabilities of Population-Based Training with the structural priors of Geometric Fabrics embedded directly into the learning loop. We construct a decoupled attractor landscape that guides the arm and fingertips toward task-relevant poses, strictly confined by anisotropic repulsive metrics for self-collision avoidance and singular barrier potentials for joint limits. This formulation not only leverages the diverse experience collection of a population of agents but also effectively projects the RL problem into a physically consistent manifold, transforming the policy's role from raw motor command generation to high-level energy shaping. Extensive validation in the massively parallelized IsaacLab simulation environment demonstrates that our GF-Delta strategy achieves a 97.47% success rate in drawer-opening tasks. Compared to standard joint-space baselines, our method exhibits superior convergence efficiency and generates smooth, feasible trajectories without requiring complex reward engineering.

09:15-09:30, Paper ThAT4.4
Attention-Enhanced Artificial Potential Field with Deep Reinforcement Learning for Multi-UAV Cooperative Pursuit (I)

Wang, Yunan	Beihang University
Hua, Yongzhao	Beihang University
Li, Xiaoduo	Beihang University
Dong, Xiwang	Beihang University
Keywords: Multi-agent Systems, Intelligent and AI Based Control, Motion Control Abstract: To address the problem of multi-UAV cooperative pursuit of a high-speed evader in obstacle-rich environments, this paper proposes a hybrid cooperative pursuit algorithm trained under the centralized training with decentralized execution paradigm. The proposed algorithm employs an attention encoder to produce compact representations of dynamic neighbor observations and adopts the Twin Delayed Deep Deterministic Policy Gradient framework to output continuous potential field parameters, achieving cooperative pursuit and obstacle avoidance control through attention-weighted potential fields. The Dual-Channel Gradient Decoupling architecture is designed to segregate the multi-network optimization pathways, thereby resolving the Actor--Critic gradient conflict arising from the shared attention encoder. In addition, the Critic-Anchored KL Regularization is introduced to suppress the target-value non-stationarity caused by fluctuations in the attention distribution. Simulation experiments demonstrate that the algorithm attains an average capture success rate of 88% with a peak of 92%, outperforms the baseline in both convergence speed and final performance.

09:30-09:45, Paper ThAT4.5
Deep Koopman Operator-Based Linear Quadratic Regulator for Quadrotor Pursuit-Evasion Game (I)

Yang, Xin Mei	Beijing Institute of Technology
Dong, Wei	Beijing Institute of Technology
Cai, Yeyun	Beijing Institute of Technology
Shi, Xiang	Beijing Institute of Technology
Zhang, Lele	Beijing Institute of Technology
Wang, Chunyan	Beijing Institute of Technology
Deng, Fang	Beijing Institute of Technology
Keywords: Learning-based Control, Robotics, Control Applications Abstract: This paper proposes a deep Koopman operator-based output regulation framework to address the pursuit-evasion (PE) game problem of quadrotors with strongly nonlinear dynamics. First, for the nonlinear quadrotor dynamics, a deep neural network is trained to lift the physical states into a high-dimensional latent linear space. Then, an output matrix is introduced to map the lifted states to task-relevant outputs. Next, a discrete-time linear quadratic regulator (LQR) with an output-weighted quadratic cost is formulated to yield a Riccati-based controller for efficient regulation. In contrast to existing Koopman-based control schemes, the proposed approach improves the modeling fidelity for nonlinear flight dynamics. Finally, simulation demonstrates that the proposed controller outperforms the baseline controller and also provides a principled and computationally efficient alternative to purely reinforcement learning (RL)-based policies.

09:45-10:00, Paper ThAT4.6
Solving Imperfect-Information Dynamic Defender-Attacker Blotto Games Based on Progressive-Expanding CFR (I)

He, Yuman	Beijing Institute of Technology
Zeng, Xianlin	Beijing Institute of Technology
Dou, Lihua	Beijing Institute of Technology
Keywords: Learning Systems, Modeling and Control of Complex Systems Abstract: This work studies imperfect-information dynamic Defender-Attacker Blotto games under graph constraints. In this game, two players make multi-stage decisions on how to allocate and move limited resources to control critical nodes, with each player observing only part of the opponent’s resources at each stage, highlighting the challenges of imperfect information. Existing reinforcement learning methods suffer from low sample efficiency and lack theoretical guarantees, while classic counterfactual regret minimization (CFR) algorithms struggle with low computational efficiency in large action spaces. To address this, we propose a Progressive-Expanding Counterfactual Regret Minimization (PE-CFR) algorithm. This method uses hybrid strategy pruning to construct a high-quality initial action subset and applies progressive expansion mechanism to explore the full action space. Experiments on networks with 3 to 6 nodes show that PE-CFR significantly outperforms the baseline in both convergence speed and computational efficiency. The equilibrium generated reveals that resource allocation strategy should consider network topology and the opponent's mobility, rather than solely consider values of nodes.


ThAT5	Room 259
Distributed Optimization, Game and Learning Algorithms with Their Applications in Cyber-Physical Systems	Regular Session
Chair: Hua, Haochen	Hohai University
Organizer: Xiao, Shunyuan	Nanjing University of Posts and Telecommunications
Organizer: You, Keyou	Tsinghua University
Organizer: Ye, Maojiao	Nanjing University of Science and Technology
Organizer: Liu, Zhao-Qing	Nanjing University of Posts and Telecommunications

08:30-08:45, Paper ThAT5.1
GFW-YOLO: A Small Traffic Sign Recognition Method Optimized for Detailed Features (I)

Huang, Chaohong	Chongqing University of Technology
Shen, Xiaoman	Chongqing University of Technology
Han, Shuchang	Chongqing University of Technology
Liu, Wei	Chongqing University of Technology
Keywords: Automated Guided Vehicles Abstract: ，交通标志检测在智能交通系统（ITS）与高级驾驶辅助系统（ADAS）通过提供关键支持驾驶安全和事故预防。致解决小靶遗漏、遮挡等问题，以及复杂环境中的低探测精度，即论文提出了一种新的交通标志检测算法针对基于YOLOv8n框架的小型对象进行了优化。首先，通过积分开发C2f-CGLU模块通道门控线性单元（CGLU）进入瓶颈主干中C2f模块的结构，因此加强模型对局部细节的感知，部分遮挡符号的通道更新功能。此外，设计轻量级功能精炼融合引入（FRF）模以重建 YOLOv8n 的交叉卷积颈部结构;该模块，称为C2f-FRF，促进了更优越的特征细节融合同时减少总参数数。此外，原始的损失函数ඪ

08:45-09:00, Paper ThAT5.2
Privacy-Preserving Distributed Estimation of Global Storage Capacity Via Dynamic Differential Privacy (I)

Zhang, Yun	Jinan University
Cui, Zhongrui	Jinan University
Chang, Le	Shanghai University of Electric Power
Keywords: Control of Distributed Generation Systems, Energy Efficiency, Control of Smart Power Delivery Systems Abstract: Distributed Energy Storage Systems (DESSs) are critical for grid stability, yet their effective scheduling relies on accurate knowledge of the global total capacity. However, the capacity data of individual storage units is commercially sensitive. Traditional distributed consensus algorithms allow for decentralized estimation but often expose the initial states of nodes to neighbors, leading to privacy leakage. To address this challenge, this paper proposes a privacy-preserving distributed estimation strategy based on textit{Dynamic Differential Privacy}. We design a time-varying noise injection mechanism where Laplace noise is added to the consensus process. The noise scale is initialized at a high level to mask raw data and follows a decay function, reducing to zero at a pre-set time step T. Theoretical analysis proves that this method ensures the privacy of individual units during the transient phase and achieves exact convergence to the true global capacity at step T, effectively balancing the trade-off between privacy protection and estimation accuracy.

09:00-09:15, Paper ThAT5.3
Emergency Frequency Restoration in Weak Islanded Microgrids: A Noise-Suppressing Prescribed-Time Approach (I)

Ye, Ying	State Grid Company
Xu, Bingyan	State Grid Company
Chen, Yunfeng	State Grid Company
Cao, Chun	State Grid Company
Tang, Heng	State Grid Company
Tu, Niehua	Shanghai Electric Power Industry
Han, Yi	Shanghai Electric Power Industry
Tang, Xuyin	Shanghai Electric Power Industry
Keywords: Control of Distributed Generation Systems, Nonlinear Systems and Control, Multi-agent Systems Abstract: Frequency stability in islanded microgrids is often compromised by the low inertia of inverter-based interfaces and the prevalence of measurement noise in weak communication links. Traditional secondary control strategies, such as asymptotic or finite-time methods, typically suffer from slow convergence rates and are sensitive to noise, leading to steady-state fluctuations or prolonged recovery times. To address these challenges, this paper proposes a noise-resilient distributed secondary control strategy tailored for the emergency frequency recovery of weak microgrids. By employing a prescribed-time control framework with time-varying gains, the proposed method ensures that frequency synchronization is achieved exactly within a user-defined time window, regardless of initial system states. Theoretical analysis reveals that the inherent growing gain of the controller effectively suppresses bounded measurement noise as the deadline approaches, ensuring deterministic recovery precision. Numerical simulations on a modified IEEE 33-bus system validate that the proposed strategy can restore frequency to the nominal value within 5 seconds, exhibiting superior robustness compared to conventional fixed-gain approaches.

09:15-09:30, Paper ThAT5.4
Demand Response Decision Optimization for EV Aggregators in V2G Systems Via Embedded-Crossed Graph Attention Reinforcement Learning (I)

Ruan, Mengxin	Hohai University
Hua, Haochen	Hohai University
Ma, Luyao	Hohai University
Zhou, Yang	Changsha University of Science and Technology
Jiang, Yingjin	China Quality Certification Centre
Sidorov, Denis	Energy Systems Institute, Siberian Branch of the Russian Academy of Sciences
Gertrudes, João Bosco	State University of Feira De Santana
Keywords: Energy Efficiency, Multi-agent Systems Abstract: With the increasing penetration of electric vehicles (EVs), vehicle-to-grid (V2G) technology enables them to act as flexible resources within the power grid. Electric vehicle aggregators (EVAs) play a crucial role in coordinating large-scale EV charging and discharging, yet they face the dual challenge of maximizing economic benefits of EVAs while maintaining user satisfaction. To address this issue, this paper develops a solution method based on Stackelberg game theory. As leaders, EVAs determine charging prices and V2G compensation to optimize profits and achieve peak shaving and valley filling, while users, as followers, respond to regulatory signals and adjust their charging behaviors accordingly. An embedded cross multi-agent actor–critic (EC-MAAC) algorithm is further proposed to address the scalability issues arising from large-scale EV participation in V2G systems, while alleviating the reliance of conventional game-theoretic solvers on convexity and differentiability assumptions for tractable equilibrium computation. Simulation results demonstrate that the proposed EC-MAAC approach achieves a 15.1% reduction in EVA operating costs, a 44.6% increase in user rewards, and an improvement in SoC satisfaction from 91.2% to 96.8%, effectively balancing economic efficiency and user satisfaction.

09:30-09:45, Paper ThAT5.5
Fully Distributed Optimal Coordination of Uncertain Euler-Lagrange Systems with Relative Output Measurement (I)

Liu, Tianyu	City University of Hongkong
Liu, Lu	City University of Hong Kong
Keywords: Multi-agent Systems, Networked Control, Adaptive Control Abstract: This paper investigates the distributed optimal coordination problem for multiple uncertain Euler-Lagrange systems. We first propose a novel measurement-based distributed optimal coordinator to generate the local reference trajectory without relying on communication networks. Notably, the coordinator gains are designed as two time-varying decreasing functions with distinct decay rates, which avoids the dependence on global information and, meanwhile, mitigates the error terms stemming from the replacement of communicated variables with physical outputs. Then, a norm-estimation-based adaptive controller is designed to achieve accurate reference tracking, where only one updated parameter needs to be introduced, regardless of the dimensions of disturbances and uncertainties. By constructing an appropriate composite Lyapunov function and utilizing the property of a perturbed time-varying differential inequality, it is proved that agents’ outputs can asymptotically converge to the global optimum. The proposed scheme is fully distributed and inherently immune to cyber-attacks.

09:45-10:00, Paper ThAT5.6
Differentially Private Distributed Nash Equilibrium Seeking for Aggregative Games under an Event-Triggered Mechanism (I)

Teng, Yuxin	Nanjing University of Science and Technology
Liu, Chao	Nanjing University of Science and Technology
Ye, Maojiao	Nanjing University of Science and Technology
Keywords: Multi-agent Systems, Networked Control, Optimal Control Abstract: This paper considers differentially private distributed Nash equilibrium seeking for aggregative games under an event-triggered mechanism. By incorporating an event-triggered scheme with a privacy protected aggregate estimation mechanism, a distributed Nash equilibrium seeking strategy is proposed. In the proposed strategy, the privacy protection is achieved by using Laplace noises to mask the information exchanged among the players, in which the transmission instants are determined by an event-triggered mechanism. By gradually weakening inter-player interactions, the proposed strategy ensures that players' actions can be driven exactly to the Nash equilibrium while ensuring rigorous epsilon-differential privacy with reduced communication costs. The effectiveness of the proposed strategy is verified using a numerical example.

10:00-10:15, Paper ThAT5.7
Quantized Asynchronous H∞ Control for T-S Fuzzy Systems Based on Fuzzy State Observers (I)

Yang, Jing-yu	University of Science and Technology Beijing
Guo, Xianggui	University of Science and Technology Beijing
Ding, Dawei	Chool of Automation and Electrical Engineer-Ing, University of Science and Technology Beijing, Beijing 100083, Ch
Hao, Liying	Dalian Maritime University
Keywords: Robust and H infinity Control Abstract: This work explores the observer-based controller problem for nonlinear networked dynamics where quantization non-idealities, exogenous disturbances, and asynchronous premise constraints are explicitly accounted for. A novel switching mechanism, leveraging the bounds of the MFs within an interval Type-2 Takagi–Sugeno (T–S) fuzzy framework, is proposed to control the approximated nonlinear model effectively. The proposed strategy obviates the need for precise MF information, accommodates premise asynchrony, and maintains low conservatism. The errors induced by quantization are formulated as bounded disturbances, which are suppressed under a prescribed H∞ performance index. Within the Lyapunov stability framework, conditions are derived to ensure asymptotic stability for the integrated system. The efficacy of the proposed approach, along with its advantages over existing methods, is subsequently confirmed via illustrative simulations.


ThAT6	Room 264
Embodied Intelligent Robotics, Active Perception, and Human-Machine Interaction Technologies	Regular Session
Chair: Wang, Xiangyang	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Organizer: Jun, Cheng	Chinese Academy of Sciences
Organizer: Qieshi, Zhang	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Organizer: Ziliang, Ren	Dongguan University of Technology

08:30-08:45, Paper ThAT6.1
UGPT: Uncertainty-Guided Dynamic Prompt Tuning for Vision-Language Models (I)

Huang, Baoqin	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Wu, Fuxiang	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Jun, Cheng	Chinese Academy of Sciences
Abduhalimzoda, Abdukarim	Tajik Technical University Named after Academician M.S. Osimi
Song, Chengqun	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Keywords: Intelligent and AI Based Control, Estimation and Identification, Sensor/Data Fusion Abstract: Adapting frozen vision-language models (VLMs) to downstream tasks via prompt tuning has emerged as a parameter-efficient alternative to full fine-tuning. However, existing prompt tuning methods still lack explicit control over how strongly prompts should adapt to different samples, despite the significant variation in sample difficulty and prediction reliability---a critical limitation for trustworthy deployment in complex real-world scenarios such as autonomous perception and intelligent control systems. We propose UGPT (Uncertainty-Guided Dynamic Prompt Tuning), a framework that explicitly leverages visual prediction uncertainty as a sample-level control signal for prompt modulation. UGPT computes normalized entropy from a frozen zero-shot CLIP branch, maps it through a lightweight Uncertainty Injection Network (UIN) into prompt-space perturbation vectors, and applies broadcast injection to generate sample-adaptive dynamic prompts. Under the 16-shot few-shot setting, UGPT uses only 41K trainable parameters in total, achieves 81.58% Top-1 accuracy on ImageNet-1K, and obtains consistent improvements across four out-of-distribution benchmarks (Avg. OOD: 62.0%). Ablation studies and interpretability analyses confirm that the perturbation magnitude increases monotonically with uncertainty, validating the principled "harder samples receive stronger modulation" mechanism.

08:45-09:00, Paper ThAT6.2
CAMD-HER: Competence-Aware Multi-Dimensional Curriculum Hindsight Experience Replay for Sparse-Reward Dexterous Control (I)

Xu, Zhenyu	University of Macau
Ziliang, Ren	Dongguan University of Technology
Qieshi, Zhang	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Pun, Chi-Man	University of Macau
Keywords: Robotics, Man-machine Interactions, Control Applications Abstract: Sparse-reward goal-conditioned reinforcement learning remains difficult in dexterous robotic control, where informative successful interactions are rare during early exploration. Although Hindsight Experience Replay (HER) improves sample efficiency by relabeling failed trajectories with achieved goals, standard HER usually adopts fixed hindsight-goal sampling heuristics and does not explicitly account for the agent's current competence. Meanwhile, existing curriculum strategies often rely on a single difficulty dimension, which may be insufficient for high-dimensional dexterous tasks. In this paper, we propose Competence-Aware Multi-Dimensional Curriculum HER (CAMD-HER), an extension of HER for sparse-reward goal-conditioned learning. CAMD-HER introduces a competence-aware hindsight relabeling mechanism that prioritizes candidate goals according to competence matching, novelty, and learning progress, and a multi-dimensional curriculum strategy that progressively adjusts task difficulty through success threshold, goal sampling range, reset perturbation, and observation noise. The proposed method is integrated into an off-policy HER-based training framework and implemented in a Stable-Baselines3-compatible manner. Experiments on the HandReach benchmark show that CAMD-HER improves training effectiveness over standard DDPG+HER by accelerating learning and achieving better performance under sparse rewards. The results indicate that jointly adapting replay selection and environment difficulty is an effective way to improve sample efficiency in goal-conditioned dexterous reinforcement learning.

09:00-09:15, Paper ThAT6.3
Hybrid Spiking Neural Network for Action Recognition (I)

Chen, Yan	Dongguan University of Technology
Ziliang, Ren	Dongguan University of Technology
Qieshi, Zhang	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Anvarzoda, Daler	Tajik Technical University Named after Academician M.S. Osimi
Bandishova, Risolat	Tajik Technical University Named after Academician M.S. Osimi
Keywords: Man-machine Interactions Abstract: 基于神经网络激增的人类动作识别（SNN）因其低功率优势。然而，现有基于SNN的方法受限于训练困难，这些困难由尖峰的不可微性，阻止了准确性远超人工神经网络（人工焦虑论）。为解决这一瓶颈，本研究提出了一个集成两者结合的混合人工神经网络-卫星网络（混合AS）两流架构中的网络范式。澳大利亚国立网络布兰奇确保高识别精度，而SNN分支则保持能量效率。为了实现有效的跨分支互补性，我们引入显著性评分计算和多粒度采样机制深度功能整合。HMDB-51上的广泛实验， UCF-101和Kinetics-400证明了混合AS 在保持 SNN的低功耗特性，提升效率动作识别。

09:15-09:30, Paper ThAT6.4
SpikeAttn-YOLO: An Attention-Enhanced Spiking Neural Network for Energy-Efficient Object Detection (I)

Zhou, Jun	Dongguan University of Technology
Ziliang, Ren	Dongguan University of Technology
Qieshi, Zhang	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Abdunabi, Qosimov	Tajik Technical University Named after Academician M.S. Osimi
Shavkat, Kholov	Tajik Technical University Named after Academician M.S. Osimi
Keywords: Man-machine Interactions, Adaptive Control Abstract: Spiking Neural Networks (SNNs) represent a class of biologically-inspired neural models that employ discrete spike-based communication, contrasting with the continuous-valued activations of traditional Artificial Neural Networks (ANNs). This event-driven paradigm offers significant advantages in power efficiency and temporal processing capabilities, positioning SNNs as promising alternatives for energy-constrained applications.However, two critical challenges impede their widespread adoption: the persistent performance gap compared to ANNs and the substantial computational overhead associated with training. The non-differentiable nature of spike generation prevents direct application of standard backpropagation, while existing training methodologies present fundamental limitations. ANN- to-SNN conversion techniques typically require extended temporal sequences and sacrifice inherent spatiotemporal dynamics,whereas surrogate gradient methods demand complete temporal unrolling, resulting in computational burdens comparable to conventional ANNs. To address these challenges, this work introduces the Integer Leaky Integrate-and-Fire (I-LIF) neuron model, which substantially reduces training complexity while maintaining adaptive capability.Additionally, we develop two specialized SNN architectural components: a convolution-based processing block and a Transformer-based attention block,specifically designed to overcome performance degradation in object detection applications. Extensive experimental validation on COCO,Gen1 and PASCAL VOC benchmarks demonstrates substantial performance improvements, confirming the potential of SNNs for complex visual perception tasks and advancing their practical deployment in real-world systems.

09:30-09:45, Paper ThAT6.5
MaskedStar: Orientation-Aware Star-Masked Depthwise Convolution for Lightweight Detection (I)

Liang, Zhihui	Dongguan University of Technology
Ziliang, Ren	Dongguan University of Technology
Gao, Hongchao	OPT Machine Vision Tech Co., Ltd, Dongguan
Chen, Hang	OPT Machine Vision Tech Co., Ltd, Dongguan
Liu, Ying	OPT Machine Vision Tech Co., Ltd, Dongguan
Keywords: Intelligent and AI Based Control, Fault Detection and Diagnostics, Control Applications Abstract: Real-time defect detection in industrial and infrastructural environments often faces a dilemma: dense convolutions are computationally expensive, while lightweight operators provide limited inductive bias for high-frequency, anisotropic patterns such as cracks. This paper proposes MaskedStar, a masked depthwise convolution that restricts learnable kernel locations to two complementary star supports: a horizontal–vertical cross and a diagonal “X”. Building on MaskedStar, two plug-and-play backbone blocks: MaskedStar-S, which fuses dual MaskedStar branches with different dilations via lightweight ChannelSE attention, and MaskedStar-D, which additionally blends an optional full-context branch for mid-level pyramid features, are designed. Replacing the P3/P4 C3k2 blocks in a YOLO11n baseline yields consistent improvements on three benchmarks: NEU-DET (75.8/44.1 → 76.9/44.4), RDD-MotorBike (92.8/59.6 → 93.4/59.8), and RDD-Drone (59.7/33.6 → 62.0/35.8) for mAP50/mAP50:95. Overall, the observed improvements suggest that explicit orientation priors are a practical option for lightweight detectors targeting elongated patterns. Index Terms— Object detection, orientation-aware convolution, surface defect detection, road damage detection

09:45-10:00, Paper ThAT6.6
A VLM-Driven High-Fidelity Domain Randomization Framework for Imitation Learning (I)

Huo, Ziyun	Guilin University of Electronic Technology
Wu, Fuxiang	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Hao, Fusheng	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Song, Chengqun	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Jun, Cheng	Chinese Academy of Sciences
Liu, Jianming	Guilin University of Electronic Technology
Keywords: Robotics, Learning-based Control, Intelligent and AI Based Control Abstract: Training robust vision-based robotic manipulation policies requires large-scale data, yet real-world collection is costly and unsafe. We present a high-fidelity simulation framework for data generation and policy learning, built on Isaac Lab with a closed-loop teleoperation pipeline for a UR5 manipulator and Robotiq 2F-85 gripper. The framework unifies demonstration collection, domain randomization, and policy training in a single pipeline. A central component is a semantic-aware parameter identifier that helps overcome a limitation of conventional DR by preventing unconstrained sampling from decoupling physical parameters from visual appearance, which otherwise yields semantically inconsistent scenarios that introduce spurious visual–physical correlations. To resolve this, we leverage a Vision-Language Model to infer material categories from RGB observations and assign physically plausible nominal values with confidence-aware bounds for friction and density. We validate the framework by training a Vision-Language-Action policy on the generated data. The results show that the parameter identifier attains high hit rates on material property prediction, and that the proposed DR meaningfully improves average task success over a position-only baseline.

10:00-10:15, Paper ThAT6.7
Combining Data Distribution and Adaptive Inference for Robotic Grasping in Vision-Language-Action (I)

Zhong, Lingye	Guilin University of Electronic Technology
Wu, Fuxiang	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Hao, Fusheng	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Song, Chengqun	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Jun, Cheng	Chinese Academy of Sciences
Liu, Jianming	Guilin University of Electronic Technology
Keywords: Learning-based Control, Robotics, Intelligent and AI Based Control Abstract: Recent advances in vision--language--action (VLA) models enable end-to-end robotic manipulation by learning from demonstrations. However, existing approaches have predominantly focused on scaling model architectures or expanding dataset size, often overlooking the interaction between training data distribution and inference strategies. To address this gap, we jointly study data distribution and inference strategy for robotic grasping, emphasizing coordinated distribution design and adaptive inference. Our approach integrates a compositional data collection strategy that decomposes the task state space into orientation and spatial factors, efficiently expanding coverage with a limited number of demonstrations. Additionally, we propose an adaptive inference mechanism that dynamically adjusts the execution horizon during critical task phases, thereby enhancing task performance. Embedding-space analysis suggests that performance saturates once the relevant state-space dimensions are sufficiently covered. Real-robot experiments validate our approach, demonstrating a 98% grasp success rate with only 81 demonstrations. Furthermore, applying adaptive inference to a moderately covered dataset improves the success rate from 90% to 97%, suggesting that inference-level refinement can complement data coverage, though it cannot fully compensate for insufficient distribution design.


ThBT1	Assembly Hall
Automated Guided Vehicles	Regular Session
Chair: Ding, Yulong	Tongji University

10:30-10:45, Paper ThBT1.1
Adaptive and Navigation under Global Guidance Degradation: A Candidate-Set Driven Approach

Guan, Xinyu	Zhejiang University
Ren, Qinyuan	Zhejiang University
Keywords: Automated Guided Vehicles, Control Applications, Adaptive Control Abstract: In complex environments such as industrial logistics, mobile robots relying on global planning are susceptible to communication latency, computational fluctuations, and non-convex environmental constraints. These factors often render reference paths stale or cause global guidance failures, resulting in stagnation or collisions at the control level. To address these navigation challenges, this paper proposes a candidate-set driven local resilient decision framework. By constructing a hybrid candidate set (model-based and policy-based) and incorporating receding horizon online evaluation, the proposed method reduces dependency on global paths while achieving an adaptive trade-off between local navigation efficiency and safety margins. Furthermore, an embedded fail-safe mechanism employs a conservative baseline strategy to guarantee robust operation within tightly constrained environments. Experimental results demonstrate that the proposed approach effectively avoids local deadlocks, reduces collision risks, and enhances navigation efficiency under conditions where global reference paths are significantly delayed or unreliable, while ensuring real-time performance.

10:45-11:00, Paper ThBT1.2
Straight-Line Tracking of Unicycle Mobile Robots Using Saturation Controller

Yan, Hongjiao	South China University of Technology
Xu, Dabo	South China University of Technology
Keywords: Automated Guided Vehicles, Control Applications, Robotics Abstract: This paper considers the problem of straight-line tracking for unicycle mobile robots when the reference angular velocity is trivial and the persistence of excitation condition is a lack. To overcome this limitation, we present a controller to ensure global exponential tracking with input constraints. Simulation results confirm that the method achieves fast convergence.

11:00-11:15, Paper ThBT1.3
Energy-Efficient Yaw Stability Control for Four-Wheel-Independent-Drive Electric Vehicles

Lu, Linying	Yanshan University
Fang, Jiayi	Beijing Institute of Technology
Zhang, Ting	Tsinghua University
Jia, Liheng	Yanshan University
Fang, Yiming	Yanshan University
Keywords: Automated Guided Vehicles, Energy Efficiency, Control Applications Abstract: Four-wheel-independent-drive electric vehicles (FWID-EVs) provide high control flexibility and actuator redundancy, making them promising for improving both yaw stability and energy efficiency. However, existing studies mainly focus on stability enhancement, while the energy cost associated with direct yaw moment control is often neglected. In addition, the control emphasis between stability and efficiency should vary with the instantaneous vehicle stability state. To this end, this paper proposes a stability-aware yaw control framework for FWID-EVs. At the upper layer, a yaw stability controller is designed using prescribed performance control (PPC) and SMC to regulate the vehicle's yaw response and ensure lateral stability. At the lower layer, a phase-plane-based stability index is constructed, and a driving torque allocation strategy that considers tire utilization and motor efficiency is proposed. Simulation results demonstrate that the proposed approach can effectively maintain yaw stability while reducing energy consumption, thereby achieving a better balance between safety and energy efficiency for FWID-EVs.

11:15-11:30, Paper ThBT1.4
Safe and Efficient Motion Coordination for Multi-AGV Systems with Dangerous Circle Detection

Yang, Runbang	Nankai University
Liu, Jingxuan	Nankai University
Chen, Fei	Nankai University
Keywords: Automated Guided Vehicles, Motion Control Abstract: In this paper, we propose a motion coordination strategy for multi-AGV systems operating on predefined closed paths, in which path preprocessing and dangerous circle detection serve as the two key mechanisms. Through path preprocessing, we explicitly analyze the network topology to identify conflict-prone segments and encode potential interaction relationships among AGVs, thereby transforming implicit collision and deadlock risks into a structured representation available for online decision-making. Building upon this representation, we employ a dangerous circle detection method to identify cyclic blocking configurations that are the fundamental cause of deadlocks under bidirectional motion. This enables us to shift deadlock avoidance from reactive resolution to proactive prevention by evaluating one-step-ahead motion states and suppressing the formation of dangerous circles via selective stopping and resuming actions, without resorting to global path replanning.

11:30-11:45, Paper ThBT1.5
Human-Like Trajectory Generation for Full-Process Curve Driving: An Explicit Parametric Approach

Su, Shaoka	Tongji University
Chen, Hui	Tongji University
Yang, Jiaxin	Tongji University
Lin, Huilong	School of Automotive Studies, Tongji University
Keywords: Automated Guided Vehicles, Optimal Control, Motion Control Abstract: Conventional Lane Centering Control (LCC) algorithms typically rely on the passive geometric tracking of the lane centerline during curve navigation. Consequently, they fail to replicate the active pose-adjustment mechanisms exhibited by human drivers throughout the entry, mid-curve, and exit phases, thereby degrading the human-like quality and safety margins of the system. To address these limitations, this study proposes a human-like trajectory generation approach tailored for full-process curve driving. First, an explicit parameter-driven trajectory generation model is established. By introducing five critical physical parameters, including a pre-positioning lateral offset and a time factor, this model quantitatively translates implicit human driving expertise into a controllable "pre-positioning mechanism." Furthermore, a hierarchical trajectory generation method is developed. The initial stage constructs a baseline trajectory incorporating pre-positioning features using the explicit parameters. Subsequently, a Quadratic Programming (QP) model integrating a jerk penalty is formulated to eliminate curvature discontinuities while faithfully preserving the pre-positioning geometric anchors, ensuring high-order dynamic smoothness. Finally, comprehensive closed-loop validation is conducted utilizing a Carla-Speedgoat Hardware-in-the-Loop (HIL) platform integrated with a curvature-feedforward Stanley controller. Experimental results demonstrate that the proposed approach, while fully compatible with the Conventional Centerline Tracking (CCT) approach, consistently generates dynamically executable trajectories with prominent human-like characteristics.

11:45-12:00, Paper ThBT1.6
Self-Supervised LiDAR-Camera Fusion with Joint Embedding Predictive Architecture for 3D Object Detection

Chen, Tianjie	The University of Hong Kong
Xie, Le	Shanghai Jiao Tong University
Zhang, Teng	The University of Hong Kong
Keywords: Automated Guided Vehicles, Sensor/Data Fusion, Intelligent and AI Based Control Abstract: Autonomous driving systems require reliable 3D perception for tasks such as detection and segmentation. In recent years, multimodal integration of LiDAR’s spatial information with camera’s contextual insights has achieved significant progress in this domain using supervised learning techniques. However, supervised models depend on extensive labeled data, which limits their widespread deployment. Selfsupervised learning (SSL) addresses this challenge by pretraining on unlabeled data to develop transferable features, yet prior SSL fusion techniques struggle with bridging modality differences, achieving precise alignment, and extracting meaningful abstractions. Inspired by the Joint Embedding Predictive Architecture, we propose a two-stage SSL pretraining method that fuses LiDAR and camera signals, improving semantic coherence and facilitating efficient adaptation to downstream tasks such as 3D object detection. The evaluations demonstrate significant performance gains in data-scarce scenarios, outperforming traditional SSL methods and competing supervised methods.


ThBT2	Room 256
Advanced Optimal Control, Path Planning, and Sensing for Autonomous Intelligent Vehicles	Regular Session
Chair: Zhang, Kunwu	China University of Geosciences
Organizer: Mu, Bingxian	University of Prince Edward Island
Organizer: Shen, Chao	Carleton University
Organizer: Xu, Binyan	University of Guelph
Organizer: Liu, Fuqiang	Chongqing University
Organizer: Zuo, Lei	Chang'an University

10:30-10:45, Paper ThBT2.1
Distributed Coverage Control for Multi-Polar Unknown Environments Based on GNN-MLP (I)

Xu, Jiangwen	Chang'an University
Zuo, Lei	Chang'an University
Xiong, Li	Chang'an University
Zhang, Ziheng	Chang'an University
Keywords: Multi-agent Systems, Learning-based Control, Sensor Networks Abstract: This paper investigates the distributed coverage control problem for multi-agent systems operating in complex environments characterized by unknown sensory density functions. To address this challenge, we propose a novel cooperative control framework that synergizes Graph Neural Networks (GNN) and Multi-Layer Perceptrons (MLP). Specifically, GNNs are employed to aggregate dynamic neighbor information, while MLPs approximate the local sensory density. This architecture enables agents to adaptively optimize their deployment based on estimated fields without requiring global prior knowledge. Lyapunov stability theory is employed to guarantee the convergence of the agents' positions, regarding the estimation errors of the density function. Numerical simulation shows that the proposed method outperforms the traditional Lloyd's algorithm by achieving performance comparable to benchmarks with access to the ground-truth density function. Furthermore, ablation studies validate the efficacy of the GNN component in enhancing both coverage performance and estimation accuracy.

10:45-11:00, Paper ThBT2.2
Improved Ant Colony Optimization for Revolving Path Planning (I)

Zhao, Bin	Chongqing University
Shang, Xiruo	Chongqing University
Deng, Zhichao	Chongqing University
Liu, Siyu	CRRC Qingdao Sifang Co., Ltd
Liu, Fuqiang	Chongqing University
Keywords: Robotics, Modeling and Control of Complex Systems, Nonlinear Systems and Control Abstract: To overcome the limitations of existing moxibustion robots, such as reliance on predefined paths, poor adaptability to body surface, and lack of temperature regulation, this paper proposes a revolving moxibustion path planning method based on improved Ant Colony Optimization (ACO). First, a revolving moxibustion path planning model is established by integrating the body surface normals and heat transfer model, which enhances surface adaptability and enables temperature prediction during treatment. Second, the ACO algorithm is improved by designing a path transition probability function that incorporates directional guidance and turning angle constraints, along with an adaptive pheromone enhancement strategy based on the Sigmoid function. These enhancements accelerate the convergence of the algorithm while balancing global exploration and local exploitation. Experimental results illustrate that the proposed method performs well in surface adaptability, moxibustion efficacy, and operational efficiency, providing a valuable reference for further research and applications of moxibustion robots.

11:00-11:15, Paper ThBT2.3
Robust UWB–IMU Localization under NLOS Conditions Using Huber-IRLS Trilateration and EKF Fusion (I)

Ramadan, Omar A.	Univeristy of Guelph
Xu, Binyan	University of Guelph
Al Janaideh, Mohammad	University of Guleph
Keywords: Sensor/Data Fusion, Estimation and Identification, Robotics Abstract: Ultra-wideband (UWB) ranging is a widely used solution for indoor positioning, but its accuracy degrades under non-line-of-sight (NLOS) conditions where range errors become biased, non-Gaussian, and heavy-tailed. In loosely coupled UWB--IMU fusion, these corrupted ranges first affect the nonlinear least-squares (LS) trilateration step, and the resulting erroneous position pseudo-measurements can destabilize a downstream extended Kalman filter (EKF). This paper proposes a robust trilateration front-end based on the Huber loss, solved via iteratively reweighted least squares (IRLS), and integrates it into a 15-state error-state inertial navigation EKF. The Huber-IRLS formulation bounds the influence of large residuals, improving robustness to NLOS-contaminated anchors without explicit NLOS detection or anchor rejection. Experiments in a controlled 3D indoor simulation with geometry-based NLOS modeling demonstrate that the robust method preserves nominal LOS accuracy while significantly improving NLOS performance. In the baseline NLOS scenario, the EKF position RMSE is reduced from 0.205 m (LS) to 0.131 m (Huber-IRLS), with the 95th-percentile error reduced from 0.376 m to 0.250 m. Moreover, the proposed Huber-IRLS solver achieves substantially lower runtime than lsqnonlin-based LS, reducing the total per-update computation from approximately 2.23 ms to 0.20 ms, supporting real-time deployment.

11:15-11:30, Paper ThBT2.4
Optimal Kinodynamic Motion Planning through Anytime Bidirectional Heuristic Search with Tight Termination Condition (I)

Wang, Yi	University of New Hampshire
Mu, Bingxian	University of Prince Edward Island
Shokouhi, Shahab	University of New Hampshire
Thein, May-Win	University of New Hampshire
Keywords: Robotics Abstract: This paper introduces Bidirectional Tight Informed Trees (BTIT), an asymptotically optimal kinodynamic sampling-based motion planning algorithm that integrates an anytime bidirectional heuristic search (Bi-HS) and ensures the meet-in-the-middle property (MMP) and optimality (MM-optimality). BTIT is the first anytime MEET-style algorithm to utilize termination conditions that are efficient to evaluate and enable early termination on-the-fly in batch-wise sampling-based motion planning. Experiments show that BTIT* achieves strongly faster time-to-first-solution and improved convergence than representative non-lazy informed batch planners on two kinodynamic benchmarks: a 4D double-integrator model and a 10D linearized Quadrotor. The source code is available href{https://github.com/yi213-robotic/Bidirectional-Tight-Informed-Trees}{textcolor{blue}{here}}.

11:30-11:45, Paper ThBT2.5
A Reinforcement Learning Framework for Real-Time Update of MPC Parameters with Ensured Stability (I)

Zhang, Xiangyu	University of New Hampshire
Li, Guowei	University of New Hampshire
Thein, May-Win	University of New Hampshire
Mu, Bingxian	University of Prince Edward Island
Keywords: Learning-based Control, Optimal Control Abstract: Model Predictive Control (MPC) is an established technique for controlling constrained dynamical systems, but its performance is highly sensitive to the tuning of its cost function weighting matrices and prediction horizon. Manual tuning is often time-consuming and yields sub-optimal performance for systems operating under varying conditions. This paper presents a novel framework that utilizes Reinforcement Learning (RL) for the online, simultaneous adaptation of these key parameters. By interacting with the closed-loop system, the RL agent learns a dynamic tuning policy aimed at maximizing a cumulative reward associated with tracking performance and control efficiency. To establish baseline safety, the framework first constructs a pre-certified stability-guaranteed set of MPC parameters offline. However, a critical challenge remains during online operation: dynamic parameter updates render the closed-loop system a switched system, where arbitrary switching can induce instability. To address this, we introduce a supervisory stability filter based on a Common Lyapunov Function (CLF). This mechanism rigorously enforces a monotonic decay in a fixed energy metric, functioning as a real-time safety gate that permits only stability-preserving updates from the RL agent. Finally, the framework's effectiveness is demonstrated through a comprehensive numerical evaluation comparing Deep Q-Network (DQN), Proximal Policy Optimization (PPO), and Trust Region Policy Optimization (TRPO), highlighting the superior tracking accuracy and stability of the proposed approach.

11:45-12:00, Paper ThBT2.6
Distributed Learning-Based MPC with QP Formulation for Platooning Control of Heterogeneous Autonomous Surface Vehicles (I)

Lin, Yingtao	Carleton University
Shen, Chao	Carleton University
Keywords: Learning-based Control, Optimal Control, Multi-agent Systems Abstract: This paper develops a distributed learning-based model predictive control (DMPC) framework with an explicit quadratic programming (QP) formulation for platooning control of heterogeneous autonomous surface vehicles (ASVs) under disturbances and modeling uncertainties. A composite control structure is adopted, in which the control signal consists of a nominal control term, a neuro-adaptive term, and a feedforward term. The nominal control term is obtained by solving a tube-based robust MPC problem, while a neuro-adaptive term estimates matched uncertainties online. The feedforward term is included to form a linear prediction model which facilitates the QP formulation. By integrating neural network (NN) based disturbance estimation into the tube-based MPC design, the proposed method preserves recursive feasibility, constraint satisfaction, and closed-loop stability. Simulation results demonstrate excellent control performance and improved disturbance rejection capability.

12:00-12:15, Paper ThBT2.7
Data-Driven Robust MPC for the Path Following Control of Wheeled Mobile Robots (I)

Zahid, Sana	Carleton University
Shen, Chao	Carleton University
Keywords: Optimal Control, Automated Guided Vehicles, Robotics Abstract: The data-driven model predictive control (MPC) approach is investigated for path following control applications of wheeled mobile robots (WMRs). By updating Hankel matrices continuously using input-output data, the robot motion dynamics can be encoded into an implicit linearized model, which facilitates the MPC controller design. A novel two-step path following control algorithm is proposed so that part of the control problem can be formulated into a standard form quadratic program (QP), which can be solved efficiently by off-the-shelf optimization software. The detailed path following control algorithm design is discussed for a linear equivalent model and the nonlinear unicycle model of WMRs. The linearization errors are estimated throughout the control sequence and explicitly included in a robust MPC problem formulation. Simulation results demonstrate excellent path following performance with the proposed data-driven method.


ThBT3	Room 267
Agile Planning and Intelligent Control for Autonomous Robots	Regular Session
Chair: Jin, Xin	Fudan University
Co-Chair: Miao, Zhiqiang	Hunan University
Organizer: Yong, Kenan	Nanjing University of Aeronautics and Astronautics
Organizer: Cai, Bo	Harbin Institute of Technology
Organizer: Ren, Lu	Anhui University
Organizer: Yin, Zeyang	Central South University
Organizer: Jin, Xin	Fudan University
Organizer: Pang, Bo	Northeastern University

10:30-10:45, Paper ThBT3.1
Neural Network-Based Integral Sliding Mode Control for Modular Reconfigurable Flight Arrays under Midair Separation Disturbance (I)

Zhou, Weichen	Kunming University of Science and Technology
Yang, Chunxi	Kunming University of Science and Technology
Zhang, Xiufeng	Kunming University of Science and Technology
Sun, Hongwei	Huazhong University of Science and Technology
Shi, Yu	Kunming University of Science and Technology
Keywords: Nonlinear Systems and Control, Robotics, Estimation and Identification Abstract: This paper proposes a neural-network-based integral sliding mode control (NN-ISMC) method to address abrupt variations during the midair reconfiguration of modular reconfigurable flight arrays (MRFAs). First, an integral sliding function is constructed to eliminate the reaching phase and enhance robustness at the moment of separation. Then, an online neural network is employed to estimate the equivalent disturbance from the sliding variable and system states for feedforward compensation. A Lyapunov-based adaptive law is derived to guarantee convergence of all closed-loop signals. Finally, comparative simulations are provided for several representative scenarios to demonstrate effectiveness of the proposed method.

10:45-11:00, Paper ThBT3.2
Semiglobal Exponential Attitude Consensus under Switching Topologies: A Hybrid System Approach (I)

Zhang, Manting	Fudan University
Jin, Xin	Fudan University
Tang, Yang	East China University of Science and Technology
Keywords: Multi-agent Systems, Networked Control Abstract: In this paper, we study attitude synchronization on SO(3) under switching communication topologies within a hybrid framework. A hybrid model is constructed to capture continuous attitude dynamics and discrete topology switching, and a distributed controller based on relative attitude information is adopted. Due to switching, the Lyapunov function may be discontinuous at switching instants. To address this issue, a trajectory-based bounding technique is developed. Under a minimum dwell-time condition, semiglobal exponential synchronization is established. Numerical simulations validate the theoretical results.

11:00-11:15, Paper ThBT3.3
Risk-Aware Smooth Reinforcement Learning for Fixed-Wing UAV Aggressive Maneuvering (I)

Zhu, Haojie	Nanjing University of Aeronautics and Astronautics
Chen, Mou	Nanjing University of Aeronautics and Astronautics
Yan, Chao	Nanjing University of Aeronautics and Astronautics
Yong, Kenan	Nanjing University of Aeronautics and Astronautics
Han, Zengliang	College of Automation Engineering, Nanjing University of Aeronautics and Astronautics
Keywords: Learning-based Control, Modeling and Control of Complex Systems, Nonlinear Systems and Control Abstract: Fixed-wing Unmanned Aerial Vehicles (UAVs) are essential for high-dynamic aerial missions, yet fully exploiting their mechanical capabilities remains a challenge. Unlocking their potential for aggressive maneuvers via Deep Reinforcement Learning (DRL) is hindered by control jitter and safety concerns. In this paper, a Risk-Aware Smooth Reinforcement Learning framework (RAS-RL) is proposed to achieve aggressive flight without compromising safety or stability. Specifically, a learnable spectral filter is designed to denoise the observation stream, and Jacobian regularization is incorporated to suppress actuator jitter in closed-loop execution. Furthermore, a model-predictive risk-aware reward shaping term is constructed by combining a nominal dynamics model with a supervised residual dynamics approximator. Lastly, a curriculum learning schedule is employed that progressively increases command aggressiveness, observation noise, and the risk weight to stabilize training under the shaped objective. In high-fidelity JSBSim simulations, RAS-RL achieves a 96% success rate on randomized command tests with 0.8% safety violations, while maintaining low action fluctuation comparable to smoothness-oriented baselines.

11:15-11:30, Paper ThBT3.4
Data-Driven H_infty Control for Satellite Orbit-Attitude System under Directional Jamming Via Off-Policy Reinforcement Learning (I)

Fu, Shuai	Fudan University
Jin, Xin	Fudan University
Keywords: Control Applications, Optimal Control, Robust and H infinity Control Abstract: This paper addresses the robust orbit-attitude control problem for satellites subject to directional jamming. Unlike conventional approaches that model interference as additive Gaussian noise, we formulate the anti-jamming problem as a zero-sum differential game based on the coupled nonlinear orbit-attitude dynamics, thereby transforming the problem from passive disturbance rejection to active geometry-aware anti-jamming control. A novel interference metric is introduced to characterize the intensity of directional jamming, which is explicitly determined by the beam-to-beam geometry between the target and the adversary. To solve the resulting Hamilton-Jacobi-Isaacs equation without requiring precise knowledge of the system dynamics, an off-policy integral reinforcement learning algorithm is developed. The proposed data-driven scheme iteratively approximates the Nash equilibrium solution using neural networks. Rigorous theoretical analysis guarantees the Uniformly Ultimate Boundedness of the weight approximation errors and the finite L_2-gain stability of the closed-loop system, thereby ensuring H_infty robustness. Numerical simulations validate the efficacy of the proposed method.

11:30-11:45, Paper ThBT3.5
Occlusion-Aware Reinforcement Learning for Agile Quadrotor Target Tracking in Cluttered Environments

Liu, Xingxun	Hunan University
Wang, Yaonan	Hunnan University
Song, Shichen	Hunan University
Yang, Zeyu	Hunan University
Miao, Zhiqiang	Hunan University
Keywords: Robotics, Learning Systems Abstract: Autonomous quadrotor target tracking in cluttered environments remains a critical challenge, requiring persistent tracking of dynamic targets using exclusively onboard perception. Existing approaches frequently struggle with agile maneuvers, sim-to-real gaps, and prolonged target occlusions. To address these limitations, we propose a robust reinforcement learning framework. For high-bandwidth maneuverability and a minimized sim-to-real gap, our policy directly outputs collective thrust and body rates, bypassing the delays inherent in standard position controllers. Furthermore, we design a specialized observation space encoding target kinematics and introduce a novel geometry-agnostic visibility reward to proactively anticipate occlusions and maintain line-of-sight (LOS). To facilitate stable convergence and overcome early local optima, we implement a progressive curriculum learning strategy that incrementally scales obstacle density. Comprehensive simulations demonstrate that our framework maintains robust tracking capabilities and achieves exceptionally high success rates, even under severe occlusion conditions.

11:45-12:00, Paper ThBT3.6
LV-Fusion Planner: Dynamic Motion Primitive-Based Multimodal Fusion for Low-Latency UAV Path Planning (I)

He, Weiming	Sichuan University
Qi, Qihan	Sichuan University
Yang, Xinsong	Sichuan University
Keywords: Learning-based Control, Motion Control Abstract: LiDAR-based and vision-based end-to-end path planning algorithms suffer from lighting variations and texture deficiency, leading to sudden trajectory planning failure and subsequent collision accidents. To address this problem, this paper proposes LV-Fusion Planner, a LiDAR-vision fusion end-to-end path planning algorithm based on the YOPO algorithm. The planner fuses LiDAR, vision and UAV self-state multimodal data. It combines dynamic motion primitive-based trajectory parameterization with end-to-end mapping from multimodal data to trajectory. It enables low-latency and fully autonomous navigation, and overcomes single-modal perception limitations in path planning.


ThBT4	Room 269
Cooperative Planning and Control for Unmanned Systems	Regular Session
Chair: Xin, Bin	Beijing Institute of Technology
Co-Chair: Ding, Yulong	Tongji University
Organizer: Ding, Yulong	Tongji University
Organizer: Xin, Bin	Beijing Institute of Technology
Organizer: Wang, Miao	Beijing Institute of Technology

10:30-10:45, Paper ThBT4.1
Smoothed Particle Hydrodynamics with Differential Interaction Potentials for Multi-Swarm Segregation and Coordination (I)

Li, Ruocheng	Beijing Institute of Technology
Xin, Bin	Beijing Institute of Technology
Zhang, Shuai	Hong Kong Polytechnic University
Liu, Xuchen	Chinese University of Hong Kong
Cui, Jinqiang	Pengcheng Laboratory
Chen, Ben M.	Chinese University of Hong Kong
Keywords: Multi-agent Systems, Control Applications, Robotics Abstract: In this paper, we present a distributed framework for coordinated motion among multiple robotic swarms. The proposed approach is built upon the Smoothed Particle Hydrodynamics (SPH) paradigm, in which each robot is modeled as a particle and computes its control input based on the relative positions of neighboring robots within a finite communication range. Inspired by the immiscibility phenomenon observed in water–oil mixtures, we incorporate a differential interaction mechanism into the SPH framework to encode heterogeneous inter-group behaviors. As a result, in scenarios involving multiple robot groups with distinct affiliations, the swarm can spontaneously evolve from random initial configurations into multiple well-segregated and uniformly organized formations. Simulation results validate the effectiveness of the proposed method.

10:45-11:00, Paper ThBT4.2
A Distributed Multi-Robot Herding Algorithm for Faster-Than-Herder Swarm Evaders (I)

Chen, Delong	Tongji University
Ding, Yulong	Tongji University
Chen, Jiayu	Tongji University
Yin, Zhen	Tongji University
Keywords: Multi-agent Systems, Robotics, Modeling and Control of Complex Systems Abstract: Multi-robot herding is of great importance for autonomous agricultural and livestock management systems. However, traditional methods often struggle to converge when dealing with evader swarms that are faster and have a strong tendency to disperse. This paper proposes a decentralized cooperative herding algorithm that combines a greedy strategy with dynamic convex hull envelopment. The algorithm uses a real-time dynamic convex hull to regulate the robot formation, while employing a greedy strategy that allows each robot to autonomously select the nearest target within its field of view, complemented by a rotational search mechanism to ensure target visibility. Based on a second-order dynamics model, a coverage angle constraint considering the speed ratio and a force-balancing control law are designed to ensure the dynamic stability of the encirclement. In a scenario with 6 herders managing 30 high-speed evaders, the proposed algorithm significantly reduces the average completion time and improves stability compared to a non-cooperative greedy strategy. The algorithm demonstrates strong robustness against obstacles, scalability to larger groups, and varying initial conditions, providing a feasible solution for practical ranch applications.

11:00-11:15, Paper ThBT4.3
Priority-Aware Multi-UAV Landing Scheduling with Yielding Strategy for Emergency Logistics (I)

Fan, Jiaxin	Tongji University
Ding, Yulong	Tongji University
Feng, Kairui	Tongji University
Tan, Yu Herng	National University of Singapore
Keywords: Multi-agent Systems, Optimal Control, Intelligent and AI Based Control Abstract: Multi-UAV systems have demonstrated significant potential in urban low-altitude transportation, particularly in disaster response and emergency supply delivery, owing to their high maneuverability and contactless delivery capabilities. However, under high-density arrival traffic and limited vertiport resources, conventional landing scheduling methods struggle to balance delivery timeliness for supplies with varying urgency against UAV battery safety. To address these challenges, this paper formulates a multi-UAV landing scheduling optimization problem for emergency logistics that jointly considers delivery efficiency with priority differentiation and low-battery risk, where the decision variables exhibit strong coupling. To solve this problem, we propose a priority-aware multi-UAV landing scheduling framework. The landing sequence is first determined by a genetic algorithm, followed by a Sequence-Based Approach with Yield strategy (SBAY) for approach control. The SBAY model introduces outer yield holding points to create expedited descent corridors for earlier-ranked UAVs in the landing sequence, thereby mitigating the conflict between strongly coupled decision variables and reducing low-battery risks. Simulation results demonstrate that the proposed method improves delivery efficiency for high-priority emergency supplies while ensuring battery safety, and effectively enhances overall landing scheduling performance.

11:15-11:30, Paper ThBT4.4
Dense Semantic 3D Gaussian Splatting SLAM Via Consistent Feature Distillation and Gradient Decoupling (I)

Ying, Ruobing	Taiyuan University of Technology
Qi, Zixi	Taiyuan University of Technology
Zhang, Zhe	Taiyuan University of Technology
Cheng, Lan	Taiyuan University of Technology
Keywords: Robotics, Estimation and Identification, Learning Systems Abstract: This paper addresses semantic boundary blurring and the conflict between geometric and semantic optimization in existing semantic visual SLAM systems based on Gaussian Splatting. Current semantic 3DGS methods typically employ simple feature regression strategies, which lead to averaged semantic features at object boundaries and often destabilize geometric tracking due to forced semantic constraints. To this end, we propose a novel 3DGS-SLAM system. First, we introduce Semantic Center Loss and Entropy Minimization Loss to replace conventional regression losses. Drawing upon metric learning principles, these losses consolidate intra-class features in the latent space and sharpen semantic boundaries. Second, we design a gradient decoupling mechanism that separates geometric updates from semantic optimization, preventing semantic gradients from interfering with camera pose estimation. Experiments on the Replica dataset demonstrate that our method achieves significantly improved semantic segmentation accuracy (mIoU) while maintaining high-precision trajectory estimation and superior rendering quality, outperforming current state-of-the-art approaches.

11:30-11:45, Paper ThBT4.5
Adaptive Exploration–Exploitation Balancing for Robotic Gas Source Seeking Via Time Progress and Spatial Dispersion (I)

Wang, Miao	Beijing Institute of Technology
Xin, Bin	Beijing Institute of Technology
Yun, Qu	Beijing Aerospace Automatic Control Institute
Keywords: Robotics, Automated Guided Vehicles, Motion Control Abstract: Gas source localization is critical for industrial safety monitoring, disaster rescue, and environmental protection. This paper presents an adaptive source-seeking planner that explicitly balances exploration and exploitation under uncertainty. At each step, the robot samples candidate goal points and scores them by combining (i) an exploitation term derived from a Gaussian-like dispersion model and (ii) an exploration term computed as frontier-based information gain. To avoid search stagnation, a time-dependent penalization is introduced to reduce the attractiveness of early sampled goals, discouraging long-distance revisits. Moreover, the exploration--exploitation weight is adapted online using the spatial variance of a high-probability candidate set: dispersed candidates trigger stronger exploration, while concentrated candidates promote rapid exploitation toward the source. Simulation and real-robot experiments demonstrate that the proposed algorithm improves search efficiency in complex environments.

11:45-12:00, Paper ThBT4.6
Global Formation Stabilization of Higher-Order Integrators Using Bearing-Only Measurements

Cheng, Haoshu	Nanyang Technological University
Yan, Yamin	Nanyang Technological University
Hu, Guoqiang	Nanyang Technological University
Keywords: Networked Control, Nonlinear Systems and Control, Multi-agent Systems Abstract: This paper investigates leaderless bearing-constrained formation control for higher-order integrator systems utilizing bearing-only measurements. Unlike existing results, which are typically limited to first- or second-order dynamics, this work addresses systems of arbitrary order. We first establish a baseline for bearing-constrained formation control using state feedback and bearing measurements. By designing a bearing-feedback sliding surface and employing the backstepping technique, a distributed control law is synthesized to guarantee global convergence of the bearing constraint errors to zero. Subsequently, by leveraging the separation principle and the certainty equivalence principle, a distributed output-feedback control law is synthesized to achieve the target formation. Finally, the performance of the developed control schemes is demonstrated through numerical simulations.

12:00-12:15, Paper ThBT4.7
Enhanced OC-SORT for UAV Swarm Tracking Via Direction Consistency and Trajectory Memory

Li, Tianyang	Beijing Institute of Technology
Peng, Zhihong	Beijing Institute of Technology
Li, Yukun	Beijing Institute of Technology
Zha, Wenzhong	Eastern Communications Group of China Electronics Technology Group Corporation
Keywords: Sensor/Data Fusion, Signal Processing Abstract: Multi-object tracking (MOT) in Unmanned Aerial Vehicle (UAV) swarm scenarios presents unique challenges due to dense formations, appearance similarity, frequent occlusion, and limited resolution. We propose a lightweight tracking framework based on Observation-Centric SORT (OC-SORT), specifically designed for appearance-free aerial tracking. We further introduce a Velocity Direction Consistency (VDC) module, which leverages temporally distant motion cues to construct a stable and discriminative descriptor. This helps suppress angular inconsistencies, improves association in dense formations, and significantly reduces ID switches when UAV trajectories overlap or targets undergo occlusion. Additionally, a trajectory memory module is employed to store recent observations and recover identities lost due to short-term occlusion without relying on visual features. Experimental results on the UAVSwarm Dataset demonstrate that our approach improves ID consistency and overall tracking accuracy compared to existing methods, while maintaining real-time performance.


ThBT5	Room 259
DT-Driven Smart Inspection and Diagnosis	Regular Session
Chair: Chen, Xi	The Chinese University of Hong Kong
Organizer: Hong, Wenxing	Xiamen University
Organizer: Liu, Chuanbin	University of Science and Technology of China
Organizer: Huo, Mengzhen	School of Automation Science and Electrical Engineering, Beihang University
Organizer: Zhang, Jihan	The Chinese University of Hong Kong
Organizer: Li, Yu	The Chinese University of Hong Kong
Organizer: Huang, Yijun	The Chinese University of Hong Kong
Organizer: Hong, Duanqin	Xiamen University
Organizer: Zhu, Jiacheng	Donghai Lab
Organizer: Xu, Jiwen	The Chinese University of Hong Kong

10:30-10:45, Paper ThBT5.1
An Unsupervised Underwater Image Restoration Framework with Adaptive Color Stretch and Compensation Strategy (I)

Zhu, Jiacheng	Donghai Lab
Ke, Cheng	Xiamen University
Tong, Yicheng	Zhejiang University
Mei, Lin	Donghai Lab
Zhang, Jihan	The Chinese University of Hong Kong
Hong, Wenxing	School of Aerospace Engineering, Xiamen University
Keywords: Signal Processing, Intelligent and AI Based Control, Learning Systems Abstract: Digital Twin (DT)-driven smart inspection and diagnosis in marine environments rely on reliable underwater visual perception for state estimation, anomaly analysis, and virtual-physical consistency. However, underwater images are severely degraded by wavelength-dependent absorption and scattering, which hinders accurate inspection of underwater assets, ecological targets, and robotic operations. To address this issue, this paper proposes ACSC-UIR, an unsupervised underwater image restoration framework with adaptive color stretch and compensation (ACSC). The ACSC module supplements missing red-channel information while suppressing overcompensation in severely degraded scenes, and a multi-branch fully convolutional network is further designed to learn multi-scale restoration features without paired supervision. Experiments on UIEB and LSUI show that ACSC-UIR achieves state-of-the-art unsupervised performance in peak signal-to-noise ratio and structural similarity, while maintaining a favorable trade-off between restoration quality and model complexity. These results indicate that the proposed method can serve as an effective visual front-end for DT-driven smart inspection in underwater scenarios.

10:45-11:00, Paper ThBT5.2
Environment-Aware Multi-UAV Search for Efficient Large-Scale Small Target Detection (I)

Xu, Jiwen	The Chinese University of Hong Kong
Hong, Duanqin	Xiamen University
Zhang, Jihan	The Chinese University of Hong Kong
Chen, Xi	The Chinese University of Hong Kong
Chen, Ben M.	Chinese University of Hong Kong
Keywords: Multi-agent Systems, Automated Guided Vehicles, Robotics Abstract: Searching for small targets in large-scale natural environments is challenging due to limited flight resources and the sparse spatial distribution of targets. Traditional coverage strategies often rely on uniform sweeping patterns, which can lead to redundant observations and inefficient exploration. In this paper, we propose an environment-aware multi-UAV search framework for efficient large-scale small target detection. The proposed approach leverages environmental sensing and prior information to construct a probabilistic spatial belief map that estimates the likelihood of target presence. Instead of exhaustively covering the entire search area, high-probability regions are extracted and clustered into representative anchor points, significantly reducing planning complexity. A cooperative multi-UAV searching strategy is then developed by solving a rewarddriven team orienteering problem under flight distance and safety constraints. Extensive simulations demonstrate that the proposed method improves search efficiency and information collection compared with conventional coverage strategies. Field experiments with multiple UAVs further validate the feasibility and effectiveness of the framework in real-world environments.

11:00-11:15, Paper ThBT5.3
Campus Safety: UAV-Based Autonomous Defect Detection in Minnan-Style Architecture (I)

Hong, Wenxing	School of Aerospace Engineering, Xiamen University
Zhang, Rui	Xiamen University
Wang, Xianyi	Xiamen University
Hong, Duanqin	Xiamen University
Zhang, Jihan	The Chinese University of Hong Kong
Luo, Kunhong	Xiamen University
Keywords: Smart Buildings Abstract: The preservation of traditional Minnan-style architecture, characterized by complex topologies such as upturned eaves and elaborate facades, poses significant challenges for manual structural inspections. To overcome the labor-intensive and subjective nature of traditional methods, this paper proposes an autonomous Unmanned Aerial Vehicle (UAV) inspection framework tailored for safe and highly efficient defect detection in high-traffic campus environments. The proposed system integrates a hybrid path planning strategy, combining Boustrophedon and A-star algorithms, to ensure collision-free navigation and high-fidelity image acquisition across both expansive exteriors and spatially constrained interiors. Field deployments at Xiamen University validate the superiority of this framework, which cuts flight duration by more than half while increasing surface coverage to over 96 percent compared to manual piloting. Furthermore, we developed XMU AI-DETECT, a closed-loop visual decision-support platform coupled with a comprehensive deep learning-based object detection framework for robust anomaly identification against intricate brickwork backgrounds. This framework effectively transforms raw aerial data into actionable maintenance strategies, establishing a highly scalable paradigm for the preventative conservation of complex cultural heritage sites.

11:15-11:30, Paper ThBT5.4
A Multi-Agent Digital Twin Framework for LLM-Driven Building Energy and Indoor Air Quality Co-Simulation (I)

Huang, Yijun	The Chinese University of Hong Kong
Li, Yu	The Chinese University of Hong Kong
Chen, Xi	The Chinese University of Hong Kong
Chen, Ben M.	Chinese University of Hong Kong
Keywords: Smart Buildings, Energy Efficiency Abstract: Simulation-based building thermal and indoor air quality analysis requires coupling building energy simulation (BES) with computational fluid dynamics (CFD), yet the associated multi-software configuration workflow remains fragmented, expertise-demanding, and error-prone. This paper presents a multi-agent co-simulation framework that transforms natural-language building-analysis requests into controlled BES-CFD workflows within a digital twin environment. The framework organizes an analysis chain comprising objective-profile extraction, case-bundle planning, shared research memory, domain-specific artifact generation with context-isolated sub-agents, guarded runtime execution with policy-constrained overrides, and evidence reporting. Retrieval-augmented generation supports each sub-agent, while a human-in-the-loop approval gate and a two-layer artifact management scheme separate candidate outputs from the validated runtime baseline to improve traceability, reproducibility, and execution safety. A sports hall occupancy-driven thermal and air quality analysis task is used to evaluate the framework, with a cross-backbone assessment across six frontier LLMs. All backbones complete the workflow and produce identical physical outputs under the guarded execution policy, while exhibiting substantial planning-level diversity in artifact expressiveness and scenario design.

11:30-11:45, Paper ThBT5.5
Frequency-Calibrated UNet with Optimized Compound Loss for Kelp Semantic Segmentation (I)

Hu, Xingzheng	Xiamen University
Chen, Binqiang	Xiamen University
Ke, Cheng	Xiamen University
Zhu, Jiacheng	Donghai Lab
Hong, Wenxing	School of Aerospace Engineering, Xiamen University
Keywords: Learning Systems, Fuzzy and Neural Systems, Process Automation Abstract: Semantic segmentation of kelp holds critical significance for intelligent aquaculture and automated phenotypic analysis. However, conventional models suffer from representation collapse due to the extreme pixel-level long-tailed distribution between the dominant kelp body and sparse holdfast structures. To address this, we propose a Frequency-Calibrated UNet (FCUNet) featuring an Adaptive Frequency-Calibrated Classifier (AFCC). By integrating Adaptive Feature Norm (AFN) and Channel Group Scaling (CGS), the AFCC recalibrates logit distributions to prevent the marginalization of kelp holdfast category. Furthermore, we introduce an optimized compound loss that synergistically couples pixel-wise hard-sample mining with global structural consistency. Evaluated on our self-collected land-based Macroalgae Phenotyping Image Dataset (MPID), the framework demonstrates superior robustness across five mainstream backbones. Notably, the proposed method achieves significant performance gains for the holdfast category, improving the IoU and F1 Score by 0.1897 and 0.1933 respectively in the UNet architecture, effectively ensuring structural integrity for macroalgae phenotyping.

11:45-12:00, Paper ThBT5.6
Path Planning for Mobile Robots Based on Continuous Cost Guidance (I)

Jiang, Yuze	Xiamen University
Dong, Zicheng	Xiamen University
Ke, Cheng	Xiamen University
Hong, Wenxing	School of Aerospace Engineering, Xiamen University
Keywords: Robotics, Optimal Control, Learning-based Control Abstract: To address the challenge of balancing initial solution efficiency and asymptotic convergence in complex environments, this paper proposes CG-RRT, a real-time path planning method guided by a continuous cost field. A lightweight neural network is employed to construct a global continuous cost field offline, effectively eliminating the high computational overhead of online inference. By adaptively adjusting the sampling radius based on the cost field and employing a two-stage strategy---integrating frontier guidance with ellipsoidal constraints---the algorithm achieves rapid exploration and stable convergence. Experimental results demonstrate that CG-RRT achieves a 100% success rate under a 2% optimality threshold across various complex environments. With an average planning time of only 84~ms, the proposed method is approximately 7 times faster than the state-of-the-art NIRRT* while maintaining comparable path quality. Its robust generalization capability is further validated in unseen complex scenarios and real-world ancient architecture inspection tasks.


ThBT6	Room 264
Embodied Multiagent Systems	Regular Session
Chair: Cui, Jinqiang	Pengcheng Laboratory
Organizer: Cui, Jinqiang	Pengcheng Laboratory
Organizer: Zhao, Shiyu	Westlake University
Organizer: Zhang, Hongwei	Harbin Institute of Technology

10:30-10:45, Paper ThBT6.1
A Bearing-Strength Method for Motion Estimation of Unknown Energy Emitters (I)

Chen, Haoyu	Westlake University
Ning, Zian	Westlake University
Zhang, Yin	Westlake University
Zhao, Shiyu	Westlake University
Keywords: Estimation and Identification, Robotics Abstract: This paper studies motion estimation of moving energy emitters using passive sensors. The emitters may be light, acoustic, or radio sources. While the bearing vector pointing from the sensor to the emitter can be easily obtained, existing approaches mainly rely on the bearing-only motion estimation method. However, this method suffers from a fundamental limitation that the sensor must have lateral motion to ensure observability. Unfortunately, this lateral motion requirement often conflicts with the sensor's desired motion in many tasks. In this paper, we point out that the received signal strength, which can also be obtained easily in many ways, can greatly enhance motion estimation. Surprisingly, this strength information has not been well explored so far. Here, we propose a new bearing-strength method to fully exploit both the bearing and strength measurements. Our theoretical analysis shows that the system observability is significantly enhanced in the sense that the lateral motion condition is not required anymore. Real-world experimental results verify the proposed method and the theoretical analysis. It is notable that the benefit of the proposed method comes with no additional cost since it simply utilizes the received strength information that has not been fully exploited in the past.

10:45-11:00, Paper ThBT6.2
Input-To-State Safety with High-Order Control Barrier Functions (I)

Wang, Xinyang	Harbin Institute of Technology
Zhang, Hongwei	Harbin Institute of Technology
Keywords: Nonlinear Systems and Control, Control Applications, Optimal Control Abstract: Input-to-state safety is a crucial property for evaluating the robustness of a safe set against unknown disturbances. To guarantee the input-to-state safety of the safe set, input-to-state safe control barrier functions (ISSf-CBFs) have been proposed and further extended to a high-order version, termed as input-to-state safe high-order control barrier functions (ISSf-HOCBFs) to handle safety constraints with arbitrary relative degree. However, existing results on ISSfHOCBFs are limited to matched disturbances, which cannot affect the high-order control barrier function design until the control input is applied. In this paper, we extend the input-tostate safety guarantee from CBFs to HOCBFs under unmacthed disturbances, and propose a new type of ISSf-HOCBFs to handle safety constraints with arbitrary relative degree. A case study of an adaptive cruise control system is provided to demonstrate the effectiveness of the proposed methods

11:00-11:15, Paper ThBT6.3
Fixed-Time Distributed Robust Coverage Control for Heterogeneous UAVs-UGVs Systems with Human Interaction Via LLM (I)

Wu, Hao	Beihang University
Duan, Haibin	Beihang University
Chang, Yingxiu	Pengcheng Laboratory
Cui, Jinqiang	Pengcheng Laboratory
Yang, Jiankun	Peng Cheng Laboratory
Keywords: Multi-agent Systems, Robotics, Man-machine Interactions Abstract: To address the area coverage challenge in heterogeneous UAVs-UGVs systems, this article proposes a fixed-time distributed robust coverage control method based on human-machine interaction via large language model (LLM). This approach is applicable to scenarios involving system uncertainties and external disturbances. First, the multi-dimensional dynamic models of UAVs and UGVs are unified and the Voronoi diagram partitioning method is used to obtain the area and centroid for each region. A fixed-time observer is developed to estimate system uncertainties and external disturbances. A distributed fixed-time controller based on dynamic fast terminal sliding mode (FTSMC) is developed to achieve rapid error convergence within the fixed time. For scenarios such as issuing commands during emergencies, a method is developed where LLM parses human input dialogues and subsequently issues coverage tasks to the system. The Lyapunov function is employed to prove system stability under the designed control framework. Simulation results show that the coverage control error converges to near-zero within the fixed time. The proposed coverage control method enables the heterogeneous UAVs-UGVs systems to achieve area coverage within the specified timeframe.

11:15-11:30, Paper ThBT6.4
A Hierarchical DRL-Based Planning and Navigation Framework for Complex Multi-Robot Missions Leveraging LLMs (I)

Liu, Lian	South China University of Technology; Peng Cheng Laboratory
Wang, Zhenmin	South China University of Technology
Liu, Xuchen	The Chinese University of Hong Kong
Cui, Jinqiang	Pengcheng Laboratory
Keywords: Learning Systems, Learning-based Control, Multi-agent Systems Abstract: With increasingly complex mission requirements, current multi-robot systems often struggle to achieve stable and effective cooperative task planning in open-ended, partially observable, and communication-constrained real-world scenarios while ensuring the executability of joint plan. To address these challenges, this paper proposes a learning-based general hierarchical planning framework. At the top layer, a large language model is introduced for task understanding and decomposition of complex missions. The middle layer constructs a modular cooperative capability library, in which policies are uniformly encapsulated as waypoint generation modules, thereby constraining high-level reasoning and enhancing cross-task transferability. The bottom layer develops a waypoint-guided deep reinforcement learning motion planner that performs online collision avoidance and stable navigation using only local sensing. Experimental results demonstrate that, under diverse open-ended instructions, the top-layer planner can reliably generate executable task decomposition and assignment schemes; across different environment structures, the proposed motion planner consistently completes waypoint navigation, validating the generality and deployability of the proposed framework in open-task and communication-constrained scenarios.

11:30-11:45, Paper ThBT6.5
PLAF: Pixel-Wise Language-Aligned Feature Extraction for Efficient 3D Scene Understanding (I)

Wen, Junjie	The Chinese University of Hong Kong
He, Junlin	Sun Yat-Sen University
Ma, Fei	University of Chinese Academy of Sciences
Cui, Jinqiang	Pengcheng Laboratory
Keywords: Learning Systems, Real-time Systems, Sensor/Data Fusion Abstract: Accurate open-vocabulary 3D scene understanding requires semantic representations that are both language-aligned and spatially precise at the pixel level, while remaining scalable when lifted to 3D space. However, existing representations struggle to jointly satisfy these requirements, and densely propagating pixel-wise semantics to 3D often results in substantial redundancy, leading to inefficient storage and querying in large-scale scenes. To address these challenges, we present emph{PLAF}, a Pixel-wise Language-Aligned Feature extraction framework that enables dense and accurate semantic alignment in 2D without sacrificing open-vocabulary expressiveness. Building upon this representation, we further design an efficient semantic storage and querying scheme that significantly reduces redundancy across both 2D and 3D domains. Experimental results show that emph{PLAF} provides a strong semantic foundation for accurate and efficient open-vocabulary 3D scene understanding. The codes are publicly available at https://github.com/RockWenJJ/PLAF.

11:45-12:00, Paper ThBT6.6
HiveNav: Hierarchical Semantic Planning for UAV Swarm Exploration (I)

Liu, Xuchen	The Chinese University of Hong Kong
Zhang, Weichen	Tsinghua University
Li, Ruocheng	Beijing Institute of Technology
Wei, Hejun	Southern University of Science and Technology
Huang, Shunyuan	Harbin Institute of Technology
Huang, JunSong	Harbin Institute of Technology, Shenzhen ; Peng Cheng Laboratory
Cui, Jinqiang	Pengcheng Laboratory
Keywords: Intelligent and AI Based Control, Robotics, Multi-agent Systems Abstract: This paper presents HiveNav, a distributed semantic exploration framework for multi-UAV target search in structured environments. Inspired by the hierarchical decomposition and memory-driven reasoning style of CityNavAgent~cite{zhang2025citynavagent}, the full stack is redesigned for swarm robotics with three core upgrades. First, simulated RGB-D sensing is replaced with a real RGB-LiDAR reconstruction pipeline based on FAST-LIVO2~cite{zheng2025fastlivo2}, and a colorized semantic point cloud map is constructed for planning. Second, centralized planning is replaced with distributed hierarchical semantic planning (HSP) agents that run asynchronously on each UAV, enabling parallel decision making under heterogeneous observations. Third, a swarm-memory mechanism is introduced to share real-time state, future intent, and reservation information, reducing duplicated exploration and inter-agent conflicts. On top of this semantic layer, an SPH-based swarm navigation planner~cite{li2025sphswarm} is integrated as the motion-level planner for decentralized collision-free trajectory generation. Simulated experiments in multiple indoor scenes show consistent gains over geometric and centralized baselines, including higher task success, faster first-target discovery, and lower exploration overlap, while maintaining practical planner latency in the LLM-driven decision loop. These results demonstrate that colorized semantic mapping, distributed agents, and swarm memory are complementary and jointly critical for robust multi-UAV exploration.

12:00-12:15, Paper ThBT6.7
Modeling and Control of a Swashplateless Rotor with a Teetering Hinge (I)

Wang, Biao	Nanjing University of Aeronautics and Astronautics
Wei, Xing	Nanjing University of Aeronautics and Astronautics
Tang, Chaoying	Nanjing University of Aeronautics and Astronautics
Keywords: Control Applications, Nonlinear Systems and Control, Modeling and Control of Complex Systems Abstract: To reduce the structural complexity and maintenance cost associated with conventional swashplate mechanisms, a swashplateless rotor with a teetering hinge is studied. The rotor achieves cyclic pitch control through periodic acceleration and deceleration of the motor combined with a tilted lag-pitch hinge. However, the blade motion and cyclic pitch generation mechanism of the rotor are complex, and its dynamic characteristics still lack systematic modeling and analysis. Therefore, the dynamic model of the blade is established based on Lagrangian mechanics and the blade element method. In addition, a closed-loop motor speed control scheme based on quasi-proportional resonant (QPR) control and active disturbance rejection control (ADRC) is designed. Simulation and bench experimental results demonstrate that the rotor can achieve effective cyclic pitch variation and adjustable thrust vector control, and that the theoretical analysis agrees well with the experimental results, providing a feasible solution for the simplification and lightweight design of rotor systems for micro aerial vehicles (MAVs).


ThCT1	Assembly Hall
Nonlinear Systems and Control	Regular Session
Chair: Guay, Martin	Queen's University

13:45-14:00, Paper ThCT1.1
Nonlinear Model Predictive Control with Time-Proportioning Vent and Ballast Actuation for an Autonomous Balloon Control System

Azhdari, Maryam	Queen's University
Guay, Martin	Queen's University
Keywords: Nonlinear Systems and Control, Control Applications, Automated Guided Vehicles Abstract: This paper presents a nonlinear model predictive control (NMPC) strategy for altitude regulation of a high- altitude balloon equipped with a vent valve and a ballast release mechanism. The control objective is to track commanded altitude references under time-varying atmospheric conditions while respecting actuator limits and the discrete nature of valve operation. To better reflect practical hardware, a duty-cycle actuation model is adopted: within each control interval, the controller allocates fractions of time for venting and ballast release, and the plant implements these commands through first- order valve dynamics. The proposed NMPC is implemented and evaluated in simulation with interpolated atmospheric data. Results on step changes in altitude reference demonstrate stable tracking and physically plausible on/off valve operation while avoiding the computational burden of mixed-integer predictive control.

14:00-14:15, Paper ThCT1.2
A Mathematical Description of Intelligence in Dynamical System

Liu, Shiqi	Tsinghua University
Zhang, Xiangteng	Tsinghua University
Yu, Zhouyang	Tsinghua University
Li, Shengbo Eben	Tsinghua University
Keywords: Nonlinear Systems and Control, Intelligent and AI Based Control, Modeling and Control of Complex Systems Abstract: From the origin of life to the formulation of thermodynamic laws, understanding whether physical systems can manifest intelligence has remained a central scientific challenge. The ability to quantify intelligence is essential for determining whether a system exhibits intelligent behavior, and prior theoretical work has highlighted deep connections between intelligence and the fundamental principles of living systems. Dynamical systems offer a foundational framework for modeling the evolution of physical processes; however, their potential to exhibit intelligence has not been systematically characterized. In this paper, we introduce a mathematical modeling framework that defines dynamical systems composed of two interacting subsystems as Meta-Coupled Systems (MCS) to analyze the emergence of intelligence. Leveraging the concept of state entropy, we establish the Intelligence Emerging Principle (IEP) and derive its necessary and sufficient conditions. We claim that an isolated system that satisfies the energy conservation and an increment of positive total entropy, along with an entropy-declining subsystem, is able to generate intelligent behaviors. Furthermore, we specialize IEP under linear dynamical systems with different energy forms, and find that linear systems can also exhibit intelligence in some extent. Through both linear and nonlinear illustrative simulations, we verify the effectiveness of this mathematical framework as a principled method of investigating intelligence in dynamical systems.

14:15-14:30, Paper ThCT1.3
Improving the Algorithms of Computing Control Inputs for a Class of Controllable Discrete-Time Bilinear Systems

Zhao, Wenyu	Beihang University
Tie, Lin	Beihang University (Beijing University of Aeronautics and Astronautics)
Keywords: Nonlinear Systems and Control, Linear Systems Abstract: For nonlinear systems, controllability is in general difficult to prove, let alone to compute the control inputs that achieve state transitions. In this paper, we consider a class of discrete-time bilinear systems which own not only algebraically verifiable controllability criteria but also algorithms of computing the required control inputs to achieve state transitions based on the root locus approach. However, since such control inputs are approximated rather than exact, there exist computation errors between the real terminal state and the given terminal state. How to minimize the errors becomes an important issue for the bilinear systems. We first analyze the errors and find through simulations that the selection of the root locus gain significantly influences the errors. As a result, under the constraint that all the control inputs have to be real numbers, the larger the root locus gains are, the smaller the errors will be. We then provide an efficient method to minimize the errors by adding one more control input, so that the locations of the open-loop transfer function’s zeros can be adjusted and a larger gain can be obtained. Simulations demonstrate our provided method and the previous algorithms to achieve transitions between arbitrarily given pairs of states can thus be improved.

14:30-14:45, Paper ThCT1.4
Data-Driven Nonlinear Min-Max Model Predictive Control with Measurement Errors

Wei, Yuzhou	Beijing Institute of Technology
Xiao, Wei	Beijing Institute of Technology
Wang, Linqi	Beijing Institute of Technology
Liu, Wenjie	Beijing Institute of Technology, Beijing, China
Wang, Gang	Beijing Institute of Technology
Keywords: Nonlinear Systems and Control, Optimal Control, Control Applications Abstract: This paper addresses a data-driven predictive control problem for discrete-time nonlinear systems subject to process disturbances and measurement errors. In the absence of an accurate system model, a lifting-based representation is employed to transform the nonlinear dynamics into a linearly parameterized form, enabling tractable controller design from data. To account for bounded uncertainties, a worst-case performance criterion is incorporated, leading to an optimization problem that can be reformulated as a semidefinite program (SDP). The resulting control strategy enforces constraint satisfaction while maintaining closed-loop stability. Numerical studies demonstrate that the proposed approach achieves reliable regulation under noisy measurements and outperforms existing data-driven control schemes in terms of performance.

14:45-15:00, Paper ThCT1.5
Multi-Factor Data Preparation and Scenario Assessment for Forecasting Freight Flows Along the Trans-Caspian International Transport Route

Gafiatullin, Farkhad	"Online Shopping" LLP
Mukhanova, Gulmira	Satbayev University
Antoni, Alfonz	Budapest Mtropolitan University
Imasheva, Gulnar	Satbayev University
Toregali, Nargiza	Satbayev University
Keywords: Nonlinear Systems and Control Abstract: Accurate forecasting of freight flows along international transport corridors has become increasingly vital amid global supply chain reconfiguration. This study focuses on the Trans-Caspian International Transport Route (TITR), a key Eurasian corridor linking China, Kazakhstan, the Caspian region, and Europe. The research aims to enhance the precision of freight forecasting by applying a multi-factor data preparation and scenario assessment framework grounded in artificial intelligence and data-driven analytics. The methodology integrates heterogeneous datasets from 2010–2024, sourced from national and international institutions such as Kazakh Invest, Kazakhstan Temir Zholy, the World Bank, and OECD. Five principal factor groups were identified—infrastructure investment, technological advancement and digitalization, market liberalization, environmental policy, and cargo structure transformation—each represented by quantitative indicators and derived variables. Data preprocessing included normalization, outlier treatment, interpolation of missing values, dimensionality reduction, and feature engineering to ensure consistency and analytical validity. The proposed multi-factor approach enables scenario-based evaluation of freight flow dynamics under varying development conditions. By combining theoretical principles of transport economics with systematic data preparation and factor integration, the study establishes a robust methodological basis for forecasting freight volumes and supporting strategic decisions within the TITR corridor.

15:00-15:15, Paper ThCT1.6
Reduced-Order Flow Estimation and Predictive Station-Keeping in Flows Governed by the Navier-Stokes Equations

Waterman, Adam	Queen's University
Guay, Martin	Queen's University
Keywords: Nonlinear Systems and Control, Automated Guided Vehicles, Estimation and Identification Abstract: This paper investigates predictive station--keeping of a passive vehicle navigating in environmental flow fields governed by the Navier--Stokes equations. A computational framework combining Proper Orthogonal Decomposition (POD) reduced--order models, moving--sensor flow estimation, and nonlinear model predictive control (MPC) is developed. The vehicle is assumed to possess limited actuation capability and can only modify its vertical velocity while horizontal motion is determined by the surrounding flow. The goal is to exploit altitude--dependent flow structures in order to regulate the vehicle toward a desired spatial region. The work should therefore be interpreted as a computational investigation of the integration of reduced--order flow estimation and predictive control rather than the development of a new MPC theory. Simulation studies illustrate how predictive control can exploit environmental flow structures for station--keeping tasks.


ThCT2	Room 256
Advanced Technologies for Robot Learning, Control, and Manipulation	Regular Session
Chair: Cai, Mingxue	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Co-Chair: Xu, Sheng	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Organizer: Cai, Mingxue	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Organizer: Xu, Sheng	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Organizer: Yang, Lidong	The Hong Kong Polytechnic University

13:45-14:00, Paper ThCT2.1
RRT*-Based Hybrid Path Planning Method and Automatic Navigation of Microswarms (I)

Jiang, Jialin	Shenyang Institute of Automation, Chinese Academy of Science
Keywords: Micro and Nano Systems, Motion Control, Robotics Abstract: Remotely controllable untethered microrobots with multiple functions and performances have attracted increasing research interests in recent years. Benefiting from the small sizes, these miniaturized agents have exhibited promising application potential in biomedicine, especially for therapy tasks in confined, tortuous, and vulnerable physiological environments. Targeted navigation is crucial for practical applications, where effective path planning methods and motion control algorithms are necessary for obstacle avoidance. Moreover, at in vivo scenarios, the peristalsis of biological tissue will lead to deviations between real-time environments and pre-registered environments. Thus, a planning method with online adjustment mechanism would worthy exploring. In this work, we proposed a rapidly-exploring random tree(RRT)-based hybrid planning scheme. In comparison to conventional RRT* methods, a potential field-based module is employed to modify the planned paths when possible collision may occur. Then, we designed a super twisting sliding mode controller (STSMC) to govern the motion control. A disturbance observer (DOB) was presented to estimate system states and lumped disturbances. Simulations and experiments were performed to validate our proposed schemes via a vortex microswarm. The results indicate that the hybrid scheme could effectively update the motion directions for the microswarm when new obstacles are added, and the motion controller is adequate to achieve precise trajectory tracking.

14:00-14:15, Paper ThCT2.2
A Learning-Based Viscoplastic Filament Manufacture Method Using Robotic Arm Based on Flow Matching Policy (I)

Chen, Guoqing	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Tang, Yifeng	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Jiang, Guolai	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Xu, Sheng	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Keywords: Learning-based Control, Robotics, Motion Control Abstract: In traditional material laboratory operations, a special filament with double-helical structures is often manufactured via manual work. This manual manufacturing process uses two needles to pull viscous melt materials along curved paths and at the same time rotate the needles. Therefore, the quality of produced filaments is always affected by human operation factors. Such factors as motion consistency, rotation coordination, and process stability all impact the quality of the filament. Hence, in order to improve repeatability and reduce differences caused by different operators, this paper researches a demonstration-guided robot viscoplastic filament drawing system. In this system, a robot manipulator first learns drawing skills from human experts, then independently completes the drawing process under specific endpoint constraint conditions. Firstly, this filament drawing task is formulated as a goal-directed sequential control problem. It requires the end-effector to carry out translational and rotational movements in a coordinated way. Secondly, for understanding the multi-mode characteristic of possible expert movement paths, a strategy based on flow-matching is proposed. This strategy is applied to learn a conditional continuous-time action generation mechanism. Thirdly, the expert demonstration data is collected through simulation methods. Subsequently, the strategy undergoes offline training to approximate the velocity field of action paths in the task space. Finally, the effectiveness of the proposed method is verified by both simulation and real-world examples.

14:15-14:30, Paper ThCT2.3
LogCosh Super-Twisting Control for Robust Path Planning and Following (I)

Cai, Mingxue	Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Gao, Daan	Southern University of Science and Technology
Keywords: Nonlinear Systems and Control, Robotics, Control Applications Abstract: Output singularities and slow error convergence are two critical factors that cause sliding mode control failure. This paper proposes LogCosh Super-Twisting Control (LCSTC), a super-twisting sliding mode controller constructed with a nonsingular terminal sliding manifold that incorporates the LogCosh function. We also apply LCSTC to the path following task. LCSTC avoids singular behavior, achieves faster, finite-time convergence, and improves path-following accuracy. Based on this controller, we develop two planners. One is based on the upgraded RRT(URRT) algorithm for static scenes, and the other is based on the bi-robot APF(BiRAPF) algorithm for dynamic scenes. Both planners generate short paths while maintaining a safe distance, and they couple with LCSTC to form integrated plan-and-follow schemes. Simulations with LCSTC in representative two-dimensional scenarios show generally faster convergence and lower following RMSE than classical sliding mode control (SMC) and super-twisting control (STC). Experiments conducted on a magnetic microrobot further strengthen the dependable effects of the LCSTC controller in practical scenarios. The efficacy of two plan-and-follow schemes is also demonstrated by simulations.

14:30-14:45, Paper ThCT2.4
Physics-Informed Predictive Control for Isoline Tracking in Dynamic Scalar Fields

Huang, Nanxi	Beijing Institute of Technology
Li, Zhuo	Tsinghua University
Zhang, Yanjun	Beijing Institute of Technology
Sun, Jian	Beijing Institute of Technology
Gao, Zhanyu	New York University
Keywords: Intelligent and AI Based Control, Learning-based Control, Robotics Abstract: This paper addresses the problem of isoline tracking in dynamic spatiotemporal scalar fields. This task is challenging because the field is unknown, time-varying, and can only be sensed through local pointwise measurements. Most existing methods are designed for static scalar fields. We present a physics-informed (PI) predictive control scheme for a nonholonomic vehicle. A local predictor is proposed to estimate the local field strength near a potential arrival point from short trajectory history and query-point information. The predictor employs physical residual regularization and multi-step recursive error to better capture the dynamics of spatiotemporal field evolution. A PI Predictive controller is designed with incorporation of the predictor to generate a set of feasible finite-horizon tracking trajectories and to select the optimal one. Only the first step of the optimal trajectory is implemented. The effectiveness and advantages of our scheme are validated via isoline tracking simulation experiments conducted on dynamic scalar fields.

14:45-15:00, Paper ThCT2.5
Real-Time Mask-Conditioned Surgical Grasping Via Action Chunking Transformers

Duan, Wenxing	The Chinese University of Hong Kong
Li, Bin	The Chinese University of Hong Kong
Liu, Yunhui	Chinese University of Hong Kong
Keywords: Learning-based Control, Intelligent and AI Based Control, Robotics Abstract: Learning-based surgical manipulation often relies on accurate pose estimation and large annotated datasets, both of which are hard to obtain in dynamic surgical environments. We propose an end-to-end framework for learning mask-conditioned surgical grasping policies from teleoperated demonstrations. An online SAM+XMem pipeline generates target masks with minimal user input, while synchronized RGB images, masks, robot states, and actions are collected to train an Action Chunking Transformer (ACT) policy. Using RGB images and masks as dual visual streams helps the policy separate appearance from task-relevant spatial cues, improving robustness to occlusion. We evaluate the effectiveness of our method in grasp-centric surgical scenarios, specifically focusing on the Needle Pick task in both the SurRoL simulator and real-world settings. Results show that mask-conditioned input consistently improves task success, highlighting the value of explicit spatial guidance for reliable policy learning in visually challenging surgical scenes.

15:00-15:15, Paper ThCT2.6
PID-Type Distributed Iterative Learning Control for Wheeled Mobile Robots: A Two-Dimensional System Approach

Wang, Wenxian	Beihang University
Meng, Deyuan	Beihang University (BUAA)
Keywords: Learning-based Control, Multi-agent Systems, Robotics Abstract: This paper aims to address the distributed control problem within cooperative learning systems and achieve high-precision performance from the beginning to the end via an iterative learning process. Specifically, a PID-type distributed iterative learning control (DILC) approach is proposed for multi-agent networks to achieve learning-based output consensus. By formulating an augmented vector to incorporate all terms in the PID-type DILC, a rigorous necessary and sufficient condition is derived for consensus error convergence via two-dimensional system analysis. Notably, the presented theoretical results can be naturally generalized to conventional D-type or PD-type DILC approaches. Furthermore, simulations are conducted on wheeled mobile robots to demonstrate the effectiveness and superior performance of the proposed PID-type DILC.


ThCT3	Room 267
Autonomous Intelligence for Industry, Daily Services, and Rescue Operations	Regular Session
Chair: Hu, Songyu	Zhejiang University
Organizer: Chen, Haoyao	Harbin Institute of Technology, Shenzhen Graduate School
Organizer: Hu, Songyu	Zhejiang University

13:45-14:00, Paper ThCT3.1
Lite 2S-AGCN towards Skeleton Action Recognition for Mobile and Edge Devices (I)

Zheng, Jianshu	Shenzhen Institutes of Advanced Technology
Zheng, Zhiyuan	Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences
Wang, Can	Chinese Academy of Sciences
Wu, Xinyu	Shenzhen Institutes of Advanced Technology (SIAT), CAS
Keywords: Intelligent and AI Based Control, Estimation and Identification Abstract: Skeleton-based action recognition shows great potential in human-computer interaction and healthcare. However, mainstream high-precision models (e.g., 2S-AGCN) have a large parameter count (≈3.0M) and high computational complexity. This paper proposes a lightweight two-stream adaptive graph convolutional network (Lite 2S-AGCN), which simplifies the graph convolution structure and optimizes network depth, reducing parameters from 3.0M to 40K and model size from 11.7MB to 156KB. Experimental results show that compared with 2S-AGCN, Lite 2S-AGCN achieves 91.29% accuracy (vs. 58.37%) and 1.74ms average inference time (vs. 9.97ms) on CZU-MHAD (cross-person test), and 94.42% accuracy (vs. 35.58%) and 1.81ms average inference time (vs. 7.43ms) on UTD-MHAD. It outperforms the baseline significantly in inference speed with excellent recognition performance, making it suitable for resource-constrained edge and mobile devices.

14:00-14:15, Paper ThCT3.2
DeepPVE: Deep Learning-Based Point Visibility Estimation against Density Variation and Noise (I)

He, Rui	Harbin Institute of Technology, Shenzhen
Cui, Hongkang	Harbin Institute of Technology(Shenzhen)
Chen, Haoyao	Harbin Institute of Technology, Shenzhen Graduate School
Li, Peng	Harbin Institute of Technology, Shenzhen
Keywords: Sensor/Data Fusion, Robotics, Signal Processing Abstract: For a long time, many point visibility estimation methods based on traditional geometric analysis or surface reconstruction have been studied. However, they struggle to handle scenes with varying density and noise. In this work, a real-time voxel-based point visibility estimation network, namely DeepPVE, is proposed to address the above problems. The scene points are voxelized into a grid and input into the encoder-decoder structure with sparse 3D convolution to extract local structural information and spatial relationships from the point cloud in an efficient manner. Compared to point-based approaches, the proposed DeepPVE can extract features that are easier to distinguish, leading to higher accuracy in visibility estimation. Moreover, it is robust against variations in point cloud density and noise. To overcome the lack of datasets, we propose an automatic point visibility data generator based on ray tracing which enables the self-supervised training of the proposed network, and a universal benchmark to evaluate the proposed estimation methods, enabling a comprehensive comparison. The visibility estimation tests, involving 600 objects and 300 pre-sampled viewpoints, confirm the generalization ability, adaptation to density variations, and robustness against noise of the DeepPVE in comparison with recent well-known methods. Furthermore, the relevant code will be released to promote the development of point visibility estimation.

14:15-14:30, Paper ThCT3.3
MSCN-LSTM: A Multi-Scale CNN and BiLSTM-Based Model for Multi-Modal Gesture Recognition Using EMG and IMU Signals (I)

He, Xiaoyao	Nanjing University of Aeronautics and Astronautics
Xie, Mingyang	Nanjing University of Aeronautics and Astronautics
Yao, Taike	Aero Engine Corporation of China Control System Research Institute
Zhou, Qi	Shanghai Jiaotong University
Bi, Youyi	Shanghai Jiao Tong University
Keywords: Sensor/Data Fusion Abstract: Gesture recognition plays a crucial role in human-computer interaction, robotics, and assistive technologies. In this study, we propose a novel framework, MSCN-LSTM, for gesture recognition using EMG and IMU signals. The model combines Multi-Scale Convolutional Neural Networks (Multi-Scale CNN) with Bidirectional Long Short-Term Memory (BiLSTM) networks to integrate spatial features from EMG signals with temporal dependencies from IMU signals. Extensive experiments demonstrate that MSCN-LSTM outperforms traditional models like GRU, LSTM, and other multi-modal models such as DSC-GRU and Multimodal CNN across accuracy, precision, recall, and F1 score. Achieving a recognition accuracy of 97.41%, MSCN-LSTM excels in both robustness and precision. Additionally, the model demonstrates real-time applicability, with an inference time of 52.97 ms per sample, making it suitable for dynamic, real-time applications.

14:30-14:45, Paper ThCT3.4
High-Precision Vascular Network Mapping for Active Targeting Microrobots Navigation In-Vivo (I)

Huang, Renqiang	Soochow University
Zhang, WenKai	Soochow University
Pan, Hao	Soochow University
Zhang, Xinyue	Soochow University
Chen, Xuanhan	College of Electricial and Engineering, Soochow University
Sun, Lining	Soochow University
Li, Xiangpeng	Soochow University
Keywords: Robotics Abstract: Microrobots hold significant promise for targeted drug delivery, yet their precise navigation through vascular networks remains challenging due to the lack of accurate, high-resolution in vivo vascular maps. Existing imaging modalities are largely limited to ex vivo or phantom-based mapping, lacking the spatial resolution and penetration depth required for in situ guidance. Here, we present a methodology for constructing high-resolution 3D vascular network models in vivo using a reflection-mode photoacoustic microscopy system. Our approach achieves imaging of microvascular structures down to capillary levels in mouse ears and subcutaneous tumors. Through a dedicated image processing pipeline, raw volumetric data are transformed into quantifiable, navigable vascular maps that preserve topological fidelity. This work addresses the critical bottleneck of in situ map construction, providing essential support for microrobot navigation and targeted therapeutic delivery.

14:45-15:00, Paper ThCT3.5
Magnetically Controlled Patterning of Liquid Metals Via Superwetting of Fe Liquid Metal Composite Coatings for Flexible Electronics (I)

Sun, Xiaoqin	Northwestern Polytechnical University
Chen, Xuanhan	College of Electricial and Engineering, Soochow University
Deng, Yuguo	School of Mechatronics Engineering, Harbin Institute of Technology
Gan, Kun	Soochow University
Bao, Wanlin	Soochow University
Bing, Xiao	School of Automation, Northwestern Polytechnical University
Li, Xiangpeng	Soochow University
Keywords: Sensor/Data Fusion Abstract: Gallium-based liquid metals (LMs) that remain fluid at room temperature combine fluidic mobility with high metallic electrical conductivity, emerging as highly attractive transformative materials in flexible electronics. However, constrained by their extremely high surface tension, liquid metals struggle to achieve reliable wetting and adhesion on flexible substrates, which severely hinders their widespread application in flexible electronics. Here, we report a magnetically controlled superwetting strategy that enables the construction of patterned liquid metal circuits on flexible substrates. Our approach is based on an iron-liquid metal composite (FeLMC), which effectively overcomes the high surface tension limits of liquid metals, enabling the LM to undergo ultrafast spreading on flexible substrates such as Ecoflex, thereby exhibiting remarkable superwetting performance. Even if prolonged exposure leads to a degradation in wetting performance, the coating's superwetting properties can be regenerated through simple mechanical stirring. We designed and fabricated a liquid metal based flexible strain sensor by using FeLMC as the wetting layer to monitor bending angle of a human finger. It exhibits a highly linear response in its relative resistance change with respect to the finger bending angle. The proposed method offers a highly promising way for advancing the low-cost and scalable manufacturing of liquid metal based flexible electronic devices and wearable systems.

15:00-15:15, Paper ThCT3.6
An RCM-Constrained Robot Path Planning Method Based on Two-Stage Sampling (I)

Xu, Jiahang	Zhejiang University
Hu, Jiaying	Zhejiang University
Li, Jiayi	ZheJiang University
Hu, Songyu	Zhejiang University
Fu, Jianzhong	Zhejiang University
Keywords: Robotics, Motion Control Abstract: The Remote Center of Motion (RCM) constraint plays a critical role in robot-assisted minimally invasive surgery, ensuring that surgical instruments move around a fixed point and thereby reducing tissue damage near the surgical incision. RCM constraint methods can be categorized into mechanical and algorithmic approaches. Mechanical methods provide high accuracy but lack flexibility, while algorithmic methods offer greater adaptability; however, existing algorithmic approaches typically maintain RCM constraints in real time at the velocity level, which can fail under task conflicts or in complex environments. To address this issue, this paper proposes a novel robot path planning method based on two-stage sampling, which enforces RCM constraints through geometric construction rather than real-time correction, explicitly considering the global feasibility of the path. First, the robot performs a collision-free first-stage sampling in the task space to generate an initial path of the working points. Then, based on geometric relationships, all initial path points are mapped through the RCM point to compute the corresponding robot joint-end poses. Finally, a second-stage sampling is performed on the obtained joint-end poses to generate the final path in the robot’s joint or Cartesian space. To address sudden pose changes and increased RCM errors caused by initial path points being close to the RCM point, a distance-checking mechanism is introduced, and helical interpolation is applied to poses exceeding the threshold, ensuring path continuity and satisfaction of the RCM constraint. Experimental results demonstrate that the proposed method produces feasible paths that meet the precision requirements of medical robots under RCM constraints.


ThCT4	Room 269
Distributed Control and Decision-Making for Multi-Agent Systems with Safety and Security Guarantees	Regular Session
Chair: Dong, Yi	Tongji University
Co-Chair: Liu, Tao	Southern University of Science and Technology
Organizer: Dong, Yi	Tongji University
Organizer: Liu, Tao	Southern University of Science and Technology

13:45-14:00, Paper ThCT4.1
Data-Driven Cooperative Output Regulation of Singular Linear Multi-Agent Systems (I)

Cai, Jialei	University of Shanghai for Science and Technology
Zhou, Chi	University of Shanghai for Science and Technology
Liang, Dong	University of Shanghai for Science and Technology
Dong, Yi	Tongji University
Wang, Chaoli	Univ. of Shanghai for Sci. & Tech
Sun, Yuanyingyi	Shanghai Linksai Technology Co. Ltd
Keywords: Multi-agent Systems, Linear Systems Abstract: Cooperative output regulation of singular multi-agent systems has wide applications in unmanned systems, smart grids, and other fields. Prior study provided the pioneering work on the cooperative output regulation with deterministic models by introducing the novel distributed observers. However, when the mathematical models of follower agents are unknown, traditional model-based design methods will no longer be applicable. This study extends the existing results to more general linear singular discrete-time multi-agent systems with unknown system matrices. To deal with the external disturbances, a coordinate transformation is employed to transfer the original tracking problem to a simplified stabilization problem. By using input and state data satisfying the full rank requirement, the feedback gain for each agent can be designed by solving data-based linear matrix inequalities. Then, a data-driven distributed control scheme is proposed to solve the problem under some mild conditions. Finally, an illustrative example is provided to verify the correctness of the proposed scheme.

14:00-14:15, Paper ThCT4.2
Control of a Reaction-Diffusion PDE–ODE System with an Actuator Delay (I)

Wu, Tao	Southern University of Science and Technology
Xu, Xiang	Southern University of Science and Technology
Wu, Xuyang	Southern University of Science and Technology
Keywords: Linear Systems Abstract: Controlling a reaction-diffusion PDE–ODE system presents a formidable challenge due to the inherent instability of both the open-loop PDE and open-loop ODE components. Consequently, a single controller must compensate for the instability of both the PDE and ODE states simultaneously. In this study, we address this challenging control problem inherent in reaction-diffusion PDE–ODE systems. Moreover, we extend our investigation to encompass a scenario where the connection between the PDE and ODE is subject to distributed delays, further complicating the control task. To tackle this formidable challenge, we employ a combination of predictor feedback and infinite-dimensional backstepping techniques to design an effective controller. We rigorously analyze the stability properties of the resulting closed-loop system, considering both L2 and L∞ norms, and establish proofs based on Lyapunov construction. Finally, we illustrate the effectiveness of our proposed controller through a simulation example.

14:15-14:30, Paper ThCT4.3
ADMM-Based Distributed Formation Control for Multi-Parafoil Systems (I)

Zhou, Tianyi	Shanghai Jiao Tong University
Zhou, Tianyi	Shanghai Jiao Tong University
Li, Yuanlong	Shanghai Jiao Tong University
Shi, Liangren	Shanghai Jiao Tong University
Keywords: Control Applications, Nonlinear Systems and Control, Multi-agent Systems Abstract: 本文探讨了协同形成控制多伞翼系统的问题，涉及大量滑翔翼和非凸翼的非线性飞行动力学耦合约束包括碰撞避免和队形维护。为了解决这些困难，一基于交替方向的分布式控制器提出了乘法法（ADMM），其中子问题通过使用微分动力学高效求解规划（DDP）、封闭形式投影和半正定投影松弛（SDR）。受益于的并行结构 ADMM 与约束的有效处理，所提方法适用于实时控制和具有良好的可扩展性，且对副翼。多个副翼的模拟结果阵型切换验证拟议方法。

14:30-14:45, Paper ThCT4.4
Position Synchronization of Multiple PMSM Systems: A Distributed Internal Model Approach (I)

Guo, Jiayu	Hefei University of Technology
Ping, Zhaowu	Hefei University of Technology
Zhang, Hongwei	Harbin Institute of Technology
Keywords: Multi-agent Systems, Motion Control, Control Applications Abstract: This paper investigates a position synchronization problem of multiple permanent magnet synchronous motor (PMSM) systems under parametric uncertainties and directed communication network. Based on the distributed internal model approach, a distributed position synchronization controller is proposed, which can simultaneously achieve position tracking, disturbance rejection, and robustness. Moreover, it can lead to superior transient performance. The simulation results demonstrate the effectiveness of our design.

14:45-15:00, Paper ThCT4.5
A Scalable Distributed Algorithm for Solving Linear Equations Over Double-Layered Networks with Jointly Connected Clusters (I)

Chen, Chen	Southern University of Science and Technology
Wang, Lili	Southern University of Science and Technology
Liu, Tao	Southern University of Science and Technology
Keywords: Linear Systems, Multi-agent Systems Abstract: This paper studies a scalable distributed algorithm for solving linear equations over double-layered multi-agent networks. The network is divided into clusters, each consisting of an aggregator and a group of agents. Agents exchange local information within clusters, while aggregators coordinate information across clusters. Unlike existing results that require the inter-cluster communication graph to remain connected at all times, this paper considers a switching inter-cluster communication graph that may be disconnected at any time instant, as long as it satisfies a jointly connected condition. By modeling the resulting error dynamics as a linear switched system and exploiting its uniform complete observability (UCO) property, we establish exponential convergence of the distributed algorithm to the unique solution of a linear equation.

15:00-15:15, Paper ThCT4.6
Active Vibration Control with Frequency-Switching Excitation: A Sparse Autoencoder and Reinforcement Learning Based Approach (I)

Wang, Hongman	Tongji University
Dong, Yi	Tongji University
Li, Rongyan	Tongji University
Xin, Bin	Beijing Institute of Technology
Wang, Qing	Beijing Institute of Technology
Chen, Xi	The Chinese University of Hong Kong
Keywords: Intelligent and AI Based Control, Learning-based Control, Signal Processing Abstract: This paper proposes an integrated frequency identification and reinforcement learning based approach for active vibration control system to especially address the challenge from the frequency-switching excitation. Technically, we first propose a sparse autoencoder based method to identify frequencies, capable of directly extracting feature and achieving fast inference speeds, and then design an optimal frequency-specific control policy with the aid of reinforcement learning technique. In particular, the Beta distribution is adopted in the construction of FxLMS-based reinforcement learning environment and used to model the action in Markov decision process due to its bounded property, which not only results in a stable training process, but also accelerates the convergence of policy optimization. Our integrated method requires no manual parameter tuning when the frequency of the excitation dramatically changes, and effectively minimizes the total error energy under different frequencies of the exciter, demonstrated in the experiment of 8-channel AVC system.


ThCT5	Room 259
Dynamic and Intelligent Decision-Making for Autonomous Systems	Regular Session
Chair: Zeng, Xianlin	Beijing Institute of Technology
Co-Chair: Dong, Xiwang	Beihang University
Organizer: Zeng, Xianlin	Beijing Institute of Technology
Organizer: Xu, Jinming	Zhejiang University
Organizer: Wang, Qing	BUAA

13:45-14:00, Paper ThCT5.1
Momentum-Based Gradient-Free Algorithm for Nonsmooth Nonconvex Compositional Optimization without Large Outer Batches (I)

Hou, Jie	Beijing Institute of Technology
Zeng, Xianlin	Beijing Institute of Technology
Xu, Jinming	Zhejiang University
Keywords: Intelligent and AI Based Control, Multi-agent Systems, Robotics Abstract: Stochastic compositional optimization (SCO) arises in a wide range of applications, including risk management and reinforcement learning. However, most existing methods for SCO rely on the smoothness assumption and require large batch sizes. To overcome these limitations, we propose a novel stochastic gradient-free algorithm, called SRMGF, designed for general nonsmooth nonconvex SCO. Unlike prior large-batch methods, SRMGF uses an increasing batch size for inner function evaluations, while employing only a single outer sample per iteration for gradient estimations. We further incorporate a momentum-based variance-reduction scheme to stabilize estimations. Theoretically, we prove that SRMGF converges to a (delta, epsilon)-Goldstein stationary point with a rate of mathcal{O}(delta^{-1/2}T^{-1/4}), achieving a total function query complexity of mathcal{O}(delta^{-4} epsilon^{-6}). Notably, SRMGF is the first algorithm to guarantee convergence without the need for large outer batches in nonsmooth SCO. Finally, numerical experiments on real-world tasks demonstrate the efficiency of SRMGF.

14:00-14:15, Paper ThCT5.2
A Weighted Bundle Method of Multipliers with Improved Computational Efficiency (I)

Zheng, Zhuoqing	Southern University of Science and Technology
Li, Cong	Southern University of Science and Technology
Xu, Xiang	Southern University of Science and Technology
Wu, Xuyang	Southern University of Science and Technology
Keywords: Optimal Control, Learning-based Control Abstract: The method of multipliers (MM) is a standard approach for equality-constrained convex optimization, but it often suffers from high computational cost in the primal update, and its dual update amounts to a gradient ascent step that can result in slow convergence. The bundle method of multipliers (BMM) mitigates these issues by incorporating a proximal bundle model into both the primal and dual updates, leading to a cheaper primal update and a more effective dual update. Nevertheless, the primal subproblem in BMM may still be expensive to solve. To improve computational efficiency, we propose a weighted BMM, which introduces a weighted proximal term to diagonalize the quadratic component of the primal subproblem. With this modification, the subproblem becomes significantly more tractable when solved using gradient-based dual approaches. We establish convergence of the proposed method under standard assumptions and demonstrate its superior time efficiency through numerical experiments.

14:15-14:30, Paper ThCT5.3
KL-Prior Regularized PPO: Integrating LLM Tactical Knowledge into Reinforcement Learning for Beyond-Visual-Range Air Combat (I)

Guo, Zheng	Beihang University
Yu, Jianglong	Beihang University
Chen, Yiming	Beihang University
Ren, Zhang	Beihang University
Keywords: Intelligent and AI Based Control, Multi-agent Systems, Robotics Abstract: Beyond-visual-range (BVR) air combat is a complex sequential decision-making problem. Reinforcement learning (RL) agents often struggle to discover multi-phase tactics in such high-dimensional spaces. While large language models (LLMs) offer strong strategic reasoning, integrating them directly into RL training remains an open challenge. To address this, a novel framework named KL-Prior Regularized PPO is proposed in this paper. This training-time-only framework incorporates a data-driven LLM tactical prior into the Proximal Policy Optimization (PPO) objective via a decaying KL-divergence penalty. This prior is constructed by analyzing 299 winning engagements from a baseline RL agent, successfully encoding the ``fire-and-extend" BVR doctrine into soft action distributions. Unlike inference-time fusion methods, actions in this approach are sampled exclusively from the PPO policy. This design preserves the stationary Markov Decision Process (MDP) assumption, which is critical for stable value function learning. The proposed method is evaluated in a high-fidelity 3D BVR simulator featuring F-16 aircraft, realistic sensors, and AIM-120C missile dynamics. Over 50,000 training episodes, KL-Prior PPO achieves a 28.8% accumulated win rate, outperforming the pure PPO baseline (25.0%). Furthermore, three failed inference-time fusion variants are analyzed. Entropy collapse and value function poisoning are identified as their fundamental failure mechanisms. These insights provide practical design guidance for robust LLM-RL integration in complex control domains.

14:30-14:45, Paper ThCT5.4
Koopman-Based Linear MPC with INDI for Quadrotor Trajectory Tracking Control (I)

Lv, Xiaokang	Beihang University
Wang, Qing	Beihang University
Wang, Shimin	Massachusetts Institute of Technology
Dong, Xiwang	Beihang University
Keywords: Optimal Control, Real-time Systems, Estimation and Identification Abstract: This paper proposes an analytical Koopman-based linear model predictive control (MPC) method for real-time quadrotor trajectory tracking. While linear MPC offers computational efficiency, it sacrifices modeling fidelity; nonlinear MPC solved via sequential quadratic programming achieves high accuracy but requires multiple iterations at each control step. We develop a systematic procedure to derive Koopman observables that lift the dynamics into a quasi-linear model with state-dependent control matrix. An assumed state trajectory converts this to a linear time-varying system at each control period, enabling quadratic program formulation with guaranteed real-time solvability. An incremental nonlinear dynamic inversion (INDI)-based robust control allocation scheme is proposed, which requires no precise control effectiveness model. Simulation results demonstrate tracking performance comparable to nonlinear MPC with deterministic computation times. The proposed method requires no training data collection, making it straightforward to implement.

14:45-15:00, Paper ThCT5.5
Local Generalization Analysis for Decentralized Personalized Federated Learning (I)

Chen, Xing	Southeast University
Yu, Yuanyuan	Southeast University
Yang, Shaofu	Southeast University
Xu, Wenying	Southeast University
Keywords: Learning Systems, Multi-agent Systems, Networked Control Abstract: Generalization analysis in personalized federated learning (PFL) is crucial for the design of learning algorithms. Existing theoretical studies primarily focus on characterizing the generalization performance of the global averaged model, while the performance of individual agents’ local models remains largely unexplored. To address this gap, we first establish agent-specific generalization error bounds for decentralized personalized federated learning (D-PFL) based on the notion of algorithmic stability. By reformulating the iteration process of D-PFL as a linear dynamical system and leveraging matrix decomposition techniques, we explicitly characterize how data perturbations propagate across the shared and personalized layers. Our analysis reveals the connection between network topology and agent-level generalization: agents with higher topological centrality benefit from more effective diffusion of perturbations through the communication network and consequently achieve improved generalization performance. In contrast, peripheral agents experience weaker perturbation dissipation and are therefore more susceptible to overfitting. Numerical experiments are provided to validate the theoretical findings.

15:00-15:15, Paper ThCT5.6
Spatial-Temporal Hierarchical Planning and Incremental NMPC Tracking for Fixed-Wing UAVs in Cluttered Environments (I)

Deng, Shijie	Beihang University
Yu, Jianglong	Beihang University
Feng, Zhi	Beihang University
Dong, Xiwang	Beihang University
Keywords: Optimal Control, Control Applications, Motion Control Abstract: Fixed-wing UAV navigation in cluttered environments requires robust spatial-temporal planning and high-fidelity tracking under non-holonomic constraints. To address these challenges, this paper presents a hierarchical framework tightly coupling a kinematic-aware front-end, a MINCO-based trajectory optimizer, and an incremental Nonlinear Model Predictive Control (NMPC) scheme. To strike an optimal balance between computational tractability and geometric tightness, an adaptive Safe Flight Corridor (SFC) generation strategy is developed based on a 3D Dubins search. Furthermore, a novel Kinematics-Aware Terminal Regularization (KATR) mechanism is introduced. It effectively resolves kinematic boundary conflicts caused by naive zero-acceleration assumptions, preventing forced zero-curvature terminal states while unlocking tangential acceleration degrees of freedom. For robust execution, the incremental NMPC utilizes a physics-informed Zero-Control Invariant Set Extrapolation to eradicate terminal predictive conflicts, ensuring exact terminal convergence. ROS and PX4 SITL simulations demonstrate that the framework achieves millisecond-level online planning and high-fidelity 3D maneuver tracking with sub-meter root-mean-square error (RMSE).

15:15-15:30, Paper ThCT5.7
DP-KF: A Dual-Stage Personalization Kalman Filter for Human Joint-Angle Prediction (I)

Wang, Nan	Nanjing University of Posts and Telecommunications
Shao, Xiaojuan	Nanjing University of Posts and Telecommunications
Lulu, Song	College of Automation, Nanjing University of Posts and Telecommunications, Nanjing, China
Ling, Ren	College of Automation, Nanjing University of Posts and Telecommunications, Nanjing China
Sha, Fan	College of Automation, Nanjing University of Posts and Telecommunications, Nanjing China
Chao, Deng	Nanjing University of Posts and Telecommunications
Keywords: Estimation and Identification, Man-machine Interactions Abstract: Real-time and accurate joint angle prediction is essential for human-machine interaction systems, yet it remains difficult due to substantial inter-subject variability and diverse locomotion tasks. While advances in neural networks have boosted prediction accuracy, data-driven approaches often struggle to achieve low-latency estimation in real-world online settings. To address this gap, we propose a dual-stage personalization Kalman filter (DP-KF) framework that couples long-term personalization with short-term dynamic correction. In the first stage, human physiological characteristics are incorporated as individualized prior conditions, and a conditional variational autoencoder (CVAE) is employed to generate a long-timescale baseline trajectory of joint angles. Meanwhile, a plantar-pressure matrix is fused to enrich gait, which improves the identifiability and stability of the learned latent variables. In the second stage, a Kalman filter (KF) operates under a system dynamics model and applies recursive estimation. It performs online short-timescale prediction of the residual between the long-timescale baseline trajectory generated in the first stage and the real-time observed joint angles. Finally, comparative experiments and ablation studies demonstrate that the proposed method achieves superior accuracy, while maintaining real-time performance.


ThCT6	Room 264
Embodied Perception, Decision, and Control for Autonomous Unmanned Systems	Regular Session
Chair: Wu, Wentao	The Hong Kong Polytechnic University
Co-Chair: Huang, Hailong	Hong Kong Polytechnic University
Organizer: Wu, Wentao	The Hong Kong Polytechnic University
Organizer: Wang, Huiting	The Hong Kong Polytechnic University
Organizer: Xu, Lei	KTH Royal Institute of Technology
Organizer: Chen, Wen-Hua	Loughborough University
Organizer: Zhang, Weidong	Shang Hai Jiaotong University

13:45-14:00, Paper ThCT6.1
Safety-Critical Accelerated Fixed-Time Convergence Learning-Based Control Via Dual Objective Synthesis (I)

Tan, Junkai	The Hong Kong Polytechnic University
Wu, Wentao	The Hong Kong Polytechnic University
Chen, Wen-Hua	Loughborough University
Zuo, Zhiqiang	Tianjin University
Wang, Yijing	Tianjin University
Keywords: Adaptive Control, Optimal Control, Learning-based Control Abstract: This paper proposes a safety-critical approximate optimal control framework, which synthesizes dual control for safety and optimality using Nesterov accelerated gradients with fixed-time stability (DCSO-NAG-FxT). For both safety and optimality objectives, a dual-objective function is constructed, and the Nesterov acceleration-based fixed-time learning law is applied to update the learning weights. Rigorous theoretical analysis establishes practical fixed-time stability for the closed-loop system, ensuring that the settling time of weights is uniformly bounded. By deriving explicit conditions for the bi-homogeneous weight updates, this work bridges the gap between accelerated gradient methods and fixed-time stability in adaptive dynamic programming. Furthermore, a safety-aware synthesis mechanism is integrated to guarantee the forward invariance of safe sets during the learning transient. Simulations on a nonlinear benchmark demonstrate faster convergence, lower cost, and zero safety violations.

14:00-14:15, Paper ThCT6.2
Trajectory Tracking of Autonomous Ground Vehicles Based on Robust MPC with Gain Scheduling (I)

Zhang, Kunwu	China University of Geosciences
Wang, Huiting	The Hong Kong Polytechnic University
Cao, Weihua	China University of Geosciences, Wuhan, China
Keywords: Control Applications, Motion Control, Optimal Control Abstract: With the rapid advancement of information and automation technologies, autonomous ground vehicles (AGVs) have been widely deployed across various industrial and domestic applications. Trajectory tracking of AGVs remains a challenging task in practice due to the presence of external disturbances and physical constraint. To address this challenge, this extended abstract presents a tube-based model predictive control (MPC) framework. The key idea is to reformulate the original tracking problem as a regulation problem by analyzing the dynamics of the tracking error. A robust positively invariant set is then computed to capture the influence of external disturbances on the evolution of the tracking error. Furthermore, a gain-scheduling mechanism is developed to reduce the conservatism arising from the invariant set. The resulting tube-based MPC scheme ensures constraint satisfaction and tracking accuracy while effectively handling external disturbances.

14:15-14:30, Paper ThCT6.3
Closed-Chain Sim2Sim Gait Transfer for Linear-Actuator Driven Humanoid Robot on Complex Terraims (I)

Ding, Tao	Huazhong University of Science and Technology
Liu, Zexu	Huazhong University of Science and Technology
Zhang, Yuhao	Huazhong University of Science and Technology
Zhu, Qingmiao	Huazhong University of Science and Technology
Zhao, Xingwei	Huazhong University of Science and Technology
Bo, Tao	Huazhong University of Science and Technology
Shi, Yang	Canada
Keywords: Robotics, Intelligent and AI Based Control, Control Applications Abstract: Linear-actuator driven full-size humanoid robots (LF-HRs) provide high payload capacity and low energy consumption during static standing. Nevertheless, the coupled closed-chain kinematics and linear actuator dynamics introduce a deployment gap, making reinforcement learning (RL) locomotion policies trained on a 12-DoF opened-chain model difficult to transfer and deploy. To address this gap, we propose a robust two-stage closed-chain Sim2Sim gait transfer controller, which geometrically correlates rotational joint angles with linear actuator displacements through a high-fidelity virtual-physical joint space mapping. Specifically, sagittal-plane projection is employed to model the hip and knee linkages, and a Jacobian-based iterative solver is developed to accurately handle the parallel ankle mechanism, thereby guaranteeing coordinated and kinematically feasible motion under closed-chain constraints. Building on this mapping, we combine potential-based reward shaping and domain randomization to transfer the learned policy from the opened-chain training environment to a high-fidelity closed-chain model, followed by two-stage validation on unstructured terrains, including height field and rough ground. The proposed controller provides an effective pathway for the high-performance deployment of LF-HRs.

14:30-14:45, Paper ThCT6.4
Tether-Connected Tilt-Rotor Quadrotor Robot for Building Maintenance (I)

Su, Jiangcheng	Hong Kong Polytechnic University
Zhou, Guanzhong	Hong Kong Polytechnic University
Haoyang, Yang	Hong Kong Polytechnic University
Cheng, Li	Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong
Huang, Hailong	Hong Kong Polytechnic University
Keywords: Robotics, Modeling and Control of Complex Systems, Motion Control Abstract: This paper explores a robotic system that synergizes the adhesion capability of wall-climbing robots with the agility of aerial vehicles for building maintenance applications. While wall-climbing robots can adhere to vertical surfaces, they suffer from limited obstacle-crossing abilities; conversely, unmanned aerial vehicles offer flexible maneuverability around walls but face challenges including limited proximity to surfaces, short endurance, and operational risks. To address these limitations, we propose a tether-connected tilt-rotor quadrotor robot that leverages the tensile force from the tether to enhance endurance and flexibility. The integration of a tilt-rotor mechanism with the tether enables decoupled position and attitude control, allowing the robot to approach walls at arbitrary orientations and substantially improving adaptability to complex environments. The dynamic model of this tethered tilt-rotor quadrotor is analyzed, with system identification employed to determine key parameters. Controller design and parameter tuning are subsequently performed based on the established model and identified parameters. Experimental results validate the effectiveness of the controller in angle command tracking and demonstrate the system's potential for wall inspection applications.

14:45-15:00, Paper ThCT6.5
Graph-MAPPO for Self-Organized Airship Encirclement and Hotspot Surveillance

Pei, Wenyuan	Beihang University
Liu, Simin	Beihang University
Zheng, Zewei	Beihang University
Zou, Yuxuan	Beihang University
Keywords: Multi-agent Systems, Intelligent and AI Based Control, Learning-based Control Abstract: This paper studies a coordination-layer problem for self-organized airship encirclement and hotspot surveillance. A team of airships moves on a prescribed standoff ring around a protected location, while its angular distribution is adjusted according to a hotspot-induced directional surveillance demand. To characterize this task, a density-based equal-mass criterion is introduced on the angular domain, where the hotspot increases the task value of the facing sector without removing the coverage demand on the remaining ring. Based on this formulation, Graph-MAPPO is developed by embedding a lightweight ring message-passing encoder into the MAPPO actor--critic architecture. The proposed method preserves the standard PPO optimization backbone, but injects a graph inductive bias matched to the fixed neighbor-only communication topology, so that each agent can aggregate multi-hop neighborhood information for local mass equalization. The model is intended for the tangential redistribution layer, with radial regulation and low-level stabilization assumed to be handled by an inner-loop controller. Comparative simulations, including multi-seed training, message-passing-depth ablation, random-initial-formation tests, and spacing-dynamics evaluation, demonstrate the advantages of Graph-MAPPO over an MLP-based MAPPO baseline and a handcrafted local controller in convergence and hotspot-oriented redistribution.

15:00-15:15, Paper ThCT6.6
GIGA: A Generative-Initialized Gated-Adaptation Framework for Multi-Agent Adversarial Games

Wang, Xiaoxiao	Tongji University
Zhao, Yihang	Tongji University
Lei, Jinlong	Tongji University
Hong, Yiguang	Chinese Academy of Sciences
Keywords: Multi-agent Systems, Learning-based Control, Intelligent and AI Based Control Abstract: Multi-agent adversarial games with pre-protection points are challenging due to partial observability, sparse rewards, delayed feedback, and dynamically evolving opponent strategies. In such settings, pure centralized training with decentralized execution (CTDE) reinforcement learning often suffers from low sample efficiency and fragile coordination, while standard behavior cloning struggles to represent multimodal expert policies and may lead to an unstable transition from imitation to autonomous exploration. To address these issues, we propose Generative-Initialized Gated-Adaptation (GIGA), a two-stage framework that bootstraps multi-agent reinforcement learning with generative expert priors. In Stage I, a conditional variational autoencoder is trained to model the multimodal distribution of expert behaviors, yielding a structured latent prior for policy initialization. In Stage II, a bias-initialized gating mechanism adaptively combines the frozen expert prior with a learnable residual policy under the CTDE paradigm, enabling a smooth transition from expert-guided behavior to maximum-entropy reinforcement learning. Together with twin centralized critics, GIGA improves convergence speed, final performance, and robustness across multiple training regimes, outperforming strong multi-agent reinforcement learning baselines and the expert policy used for initialization.


ThPT7	Assembly Hall
Smart Systems: The Impact of Generative and Adaptive AI on Modern Automation	Plenary Session
Chair: Chen, Ben M.	Chinese University of Hong Kong

Technical Program for Thursday June 18, 2026