Humanoids 2024 Program | Sunday November 24, 2024


SuO_1P Regular, Amphitheatre 450-850	Add to My Program
Oral Session 3

Chair: Okada, Kei	The University of Tokyo
Co-Chair: Lee, Dongheui	Technische Universität Wien (TU Wien)

09:00-09:10, Paper SuO_1P.1	Add to My Program
Not Only Rewards but Also Constraints: Applications on Legged Robot Locomotion

Kim, Yunho	Neuromeka
Oh, Hyunsik	Korea Advanced Institute of Science and Technology
Lee, Jeong Hyun	Korea Advanced Institute of Science & Technology (KAIST)
Choi, Jinhyeok	Korea Advanced Institute of Science and Technology
Ji, Gwanghyeon	Korea Advanced Institute of Science and Technology
Jung, Moonkyu	Korea Advanced Institute of Science and Technology
Youm, Donghoon	Korea Advanced Institute of Science and Technology
Hwangbo, Jemin	Korean Advanced Institute of Science and Technology
Keywords: Legged Robots, Reinforcement Learning, Deep Learning in Robotics and Automation, AI-Based Methods Abstract: Several earlier studies have shown impressive control performance in complex robotic systems by designing the controller using a neural network and training it with model-free reinforcement learning. However, these outstanding controllers with natural motion style and high task performance are developed through extensive reward engineering, which is a highly laborious and time-consuming process of designing numerous reward terms and determining suitable reward coefficients. In this work, we propose a novel reinforcement learning framework for training neural network controllers for complex robotic systems consisting of both rewards and constraints. The learning framework is applied to train locomotion controllers for several legged robots with different morphology and physical attributes to traverse challenging terrains. Extensive simulation and real-world experiments demonstrate that performant controllers can be trained with significantly less reward engineering, by tuning only a single reward coefficient. Furthermore, a more straightforward and intuitive engineering process can be utilized, thanks to the interpretability and generalizability of constraints.

09:10-09:20, Paper SuO_1P.2	Add to My Program
Robust Quadrupedal Jumping with Impact-Aware Landing: Exploiting Parallel Elasticity

Ding, Jiatao	Delft University of Technology
Atanassov, Vassil	University of Oxford
Panichi, Edoardo	Technische Universiteit Delft
Kober, Jens	TU Delft
Della Santina, Cosimo	TU Delft
Keywords: Legged Robots, Optimization and Optimal Control, Compliance and Impedance Control, Motion Control Abstract: Introducing parallel elasticity in the hardware design endows quadrupedal robots with the ability to perform explosive and efficient motions. However, for this kind of articulated soft quadruped, realizing dynamic jumping with robustness against system uncertainties remains a challenging problem. To achieve this, we propose an impact-aware jumping planning and control approach. Specifically, an offline kino-dynamic-type trajectory optimizer is first formulated to achieve compliant 3D jumping motions, using a novel actuated spring-loaded inverted pendulum (SLIP) model. Then, an optimization-based online landing strategy, including the pre-impact leg motion modulation in the air and post-impact landing recovery after touch-down, is designed. The actuated SLIP model, with the capability of explicitly characterizing parallel elasticity, captures the jumping and landing dynamics, making the problem of motion generation/regulation more tractable. Finally, a hybrid torque control consisting of a feedback tracking loop and a feedforward compensation loop is employed for motion control. Experiments demonstrate the ability to accomplish robust 3D jumping motions with stable landing and recov

09:20-09:30, Paper SuO_1P.3	Add to My Program
NAS: N-Step Computation of All Solutions to the Footstep Planning Problem

Wang, Jiayi	The University of Edinburgh
Samadi, Saeid	University of Edinburgh
Wang, Hefan	University of Edinburgh
Fernbach, Pierre	Cnrs - Laas
Stasse, Olivier	LAAS, CNRS
Vijayakumar, Sethu	University of Edinburgh
Tonneau, Steve	The University of Edinburgh
Keywords: Humanoid and Bipedal Locomotion, Motion and Path Planning, Legged Robots Abstract: How many ways are there to climb a staircase in a given number of steps? Infinitely many, if we focus on the continuous aspect of the problem. A finite, possibly large number if we consider the discrete aspect, i.e. on which surface which effectors are going to step and in what order. We introduce NAS, an algorithm that considers both aspects simultaneously and computes all the possible solutions to such a contact planning problem, under standard assumptions. To our knowledge NAS is the first algorithm to produce a globally optimal policy, efficiently queried in real time for planning the next footsteps of a humanoid robot. Our empirical results (in simulation and on the Talos platform) demonstrate that, despite the theoretical exponential complexity, optimisations reduce the practical complexity of NAS to a manageable bilinear form, maintaining completeness guarantees and enabling efficient GPU parallelisation. NAS is demonstrated on a variety of scenarios for the Talos robot, both in simulation and on the hardware platform. Future work will focus on further reducing computation times and extending the algorithm’s applicability beyond gaited locomotion

09:30-09:40, Paper SuO_1P.4	Add to My Program
Online DNN-Driven Nonlinear MPC for Stylistic Humanoid Robot Walking with Step Adjustment

Romualdi, Giulio	Istituto Italiano Di Tecnologia
Viceconte, Paolo Maria	Lab0 SRL
Moretti, Lorenzo	Istituto Italiano Di Tecnologia
Sorrentino, Ines	Istituto Italiano Di Tecnologia
Dafarra, Stefano	Istituto Italiano Di Tecnologia
Traversaro, Silvio	Istituto Italiano Di Tecnologia
Pucci, Daniele	Italian Institute of Technology
Keywords: Humanoid and Bipedal Locomotion, Whole-Body Motion Planning and Control, Humanoid Robot Systems Abstract: This paper presents a three-layered architecture that enables stylistic locomotion with online contact location adjustment. Our method combines an autoregressive Deep Neural Network (DNN) acting as a trajectory generation layer with a model-based trajectory adjustment and trajectory control layers. The DNN produces centroidal and postural references serving as an initial guess and regularizer for the other layers. Being the DNN trained on human motion capture data, the resulting robot motion exhibits locomotion patterns, resembling a human walking style. The trajectory adjustment layer utilizes non-linear optimization to ensure dynamically feasible center of mass (CoM) motion while addressing step adjustments. We compare two implementations of the trajectory adjustment layer: one as a receding horizon planner (RHP) and the other as a model predictive controller (MPC). To enhance MPC performance, we introduce a Kalman filter to reduce measurement noise. The filter parameters are automatically tuned with a Genetic Algorithm. Experimental results on the ergoCub humanoid robot demonstrate the system's ability to prevent falls, replicate human walking styles, and withstand disturbances up to 68 Newton.

09:40-09:50, Paper SuO_1P.5	Add to My Program
Guiding Collision-Free Humanoid Multi-Contact Locomotion Using Convex Kinematic Relaxations and Dynamic Optimization

Gonzalez Bolivar, Carlos Isaac	The University of Texas at Austin
Sentis, Luis	The University of Texas at Austin
Keywords: Multi-Contact Whole-Body Motion Planning and Control, Collision Avoidance, Motion and Path Planning Abstract: Humanoid robots rely on multi-contact planners to navigate a diverse set of environments, including those that are unstructured and highly constrained. To synthesize stable multi-contact plans within a reasonable time frame, most planners assume statically stable motions or rely on reduced order models. However, these approaches can also render the problem infeasible in the presence of large obstacles or when operating near kinematic and dynamic limits. To that end, we propose a new multi-contact framework that leverages recent advancements in relaxing collision-free path planning into a convex optimization problem, extending it to be applicable to humanoid multi-contact navigation. Our approach generates near-feasible trajectories used as guides in a dynamic trajectory optimizer, altogether addressing the aforementioned limitations. We evaluate our computational approach showcasing three different-sized humanoid robots traversing a high-raised naval knee-knocker door using our proposed framework in simulation. Our approach can generate motion plans within a few seconds consisting of several multi-contact states, including dynamic feasibility in joint space.

09:50-10:00, Paper SuO_1P.6	Add to My Program
Delay Robust Model Predictive Control for Whole-Body Torque Control of Humanoids

Subburaman, Rajesh	LAAS-CNRS
Stasse, Olivier	LAAS, CNRS
Keywords: Optimization and Optimal Control, Whole-Body Motion Planning and Control, Humanoid Robot Systems Abstract: Whole body model predictive control (WBMPC) is a powerful tool to generate complex robotics motion. Despite the recent increase in computational capabilities with new processors such as the Apple chipsets (M1, .. M3) or GPUs, WBMPC algorithms need a significant amount of computational time. This induces delay, which, if not properly accounted for, can have detrimental effects on the controller's performance. This paper conducts a detailed study to understand the impact of delay on WBMPC and proposes an efficient solution to handle it effectively. In this regard, a whole-body control task is formulated as an optimal control problem and solved using the Crocoddyl library. An extensive amount of numerical studies are carried out to understand the nature of the problem and thereby devise an effective solution. The proposed solution is found to be effective numerically, and it has been experimentally verified with the humanoid TALOS. Both numerical and experimental results are presented and discussed in this work to provide valuable incites.


SuCB_1BR Coffee, Hall 1 and 2	Add to My Program
Coffee Break 5


SuP_1L Plenary, Amphitheatre 450-850	Add to My Program
Plenary Session 3 Jan Peters – Inductive Biases for Learning of Anthropomorphic Robots Justin Carpentier - towards Fully Differentiable Control Architecture for Robotics: Simpler, Nimbler, Faster, Stronger

Chair: Ivaldi, Serena	INRIA
Co-Chair: Yoshida, Eiichi	Faculty of Advanced Engineering, Tokyo University of Science


SuLBR Lunch, Hall 3	Add to My Program
Lunch 3


SuO_2P Regular, Amphitheatre 450-850	Add to My Program
Oral Session 4

Chair: Stasse, Olivier	LAAS, CNRS

-, Paper SuO_2P.1	Add to My Program
Robots Can Multitask Too: Integrating a Memory Architecture and LLMs for Enhanced Cross-Task Robot Action Generation

Ali, Hassan	University of Hamburg
Allgeuer, Philipp	University of Hamburg
Mazzola, Carlo	Istituto Italiano Di Tecnologia
Belgiovine, Giulia	Istituto Italiano Di Tecnologia
Kaplan, Burak Can	University of Hamburg
Gajdošech, Lukáš	Comenius University
Wermter, Stefan	University of Hamburg
Keywords: Human-Robot Collaboration, Cognitive Control Architectures Abstract: Large Language Models (LLMs) have been recently used in robot applications for grounding LLM common-sense reasoning with the robot's perception and physical abilities. In humanoid robots, memory also plays a critical role in fostering real-world embodiment and facilitating long-term interactive capabilities, especially in multi-task setups where the robot must remember previous task states, environment states, and executed actions. In this paper, we address incorporating memory processes with LLMs for generating cross-task robot actions, while the robot effectively switches between tasks. Our proposed dual-layered architecture features two LLMs, utilizing their complementary skills of reasoning and following instructions, combined with a memory model inspired by human cognition. Our results show a significant improvement in performance over a baseline of five robotic tasks, demonstrating the potential of integrating memory with LLMs for combining the robot's action and perception for adaptive task execution.

-, Paper SuO_2P.2	Add to My Program
TactileMemory: Multi-Fingered Simultaneous Shape and Pose Identification Using Contact Traces

Abubucker, Mohammed Shameer	Bielefeld University
Meier, Martin	Bielefeld University
Haschke, Robert	Bielefeld University
Ritter, Helge Joachim	Bielefeld University
Keywords: Multifingered Hands, Incremental Learning Abstract: We propose a model of Tactile Memory that integrates sequential touches to identify the shape and pose of an object. The memory also controls the next tactile action to approximately optimize its information gain at each step. The information fusion and determination of the next tactile action is achieved with the aid of an ensemble of hypotheses, each of which represents a possible shape and pose of the object. We assume a first touch event has already occurred, and focus on the process of shape and pose identification. In order to minimize the number of tactile actions required, the proposed method combines: 1) Tactile Memory: a record of the tactile event history from multiple fingers of a Shadow Hand along with the contact traces and hand location 2) the hypothesis ensemble as a distributed representation of the remaining shape and pose uncertainty and 3) Explorative Tactile Actions: a set of tactile event-specific heuristics that create proposals for hand location based on the tactile feedback. We analyze our approach in simulation and quantify its improvement of exploration over a baseline algorithm that does not use the contact traces. Also we compare Explorative Tactile Actions with a baseline that uses random hand locations. We also demonstrate our algorithm on a robot with Shadow Hand to show that we can estimate the shape and pose of an object in about ten tactile actions.

-, Paper SuO_2P.3	Add to My Program
APriCoT: Action Primitives Based on Contact-State Transition for In-Hand Tool Manipulation

Saito, Daichi	Tokyo Institute of Technology
Kanehira, Atsushi	Microsoft
Sasabuchi, Kazuhiro	Microsoft
Wake, Naoki	Microsoft
Takamatsu, Jun	Microsoft
Koike, Hideki	Tokyo Institute of Technology
Ikeuchi, Katsushi	Microsoft
Keywords: In-Hand Manipulation, Multifingered Hands, Grasping Abstract: In-hand tool manipulation is an operation that not only manipulates a tool within the hand (i.e., in-hand manipulation) but also achieves a grasp suitable for a task after the manipulation. This study aims to achieve an in-hand tool manipulation skill through deep reinforcement learning. The difficulty of learning the skill arises because this manipulation requires (A) exploring long-term contact-state changes to achieve the desired grasp and (B) highly-varied motions depending on the contact-state transition. (A) leads to a sparsity of a reward on a successful grasp, and (B) requires an RL agent to explore widely within the state-action space to learn highly-varied actions, leading to sample inefficiency. To address these issues, this study proposes Action Primitives based on Contact-state Transition (APriCoT). APriCoT decomposes the manipulation into short-term action primitives by describing the operation as a contact-state transition based on three action representations (detach, crossover, attach). In each action primitive, fingers are required to perform short-term and similar actions. By training a policy for each primitive, we can mitigate the issues from (A) and (B). This study focuses on a fundamental operation as an example of in-hand tool manipulation: rotating an elongated object grasped with a precision grasp by half a turn to achieve the initial grasp. Experimental results demonstrated that ours succeeded in both the rotation and the achievement of the desired grasp, unlike existing studies. Additionally, it was found that the policy was robust to changes in object shape.

-, Paper SuO_2P.4	Add to My Program
Adaptive Motion Planning for Multi-Fingered Functional Grasp Via Force Feedback

Tian, Dongying	Dalian University of Technology, Shenyang Institute of Automatio
Lin, Xiangbo	Dalian University of Technology
Sun, Yi	Dalian University of Technology
Keywords: Multifingered Hands, Force and Tactile Sensing, Reinforcement Learning Abstract: Enabling multi-fingered robots to grasp and manipulate objects with human-like dexterity is especially challenging during the dynamic, continuous hand-object interactions. Closed-loop feedback control is essential for dexterous hands to dynamically finetune hand poses when performing precise functional grasps. This work proposes an adaptive motion planning method based on deep reinforcement learning to adjust grasping poses according to real-time feedback from joint torques from pre-grasp to goal grasp. We find the multijoint torques of the dexterous hand can sense object positions through contacts and collisions, enabling real-time adjustment of grasps to generate varying grasping trajectories for objects in different positions. In our experiments, the performance gap with and without force feedback reveals the important role of force feedback in adaptive manipulation. Our approach, utilizing force feedback, preliminarily exhibits human-like flexibility, adaptability, and precision.

-, Paper SuO_2P.5	Add to My Program
RoPotter: Toward Robotic Pottery and Deformable Object Manipulation with Structural Priors

Yoo, Uksang	Carnegie Mellon University
Hung, Adam Joshua	University of Michigan
Francis, Jonathan	Bosch Center for Artificial Intelligence
Oh, Jean	Carnegie Mellon University
Ichnowski, Jeffrey	Carnegie Mellon University
Keywords: Art and Entertainment Robotics, Perception for Grasping and Manipulation, Deep Learning in Grasping and Manipulation Abstract: Humans are capable of continuously manipulating a wide variety of deformable objects into complex shapes. This is made possible by our intuitive understanding of material properties and mechanics of the object, for reasoning about object states even when visual perception is occluded. These capabilities allow us to perform diverse tasks ranging from cooking with dough to expressing ourselves with pottery-making. However, developing robot systems to robustly perform similar tasks remains challenging, as current methods struggle to effectively model volumetric deformable objects and reason about the complex behavior they typically exhibit. To study the robot systems and algorithms capable of deforming volumetric objects, we introduce a novel robot task of continuously deforming clay on a pottery wheel. We propose a pipeline for perception and pottery skill-learning, called RoPotter, wherein we demonstrate that structural priors specific to the task of pottery-making can be exploited to simplify the pottery skill-learning process. Namely, we can project the cross-section of the clay to a plane to represent the state of the clay, reducing dimensionality. We also demonstrate a mesh-based method of occluded clay state recovery, toward robot agents capable of continuously deforming clay. Our experiments show that by using the reduced representation with structural priors based on the deformation behaviors of the clay, RoPotter can perform the long-horizon pottery task with 44.4% lower final shape error compared to the state-of-the-art baselines. Supplemental materials, experiment data, and visualizations are available at https://robot-pottery.github.io.

-, Paper SuO_2P.6	Add to My Program
Vlimb: A Wire-Driven Wearable Robot for Bodily Extension, Balancing Powerfulness and Reachability

Sawaguchi, Shogo	The Universtiy of Tokyo
Suzuki, Temma	The University of Tokyo
Miki, Akihiro	The University of Tokyo
Kawaharazuka, Kento	The University of Tokyo
Yuzaki, Sota	The University of Tokyo
Yoshimura, Shunnosuke	The University of Tokyo
Ribayashi, Yoshimoto	The University of Tokyo
Okada, Kei	The University of Tokyo
Inaba, Masayuki	The University of Tokyo
Keywords: Wearable Robotics, Tendon/Wire Mechanism, Mechanism Design Abstract: Numerous wearable robots have been developed to meet the demands of physical assistance and entertainment. These wearable robots range from body-enhancing types that assist human arms and legs to body-extending types that have extra arms. This study focuses speciﬁcally on wearable robots of the latter category, aimed at bodily extension. However, they have not yet achieved the level of powerfulness and reachability equivalent to that of human limbs, limiting their application to entertainment and manipulation tasks involving lightweight objects. Therefore, in this study, we develop an body-extending wearable robot, Vlimb, which has enough powerfulness to lift a human and can perform manipulation. Leveraging the advantages of tendon-driven mechanisms, Vlimb incorporates a wire routing mechanism capable of accommodating both delicate manipulations and robust lifting tasks. Moreover, by introducing a passive ring structure to overcome the limited reachability inherent in tendon-driven mechanisms, Vlimb achieves both the powerfulness and reachability comparable to that of humans. This paper outlines the design methodology of Vlimb, conducts preliminary manipulation and lifting tasks, and veriﬁes its effectiveness.


SuCB_2BR Coffee, Hall 1 and 2	Add to My Program
Coffee Break 6


SuP_2L Plenary, Amphitheatre 450-850	Add to My Program
Plenary Session 4 Agnieszka Wykowska – the Role of Humanoid Robots in Cognitive Neuroscience

Chair: Ude, Ales	Jozef Stefan Institute


SuPaL Plenary, Amphitheatre 450-850	Add to My Program
Panel - Cognitive Humanoids and Generative AI

Chair: Cheng, Gordon	Technical University of Munich


SuAwL Award, Amphitheatre 450-850	Add to My Program
Award Ceremony


SuFL Plenary, Amphitheatre 450-850	Add to My Program
Farewell


SuRR Nancy Opera	Add to My Program
Farewell Reception

Technical Program for Sunday November 24, 2024