PhD candidate in planning for robotics at the Department of Cognitive Robotics (CoR), TU Delft (Netherlands). I am supervised by Professor Javier Alonso-Mora and Professor Christian Pek. My research focuses on task and motion planning in multi-agent systems accounting for interaction and cooperation between other agents and humans. Feel free to contact me at A.MatosesGimenez (at) tudelft.nl
Before coming to Delft, I obtained my bachelor’s degree in aerospace engineering from Universitat Politècnica de València (Spain), and a master’s degree from KTH Royal institute of Technology (Sweden). After my MSc, I worked as a research engineer at KTH for Professor Dimos Dimarogonas’s group.
We use a parallelized physics simulator and cross entropy optimization to find optimal realizations of task and motion planning (TAMP) problems. This allows us to consider dynamics, contacts and the low level controllers that the real robot has, and their impact on the optimality and feasibility of the solutions.
Sadcher is a real-time, imitation-learned task assignment framework for heterogeneous multi-robot teams with dynamic coalitions and task precedence. It predicts robot-task rewards using graph attention and transformers, then applies relaxed bipartite matching to produce feasible, high-quality schedules that scale and outperform learning/heuristic baselines; we also release a dataset of 250k optimal schedules.
Decentralized MARL for 6-DoF control of a cable-suspended load with multiple MAVs, requiring no inter-drone communication and enabling scalable, onboard deployment. Combined with low-level controllers, it transfers robustly from simulation to reality, matching centralized performance while tolerating uncertainties and even loss of a drone.
Implemented a ROS based linear temporal logic (LTL) planner for the CANOPIES ERC: Collaborative Paradigm for Human Workers and Multi-Robot Teams in Precision Agriculture Systems.
Scalable multi-agent reinforcement learning for formation control with collision avoidance. This work was my Master's Thesis while at KTH Royal Institute of Technology. The proposed method exploits the reward structure to enable local approximation of Q-functions and policy gradients, allowing for scalable training. We compare discrete and continuous policies and analyze the impact of the sensing radius on performance and collision avoidance.
Design of an autonomous high altitude long endurance UAV to study optical phenomena in the upper atmosphere using scientific imaging instruments. Project under the KTH Space Physics department. Paper presented at ICAS 2022.