I am a first-year Master’s student at the Key Laboratory of Autonomous Intelligent Unmanned Systems(AIUS), Harbin Institute of Technology (HIT), China.
My research interests lie in Embodied AI and Robotics, especially learning-based manipulation / dexterous skills and reinforcement learning for robot control. I am also actively exploring vision-language-action (VLA) models and other learning-based control approaches for generalizable manipulation.
I am seeking a visiting research internship throughout 2026(6–12 months). I hope to work under close mentorship, contribute to a strong research direction, and deliver high-quality, publishable results, with the goal of making one top-tier conference submission in 2026.
If you are interested in collaboration or hosting, please feel free to contact me at chentingjia1209@163.com. You can also download my CV here.
🔥 News
- 2026.02: 📨 Submitted ResMerge to ICML 2026 (under review).
- 2025.11: 🔥 Started my research intership at the PKU-PsiBot Joint Lab(Peking).
- 2025.08: 🎉 As an operator and algorithm developer, won a Silver Medal in the material sorting track of the 2025 World Humanoid Robot Games–Competition.
- 2025.07: 🔥 Started my Embodied AI internship at Standard Robots(Shenzhen)
- 2025.03: 🔥 Started my engineering project at Harbin Institute of Technology (Suzhou) Research Institute
🤖 Projects
Role: End-to-end fine-tuning + evaluation infrastructure · Period: 2026
- Parameter-efficient training: froze vision encoder + LLM, trained projector + diffusion decoder (~40% trainable params)
- Config-driven parallel evaluation (5 envs) with automatic video recording and CSV summaries
- Released 4 validated fine-tuned checkpoints on HuggingFace (Spatial/Object/Goal/LIBERO-10)
Role: Lead implementer (data → training → evaluation) · Period: 2025
- Implemented LIBERO HDF5 → RDT input pipeline (state/action remapping, SigLIP-style image preprocessing, augmentation)
- Added T5-XXL language embedding caching to avoid repeated encoding (~30% training speedup)
- Systematic comparison: Full fine-tuning reaches 76–94% (suite-dependent) while LoRA saturates around ~20%
Role: Core implementer (model + env interface + evaluation) · Period: 2026
- Wrapped RDT as an RLinf policy with DDPM-based action generation and action-space extraction for LIBERO execution
- Extended LIBERO env + RLinf I/O to include 9-DoF joint proprioception required by RDT (arm + gripper)
- Identified the key RL blocker: diffusion policies do not directly provide action log-probabilities required by PPO-style algorithms (implementation in progress)
Role: System Integration + ROS2/C++ Development · Period: 2025
- Bridged OptiTrack Motive (NatNet) rigid-body tracking into ROS2 as PoseStamped / TF for downstream robotics pipelines
- Enabled RViz/MoveIt2-side pose mapping and visualization for rapid iteration and debugging of teleop demonstrations
- Supports dataset collection by recording the pose stream (e.g., rosbag2) as supervised signals for imitation learning
💻 Internships
- Embodied AI research internship.
- Worked on robotics engineering tasks.
📝 Publications
🎖 Honors & Awards
- 2025.09: 2024-2025 HIT National Special Scholarship
- 2025.08: 2025 World Humanoid Robot Games–Competition Silver Medal
- 2025.03: 2024-2025 HIT Excellent Student Worker
- 2024.08: National First Prize and Ranked Second in National University Students Intelligent Car Race
- 2024.08: National Second Prize and Patent Publication(second inventor) in China University Intelligent Robot Creative Competition
- 2024.07: National College Student Innovation and Entrepreneurship Project
- others: Outstanding student(2022-2023,2023-2024),Second Prize Scholarship(2022-2023,2023-2024,2024-2025)