Blog / Docs • Email • ORCID
I am currently a Master's student in Artificial Intelligence at Shanghai Normal University. My research and engineering efforts are heavily focused on Vision-Language-Action (VLA) models and Reinforcement Learning for physical robotic manipulation. I bridge the gap between open-world semantic understanding and contact-rich precision control.
- VLA Deployment: Fine-tuning and deploying VLA models (
SmolVLA,LingBot-VLA) on SOARM-101 and AgiBot G1 robot tasks. - Robot Data Flywheel: Building teleoperation datasets, SFT post-training loops, and offline-to-online RL data replay from failures and human interventions.
- Hardware-Software Co-design: Optimizing client-server inference and closed-loop robot execution pipelines.
- Embodied AI & Robotics: LeRobot | SOARM-101 | AgiBot G1 | LingBot-VLA | Action Chunking
- Reinforcement Learning: Offline-to-Online RL | TD3 | PPO | GRPO | Behavior Cloning
- Deep Learning: PyTorch | HuggingFace | CUDA | 3D Gaussian Splatting
- Simulation: MuJoCo | Isaac Lab / Isaac Sim
- DevOps & Backend: Linux | Docker | FastAPI | Git | Claude Agentic Workflow
- 📄 GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting (1st Author)
- Formulated camera pose estimation as a gradient optimization problem to minimize translation and rotation errors in high-resolution 3DGS environments.
- 🦾 LeRobot (VLA Pipeline SOPs)
- Maintaining comprehensive Chinese engineering docs for
SO-ARM101 × VLA. Covering data collection, 3D-2D mask projection filtering, LoRA fine-tuning, and asynchronous inference loops.
- Maintaining comprehensive Chinese engineering docs for
- ⚡ fastapi-supabase-template
- A highly concurrent async backend scaffolding boilerplate, deeply integrated with Docker and built via agentic workflows.
"Replacing pure imitation with physical reflexes."




