Supervisor of Master's Candidates
罗皓楠,新加坡南洋理工大学/南京理工大学联合培养博士,现任西南交通大学计算机学院副教授/硕士生导师。近五年在领域权威期刊和会议TPAMI (CCF A, SCI一区Top), ICCV (CCF A), AAAI (CCF A), TNNLS (SCI一区Top), TCSVT (SCI一区Top) 等发表高水平论文20余篇。主持国家自然科学基金青年项目、四川省自然科学基金青年项目、博士后科学基金面上项目、中央高校科技创新项目、高水平育人项目、重大教改项目子课题、企业合作项目10项,主研中央军委装备发展部预先研究项目、国家自然科学基金项目、四川省科技厅重点研发项目8项。参编教材2本,获DICTA国际会议最佳论文奖,担任知名国际期刊Image and Vision Computing客座编辑。
欢迎对科研充满热情的硕士、博士同学与我合作!(硕士、博士均可)可选两个方向:
- 智能机器人:开放世界机器人导航、具身智能
- 大模型Agent:多模态大模型、多智能体博弈
指导学生(包括协助李天瑞指导):
- 博士生:沈植铉(2024级)、罗圣祥(2022级)、陈曦(2023级)
- 硕士生:曾欣(2022级)、李思佳(2022级)、杨龙(2023级)、陈柯汛(2023级)、郭宇琛(2023级)、曾怡杰(2023级)、蔡欣月(2023级)、陈欣怡(2023级)、郭子玉(2024级)、叶小乐(2024级)、王亦硕(2024级)、叶子扬(2024级)、杨程椿(2024级)
项目经历
国家自然科学基金青年项目,“面向陌生环境的室内机器人感知与决策方法研究”,2024-01至2026-12,30万元,主持
四川省自然科学基金青年项目,“开放世界下室内机器人多模态感知与推理决策方法研究”,2024-01至2025-12,10万元,主持
中国博士后基金委面上项目,“面向多模态特征的室内机器人持续性感知与决策技术研究”,2023-01至2024-12,8万元,主持
中央高校基本科研业务项目,“面向复杂场景的室内机器人感知与决策技术研究”,2023-01至2024-12,10万元,主持
中国航天工业集团合作项目,“智能可解释的图像目标属性识别数据采集标注 ”,2023-06至2023-07,40万元,主持
北京航空航天大学合作项目,“策略推理模型在仿真平台的测试 ”,2024-03至2025-03,40万元,主持
九洲集团合作项目,“基于视频识别的安全监测管理系统”,2024-06至2025-06,45万元,主持
高水平育人课程教学改革项目,“深度学习”,2024-06至2025-06,2万元,主持
本科教学研究与改革重大项目子课题,“面向教育知识图谱构建的实体与关系抽取技术”,2024-07至2025-07,2.5万元,主持
国家自然科学基金面上项目,“内镜超声为中心多模医学影像协同学习在胃肠道间质瘤诊疗中的研究”,2024-01至2027-12, 49万元,参与
四川省重点研发计划项目,“矿山智慧化综合监管关键技术与应用”, 2022-07至2025-07, 100万元,参与
陆军装备部预先研究计划项目,“复杂非结构化环境建模技术”, 2017-06至2020-12, 200万元,参与
发表论文:
Haonan Luo, Guosheng Lin, Yazhou Yao, Fayao Liu, Zichuan Liu, Zhenmin Tang,“Depth and Video Segmentation Based Visual Attention for Embodied Question Answering”, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) , 45(6), 6807-6819, 2023.
Haonan Luo, Guosheng Lin, Zichuan Liu, Fayao Liu, Zhenmin Tang, Yazhou Yao,“SegEQA: Video Segmentation based Visual Attention for Embodied Question Answering”, International Conference on Computer Vision (ICCV), 9667-9676, 2020.
Haonan Luo, Guosheng Lin, Fumin Shen, Hengtao Shen,“Robust-EQA: Robust Learning for Embodied Question Answering with Noisy Labels”, IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 35(9): 12083-12094, 2024.
Haonan Luo, Guosheng Lin, Yazhou Yao, Zhenmin Tang, Qingyao Wu, Xiansheng Hua,“Dense Semantics Assisted Networks for Video Action Recognition”, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 32(5): 3073-3084, 2022.
Haonan Luo, Ziyu Guo, Zhenyu Wu, Fei Teng, Tianrui Li,“Transformer-based vision-language alignment for robot navigation and question answering”, Information Fusion, 108: 102351, 2024.
Haonan Luo, Yijie Zeng, Kexun Chen, Zhixuan Shen, Yang Li,Fengmao Lv,“VLAI: Exploration and Exploitation based on Visual-Language Aligned Information for Robotic Object Goal Navigation”, Image and Vision Computing, 2024.
Haonan Luo, Sijia Li, Yijie Zeng, Zihang Wang, Botao Jiang,Xiruo Jiang,“Bidirectional Chain-of-Thought for Zero-shot Object Navigation”, Frontiers of Computing Science, 2025.
Zhixuan Shen, Haonan Luo*, Kexun Chen, Fengmao Lv, Tianrui Li,“Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration”, AAAI, 2025.
Zhixuan Shen, Haonan Luo*, Sijia Li, Tianrui Li,“Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering”, IEEE International Conference on Multimedia and Expo, 2024.
Xinzeng, Haonan Luo*, Zihang Wang, Sijia Li, Tianrui Li,“A Continual Learning Approach for Embodied Question Answering with Generative Adversarial Imitation Learning”, International Conference on Acoustics, Speech and Signal Processing, 2025.
Xinzeng, Haonan Luo*, Zihang Wang, Sijia Li, Leyu Zhang, Tianrui Li,“KLFormer: Karhunen-Loeve Transform for Robust 3D Human Pose Estimation”, International Conference on Acoustics, Speech and Signal Processing, 2025.
Sijia Li, Haonan Luo*, Xu Zhang, Xin Zeng, Zhixuan Shen, Tianrui Li,“Role-Specific Reward Design with Large Language Model for StarCraft II”, International Conference on Acoustics, Speech and Signal Processing, 2025.
Di Zhang, Haonan Luo*, Honglin Dong, Jianfeng Lu,“Safety-constrained Reinforcement Learning with Interaction-aware for Decision-making of Autonomous Driving”, IEEE International Conference on Multimedia and Expo, 2025.
Yijie Zeng, Xinyue Zhao, Kexun Chen, Zhixuan Shen, Tianrui Li, Haonan Luo*,“MoPE: Mixture of Policy Experts and Verification with Multimodal Information for Instance ImageGoal Navigation”, IEEE International Conference on Multimedia and Expo, 2025.
Enping Li, Tianrui Li, Tao Liang, Azhen Kang, Kexun Chen, Haonan Luo*,“Cross-lingual sentiment analysis empowered by emotional mutual reinforcement through emojis", International Journal of Machine Learning and Cybernetics, 2025.
Xiang Wang, Haonan Luo, Zihang Wang, Xiao Bai,“Self-supervised multi-frame depth estimation with visual-inertial pose transformer and monocular guidance”, Information Fusion, 108: 102363, 2024.
Xiang Wang, Haonan Luo, Zihang Wang, Jing Zheng, Xin Ning, Xiao Bai,“Robust training for multi-view stereo networks with noisy labels”, Displays, 81: 102604, 2024.
Zihang Wang, Haonan Luo, Xiang Wang, Jing Zheng, Xiao Bai,“A contrastive learning based unsupervised multi-view stereo with multi-stage self-training strategy”, Displays, 102672, 2024.
Jing Bai, Haonan Luo, Feiwei Qin,“Design pattern modeling and extraction for CAD models”, Advances in Engineering Software, 93: 30-43, 2016.
Mengmeng Sheng, Zeren Sun, Gensheng Pei, Tao Chen, Haonan Luo, Yazhou Yao,“Enhancing Robustness in Learning with Noisy Labels: An Asymmetric Co-Training Approach”, ACM International Conference on Multimedia, 2024.
Peng Liu, Yanqi Ge, Lixin Duan, Wen Li, Haonan Luo, Fengmao Lv,“Transferring Multi-Modal Domain Knowledge to Uni-Modal Domain for Urban Scene Segmentation”, IEEE Transactions on Intelligent Transportation Systems, DOI: 10.1109/TITS.2024.3382880, 2024.
Jin He, Wei Wang, Fengmao Lv, Haonan Luo, Gexiang Zhang, Zhenghua Chen,“Multi-Scale CNN-Transformer Hybrid Network for Rail Fastener Defect Detection”, IEEE Transactions on Intelligent Transportation Systems, DOI: 10.1109/TITS.2025.3540846, 2025.
Xiaopo Zhang, Yuxin Zhou, Zhijie Lu, Donghai Zhai, Haonan Luo, Tianrui Li, Yang Li,“Multi-level Graph Neural Network with Sparsity Pooling for Recognizing Parkinson’s Disease”, IEEE Transactions on Neural Systems and Rehabilitation Engineering, DOI: 10.1109/TNSRE.2023.3330643, 2023.
Shide Zou, Jianfeng Lu, Haonan Luo,“AF-Net: All-scale Feature Fusion Network for Road Extraction from Remote Sensing Images”, Digital Image Computing: Techniques and Applications (DICTA), 2021: 1-8.
罗皓楠,李思佳,杨燕,杜圣东,梁伟超,“智能对话技术推进工科教育创新”, 信息技术时代, 3: 122-124, 2023.
白静, 罗皓楠, 秦飞巍,“面向非线性特征的三维CAD模型聚类”, 计算机辅助设计与图形学学报, 27(8): 1578-1586, 2015.
南京理工大学  PhD graduate  Dr.
南洋理工大学  联合培养博士
西南交通大学 计算机与人工智能学院 讲师
The Last Update Time : ..