张立华

职称/职务:特聘教授、副院长

办公地址:湾谷科技园二期D2栋

电话:

Email:lihuazhang@fudan.edu.cn

研究方向:人工智能、元宇宙与机器人交叉创新

部分代表性论文:

[1] Han, M., Qu, L., Yang, D., Zhang, X., Wang, X., & Zhang, L.* (2025). Mscpt: Few-shot whole slide image classification with multi-scale and context-focused prompt tuning. IEEE Transactions on Medical Imaging.

[2] Yang, D., Xiao, D., Wei, J., Li, M., Chen, Z., Li, K., & Zhang, L.* (2025). Improving factuality in large language models via decoding-time hallucinatory and truthful comparators. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 39, No. 24, pp. 25606-25614).

[3]Yang, D., Yang, K., Kuang, H., Chen, Z., Wang, Y., & Zhang, L.* (2024). Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training. IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]Yang, D., Li, M., Qu, L., Yang, K., Zhai, P., Wang, S., & Zhang, L.* (2024). Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and-Agnostic Representations. IEEE Transactions on Circuits and Systems for Video Technology.

[5] Yang, D., Wei, J., Xiao, D., Wang, S., Wu, T., Li, G., ... & Zhang, L.* (2024). Pediatricsgpt: Large language models as chinese medical assistants for pediatric applications. Advances in Neural Information Processing Systems37, 138632-138662.

[6] Zhao, X., Zhang, X., Chen, G., Wei, X., Li, Z., Xu, Z., Wang, Y., Zhu, J., Zhai, P., and & Zhang, L.* (2024). DeNoising Transformer for BEV 3D Object Detection via Multiview Multiscale Cross-Attention. IEEE Transactions on Intelligent Transportation Systems. May. 2025, doi: 10.1109/TITS.2025.3565310

[7] Zhao, X., Chen, B., Sun, M., Yang, D., Wang Y., Zhang, X., Li, M., Kou, D., Wei, X., & Zhang, L.* HybridOcc: NeRF Enhanced Transformer-Based Multi-Camera 3D Occupancy Prediction. IEEE Robotics and Automation Letters, vol. 9,no. 9, pp. 7867 - 7874, Sep. 2024, doi: 10.1109/LRA.2024.3416798

[8] Zhao, X., Zhang, X., Yang, D., Sun, M., Li, M., Wang, S., L Zhang, L.* (2024) MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation. In Proceedings of the 32nd ACM International Conference on Multimedia. (pp. 2652-2661).

[9]Liu, L.Yang, J.Lin, Y., Zhang, P.Zhang, L.* (2024). 3D human pose estimation with single image and inertial measurement unit (IMU) sequence, Pattern RecognitionVolume 149, 2024, 110175, ISSN 0031-3203.

[10]Yang, D., Liu, Y., Huang, C., Li, M., Zhao, X., Wang, Y., ... & Zhang, L.* (2023).Target and source modality co-reinforcement for emotion understanding from asynchronous multimodal sequences[J]. Knowledge-Based Systems, 2023, 265: 110370.

[11]Zhai. P., Zhang. L.,* Dong. Z., et al. Machine intuition (in Chinese). Sci Sin Inform, 2020, 50: 1475-1500, doi: 10.1360/SSI-2020-0075.

[12]Yang, D., Li, M., Xiao, D., Liu, Y., Yang, K., Chen, Z., ... & Zhang, L.* (2024). Towards multimodal sentiment analysis debiasing via bias purification. In European Conference on Computer Vision. Cham: Springer Nature Switzerland.

[13]Yang, D., Yang, K., Li, M., Wang, S., Wang, S., & Zhang, L.* (2024). Robust emotion recognition in context debiasing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 12447-12457).

[14]Wang, S., Wang, S., Yang, D., Li, M., Kuang, H., Zhao, X., ... & Zhang, L.* (2024). CPR-Coach: Recognizing composite error actions based on single-class training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 18782-18792).

[15]Li, M., Yang, D., Lei, Y., Wang, S., Wang, S., Su, L., ... & Zhang, L.* (2024). A Unified Self-Distillation Framework for Multimodal Sentiment Analysis with Uncertain Missing Modalities. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, No. 9, pp. 10074-10082)

[16]Yang, D., Yang, K., Wang, Y., Liu, J., Xu, Z., Yin, R., ... & Zhang, L.* (2024). How2comm: Communication-efficient and collaboration-pragmatic multi-agent perception. Advances in Neural Information Processing Systems, 36.

[17]Hou, T., Tu, J., Gao, X., Dong, Z., Zhai, P., & Zhang, L.* (2024). Multi-Task Learning of Active Fault-Tolerant Controller for Leg Failures in Quadruped Robots. IEEE International Conference on Robotics and Automation.

[18]Yang, D., Chen, Z., Wang, Y., Wang, S., Li, M., Liu, S., ... & Zhang, L.* (2023). Context de-confounded emotion recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 19005-19015).

[19]Zhang, X., Liu, Y., Ali, S., Zhao, X., Sun, M., ... Zhang, L.* (2023). Anatomical-Aware Point-Voxel Network for Couinaud Segmentation in Liver CT. In: Greenspan, H., et al. Medical Image Computing and Computer Assisted Intervention. Lecture Notes in Computer Science, vol 14222. Springer, Cham.

[20]Zhai, P., Luo, J., Dong, Z., Zhang, L.*, Wang, S., & Yang, D. Robust Adversarial Reinforcement Learning with Dissipation Inequation Constraint,36th AAAI Conference on Artificial Intelligence (AAAI 2022), Virtual Conference,2022-2-2.

[21]Wang, S., Wang, S., Jiao, B., Yang, D., Su, L., Zhai, P., ... & Zhang, L.* CA-SpaceNet: Counterfactual Analysis for 6D Pose Estimation in Space[C]//2022 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2022: 10627-10634.

[22]P. Zhai, T. Hou, X. Ji, Z. Dong and L. Zhang*, Robust Adaptive Ensemble Adversary Reinforcement Learning, in IEEE Robotics and Automation Letters, vol. 7,no. 4, pp. 12562-12568, Oct. 2022, doi: 10.1109/LRA.2022.3220531

[23]Wang, S., Yang, D., Zhai, P., Chen, C., & Zhang, L.* 2021. TSA-Net: Tube Self-Attention Network for Action Quality Assessment. In Proceedings of ACM Conference 2021.

[24]Shi, X., Zuo, Y., Zhai, P., Shen, J., Yang, Y., Gao, Z., ... Zhang, L., … & Peng, H. (2021). Large-area display textiles integrated with functional systems. Nature, 591(7849), 240-245.

 

部分授权发明专利:

1.      一种器官图像分割的方法、装置、设备及存储介质, 专利号: ZL 202210577649.4, 授权日期:2025311.

2.      基于弱监督学习的肝脏血管分割方法、装置、设备及介质, 专利号: ZL 202210349732.6,授权日期:202537.

3.      一种基于增强现实的儿科气管插管培训系统及方法, 专利号: ZL 2022 1 1511923.4, 授权日期:202513.

4.      一种基于情境感知的多模态情感识别方法和系统,专利号: ZL202111080047.X, 授权日期:2024927.

5.      一种电缆中间接头的绕包装置以及绕包方法, 专利号:ZL202111497357.1, 授权日期:2024820.

6.      医用人机交互辅助系统及含该程序的计算机可读存储介质, 专利号: ZL202010691420.4, 授权日期:202482.

7.      一种基于医疗行为数据的医疗行为操作合规性评估系统, 专利号: ZL202010711862.0, 授权日期:202472.

8.      一种基于统计学习的患者行为多模态分析与预测系统, 专利号: ZL202010740444.4,授权日期:202388

9.      一种医疗行为多模态数据标注方法和系统, 专利号:ZL202010713382.8授权日期:2023113.

10.   医疗行为细粒度识别装置及计算机可读存储介质, 专利号:ZL202010732191.6,授权日期:2022722.

11.   多模态数据标注装置及包含程序的计算机可读存储介质, 专利号: ZL202010739336.5,授权日期:2022513.

12.   一种基于深度学习的患者行为多模态感知与分析系统, 专利号: ZL202010740442.5,授权日期:2022329.

13.   一种行为数据与医疗结局相关性的分析系统, 专利号: ZL202010742790.6,授权日期:2022315.

部分代表性论文:

[1] Han, M., Qu, L., Yang, D., Zhang, X., Wang, X., & Zhang, L.* (2025). Mscpt: Few-shot whole slide image classification with multi-scale and context-focused prompt tuning. IEEE Transactions on Medical Imaging.

[2] Yang, D., Xiao, D., Wei, J., Li, M., Chen, Z., Li, K., & Zhang, L.* (2025). Improving factuality in large language models via decoding-time hallucinatory and truthful comparators. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 39, No. 24, pp. 25606-25614).

[3]Yang, D., Yang, K., Kuang, H., Chen, Z., Wang, Y., & Zhang, L.* (2024). Towards Context-Aware Emotion Recognition Debiasing from a Causal Demystification Perspective via De-confounded Training. IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]Yang, D., Li, M., Qu, L., Yang, K., Zhai, P., Wang, S., & Zhang, L.* (2024). Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and-Agnostic Representations. IEEE Transactions on Circuits and Systems for Video Technology.

[5] Yang, D., Wei, J., Xiao, D., Wang, S., Wu, T., Li, G., ... & Zhang, L.* (2024). Pediatricsgpt: Large language models as chinese medical assistants for pediatric applications. Advances in Neural Information Processing Systems37, 138632-138662.

[6] Zhao, X., Zhang, X., Chen, G., Wei, X., Li, Z., Xu, Z., Wang, Y., Zhu, J., Zhai, P., and & Zhang, L.* (2024). DeNoising Transformer for BEV 3D Object Detection via Multiview Multiscale Cross-Attention. IEEE Transactions on Intelligent Transportation Systems. May. 2025, doi: 10.1109/TITS.2025.3565310

[7] Zhao, X., Chen, B., Sun, M., Yang, D., Wang Y., Zhang, X., Li, M., Kou, D., Wei, X., & Zhang, L.* HybridOcc: NeRF Enhanced Transformer-Based Multi-Camera 3D Occupancy Prediction. IEEE Robotics and Automation Letters, vol. 9,no. 9, pp. 7867 - 7874, Sep. 2024, doi: 10.1109/LRA.2024.3416798

[8] Zhao, X., Zhang, X., Yang, D., Sun, M., Li, M., Wang, S., L Zhang, L.* (2024) MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation. In Proceedings of the 32nd ACM International Conference on Multimedia. (pp. 2652-2661).

[9]Liu, L.Yang, J.Lin, Y., Zhang, P.Zhang, L.* (2024). 3D human pose estimation with single image and inertial measurement unit (IMU) sequence, Pattern RecognitionVolume 149, 2024, 110175, ISSN 0031-3203.

[10]Yang, D., Liu, Y., Huang, C., Li, M., Zhao, X., Wang, Y., ... & Zhang, L.* (2023).Target and source modality co-reinforcement for emotion understanding from asynchronous multimodal sequences[J]. Knowledge-Based Systems, 2023, 265: 110370.

[11]Zhai. P., Zhang. L.,* Dong. Z., et al. Machine intuition (in Chinese). Sci Sin Inform, 2020, 50: 1475-1500, doi: 10.1360/SSI-2020-0075.

[12]Yang, D., Li, M., Xiao, D., Liu, Y., Yang, K., Chen, Z., ... & Zhang, L.* (2024). Towards multimodal sentiment analysis debiasing via bias purification. In European Conference on Computer Vision. Cham: Springer Nature Switzerland.

[13]Yang, D., Yang, K., Li, M., Wang, S., Wang, S., & Zhang, L.* (2024). Robust emotion recognition in context debiasing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 12447-12457).

[14]Wang, S., Wang, S., Yang, D., Li, M., Kuang, H., Zhao, X., ... & Zhang, L.* (2024). CPR-Coach: Recognizing composite error actions based on single-class training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 18782-18792).

[15]Li, M., Yang, D., Lei, Y., Wang, S., Wang, S., Su, L., ... & Zhang, L.* (2024). A Unified Self-Distillation Framework for Multimodal Sentiment Analysis with Uncertain Missing Modalities. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, No. 9, pp. 10074-10082)

[16]Yang, D., Yang, K., Wang, Y., Liu, J., Xu, Z., Yin, R., ... & Zhang, L.* (2024). How2comm: Communication-efficient and collaboration-pragmatic multi-agent perception. Advances in Neural Information Processing Systems, 36.

[17]Hou, T., Tu, J., Gao, X., Dong, Z., Zhai, P., & Zhang, L.* (2024). Multi-Task Learning of Active Fault-Tolerant Controller for Leg Failures in Quadruped Robots. IEEE International Conference on Robotics and Automation.

[18]Yang, D., Chen, Z., Wang, Y., Wang, S., Li, M., Liu, S., ... & Zhang, L.* (2023). Context de-confounded emotion recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 19005-19015).

[19]Zhang, X., Liu, Y., Ali, S., Zhao, X., Sun, M., ... Zhang, L.* (2023). Anatomical-Aware Point-Voxel Network for Couinaud Segmentation in Liver CT. In: Greenspan, H., et al. Medical Image Computing and Computer Assisted Intervention. Lecture Notes in Computer Science, vol 14222. Springer, Cham.

[20]Zhai, P., Luo, J., Dong, Z., Zhang, L.*, Wang, S., & Yang, D. Robust Adversarial Reinforcement Learning with Dissipation Inequation Constraint,36th AAAI Conference on Artificial Intelligence (AAAI 2022), Virtual Conference,2022-2-2.

[21]Wang, S., Wang, S., Jiao, B., Yang, D., Su, L., Zhai, P., ... & Zhang, L.* CA-SpaceNet: Counterfactual Analysis for 6D Pose Estimation in Space[C]//2022 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2022: 10627-10634.

[22]P. Zhai, T. Hou, X. Ji, Z. Dong and L. Zhang*, Robust Adaptive Ensemble Adversary Reinforcement Learning, in IEEE Robotics and Automation Letters, vol. 7,no. 4, pp. 12562-12568, Oct. 2022, doi: 10.1109/LRA.2022.3220531

[23]Wang, S., Yang, D., Zhai, P., Chen, C., & Zhang, L.* 2021. TSA-Net: Tube Self-Attention Network for Action Quality Assessment. In Proceedings of ACM Conference 2021.

[24]Shi, X., Zuo, Y., Zhai, P., Shen, J., Yang, Y., Gao, Z., ... Zhang, L., … & Peng, H. (2021). Large-area display textiles integrated with functional systems. Nature, 591(7849), 240-245.

 

部分授权发明专利:

1.      一种器官图像分割的方法、装置、设备及存储介质, 专利号: ZL 202210577649.4, 授权日期:2025311.

2.      基于弱监督学习的肝脏血管分割方法、装置、设备及介质, 专利号: ZL 202210349732.6,授权日期:202537.

3.      一种基于增强现实的儿科气管插管培训系统及方法, 专利号: ZL 2022 1 1511923.4, 授权日期:202513.

4.      一种基于情境感知的多模态情感识别方法和系统,专利号: ZL202111080047.X, 授权日期:2024927.

5.      一种电缆中间接头的绕包装置以及绕包方法, 专利号:ZL202111497357.1, 授权日期:2024820.

6.      医用人机交互辅助系统及含该程序的计算机可读存储介质, 专利号: ZL202010691420.4, 授权日期:202482.

7.      一种基于医疗行为数据的医疗行为操作合规性评估系统, 专利号: ZL202010711862.0, 授权日期:202472.

8.      一种基于统计学习的患者行为多模态分析与预测系统, 专利号: ZL202010740444.4,授权日期:202388

9.      一种医疗行为多模态数据标注方法和系统, 专利号:ZL202010713382.8授权日期:2023113.

10.   医疗行为细粒度识别装置及计算机可读存储介质, 专利号:ZL202010732191.6,授权日期:2022722.

11.   多模态数据标注装置及包含程序的计算机可读存储介质, 专利号: ZL202010739336.5,授权日期:2022513.

12.   一种基于深度学习的患者行为多模态感知与分析系统, 专利号: ZL202010740442.5,授权日期:2022329.

13.   一种行为数据与医疗结局相关性的分析系统, 专利号: ZL202010742790.6,授权日期:2022315.