Study

Title
Associate Professor, Doctoral Supervisor and postgraduate Supervisor,School of future technology
huaidongz@scut.edu.cn
Honor
Member of the National Key Laboratory of Subtropical Architecture and Urban Science, and Member of the Visualization and Cognitive Computing Committee of the Chinese Society for Graphics and Image Technology
MEng: 1) Electronic Information
MS:1)Intelligent Science and Technology
Ph.D: 1) Intelligent Science and Technology;
Zhang Huaidong is an associate professor at the School of Future Technology, South China University of Technology. He is a member of the National Key Laboratory of Subtropical Architecture and Urban Science and a member of the Visualization and Cognitive Computing Special Committee of the China Graphics Society. His research focuses on visual perception decision-making and the continuous optimization of visual models. He has led the National Natural Science Foundation of China for Young Scientists and the Guangdong Provincial Natural Science General Project. Selected for the "Set Sail" project for Young Doctors in Guangzhou City. The developed products have achieved high-precision intelligent cognitive algorithms such as human-vehicle detection, abnormal event detection, and traffic flow information statistics. The achievements won the second prize of the Science and Technology Progress Award issued by the Chinese Society of Image and Graphics in 2022. Have achieved internationally cutting-edge research results in issues such as visual perception decision-making, continuous learning of visual models, and lightweighting. More than forty papers have been published publicly in the past five years. Representative achievements have been published in international conferences and journals such as CVPR, ECCV, AAAI, IJCAI, TPAMI, TIP, TMM, TVCG, and TCSVT.
Computer vision, embodied intelligence, continuous learning, representation learning
Xie, X., Huang, Z., Xu, W., Xiao, P., Xu, X., & Zhang, H. (2025). Let's Chorus: Partner-aware Hybrid Song-Driven 3D Head Animation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
Liu, S., Lv, J., Kang, J., Zhang, H., Liang, Z., & He, S. (2025). MODfinity Unsupervised Domain Adaptation with Multimodal Information Flow Intertwining. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
Zheng, Y., Jiang, Z., He, S., Sun, Y., Dong, J., Zhang, H., & Du, Y. (2025). NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
Li, X., Zhan, J., He, S., Xu, Y., Dong, J., Zhang, H., & Du, Y. (2025). PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium. Proceedings of the AAAI Conference on Artificial Intelligence.
Zhong, Y., Yan, Z., Xie, Y., Wu, S., Zhang, H., Shu, L., & Zhou, P. (2025). MSSDA: multi-sub-source adaptation for diabetic foot neuropathy recognition. Proceedings of the AAAI Conference on Artificial Intelligence.
Zhang, H., Xie, Y., Zhang, H., Xu, C., Luo, X., Chen, D., Xu, X., Zhang, H., Heng, P. A., & He, S. (2025). Unambiguous granularity distillation for asymmetric image retrieval. Neural Networks, 107303.
Zhou, Y., Ye, D., Zhang, H., Xu, X., Sun, H., Xu, Y., Liu, X., & Zhou, Y. (2025). Recurrent Diffusion for 3D Point Cloud Generation from a Single Image. IEEE Transactions on Image Processing.
Liu, B., Zheng, C., Xu, X., Xu, C., Zhang, H., & He, S. (2025). Rotation-Adaptive Point Cloud Domain Generalization Via Intricate Orientation Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence.
Huang, Z., Xu, X., Xu, C., Zhang, H., Zheng, C., Qin, J., & He, S. (2024). Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation. European Conference on Computer Vision.
Xiao, P., Xie, Y., Xu, X., Chen, W., & Zhang, H. (2024). Multi-person Pose Forecasting with Individual Interaction Perceptron and Prior Learning. European Conference on Computer Vision, 402–419.
Yang, Z., Jiang, Z., Li, X., Zhou, H., Dong, J., Zhang, H., & Du, Y. (2024). $$\backslashtextrm {D}^ 4$$-VTON: Dynamic Semantics Disentangling for Differential Diffusion Based Virtual Try-On. European Conference on Computer Vision, 36–52.
Jiang, X., Zheng, C., Xu, X., Liu, B., Zheng, W., Zhang, H., & He, S. (2024). VrdONE: One-stage Video Visual Relation Detection. Proceedings of the 32nd ACM International Conference on Multimedia, 1437–1446.
Yu, Y., Liu, B., Zheng, C., Xu, X., Zhang, H., & He, S. (2024). Beyond textual constraints: Learning novel diffusion conditions with fewer examples. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7109–7118.
Zhang, H., Huang, R., Xie, Y., & Zhang, H. (2024). Mask4Align: Aligned Entity Prompting with Color Masks for Multi-Entity Localization Problems. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 13373–13383.
Xie, Y., Lin, Y., Cai, W., Xu, X., Zhang, H., Du, Y., & He, S. (2024). D3still: Decoupled differential distillation for asymmetric image retrieval. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 17181–17190.
Zhang, W., Xu, C., Xu, X., Zhang, H., Zhao, R., & Qin, J. (2024). Exploiting Multi-View Clues for Context-Aware Unified Lumbar MRI Identification and Diagnosis. 2024 International Joint Conference on Neural Networks (IJCNN), 1–9.
Cai, W., Xu, X., Xu, J., Zhang, H., Yang, H., Zhang, K., & He, S. (2024). Hierarchical damage correlations for old photo restoration. Information Fusion, 107, 102340.
Cai, W., Zhang, H., Xu, X., Xu, C., Zhang, K., & He, S. (2024). Delving into Important Samples of Semi-Supervised Old Photo Restoration: A New Dataset and Method. IEEE Transactions on Multimedia, 26, 9866–9879.
Xu, C., Xu, Y., Zhang, H., Xu, X., & He, S. (2024). DreamAnime: Learning Style-Identity Textual Disentanglement for Anime and Beyond. IEEE Transactions on Visualization and Computer Graphics, 1–12.
Yang, H., Xu, X., Xu, C., Zhang, H., Qin, J., Wang, Y., Heng, P.-A., & He, S. (2024). G 2 Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors. IEEE Transactions on Information Forensics and Security, 19, 8773–8785.
ZHENG, C., LIU, B., XU, X., ZHANG, H., & HE, S. (n.d.). Learning an interpretable stylized subspace for 3D-aware animatable artforms.(2024). IEEE Transactions on Visualization and Computer Graphics, 1–13.
Zhou, Y., Qian, J., Zhang, H., Xu, X., Sun, H., Zeng, F., & Zhou, Y. (2024). Adaptive multi-text union for stable text-to-image synthesis learning. Pattern Recognition, 152, 110438.
Zhou, Y., Sun, H., Zhang, H., Xu, X., Ye, D., Zhou, Y., Liu, X., & others. (2024). GaFL: Geometric-aware Feature Learning for universal 3D models recognition. Pattern Recognition, 149, 110214.
Xu, C., Xu, X., Zhao, N., Cai, W., Zhang, H., Li, C., & Liu, X. (2023). Panel-page-aware comic genre understanding. IEEE Transactions on Image Processing, 32, 2636–2648.
Cai, W., Zhang, H., Xu, X., He, S., Zhang, K., & Qin, J. (2023). Contextual-assisted scratched photo restoration. IEEE Transactions on Circuits and Systems for Video Technology, 33(10), 5458–5469.
Zhou, Y., Dang, Z., Zhang, H., Xu, X., Qin, J., Li, W., Zeng, F., & Liu, X. (2023). EFSCNN: Encoded feature sphere convolution neural network for fast non-rigid 3D models classification and retrieval. Computer Vision and Image Understanding, 233, 103724.
Huang, X., Zhou, N., Huang, J., Zhang, H., Pedrycz, W., & Choi, K.-S. (2023). Center transfer for supervised domain adaptation. Applied Intelligence, 53(15), 18277–18293.
Xie, Y., Zhang, H., Xu, X., Zhu, J., & He, S. (2023). Towards a smaller student: Capacity dynamic distillation for efficient image retrieval. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 16006–16015.
Zheng, C., Liu, B., Zhang, H., Xu, X., & He, S. (2023). Where is my spot? Few-shot image generation via latent subspace optimization. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 3272–3281.
Xiao, W., Xu, C., Zhang, H., & Xu, X. (2022). Spatial-Aware GAN for Instance-Guided Cross-Spectral Face Hallucination. CAAI International Conference on Artificial Intelligence, 93–105.