Zhixin Cheng

I am currently a Specially Appointed Associate Professor in the School of Computer Science and Information Engineering, Hefei University of Technology (HFUT). I received my Ph.D. degree from the Department of Automation, University of Science and Technology of China (USTC), under the supervision of Prof. Tianzhu Zhang and Prof. Baoqun Yin. I spent my M.Eng. studies at the Advanced Technology Research Institute, USTC. Huazhong University of Science and Technology (HUST) conferred upon me a B.Eng. degree in 2020 from the School of Electrical and Electronic Engineering. My research interests include computer vision, multimodal learning, and 3D vision.

🔥 News

2026.05: 🎉🎉 “FS-I2P: A Hierarchical Focus–Sweep Registration Network with Dynamically Allocated Depth” was accepted by ICML 2026, and I attend the conference in Korea🍲.

2026.04: 🎉🎉 “VCR: Variance-Driven Channel Recalibration for Robust Low-Light Enhancement” was accepted by IEEE Transactions on Circuits and Systems for Video Technology.

2026.03: 🎉🎉 “GLASS: Geometry-aware Local Alignment and Structure Synchronization Network for 2D-3D Registration” was accepted by IEEE Transactions on Circuits and Systems for Video Technology.

2026.02: 🎉🎉 “Rethinking 2D-3D Registration: A Novel Network for High-Value Zone Selection and Representation Consistency Alignment” and “GeoGuide: Hierarchical Geometric Guidance for Open-Vocabulary 3D Semantic Segmentation” was accepted by CVPR 2026.

2026.01: 🎉🎉 “Adversarial Attacks Already Tell the Answer: Directional Bias-Guided Test-time Defense for Vision-Language Models” and “RayI2P: Learning Rays for Image-to-Point Cloud Registration” was accepted by ICLR 2026.

2025.11: 🎉🎉 “Adaptive Agent Selection and Interaction Network for Image-to-point cloud Registration” was accepted by AAAI 2026, and I attended the conference in Singapore🦁.

2025.10: 🎉🎉 “EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting” (Spotlight) and “BeyondMix: Leveraging Structural Priors and Long-Range Dependencies for Domain-Invariant LiDAR Segmentation” were accepted by NeurIPS 2025.

2025.06: 🎉🎉 “CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection” was accepted by ICCV 2025, and I attended the conference in Hawaii🥥.

2025.02: 🎉🎉 “Implicit Correspondence Learning for Image-to-Point Cloud Registration” was accepted by CVPR 2025 as a Highlight.

2024.12: 🎉🎉 “Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment” and “DiffCorr: Conditional Diffusion Model with Reliable PseudoLabel Guidance for Unsupervised Point Cloud Shape Correspondence” was accepted by AAAI 2025.

2024.03: 🏀🏀 Won the championship in the USTC “Revitalization Cup” Basketball Tournament 2024.

2023.06: 🎤🎤 Participated in the Graduation Gala 2023.

📝 Publications

FS-I2P: A Hierarchical Focus–Sweep Registration Network with Dynamically Allocated Depth

Zhixin Cheng, Yujia Chen, Xujing Tao, Bohao Liao, Xiaotian Yin, Baoqun Yin, Tianzhu Zhang*

This paper revisits image-to-point cloud registration by addressing two key challenges: scale ambiguity caused by viewpoint changes and repetitive textures, and attention drift during deep cross-modal interaction. To tackle these, FS-I2P introduces a hierarchical Focus–Sweep interaction module, where Focus captures global point-cloud scale cues to guide image feature adaptation. It further applies a Sweep operation to perform region-wise fine-grained interaction between image and point cloud features, enabling more reliable cross-modal correspondence refinement. By combining Focus–Sweep interaction with dynamic layer allocation, FS-I2P effectively reduces mismatches and improves registration robustness and accuracy.

Presented at International Conference on Machine Learning 2026🌸

VCR: Variance-Driven Channel Recalibration for Robust Low-Light Enhancement

Zhixin Cheng, Fangwen Zhang, Xiaotian Yin, Baoqun Yin, Haodian Wang*

This paper revisits low-light image enhancement by addressing two key challenges: channel-level inconsistency between luminance and chrominance, and misaligned color distributions that lead to visual artifacts. To tackle these, VCR introduces a Channel Adaptive Adjustment module that leverages variance-aware filtering to enhance channel-wise consistency and focus on informative regions. It further applies a Color Distribution Alignment module to regularize chrominance distributions toward well-exposed references. By combining channel recalibration with distribution alignment, VCR effectively reduces color distortion and improves perceptual quality.

Presented at IEEE Transactions on Circuits and Systems for Video Technology🌸

GLASS: Geometry-aware Local Alignment and Structure Synchronization Network for 2D-3D Registration

Zhixin Cheng, Jiacheng Deng, Xinjun Li, Bohao Liao, Li Liu, Xiaotian Yin, Baoqun Yin, Tianzhu Zhang*

This paper revisits image-to-point cloud registration by addressing two key challenges: low-texture or repetitive regions causing mismatches, and inherent modality differences between images and point clouds. To tackle these, GLASS introduces a Local Geometry Enhancement module that injects surface normals to enhance structural awareness. It further applies a Graph Distribution Consistency module to regularize the similarity distributions of matched keypoints. By combining local geometric enhancement with structural consistency, GLASS reduces ambiguity and incorrect matches.

Presented at IEEE Transactions on Circuits and Systems for Video Technology🌸

Rethinking 2D-3D Registration: A Novel Network for High-Value Zone Selection and Representation Consistency Alignment

Zhixin Cheng, Bohao Liao, Jiacheng Deng, Xiaotian Yin, Xinjun Li, Yujia Chen, Baoqun Yin, Tianzhu Zhang*

This paper rethinks image-to-point cloud registration by addressing two key challenges: many correspondences arise from non-overlapping or low-quality regions, and the same scene areas appear differently across modalities since images capture texture while point clouds represent geometry. The method therefore emphasizes selecting informative, matchable regions first, and then enforcing more consistent cross-modal region representations, reducing ambiguity and mismatches.

Presented at Computer Vision and Pattern Recognition 2026🌸

Adaptive Agent Selection and Interaction Network for Image-to-Point Cloud Registration

Zhixin Cheng, Xiaotian Yin, Jiacheng Deng, Bohao Liao, Yujia Chen, Xu Zhou, Wenfei Yang*, Baoqun Yin

This paper targets the challenges of image-to-point-cloud registration under noise, where false correspondences are common and cross-modal information is difficult to filter effectively. It proposes a framework composed of Iterative Agent Selection and Reliable Agent Interaction: phase maps enhance structural perception, and reinforcement learning selects more reliable agents to guide cross-modal interaction, thereby reducing mismatches and improving robustness.

Presented at Association for the Advancement of Artificial Intelligence 2026🌸

CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection

Zhixin Cheng, Jiacheng Deng, Xinjun Li, Xiaotian Yin, Bohao Liao, Baoqun Yin, Wenfei Yang*, Tianzhu Zhang

This paper addresses detection-free image-to-point cloud registration, where cross-modal channel mismatches and redundant top-k correspondences reduce matching quality. CA-I2P uses a Channel Adaptive Adjustment module to align channels across modalities and a Global Optimal Selection module to produce globally consistent matches for robust registration. We attend the conference and discussed our ideas with Google AI scientist Martin Sundermeyer.

Presented at International Conference on Computer Vision 2025🌸

Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment

Zhixin Cheng, Jiacheng Deng, Xinjun Li, Baoqun Yin, Tianzhu Zhang*

This paper proposes B2-3Dnet for detection-free image-to-point cloud registration, aiming to reduce distraction from noisy image patches and narrow the cross-modal domain gap. It introduces an uncertainty-aware hierarchical matching module that estimates patch reliability and performs multi-scale coarse-to-fine interactions, and an adversarial modal alignment module that aligns image and point-cloud features using a gradient reversal strategy and a domain classifier.

Presented at Association for the Advancement of Artificial Intelligence 2025🌸

GeoGuide: Hierarchical Geometric Guidance for Open-Vocabulary 3D Semantic Segmentation

Xujing Tao, Chuxin Wang, Yubo Ai, Zhixin Cheng, Zhuoyuan Li, Liangsheng Liu, Yujia Chen, Xinjun Li, Qiao Li, Wenfei Yang*, Tianzhu Zhang

RayI2P: Learning Rays for Image-to-Point Cloud Registration

Xinjun Li, Wenfei Yang*, Zhixin Cheng, Jiacheng Deng, Fei Wang, Chen Qian, Tianzhu Zhang

Adversarial Attacks Already Tell the Answer: Directional Bias-Guided Test-time Defense for Vision-Language Models

Liangsheng Liu, Si Chen, Jiamin Wu, Weiwei Feng, Zhixin Cheng, Xiaotian Yin, Wenfei Yang*, Tianzhu Zhang

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting (Spotlight)

Bohao Liao, Wei Zhai*, Zengyu Wan, Zhixin Cheng, Wenfei Yang, Yang Cao, Tianzhu Zhang, Zhengjun Zha

BeyondMix: Leveraging Structural Priors and Long-Range Dependencies for Domain-Invariant LiDAR Segmentation

Yujia Chen, Rui Sun, Wangkai Li, Huayu Mai, Si Chen, Zhuoyuan Li, Zhixin Cheng, Tianzhu Zhang*

Implicit Correspondence Learning for Image-to-Point Cloud Registration (Highlight)

Xinjun Li, Wenfei Yang*, Jiacheng Deng, Zhixin Cheng, Xu Zhou, Tianzhu Zhang

DiffCorr: Conditional Diffusion Model with Reliable PseudoLabel Guidance for Unsupervised Point Cloud Shape Correspondence

Jiacheng Deng, Jiahao Lu, Zhixin Cheng, Wenfei Yang*

🏅 Honors and Awards

Outstanding Graduate of Anhui Province Ordinary Higher Education Institutions, Class of 2026

Outstanding Graduate of the Class of 2026, USTC

First‑Class Scholarship, USTC Graduate School

Deep Space Exploration Scholarship

Excellent Minister, Graduate Student Union, Advanced Technology Research Institute, USTC

USTC “Revitalization Cup” Basketball Champion (2024) and Runner‑up (2023)

Outstanding Graduate, Huazhong University of Science and Technology

Level 10 Certification in Erhu Performance🎶

📖 Education

2022.09 – 2026.06: Ph.D. in Automation, USTC, Hefei, China

2020.09 – 2022.08: M.Eng., Advanced Technology Research Institute, USTC, Hefei, China

2016.09 – 2020.06: B.Eng., School of Electrical and Electronic Engineering, Huazhong University of Science and Technology, Wuhan, China

2013.09 – 2016.06: Senior High School, Hefei No.8 High School, Hefei, China

2010.09 – 2013.06: Junior High School, Hefei No.50 Middle School, Hefei, China

💻 Internships

2025.06 – 2025.10: Research Intern, COG1, Spark Large Model Research Institute, iFLYTEK, Hefei, China

2021.09 – 2022.08: Research Intern, Brain‑Inspired Intelligence Platform, Hefei Comprehensive National Science Center, Hefei, China

2020.09 – 2021.03: AI Algorithm Intern, Jiyuan–Huawei Ascend Joint Laboratory, NARI Jiyuan Electric Grid Technology Co., Ltd., Hefei, China

2018.07 – 2018.09: Visiting Student, The University of Manchester, Manchester, UK🎡

🛠 Skills

Programming: Python, PyTorch, CUDA

Research: Deep learning, Multimodal fusion, 3D vision task

Service: Reviewer for ICML, CVPR, ICCV, AAAI, NIPS, ICLR, ACM MM, TCSVT, TMM

Language: CET‑6, good English writing and communication

📬 Contact

Email: chengzhixin@mail.ustc.edu.cn