I am a master's student in Software Engineering at Tsinghua University, expected to graduate in 2027. I also received my B.Eng. in Software Engineering from Tsinghua University.
My interests center on LLM-driven representation learning: how to use the semantic understanding and world knowledge of LLMs/VLMs to learn transferable, compressible user/item embeddings for retrieval and recommendation.
I am particularly interested in representation alignment under weak or implicit supervision, and in jointly training embeddings with contrastive objectives and SFT/generative objectives so that they retain semantic knowledge while absorbing collaborative signals from retrieval and recommendation. My earlier work also studies disentangled representation learning and cross-domain generalization for graph and molecular tasks.
Chenghao Zhang, et al.
Manuscript under review 2026 (third author)
Proposes UBioRec, a unified LLM framework that uses structured User Biographies to consolidate multi-scenario user understanding and recommendation into a single model, with adaptive token compression for efficient serving and a 6.5x QPS speedup.
Chenghao Zhang, et al.
Manuscript under review 2026 (third author)
Proposes UBioRec, a unified LLM framework that uses structured User Biographies to consolidate multi-scenario user understanding and recommendation into a single model, with adaptive token compression for efficient serving and a 6.5x QPS speedup.
Chenghao Zhang, et al.
Manuscript under review 2026
Introduces Compressed Source Gradient Replay (CSGR), an annotation-free retriever training framework that compresses source memories and replays gradients exactly, scaling NTP-supervised training from an 8-candidate single-GPU pool to a distributed pool of 256 while matching or improving CoIR retrieval quality.
Chenghao Zhang, et al.
Manuscript under review 2026
Introduces Compressed Source Gradient Replay (CSGR), an annotation-free retriever training framework that compresses source memories and replays gradients exactly, scaling NTP-supervised training from an 8-candidate single-GPU pool to a distributed pool of 256 while matching or improving CoIR retrieval quality.
Z. Ni, Chenghao Zhang, H. Wan, X. Zhao
AAAI Conference on Artificial Intelligence (AAAI) 2026
Introduces DGPA to mitigate performance degradation in cross-domain few-shot graph-level anomaly detection, improving average AUROC by 5.72pp over the strongest baseline.
Z. Ni, Chenghao Zhang, H. Wan, X. Zhao
AAAI Conference on Artificial Intelligence (AAAI) 2026
Introduces DGPA to mitigate performance degradation in cross-domain few-shot graph-level anomaly detection, improving average AUROC by 5.72pp over the strongest baseline.
Z. Ni, Chenghao Zhang, H. Wan, X. Zhao
Frontiers of Computer Science 2025
Studies factor-wise disentangled contrastive learning for cross-domain few-shot molecular property prediction and improves average ROC-AUC by 1.53pp.
Z. Ni, Chenghao Zhang, H. Wan, X. Zhao
Frontiers of Computer Science 2025
Studies factor-wise disentangled contrastive learning for cross-domain few-shot molecular property prediction and improves average ROC-AUC by 1.53pp.
F. Yang, H. Chen, Y. He, S. Zhao, Chenghao Zhang, K. Ni, G. Ding
AAAI Conference on Artificial Intelligence (AAAI) 2024
Uses geometry priors to improve cross-domain generalization for monocular 3D object detection. I contributed to implementing and integrating the attention module.
F. Yang, H. Chen, Y. He, S. Zhao, Chenghao Zhang, K. Ni, G. Ding
AAAI Conference on Artificial Intelligence (AAAI) 2024
Uses geometry priors to improve cross-domain generalization for monocular 3D object detection. I contributed to implementing and integrating the attention module.
Learning transferable and compressible user/item embeddings that bring model semantics into retrieval, recommendation, and user understanding tasks.
Supervising embedding learning without explicit relevance labels using LLM/VLM evidence reading, answer utility, and behavior-derived signals.
Jointly training embeddings with contrastive objectives and SFT/generative objectives so they retain world knowledge while fitting collaborative distributions.
Studying factor-wise disentangled representations, domain transfer, and long-tail or cold-start generalization in graph, molecular, and multi-domain tasks.