我是清华大学软件学院软件工程硕士研究生,预计 2027 年毕业,本科同样就读于清华大学软件学院。
我的兴趣集中在大模型驱动的 representation learning: 如何利用 LLM/VLM 的语义理解与世界知识,学习可迁移、可压缩、可用于检索与推荐的 user/item embeddings。
我关注弱监督/无显式标注场景下的表征对齐,以及对比学习与 SFT/生成式目标的联合训练, 使 embedding 同时保留语义知识并吸收推荐/检索中的协同信号。 早期工作也涉及图与分子任务中的解纠缠表征学习和跨域泛化。
方向:面向短视频推荐的多类型 item 表征学习,覆盖商品、短视频、直播间等业务对象。
Chenghao Zhang, et al.
Manuscript under review 2026 (third author)
Proposes UBioRec, a unified LLM framework that uses structured User Biographies to consolidate multi-scenario user understanding and recommendation into a single model, with adaptive token compression for efficient serving and a 6.5x QPS speedup.
Chenghao Zhang, et al.
Manuscript under review 2026 (third author)
Proposes UBioRec, a unified LLM framework that uses structured User Biographies to consolidate multi-scenario user understanding and recommendation into a single model, with adaptive token compression for efficient serving and a 6.5x QPS speedup.
Chenghao Zhang, et al.
Manuscript under review 2026
Introduces Compressed Source Gradient Replay (CSGR), an annotation-free retriever training framework that compresses source memories and replays gradients exactly, scaling NTP-supervised training from an 8-candidate single-GPU pool to a distributed pool of 256 while matching or improving CoIR retrieval quality.
Chenghao Zhang, et al.
Manuscript under review 2026
Introduces Compressed Source Gradient Replay (CSGR), an annotation-free retriever training framework that compresses source memories and replays gradients exactly, scaling NTP-supervised training from an 8-candidate single-GPU pool to a distributed pool of 256 while matching or improving CoIR retrieval quality.
Z. Ni, Chenghao Zhang, H. Wan, X. Zhao
AAAI Conference on Artificial Intelligence (AAAI) 2026
Introduces DGPA to mitigate performance degradation in cross-domain few-shot graph-level anomaly detection, improving average AUROC by 5.72pp over the strongest baseline.
Z. Ni, Chenghao Zhang, H. Wan, X. Zhao
AAAI Conference on Artificial Intelligence (AAAI) 2026
Introduces DGPA to mitigate performance degradation in cross-domain few-shot graph-level anomaly detection, improving average AUROC by 5.72pp over the strongest baseline.
Z. Ni, Chenghao Zhang, H. Wan, X. Zhao
Frontiers of Computer Science 2025
Studies factor-wise disentangled contrastive learning for cross-domain few-shot molecular property prediction and improves average ROC-AUC by 1.53pp.
Z. Ni, Chenghao Zhang, H. Wan, X. Zhao
Frontiers of Computer Science 2025
Studies factor-wise disentangled contrastive learning for cross-domain few-shot molecular property prediction and improves average ROC-AUC by 1.53pp.
F. Yang, H. Chen, Y. He, S. Zhao, Chenghao Zhang, K. Ni, G. Ding
AAAI Conference on Artificial Intelligence (AAAI) 2024
Uses geometry priors to improve cross-domain generalization for monocular 3D object detection. I contributed to implementing and integrating the attention module.
F. Yang, H. Chen, Y. He, S. Zhao, Chenghao Zhang, K. Ni, G. Ding
AAAI Conference on Artificial Intelligence (AAAI) 2024
Uses geometry priors to improve cross-domain generalization for monocular 3D object detection. I contributed to implementing and integrating the attention module.
学习可迁移、可压缩的 user/item embeddings,使大模型语义知识能进入检索、推荐和用户理解任务。
在缺少显式 relevance labels 的场景下,利用 LLM/VLM 证据阅读、问答效用和行为派生信号监督 embedding 学习。
通过对比学习与 SFT/生成式目标联合训练,使 embedding 既保留世界知识,也贴近推荐/检索中的协同分布。
研究图、分子和多域任务中的因素级解纠缠表示、跨域迁移和长尾/冷启动泛化能力。