I am currently working at LiblibAI on AI agents. Previously, I worked at Ant Group on GUI agents. I received my Ph.D. from the Department of Computer Science and Technology, Tsinghua University (THU) in June 2025.
My research interests lie in LLM-based agents, post-training of Large Language Models (LLMs), and content security of (Multimodal)LLMs. During my Ph.D., I conducted extensive research on the vulnerabilities of the safety alignment, and intellectual property protection in LLMs. Currently, I focus on building practical AI agent systems.
Email: gongyc18@gmail.com
📝 Publications

Changlong Gao, Zhangxuan Gu, Yulin Liu, …, Yichen Gong, …, Weiqiang Wang
- UI-Venus-1.5 is the next-generation GUI agent built at Ant Group, advancing the UI-Venus series with improved multi-turn interaction and cross-platform generalization capabilities.

UI-Venus Technical Report: Building High-Performance UI Agents with RFT
Zhangxuan Gu, Zhengwen Zeng, …, Yichen Gong, …, Weiqiang Wang
- UI-Venus is a screenshot-based UI agent built on Qwen2.5-VL and fine-tuned via reinforcement fine-tuning (RFT), achieving state-of-the-art performance on UI grounding and navigation benchmarks.

Figstep: Jailbreaking large vision-language models via typographic visual prompts
Yichen Gong, Delong Ran, Jinyuan Liu, Conglei Wang, Tianshuo Cong, Anyu Wang, Sisi Duan, Xiaoyun Wang
- This work exposes safety vulnerabilities in Large Vision-Language Models (LVLMs) and proposes FigStep, a black-box jailbreak algorithm that converts harmful text into visual modality to bypass safe guardrail, achieving an 82.50\% attack success rate. Our analysis highlights deficiencies in visual embedding safety alignment and underscores the need for robust cross-modality safety solutions. Our findings underscore the urgent need for robust cross-modality safety alignment techniques to secure LVLMs against such attacks.

Safety Misalignment against Large Language Models
Yichen Gong, Delong Ran, Xinlei He, Tianshuo Cong, Anyu Wang, and Xiaoyun Wang
- This work introduces a framework to evaluate various safety misalignment attacks against LLMs, revealing vulnerabilities to attacks like supervised fine-tuning and our novel SSRA. We propose SSRD to effectively re-align models, preserving safety after fine-tuning.

JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models
Delong Ran, Jinyuan Liu, Yichen Gong, Jingyi Zheng, Xinlei He, Tianshuo Cong, Anyu Wang
- This work analyzes jailbreak evaluation methods for LLMs. We propose a taxonomy of jailbreak evaluators and introduce JailbreakEval, a toolkit to streamline and standardize jailbreak evaluation, advancing the efficiency and fairness of evaluation.

Tianshuo Cong, Delong Ran, Zesen Liu, Xinlei He, Jinyuan Liu, Yichen Gong, Qi Li, Anyu Wang, Xiaoyun Wang
- This work studies the robustness of IP protection techniques for Large Language Models (LLMs) under model merging. While watermark fails in merged models, fingerprint remains effective, highlighting the need to address IP protection in model merging scenarios.
🎖 Honors and Awards
- NDSS Fellowship, awarded by Internet Society, Spring 2025
- China Software Cup, Grand Prize, awarded by Ministry of Industry and Information Technology (China), Fall 2017
- Science and Technology Competition Scholarship (First-Class), awarded by Beihang University, Winter 2017
- Beihang Star of Innovation: Annual Student Innovator of the Year, awarded by Beihang University, Winter 2017
- MIIT Innovation Scholarship (Third-Class), awarded by Ministry of Industry and Information Technology (China), Spring 2018
💼 Work Experience
- 2026.03 - Present, AI Agent Engineer, LiblibAI
- 2025.07 - 2026.03, GUI Agent Engineer, Ant Group
📖 Educations
- 2018.09 - 2025.06, Ph.D. in Computer Science and Technology, Tsinghua University (THU)
- 2014.09 - 2018.06, B.E. in Software Engineering, Beihang University