Curriculum Vitae

YuChong Wu

Contact

Phone: +86 15819499387
Email: wuyuchong0317@gmail.com
Address: Room 1307, Residence Hall 6, USTC High-tech Campus

Education

University of Science and Technology of China (USTC) B.S. in Computer Science and Technology, Hua Xia Talent Program (Honors) Sep 2022–Present

Weighted Average Score: 90.25/100 — Rank: 17/215

Honors & Awards

JAC NIO Scholarship (2023–2024) Awarded to top 5% of students for outstanding academic performance
Merit Student Scholarship (2022–2023) Granted for consistent academic excellence and active participation in extracurricular activities

Research & Project Experience

Undergraduate Graduation Thesis, USTC

LLM-based Agentic workflow for Time series anomaly detection Sep 2025–Dec 2025

Developed a human-inspired tool-augmented LLM framework for anomaly detection in time series data, integrating external tools for enhanced reasoning and analysis.
Achieved state-of-the-art performance, significantly outperforming strong baselines across major metrics.

Undergraduate Research Program, USTC

Exploring Reinforcement Learning Methods for Better Planning and Alignment in LLM Training Dec 2024–Sep 2025

Proposed a novel DPO variant leveraging final-layer embedding similarity to selectively identify informative tokens for optimization, mitigating likelihood displacement.

Project Experience

Kunpeng Ascend Special Training Camp, Huawei

July 2024

Participated in intensive training on deploying AI models for edge computing applications.
Designed and implemented a real-time object detection system using YOLOv5 on the MindSpore framework, deployed on Ascend GPU and Orange Pi.

Robogame Competition, USTC

Mar 2024–Sep 2024

Led development of circuit control system and vision system (focused on traditional CV algorithms) for autonomous robotic cars.

Technical Skills

Deep Learning: PyTorch, HuggingFace Transformers/Trl, Verl; supervised fine-tuning, RLHF and RLVR on LLMs
Agent Training: Agent-lightning framework; familiar with agentic reinforcement learning methodologies
Reinforcement Learning: Q-learning, policy gradient methods, actor-critic algorithms with focus on LLM reasoning and alignment
Systems: CPU/GPU architectures, CUDA programming (CUDA C++), parallel computation

Yuchong Wu