Curriculum Vitae

YuChong Wu

Contact

  • Phone: +86 15819499387
  • Email: wuyuchong0317@gmail.com
  • Address: Room 1307, Residence Hall 6, USTC High-tech Campus

Education

University of Science and Technology of China (USTC) B.S. in Computer Science and Technology, Hua Xia Talent Program (Honors) Sep 2022–Present

Weighted Average Score: 90.25/100 — Rank: 17/215


Honors & Awards

  • JAC NIO Scholarship (2023–2024) Awarded to top 5% of students for outstanding academic performance
  • Merit Student Scholarship (2022–2023) Granted for consistent academic excellence and active participation in extracurricular activities

Research & Project Experience

Undergraduate Graduation Thesis, USTC

LLM-based Agentic workflow for Time series anomaly detection Sep 2025–Dec 2025

  • Developed a human-inspired tool-augmented LLM framework for anomaly detection in time series data, integrating external tools for enhanced reasoning and analysis.
  • Achieved state-of-the-art performance, significantly outperforming strong baselines across major metrics.

Undergraduate Research Program, USTC

Exploring Reinforcement Learning Methods for Better Planning and Alignment in LLM Training Dec 2024–Sep 2025

  • Proposed a novel DPO variant leveraging final-layer embedding similarity to selectively identify informative tokens for optimization, mitigating likelihood displacement.

Project Experience

Kunpeng Ascend Special Training Camp, Huawei

July 2024

  • Participated in intensive training on deploying AI models for edge computing applications.
  • Designed and implemented a real-time object detection system using YOLOv5 on the MindSpore framework, deployed on Ascend GPU and Orange Pi.

Robogame Competition, USTC

Mar 2024–Sep 2024

  • Led development of circuit control system and vision system (focused on traditional CV algorithms) for autonomous robotic cars.

Technical Skills

  • Deep Learning: PyTorch, HuggingFace Transformers/Trl, Verl; supervised fine-tuning, RLHF and RLVR on LLMs
  • Agent Training: Agent-lightning framework; familiar with agentic reinforcement learning methodologies
  • Reinforcement Learning: Q-learning, policy gradient methods, actor-critic algorithms with focus on LLM reasoning and alignment
  • Systems: CPU/GPU architectures, CUDA programming (CUDA C++), parallel computation