I am an undergraduate student at Shanghai Jiao Tong University majoring in Artificial Intelligence, where I enrolled in Fall 2023. My research interests center on large language models and computer vision. I am currently focusing on multimodal understanding. I have gained practical experience through coursework and research projects. I am eager to further explore these areas and welcome opportunities to exchange ideas and collaborate with peers who share similar interests.

🔬 My Research

  • Research Interests: Multi-modal understanding, Diffusion Models, CV, AI for games
  • Current Focus: Multi-model understanding

🎖 Honors and Awards

  • 2023.12 致远荣誉奖学金
  • 2024.11 本科生C等优秀奖学金
  • 2024.12 致远荣誉奖学金

📖 Educations

2023.09 – now
人工智能卓越人才试点班 · 致远工科荣誉计划
2020.09 – 2023.06

💻 Internships

2024.07 – 2024.09
Summer Research Internship
SJTU – Artificial Intelligence Institute, DeepVision Lab
  • Explored Computer Vision fundamentals.
  • Learned to read research papers and reproduced basic CV algorithms.
2025.03 – Present
Research Internship
SJTU – Artificial Intelligence Institute, DeepVision Lab
  • Conducting research on MultiModel Large Language Model.

📂 Projects

GUI-Project

Data preparation for training a GUI recognition model for future GUI Agent.

Python Public

ViT-on-Image-Classification

ViT on image classification, esp. small-scale datasets (CIFAR-10).

Jupyter Notebook Public

Regression-and-Classification-Prediction-of-Travellers

Prediction of travellers based on historical travelling datasets (ml).

Jupyter Notebook Public

Voice-Based-Car-Controll

A voice-controlled car powered by an speech recognition model trained on Edge Impulse and an ESP32 microcontroller.

C Public