About me

Email GitHub Google Scholar

自我介绍 Biography

我叫翟王宇轩,是北京交通大学计算机科学与技术学院人工智能专业的本科生。目前,我的研究重心为音乐生成方向。我对 语音合成 (Text-To-Speech, TTS) 方向也有些许涉猎。

My name is Wangyuxuan Zhai, an undergraduate student majoring in Artificial Intelligence at the School of Computer Science and Technology, Beijing Jiaotong University. Currently, my research focus lies in music generation. I also have some exposure to Text-To-Speech (TTS).

研究兴趣 Research Interests

  • 目前,我的主要研究领域是 音乐生成和语音合成🎶;
  • Currently, my main research area is Music Generation and Text To Speech (TTS)🎶.

杂项 Misc

  • 除了研究之外,我还是一名音乐爱好者。我会一点手风琴🪗,中央音乐学院业余八级水平。
    我也会制作电子音乐🎵,自娱自乐足矣。
  • 我最喜欢的游戏类型是音乐游戏,我曾热衷于街机音游《舞萌 DX》(国服 w44 水平,打着玩的)。
  • Beyond my research, I am also a music lover. I play a little accordion 🪗 — at the amateur Grade 8 level of the Central Conservatory of Music (CCoM). I also produce electronic music 🎵, just for my own enjoyment.
  • My favorite game genre is rhythm games. I used to be obsessed with the rhythm game Maimai DX.
「不诱于誉,不恐于诽,率道而行,端然正己。」

CV

Education

  1. 北京交通大学 · Beijing Jiaotong University

    2023.9 — 2027.07

Experiences

  1. OpenBMB
    面壁智能

    2026.05 - Present

    Audio Large Model Algorithm Intern 音频大模型算法实习生

  2. Human-Computer Speech Interaction Lab at Tsinghua University (THU HCSI)
    清华大学深圳国际研究生院人机语音交互实验室

    2026.04 - Present

    Advisor: Zhiyong Wu

    Music Generation & Text-To-Speech 音乐生成 & 语音合成

  3. Institute for AI Industry Research, Tsinghua University (THU AIR)
    清华大学智能产业研究院

    2025.09 - 2026.04

    LLM + Reinforcement Learning 大模型 + 强化学习

  4. BJTU NLP Group
    北交自然语言处理团队

    2024.09 - 2025.05

    Advisor: kaiyu Huang

    LLM + Geography 大模型 + 地理信息

Publications

  1. Please refer to my Google Scholar.

    请参考我的 谷歌学术.

  2. X. Wang*, Z. Kang*, W. Zhai*, X. Lou, Y. Lai, Z. Wang, Y. Wang, K. Huang, Y. Wang, P. Li, Y. Liu. (2025). MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models. EMNLP 2025 Main Conference. [Paper]

  3. L. Yuan*, F. Mo*, K. Huang, W. Wang, W. Zhai, X. Zhu, Y. Li, J. Xu, J.-Y. Nie. (2025). OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence. arXiv Preprint.[Paper]

AWARDS

  1. 北京交通大学 2023-2024 年度 国家奖学金

    National Scholarship, Beijing Jiaotong University, 2023-2024 Academic Year

  2. 北京交通大学 2024-2025 年度 国家奖学金

    National Scholarship, Beijing Jiaotong University, 2024-2025 Academic Year

Others

If you'd like to engage in a discussion or collaborate, feel free to contact me via email at any time! 🥰🥰🥰

✉️ zhaiwangyuxuan [at] bjtu.edu.cn