Output
Open-Source & Resources
- RepoAwesome Any-to-Any Multimodal Generation — curated resources on any-to-any multimodal generation.
- RepoAwesome Scene Graph Generation & Application — papers and resources on scene graphs.
- RepoAwesome Multimodal Chain-of-Thought — companion to the MM-CoT survey.
- RepoAwesome Audio-Visual Intelligence — companion to the AVI survey.
- RepoNExT-GPT — the first any-to-any multimodal large language model.
- RepoVitron — a Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing.
Invited Talks
- 2025/03Towards Semantic Equivalence of Tokenization in Multimodal LLM. NICE.
- 2024/01The Path to AGI: Achieving Modality Unification with NExT-GPT. Qingyuan Talk.
- 2023/12NExT-GPT: Any-to-Any Multimodal LLM. AI New Youth.
- 2022/12Deep Learning based Natural Language Processing: A Survey and Outlook. Jianghan University, China.
- 2021/10Comparison of Aspect-based Sentiment Analysis based on span and transition models. Wuhan University, China.
- 2021/05Learn from Syntax: Improving Pair-wise Aspect and Opinion Terms Extraction with Rich Syntactic Knowledge. CIPS Youth Working Committee.
Teaching Assistant
- 2024-2025 SpringMultimedia Analysis. School of Computing, NUS.
- 2024-2025 Autumn & SpringBig Data Systems for Data Science. School of Computing, NUS.
- 2023-2024 Autumn & SpringBig Data Systems for Data Science. School of Computing, NUS.
- 2021 Spring & AutumnPublic Opinion Analysis. School of CSE, WHU.
Mentoring