Recent Publications (2023 Onwards)

Position Paper: Towards Implicit Prompt For Text-To-Image Models
M $^3$ AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation
Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Self-Improvement of Non-autoregressive Model via Sequence-Level Distillation
Redundancy-Adaptive Multimodal Learning for Imperfect Data
A Comparative Study of Pre-trained Audio and Speech Models for Heart Sound Detection