Paper-Conference

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
EvolveBench: A Comprehensive Benchmark for Assessing Temporal Awareness in LLMs on Evolving Knowledge
Fine-tuning with Reserved Majority for Noise Reduction
ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents
Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications
CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation
HSDreport: Heart Sound Diagnosis with Echocardiography Reports
M $^3$ AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset