Retrieval-Augmented Generation (RAG) technology enhances natural language generation by incorporating information retrieved from a large database or documents, thus improving the relevance and accuracy of the generated content. Our research focuses on: cross-modal information retrieval, knowledge selection and knowledge-enhanced dialogue generation.