An Automatic Evaluation Framework for Multi-turn Medical Consultations Capabilities of Large Language Models

Publication
arXiv preprint arXiv:2309.02077