Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator

Publication
arXiv preprint arXiv:2403.08495