None defined yet.
Evaluate model responses for clinical accuracy and relevance
Evaluate model responses for clinical tasks