Blog
4 February, 2025
This paper addresses the growing importance of validating Large Language Models (LLMs) in the medical domain, focusing on prompt engineering. This work proposes a structured methodology using combinatorial testing to systematically evaluate LLM responses to medical queries. The approach generates test cases by combining sets of symptoms with various prompt components, utilizing pairwise combinatorial testing […]
Read more »