What Gets Evaluated
InfoBay evaluates responses for factual correctness, citation quality, domain fit, harmful bias, refusal behavior, and stability across prompt variations.
- Hallucination and unsupported claim detection
- Domain-specific accuracy scoring
- Bias, safety, and refusal calibration