From Accuracy to Equity: Addressing Hallucination Risks and Bias in AI-Driven Clinical Decision Support Systems

Reena Ughreja

Abstract:

Introduction:

Incorporating the use of artificial intelligence (AI) in clinical decision support systems has been shown to improve the diagnostic accuracy and efficiency of workflow. However, AI hallucinations, which are defined as the production of factual or verifiably false outputs, and long-term bias, which compromises fairness between demographic groups, are also highly significant barriers to safe clinical use. The current study explores systematic measures to reduce AI hallucinations and bias and increase justice in clinical AI implementation.

Methods:

A mixed-methods experimental design was employed using a benchmark dataset of 2,000 de-identified electronic health records stratified by gender, age, and socioeconomic status. Model-level interventions included (i) retrieval-augmented generation (RAG) using curated clinical knowledge bases; (ii) uncertainty calibration via probabilistic confidence scoring; and (iii) reinforcement learning with clinician feedback (RLHF). Data-level interventions comprised dataset diversification, subgroup-wise bias auditing, and stratified resampling to correct demographic imbalance. Governance-level measures included post hoc explainability using SHAP-based feature attribution and structured human-in-the-loop (HITL) validation by board-certified clinicians. Quantitative evaluation involved hallucination rate measurement, predictive parity and demographic disparity analysis, and clinical concordance scoring against expert-validated ground truth. Statistical significance was assessed using paired t-tests and chi-square tests, with p < 0.05 considered significant.

Results:

RAG use in combination with clinician-directed feedback decreased the occurrence of hallucinations by 41.7 % compared to baseline models (p<0.001). The predictive parity of bias mitigation methods improved in demographic subgroups, reducing outcome disparity by 18.3 to 6.9 % (p= 0.004). Explainable AI increased clinician trust scores by 32 %, and human-in-the-loop validation increased clinical concordance by 76.4 % to 89.2 %.

Conclusion:

These results support the idea that the implementation of a multilayered approach, combining both technical protective measures, a variety of data management, and patient supervision, significantly decreases AI hallucinations and enhances the level of fairness in clinical practice. These strategies are necessary in ensuring ethical, reliable, and equitable application of AI in healthcare settings.