Sleep research article

Structured Visual Evidence Decomposition for Evidence-Grounded Multimodal Screening of Obstructive Sleep Apnea-Hypopnea Syndrome

2026-05-23 · arXiv: 2606.00087

Authors: Chen Zhan , Yingchen Wei , Xiaoyu Tan , Jingjing Huang , Xihe Qiu

One-line summary

A sleep science research article on Structured Visual Evidence Decomposition for Evidence-Grounded Multimodal Screening of Obstructive Sleep Apnea-Hypopnea Syndrome.

Sleep health notes

Sleep health notes will be added by the Sleepatch editorial team.

中文解读

中文解读待补充:本站会优先为失眠研究、睡眠质量改善、昼夜节律等高价值睡眠研究添加中文说明。

Original abstract

Effective pre-polysomnography screening for obstructive sleep apnea-hypopnea syndrome (OSAHS) requires combining clinical risk factors with visible craniofacial and neck cues. Directly prompting general-purpose multimodal foundation models for medical yes/no decisions can yield unstable, poorly calibrated outputs. We propose EviOSAHS, an evidence-grounded multimodal reasoning framework that separates image-only anatomical evidence acquisition from final clinical adjudication. Each frontal facial image is decomposed into seven fixed anatomical queries covering the neck, chin, mouth, face/neck fat, lower jaw, midface, and nose. Visual responses are converted into structured evidence cards recording target anatomy, visibility, risk direction, evidence strength, confidence, and a concise summary. These cards are combined with a cleaned clinical profile only in the final stage, where a large language model performs balanced binary screening adjudication. We evaluated EviOSAHS on a 642-subject cohort, mapping normal subjects to screening-negative and mild, moderate, or severe OSAHS subjects to screening-positive. EviOSAHS achieved 88.47% accuracy, 94.86% sensitivity, 93.74% F1-score, and a 5.14% false-negative rate, outperforming clinical-only prompting, direct multimodal prompting, and naive two-stage pipelines under a unified protocol. Ablations showed that seven-question visual decomposition and balanced final adjudication were critical to the high-sensitivity operating point. A question-level audit of 4,494 visual outputs showed a 100% structured parse rate and 93.88% high-visibility rate. EviOSAHS provides an auditable, high-sensitivity workflow for binary pre-polysomnography OSAHS screening, but should be viewed as a triage assistant rather than a diagnostic system. Prospective validation, external testing, and calibrated operating-point control are needed before clinical deployment.

5.0App value
7.0Research quality
4.0Wellness relevance

Links and sources

⚕️ Medical Disclaimer
This content is provided for informational and educational purposes only and does not constitute medical advice, diagnosis, or treatment. Sleep disorders, chronic insomnia, sleep apnea, and other conditions must be evaluated and treated by a qualified healthcare professional. If you experience persistent or severe sleep problems, consult a licensed physician or sleep specialist. Research cited refers to peer-reviewed studies; individual results may vary. Sleepatch does not endorse any specific medication, supplement, or therapy.

Want a personalized sleep improvement plan?

Sleepatch can prepare a customized sleep wellness program, insomnia relief guide, and evidence-based sleep coaching based on your needs.

Explore sleep services

Comments

No comments yet. Be the first to share your thoughts on this sleep research.
Login or register to leave a comment