Sleep research article
Structured Visual Evidence Decomposition for Evidence-Grounded Multimodal Screening of Obstructive Sleep Apnea-Hypopnea Syndrome
Authors: Chen Zhan , Yingchen Wei , Xiaoyu Tan , Jingjing Huang , Xihe Qiu
One-line summary
A sleep science research article on Structured Visual Evidence Decomposition for Evidence-Grounded Multimodal Screening of Obstructive Sleep Apnea-Hypopnea Syndrome.
Sleep health notes
Sleep health notes will be added by the Sleepatch editorial team.
中文解读
中文解读待补充:本站会优先为失眠研究、睡眠质量改善、昼夜节律等高价值睡眠研究添加中文说明。
Original abstract
Effective pre-polysomnography screening for obstructive sleep apnea-hypopnea syndrome (OSAHS) requires combining clinical risk factors with visible craniofacial and neck cues. Directly prompting general-purpose multimodal foundation models for medical yes/no decisions can yield unstable, poorly calibrated outputs. We propose EviOSAHS, an evidence-grounded multimodal reasoning framework that separates image-only anatomical evidence acquisition from final clinical adjudication. Each frontal facial image is decomposed into seven fixed anatomical queries covering the neck, chin, mouth, face/neck fat, lower jaw, midface, and nose. Visual responses are converted into structured evidence cards recording target anatomy, visibility, risk direction, evidence strength, confidence, and a concise summary. These cards are combined with a cleaned clinical profile only in the final stage, where a large language model performs balanced binary screening adjudication. We evaluated EviOSAHS on a 642-subject cohort, mapping normal subjects to screening-negative and mild, moderate, or severe OSAHS subjects to screening-positive. EviOSAHS achieved 88.47% accuracy, 94.86% sensitivity, 93.74% F1-score, and a 5.14% false-negative rate, outperforming clinical-only prompting, direct multimodal prompting, and naive two-stage pipelines under a unified protocol. Ablations showed that seven-question visual decomposition and balanced final adjudication were critical to the high-sensitivity operating point. A question-level audit of 4,494 visual outputs showed a 100% structured parse rate and 93.88% high-visibility rate. EviOSAHS provides an auditable, high-sensitivity workflow for binary pre-polysomnography OSAHS screening, but should be viewed as a triage assistant rather than a diagnostic system. Prospective validation, external testing, and calibrated operating-point control are needed before clinical deployment.
Links and sources
This content is provided for informational and educational purposes only and does not constitute medical advice, diagnosis, or treatment. Sleep disorders, chronic insomnia, sleep apnea, and other conditions must be evaluated and treated by a qualified healthcare professional. If you experience persistent or severe sleep problems, consult a licensed physician or sleep specialist. Research cited refers to peer-reviewed studies; individual results may vary. Sleepatch does not endorse any specific medication, supplement, or therapy.
Want a personalized sleep improvement plan?
Sleepatch can prepare a customized sleep wellness program, insomnia relief guide, and evidence-based sleep coaching based on your needs.
Explore sleep services
Comments