https://www.medrxiv.org/content/10.1101/2025.04.05.25325217v1?
40 selected studies, 20 000 pregnancies, heart defect:
- AI: 95% - 96% accuracy, traditional diagnostic: 88 – 90% accuracy. On selected study data set
Sounds good but:
https://www.medrxiv.org/content/10.1101/2025.07.04.25330464v1.full
- “95% accuracy” misleading:
+ Most studies only identify: “Normal” or “Abnormal” (Binary classification) without specific diagnoses (13/20), (20/20) is limited to single organ, 1 study is multi-class classification – this one is commercial model, details on architecture is not available.
+ 3/20 models evaluated based on expertise sonographer, others relied solely on statistical performance.
+ most used limited gestational-age ranges
+ high data set heterogeneity
+ small, non-blinded clinician comparison
-Difficulties in real-life deployment:
https://www.reuters.com/investigations/ai-enters-operating-room-reports-arise-botched-surgeries-misidentified-body-2026-02-09/?
+ AI mislabel body parts
'Others' 카테고리의 다른 글
| ProsMAE 논문 작성의 건 (0) | 2026.05.15 |
|---|---|
| [khkim] MICCAI26 Mathena논문 Rebuttal (0) | 2026.05.09 |
| [khkim] SEFLA - AMR 논문 확장 가능성 리서치 정리 (0) | 2026.04.27 |
| [smlee] 학생자율연구 기획 (0) | 2026.03.30 |
| [khkim] 학생자율연구 주제 피드백: Biosignal × LLM (0) | 2026.03.30 |