We present the design, implementation, and in-situ deployment of a smartphone-based voice-enabled AI system for generating electronic medical records (EMRs) and clinical risk alerts in maternal healthcare settings. Targeted at low-resource environments such as Pakistan, the system integrates a fine-tuned, multilingual automatic speech recognition (ASR) model and a prompt-engineered large language model (LLM) to enable healthcare workers to engage naturally in Urdu, their native language, regardless of literacy or technical background. Through speech-based input and localized understanding, the system generates structured EMRs and flags critical maternal health risks. Over a seven-month deployment in a not-for-profit hospital, the system supported the creation of over 500 EMRs and flagged over 300 potential clinical risks. We evaluate the system's performance across speech recognition accuracy, EMR field-level correctness, and clinical relevance of AI-generated red flags. Our results demonstrate that speech based AI interfaces, can be effectively adapted to real-world healthcare settings, especially in low-resource settings, when combined with structured input design, contextual medical dictionaries, and clinician-in-the-loop feedback loops. We discuss generalizable design principles for deploying voice-based mobile healthcare AI support systems in linguistically and infrastructurally constrained settings.
翻译:本文介绍了一种基于智能手机的语音人工智能系统的设计、实现与现场部署,该系统用于在孕产保健环境中生成电子病历(EMRs)并提供临床风险预警。该系统针对巴基斯坦等低资源环境,集成了一个经过微调的多语言自动语音识别(ASR)模型和一个经过提示工程优化的大语言模型(LLM),使医护人员能够自然地使用其母语乌尔都语进行交互,无论其文化水平或技术背景如何。通过语音输入和本地化理解,该系统生成结构化的电子病历并标记关键的孕产健康风险。在一家非营利医院为期七个月的部署中,该系统支持创建了超过500份电子病历,并标记了超过300项潜在的临床风险。我们评估了该系统在语音识别准确率、电子病历字段级正确性以及人工智能生成的风险标记的临床相关性方面的性能。我们的结果表明,当结合结构化输入设计、上下文医学词典和临床医生在环反馈机制时,基于语音的人工智能界面能够有效适应现实世界的医疗保健环境,特别是在低资源环境中。我们讨论了在语言和基础设施受限的环境中部署基于语音的移动医疗人工智能支持系统的可推广设计原则。