News
Designing Conversational AI to Provide Medical Assistance on the Battlefield
In battle, soldiers with no specialized medical knowledge often find themselves having to care for injured comrades for prolonged periods of time. Naturally, they need all the help they can get.
Researchers at the 秘密直播 Applied Physics Laboratory (APL) in Laurel, Maryland, are working on a proof of concept for a conversational artificial intelligence (AI) agent that will be able to provide medical guidance to untrained soldiers in plain English, by applying knowledge gleaned from established care procedures.
The project, known as Clinical Practice Guideline-driven AI (CPG-AI), is based on a type of AI known as a large language model (LLM) 鈥 the best-known example of which is the now-famous . (CPG-AI is not affiliated with ChatGPT in any way, nor is APL.)
The Power of Large Language Models
Methods of providing clinical support using AI tend to be highly structured, requiring precisely calibrated rules and meticulously labeled training data. That approach is well suited to providing alerts and reminders to experts in a relatively calm environment. But coaching untrained novices, or even trained medics, as they provide medical care in a chaotic environment is a different story.
鈥淭here might be 20 or 30 individual components running behind the scenes to enable a conversational agent to help soldiers assist their buddies on the battlefield 鈥 everything from search components, to deciding which information from the search is relevant, to managing the structure of the dialogue,鈥 said Sam Barham, a computer scientist in APL鈥檚 Research and Exploratory Development Department, who is leading the CPG-AI project, which also includes Arun Reddy, Michael Kelbaugh and Caitlyn Bishop. 鈥淚n the past, to enable a system like this, you鈥檇 have had to train a bespoke neural network on each very specific task.鈥
An LLM, on the other hand, is trained on vast amounts of unlabeled data 鈥 text, in this case 鈥 and not specialized for any particular task. That means it can theoretically adapt to any situation that can be described in words, using text prompts that provide the situational context and relevant information.
鈥淟LMs have this incredible ability to adapt to whatever task you set for them, virtually anything that鈥檚 in the realm of natural language,鈥 said Barham. 鈥淪o instead of training a neural network on all these different capabilities, you can train a single neural network to respond fluidly to the situation.鈥
Building Better Apps
Until recently, LLMs were far too slow and computing-power-intensive to be of any practical use in this operational context. However, recent advances in computing power and in LLMs themselves have made the prospect realistic. CPG-AI draws on a wider APL-developed software ecosystem for developing apps that take advantage of LLMs, known internally as RALF, or Reconfigurable APL Language model Framework.
RALF was developed by APL鈥檚 Intelligent Systems Center (ISC) as part of a strategic initiative centered on LLMs.
鈥淟LMs are having a transformative impact on the AI community, and that impact extends to the missions of APL鈥檚 sponsors,鈥 said ISC Chief Bart Paulhamus. 鈥淭he ISC needs to explore all aspects of LLMs 鈥 to become experts at creating, training and using them. RALF is an exciting new technology that accelerates adoption of LLMs for our scientists and engineers.鈥
RALF comprises two sets of tools: The first allows users to build apps using LLMs, and the second allows users to build conversational agents that can take advantage of those apps. CPG-AI integrates both.
From Care Algorithm to AI Tool
While using LLMs makes formal training unnecessary 鈥 in the sense of manually labeling data and tweaking and calibrating all kinds of interrelated variables and parameters 鈥 a lot of work goes into transforming a basic LLM into a capability like CPG-AI. When all you have to work with is text, choosing your words becomes very important. As anyone who鈥檚 used AI text-generation tools knows, they can produce some comically wrong results that, to say the least, would not be funny on the battlefield.
鈥淎n LLM is like a precocious 2-year-old that鈥檚 very good at some things and extremely bad at others, and you don鈥檛 know in advance which is which,鈥 Barham said. 鈥淪o there are two big pieces that go into creating a tool like this: first, we have to carefully, precisely engineer text prompts, and second, we鈥檝e injected some ground truth in the form of care algorithms.鈥
Specifically, Barham and his team applied a care algorithm 鈥 essentially, a protocol for how to respond to a medical event 鈥 taken from (TCCC). The TCCC is a set of guidelines and care algorithms developed by the U.S. Department of Defense Joint Trauma System to help relative novices provide trauma care on the battlefield. Conveniently, the TCCC care algorithms exist in the form of flowcharts that lend themselves to being translated into a machine-readable form.
In addition, the researchers found and converted more than 30 clinical practice guidelines from the Department of Defense Joint Trauma System to be ingested as text by their model, including guidelines for treating burns, blunt trauma and other common conditions encountered by warfighters on the battlefield.
In the project鈥檚 first phase, Barham and his team produced a prototype model that can infer a patient鈥檚 condition based on conversational input, answer questions accurately and without jargon, and guide the user through the care algorithms for tactical field care 鈥 a category of care that encompasses the most common injuries encountered on the battlefield, including breathing issues, burns and bleeding.
Thanks to the capabilities of RALF, CPG-AI can also switch smoothly between stepping through a care algorithm and answering any questions the user may have along the way.
In the next phase, the team plans to expand the range of conditions CPG-AI is capable of addressing. They also intend to improve CPG-AI by crafting more effective prompts, as well as improve the model鈥檚 ability to correctly categorize and retrieve information drawn from the practice guidelines.
鈥淚t鈥檚 not battle-ready by any means, but it鈥檚 a step in the right direction,鈥 Barham said.
Leveraging Conversational AI to Save Lives
, who oversees the Assured Care research portfolio at APL, said this work is timelier and more important than ever, given that it connects an exciting emerging technology with an urgent military need.
鈥淸Barham] and his team are applying models like ChatGPT to solve sponsor problems, which presents considerable challenges,鈥 said Galante. 鈥淗ow can we harness these powerful tools, while also ensuring accuracy, as well as transparency 鈥 both in terms of the reasoning underlying the AI鈥檚 responses, and the uncertainty of those responses? If we want to enable relative novices to provide complex medical care at scale, we鈥檒l need a capability like this that can provide the relevant knowledge in a usable manner.鈥