Leon Chen

The future of AI in radiology

January 12, 2018
AI, as a field, has undergone numerous periods of exuberance over the past decades.

These waves of promise and excitement invariably make their way into medicine, but in the past, they have been tempered by the realities of medicine, when evidence of real-world performance is sought. Now, there is a palpable sense that this time around, things are different, that we are on the precipice of a revolution, rather than mere incremental evolution of previous technologies. The reason, of course, is deep learning. Broadly speaking, deep learning is not a single technological breakthrough, but rather a collection of accumulated mathematical principles, data structures and optimization algorithms, which when applied to the right data, produce results on certain tasks that far outperform previous methods. While it has seen broad application across almost all data types, visual data is where it has had the greatest tangible successes. Radiology is, therefore, one of its most obvious applications.

One of the attractions of deep learning is that less intensive data preparation is typically required. There is a perception that one can just feed the neural network raw pixels of say, any chest X-ray. In practice, it is not quite this magical. Good data science and engineering practices are still paramount in building such systems. One such data science practice is ensuring the input data is of sufficient quality and quantity. Almost all practical applications of machine learning today are supervised, meaning accurate labels of your ultimate objective is required to train your models on. Not only is obtaining these labels a laborious process, it is an expensive one given the human costs.

We are only in the very early phases of applying deep learning to medical imaging, though the pace of abstracts and papers being published on the topic is rapidly picking up. We are seeing applications of all types, from classification of normal versus pathology, to higher-level tasks such as localization, segmentation and quantification. Most of these current applications are relatively simple and restricted to single-task problems. An article published by Lakhani and Sundaram in Radiology earlier this year demonstrated a 96 percent accuracy rate in classifying tuberculosis on 150 plain chest X-rays in a holdout test set. The authors took off-the-shelf neural networks developed for general image recognition, trained them on this new task and obtained excellent results. One can imagine hundreds of such algorithms that can be trained today in this straightforward manner. This is before we even think about building up the complexity with higher-order reasoning, multi-modal models such as images plus text or images plus genomics, or composition of neural networks in a modular fashion. There are so many potential applications that we can already create using simple off-the-shelf neural networks, so what are the bottlenecks?

For the tuberculosis example, the level of accuracy would certainly benefit numerous patients in many parts of the world, if immediately deployed. Indeed, it is in areas where access is poor and specialists are scarce that AI will initially have the biggest impact. But deployment of these algorithms is an ongoing challenge, with data privacy and security being critically important, as well as issues of data provenance and data transfer. Whether deployment is to the cloud, to on-premise data centers or to edge devices, there are benefits and tradeoffs to consider for each.

How do we allow interactive probing of the model to intuit its inner workings? How do we deliver continuous updates or improvements à la websites or apps? How do we implement guard rails or heuristics to plan for the expected failure cases? What is clear is that we need the ability to rigorously validate on new patient data: a common issue in machine learning--particularly deep learning--is overfitting on the training data such that performance on new unseen data is poor. As AI model complexity is continuously pushed to ever-higher levels, “evidence-based” becomes not just desired, but fundamental.

The current state of AI in radiology has never before looked this promising. We should try to reign in natural tendencies to overhype, lest we overinflate our expectations and slide once again into a trough of disillusionment. We should continue to expand the realm of algorithmic possibilities, while at the same time be mindful that the processes and infrastructure to support these algorithms are vitally important as well. Patient outcomes will no doubt be improved, but it will require the efforts of an entire community and a thriving ecosystem--no single person, institution, or company can tackle any piece alone

About the author: Leon Chen is the co-founder and CEO of MD.ai. He is an engineer and physician with an M.D. from Harvard Medical School and extensive background in machine learning.