{"id":33456,"date":"2025-09-28T01:41:29","date_gmt":"2025-09-28T01:41:29","guid":{"rendered":"https:\/\/agooka.com\/news\/technologies\/delphi-2m-ai-predicts-1000-diseases-using-over-400k-medical-records\/"},"modified":"2025-09-28T01:41:29","modified_gmt":"2025-09-28T01:41:29","slug":"delphi-2m-ai-predicts-1000-diseases-using-over-400k-medical-records","status":"publish","type":"post","link":"https:\/\/agooka.com\/news\/technologies\/delphi-2m-ai-predicts-1000-diseases-using-over-400k-medical-records\/","title":{"rendered":"Delphi-2M AI predicts 1000+ diseases using over 400k medical records"},"content":{"rendered":"<p><img decoding=\"async\" src=\"https:\/\/dataconomy.com\/wp-content\/uploads\/2025\/09\/1115607.jpg\" alt=\"Delphi-2M AI predicts 1000+ diseases using over 400k medical records\" title=\"Delphi-2M AI predicts 1000+ diseases using over 400k medical records\"\/><\/p>\n<p>Researchers at the German Cancer Research Center have developed an artificial intelligence model, Delphi-2M, that can predict an individual\u2019s risk for more than 1,000 diseases up to two decades into the future using medical records.<\/p>\n<p>This development aligns with a broader shift in healthcare from reactive treatment to proactive prevention. While algorithms have been created to predict the risk of single conditions, diseases are often interconnected. A comprehensive model that can account for this complexity could inform early treatment, improve targeted screening, and identify high-risk individuals who might otherwise be overlooked.<\/p>\n<h2>How Delphi-2M works<\/h2>\n<p>The Delphi-2M model is a large language model (LLM), similar to the technology behind text-generating chatbots. Instead of being trained on internet text, it was developed by processing over 400,000 comprehensive medical records from the UK Biobank. This clinical data was supplemented with lifestyle information, such as body mass index and smoking status.<\/p>\n<p>The model treats a patient\u2019s medical history as a sequence of \u201cdisease tokens,\u201d where each diagnostic code represents a step in a potential disease progression. By analyzing these sequences, the AI learns the statistical patterns of how different conditions connect and follow one another over time. A key feature is its ability to dynamically re-evaluate predictions. When new information, like a recent blood test result, is added, the model can update its risk calculations for that individual, allowing for continuous health monitoring.<\/p>\n<h2>Performance and validation<\/h2>\n<p>In performance evaluations, Delphi-2M matched or exceeded the accuracy of established clinical risk scores for the majority of the 1,258 diseases it was trained on. It also outperformed other specialized medical AI predictors designed to forecast single diseases. The model proved particularly effective in predicting the long-range risk of cardiovascular disease and dementia, showing greater accuracy than some biomarker-based models even when forecasting two decades into the future.<\/p>\n<p>However, the model struggled to accurately predict conditions with more variable trajectories heavily influenced by lifestyle changes, such as Type 2 diabetes. This indicates a limitation in its ability to account for factors not consistently captured in electronic health records.<\/p>\n<p>To test its robustness, the researchers applied the model to the Danish National Patient Registry, which contains records for nearly two million citizens. Despite differences in the populations and healthcare systems, the model\u2019s prediction accuracy remained high, suggesting it learned fundamental principles of human disease progression.<\/p>\n<h2>Ethical design and future applications<\/h2>\n<p>Delphi-2M was designed with practical and ethical considerations in mind. It can learn from synthetic medical records to protect patient privacy and is an \u201cexplainable\u201d AI, meaning it can provide a rationale for its predictions by clustering related conditions and symptoms. The researchers emphasize that the model identifies statistical associations, not causation.<\/p>\n<p>The model is built with a modular design to incorporate additional data types in the future, such as genomics, diagnostic imaging, and data from wearable devices. Currently, the tool is being tested in other countries with diverse populations. In its present form, it could be used in clinical settings to identify individuals who would benefit from early screening, even if they do not meet traditional criteria.<\/p>\n<h2>Expert reception<\/h2>\n<p>The model has been positively received by experts not involved in the study. Justin Stebbing, a professor at Anglia Ruskin University, called the tool \u201can achievement\u201d that sets \u201ca new standard for both predictive accuracy and interpretability.\u201d Gustavo Sudre, a researcher at King\u2019s College London, described the research as:<\/p>\n<blockquote>\n<p>\u201ca significant step towards scalable, interpretable, and\u2014most importantly\u2014ethically responsible form of predictive modeling in medicine.\u201d<\/p>\n<\/blockquote>\n<p><strong>Featured image credit<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Researchers at the German Cancer Research Center have developed an artificial intelligence model, Delphi-2M, that can predict an individual\u2019s risk for more than 1,000 diseases up to two decades into the future using medical records. This development aligns with a broader shift in healthcare from reactive treatment to proactive prevention. While algorithms have been created [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":33457,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[37],"tags":[],"class_list":{"0":"post-33456","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-technologies"},"_links":{"self":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts\/33456","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/comments?post=33456"}],"version-history":[{"count":0,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts\/33456\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/media\/33457"}],"wp:attachment":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/media?parent=33456"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/categories?post=33456"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/tags?post=33456"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}