Dr Ke Yuan - AI for Cancer Research
Introduction
Vision
Our lab is at the forefront of integrating artificial intelligence (AI) with cancer research. We aim to harness the power of AI to make biologically impactful predictions, drawing from the vast and rich datasets generated through cutting-edge cancer research. By focusing on building AI models across various biological data modalities, we seek to transform how cancer is diagnosed, understood, and treated, accelerating progress toward personalised medicine and therapeutic advancements.
Key Research Areas
- AI for Histology and Spatial Deep Phenotyping
We are developing AI algorithms to analyse histological slides and integrate data across spatial biology and genomics. This will allow us to map the histomorphological landscape of both mouse models and human samples, improving phenotypic assessment and accelerating discoveries in tumour characterisation and treatment response.
- Large Language Models (LLMs) for Biological Sequences
We develop LLMs tailored to biological sequences such as proteins and RNA. These models allow us to predict the effects of genomic mutations, including rare events, providing new insights into protein-protein interactions and RNA functions in cancer biology.
- Comprehensive In Silico Tumor Models
We are building digital twin models of mouse systems, which simulate genotype-phenotype relationships to predict treatment responses and intervention outcomes. This initiative holds the potential to revolutionise preclinical cancer research by enabling real-time, cost-effective simulations of therapeutic strategies.
Methodology
We develop machine learning and deep learning models tailored to make novel inferences in cancer. We utilise state-of-the-art local and national computing infrastructures, large-scale datasets, and advanced spatial biology tools to ensure the robust training and validation of our models.
Impact and Applications
The application of our AI models has the potential to significantly advance the fields of cancer diagnosis, prognosis, and treatment. By integrating histopathology, spatial biology, and genomic data, we aim to produce AI-driven diagnostic tools that will enhance routine clinical workflows, personalise cancer treatments, identify new biomarkers for drug development, and deliver breakthroughs in understanding cancer's molecular underpinnings.
Collaborations
We collaborate very closely with world-class pathologists, computational biologists, and cancer researchers from the University of Glasgow, CRUK Scotland Institute, NHS Greater Glasgow and Clyde, and international institutions. These partnerships provide us with the expertise and resources to bridge the gap between computational innovation and clinical application, ensuring that our AI tools make a tangible impact on cancer research and patient care.
Lab Report
Key Publications
*Corresponding author; # Joint first author
Claudio Quiros A, Coudray N, Yeaton A, Yang X, Liu B, Le H, Chiriboga L, Karimkhan A, Narula N, Moore DA, Park CY, Pass H, Moreira AL, Le Quesne J, Tsirigos A, Yuan K. Mapping the landscape of histomorphological cancer phenotypes using self-supervised learning on unannotated pathology slides. Nat Commun. 2024;15(1):4596.
Lamb, K., Luka, M., Saathoff, M., Orton, R., Phan, M., Cotten, M., Yuan, K.*, Robertson, D.L.*. Mutational signature dynamics indicate SARS-CoV-2’s evolutionary capacity is driven by host antiviral molecules. PLOS Computational Biology. 2024; 20(1), e1011795.
Ji, Y., Cutiongco, M., Jensen, B., Yuan K. (2023) CP2Image: Generating high-quality single-cell images using CellProfiler representations. Conference on Medical Imaging with Deep Learning (MIDL) PMLR 227:274-285. Top AI for medical image analysis conference.
Farndale, L., Insall, R., Yuan, K. (2023) More from Less: self-supervised knowledge distillation for routine histopathology data. Machine Learning in Medical Imaging: 14th International Workshop, MLMI 2023, Held in Conjunction with MICCAI 2023, Proceedings, Part I Oct 2023 Pages 454–463.
Claudio Quiros, A., Coudray, N., Yeaton, A., Sunhem, W., Murray-Smith, R., Tsirigos, A., Yuan, K.* (2021) Adversarial learning of cancer tissue representations. International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), pp. 602-612.
Dentro, S.C., Leshchiner, I., Hasse, K., Tarabichi, M., ..., Yuan, K., Gerstung, M., Spellman, P.T., Wang, W., Morris, Q.D., Wedge, D.C., Van Loo, P., PCAWG Evolution and Heterogeneity Working Group, PCAWG Consortium. Characterizing genetic intra- tumor heterogeneity across 2,658 human cancer genomes. Cell. 2021; 184(8), 2239-2254.e39. (Glasgow & Cambridge Working Group lead)
Claudio Quiros, A.C., Murray-Smith, R., Yuan, K.* (2020) Pathology GAN: Learning deep representations of cancer tissue. Conference on Medical Imaging with Deep Learning (MIDL), Montreal, Canada, 6-9 Jul 2020, pp. 669-695.
Gerstung, M., Jolly, C., Leshchiner, I., Dentro, S.C., ..., Yuan, K., Wang, W., Morris, Q.D., PCAWG Evolution and Heterogeneity Working Group, Spellman, P.T., Wedge, D.C., Van Loo, P., PCAWG Consortium. The evolutionary history of 2,658 cancers. Nature. 2020; 578(7793), pp. 122-128 (Glasgow & Cambridge Working Group lead)
Cmero, M.#, Yuan, K.#, Ong, C. S., Schröder, J., PCAWG Evolution and Heterogeneity Working Group, Corcoran, N. M., Papenfuss, T., Hovens, C. M., Markowetz, F., Macintyre, G., and PCAWG Consortium. Inferring structural variant cancer cell fraction. Nat Comms. 2020; 11, 730.
de Santiago, I., Liu, W., Yuan, K., O'Reilly, M. and Chilamakuri, C.S.R., Ponder, B.A.J., Meyer, K.B. and Markowetz, F. BaalChIP: Bayesian analysis of allele-specific transcription factor binding in cancer genomes. Genome Biology. 2017;18:39.
Yuan, K.#, Sakoparnig, T#., Markowetz, F. and Beerenwinkel, N. BitPhylogeny: a probabilistic framework for reconstructing intra-tumor phylogenies. Genome Biology. 2015; 16:36
Biography
Education and qualifications
2008-2012: PhD, Machine Learning, University of Southampton, UK
2007-2008: MSc, Radio Frequency Communication Systems, University of Southampton, UK
2003-2007: BEng, Telecommunication Engineering, Nanjing University of Posts and Telecommunications, China
Appointments
2022-present: Senior Lecturer in Machine Learning and Computational Biology, School of Computing Science, University of Glasgow, UK
2022-present: Secondment appointment, School of Cancer Sciences, University of Glasgow, UK
2022-present: Secondment appointment, CRUK Scotland Institute, University of Glasgow, UK
2016-present: Lecturer in Machine Learning and Computational Biology, School of Computing Science, University of Glasgow, UK
2012-2016: Postdoctoral Research Associate, CRUK Cambridge Institute, University of Cambridge, UK
Current committee membership
Intelligent Systems for Molecular Biology (ISMB) 2024 Conference program committee
Recent Publications
*Corresponding author; # Joint first author
2024
Claudio Quiros A, Coudray N, Yeaton A, Yang X, Liu B, Le H, Chiriboga L, Karimkhan A, Narula N, Moore DA, Park CY, Pass H, Moreira AL, Le Quesne J, Tsirigos A, Yuan K. Mapping the landscape of histomorphological cancer phenotypes using self-supervised learning on unannotated pathology slides. Nat Commun. 2024;15(1):4596.
Farndale, L., Walsh, C., Insall, R., Yuan K. Synthetic Privileged Information Enhances Medical Image Representation Learning. 2024; arXiv:2403.05220
Lamb, K., Luka, M., Saathoff, M., Orton, R., Phan, M., Cotten, M., Yuan, K.*, Robertson, D.L.*. Mutational signature dynamics indicate SARS-CoV-2’s evolutionary capacity is driven by host antiviral molecules. PLOS Computational Biology. 2024; 20(1), e1011795.
Lamb, K.D., Hughes, J., Lytras, S., Koci, O., Young, F., Grove, J. Yuan, K.* Robertson, D.L*. From a single sequence to evolutionary trajectories: protein language models capture the evolutionary potential of SARS-CoV-2 protein sequences. 2024; bioRxiv 2024.07.05.602129.
Liu D, Young F, Lamb KD, Claudio Quiros A, Pancheva A, Miller C, Macdonald C, Robertson DL, Yuan K. PLM-interact: extending protein language models to predict protein-protein interactions. bioRxiv. 2024:2024.2011.2005.622169.
Liu, B., Polack, M., Coudray, N., Claudio Quiros, A., Sakellaropoulos, T., Crobach, A., van Krieken, J., Yuan, K., Tollenaar, R., Mesker, W.E., Tsirigos, A. Self-supervised learning reveals clinically relevant histomorphological patterns for therapeutic strategies in colon cancer. bioRxiv 2024.02.26.582106
Seyedshahi F, Rakovic K, Poulain N, Quiros AC, Powley IR, Richards C, Uraiby H, Klebe S, Nakas A, Wilson C, Sereno M, Officer-Jones L, Ficken C, Teodosio A, Ballantyne F, Murphy D, Yuan K, Le Quesne J. A histomorphological atlas of resected mesothelioma from 3446 whole-slide images discovered by self-supervised learning. bioRxiv. 2024:2024.2011.2018.624103.
2023
Coudray N, Juarez MC, Criscito MC, Quiros AC, Wilken R, Cullison SRJ, Stevenson ML, Doudican NA, Yuan K, Aquino JD, Klufas DM, North JP, Yu SS, Murad F, Ruiz E, Schmults CD, Tsirigos A, Carucci JA. Self-supervised artificial intelligence predicts recurrence, metastasis and disease specific death from primary cutaneous squamous cell carcinoma at diagnosis. Res Sq [Preprint]. 2023 Dec 13:rs.3.rs-3607399.
Farndale, L., Insall, R., Yuan, K. TriDeNT: Triple Deep Network Training for Privileged Knowledge Distillation in Histopathology. 2023; arXiv:2312.02111.
Liu, D., Young, F., Robertson, D.*, Yuan, K.* (2023) Prediction of virus-host association using protein language models and multiple instance learning. bioRxiv 2023.04.07.536023.
Yang, X., Liu, W., Macintyre, G., Van Loo, P., Markowetz, F., Bailey, P.*, Yuan, K.* (2023) Pan-cancer evolution signatures link clonal expansion to dynamic changes in the tumour immune microenvironment. bioRxiv 2023.10.12.560630.
2021
Dentro, S.C., Leshchiner, I., Hasse, K., Tarabichi, M., ..., Yuan, K., Gerstung, M., Spellman, P.T., Wang, W., Morris, Q.D., Wedge, D.C., Van Loo, P., PCAWG Evolution and Heterogeneity Working Group, PCAWG Consortium. Characterizing genetic intra- tumor heterogeneity across 2,658 human cancer genomes. Cell. 2021; 184(8), 2239-2254.e39. (Glasgow & Cambridge Working Group lead)
Macintyre G, Piskorz AM, Berman A, Ross E, Morse DB, Yuan K, Ennis D, Pike JA, Goranova T, McNeish IA, Brenton JD, Markowetz F. FrenchFISH: Poisson Models for Quantifying DNA Copy Number From Fluorescence In Situ Hybridization of Tissue Sections. JCO Clin Cancer Inform. 2021 Feb;5:176-186.
Lab Members
Postdoctoral Scientists
Chris Walsh (UoG, Joint with David Chang)
Fran Young (UoG, Joint with David Robertson)
Kieran Lamb (UoG, Joint with David Robertson)
Seyed Mousavi (UoG)
Yilong Yang (UoG, PCUK)
PhD students
Dan Liu (UoG, Joint with David Robertson)
David Meltzer (UoG, Joint with David Chang)
Farzaneh Seyedshahi (CRUK SI, Joint with John Le Quesne)
Khanh Nguyen (UoG, Joint with David Chang)
Kai Rakovic (CRUK SI, Joint with John Le Quesne)
Lucas Farndale (CRUK SI, Joint with Robert Insall)
Rozeena Arif (UoG, Joint with Alfredo Castello & David Robertson)
Robert Strange (MRC, Joint with David Robertson and Joe Marsh)
Tommy Stevens (UoG, Joint with Campbell Roxburgh and Joanne Edwards)