Drug Delivery
Zilu Zhang (he/him/his)
PhD Candidate
Department of Biomedical Engineering, Duke University
Durham, North Carolina, United States
Bárbara B. Mendes
Postdoctoral Researcher
ToxOmics, NOVA Medical School, Faculdade de Ciências Médicas, NMS|FCM, Universidade NOVA de Lisboa, United States
João Conniot
Postdoctoral Researcher
ToxOmics, NOVA Medical School, Faculdade de Ciências Médicas, NMS|FCM, Universidade NOVA de Lisboa, United States
Diana P. Sousa
PhD Student
ToxOmics, NOVA Medical School, Faculdade de Ciências Médicas, NMS|FCM, Universidade NOVA de Lisboa, United States
João M. J. M. Ravasco
Research Scholar
ToxOmics, NOVA Medical School, Faculdade de Ciências Médicas, NMS|FCM, Universidade NOVA de Lisboa, United States
Andżelika Lorenc
Research Scholar
Instituto de Investigação do Medicamento (iMed), Faculdade de Farmácia, Universidade de Lisboa, Av. Prof. Gama Pinto; Department of Biopharmacy, Ludwik Rydygier Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, United States
Tiago Rodrigues
Assistant Professor
Instituto de Investigação do Medicamento (iMed), Faculdade de Farmácia, Universidade de Lisboa, Av. Prof. Gama Pinto, United States
João Conde
Assistant Professor
ToxOmics, NOVA Medical School, Faculdade de Ciências Médicas, NMS|FCM, Universidade NOVA de Lisboa, United States
Daniel Reker
Assistant Professor
Department of Biomedical Engineering, Duke University
Durham, North Carolina, United States
Inorganic nanoparticles have become an important tool as cancer therapeutics, diagnostics, and theranostics. However, notwithstanding the achievements that have been made, it remains challenging to objectively, comprehensively, and systematically define key design requirements to ensure the success of inorganic nanomedicines in cancer treatment. To discover such design rules, we have curated the world’s largest database of inorganic nanoparticles in preclinical cancer research and use statistics and machine learning to capture trends in nanoparticle design, correlate nanoparticle properties with treatment outcomes, and provide predictive tools that can guide the creation of safe and efficacious nanoparticles for cancer drug delivery (Figure 1a).
A database integrating data from 707 publications was established through a semi-automated search with manual data curation from the PubMed database, initiated using the search terms “in vivo” + “tumor” + “cancer” and “gold nanoparticles” or “silica nanoparticles” or “iron oxide nanoparticles”, covering the period from January 2007 to June 2020. Data was manually extracted from these studies except for biodistribution values that were digitized from manuscript figures using WebPlotDigitizer. For machine learning, we filtered features present in less than 60% of the database, and only studies were kept that had all the selected features annotated. Categorical features were transformed using either one-hot encoding or ordinal encoding depending on the inherent relationship between categories within each feature. Discriminative machine learning models were built and evaluated using repeated 10-fold cross validation. Model interpretation was investigated using SHAP (SHapley Additive exPlanations) analysis on the random forest classifier. Linear discriminant analysis was performed to reduce the dimensionality of the data to 2D for clustering analysis.