PKSmart
PKSmart: Predicting PK properties using Chemical Structures
Install / Use
/learn @srijitseal/PKSmartREADME
PKSmart Repository Structure
Overview
PKSmart is a repository dedicated to predicting pharmacokinetic (PK) properties, including parameters like clearance (CL), volume of distribution (VdSS), and the fraction unbound (fup), based on chemical structures. The project integrates machine learning and chemical informatics to model PK properties for various compounds, including human, dog, rat, and monkey data.
Using local implementation
Download essential files from https://doi.org/10.5281/zenodo.10611606 and run locally!
Run Online on Server v3.0.0
If you prefer to use the predictor online via Uppsala University SciLifeLab Serve:https://pk-predictor.serve.scilifelab.se/
Data Files
data/: Contains datasets used in training and testing PKSmart models. Includes curated datasets for human and animal pharmacokinetic (PK) parameters.
Jupyter Notebooks
These notebooks provide data processing, model training, evaluation, and analysis workflows.
Data Exploration and Preprocessing
00_check_data_pka.ipynb: Examines and preprocesses PK data, ensuring data quality and consistency.00b_animal_human_lmplot_CL.ipynb: Creates linear regression plots comparing clearance (CL) between human and animal datasets.00c_animal_human_lmplot_VdSS.ipynb: Generates regression plots comparing volume of distribution (VDss) between humans and animals.00d_animal_human_lmplot_fup.ipynb: Visualizes relationships for fraction unbound in plasma (fu) between human and animal datasets.
Human PK Prediction
01_Predict_human_data_Mordred.ipynb: Predicts human PK parameters using Mordred descriptors.01_Predict_human_data_Morgan.ipynb: Predicts human PK parameters using Morgan fingerprints.01_Predict_human_data_Morgan_Mordred.ipynb: Combines Morgan fingerprints and Mordred descriptors for human PK predictions.
Animal PK Prediction
02_Predict_rat_data.ipynb: Predicts PK parameters for rats.02_Predict_dog_data.ipynb: Predicts PK parameters for dogs.02_Predict_monkey_data.ipynb: Predicts PK parameters for monkeys.02b_Analyse_animal_models.ipynb: Analyzes and compares the performance of animal PK models.
Advanced Model Training
03_Predict_human_data_with_artificial_real_animal_data_only.ipynb: Uses real animal data where available and artificial animal data for human PK predictions.03_MedianMordredCalculator_artificial_animal_data_mfp_mrd.ipynb: Calculates median Mordred descriptors and integrates animal data for enhanced PK predictions.03_Predict_human_data_mean_predictor.ipynb: Implements a mean predictor model as a baseline for comparisons.03_Predict_human_data_with_artificial_animal_data_mfp.ipynb: Uses artificial animal data and Morgan fingerprints03_Predict_human_data_with_artificial_animal_data_mrd.ipynb: Uses artificial animal data and Mordred descriptors03_Predict_human_data_with_artificial_animal_data_mfp_mrd.ipynb: Uses artificial animal data with Morgan fingerprints and Mordred descriptors.03_Predict_human_data_artificial_real_animal_data_mfp_mrd.ipynb: Uses artificial and real animal data with Morgan fingerprints and Mordred descriptors
Chemical Space and Model Analysis
07_Analyse-Chemical_space_human_dataset_animal_dataset.ipynb: Explores and visualizes chemical space for human and animal datasets.07_Analyse-Chemical_space_nested_cross_val.ipynb: Evaluates chemical space coverage across nested cross-validation splits.08_Analyse_Performance_NCV_Wo_Outputs.ipynb: Assesses model performance in nested cross-validation (NCV)09_Distribution and Drug Space.ipynb: Analyzes the distribution of PK parameters and drug chemical space.09b_Figures_Cross_Val_Fold_errors.ipynb: Generates figures to evaluate cross-validation fold errors.10_Evaluate_with_fold_errors.ipynb: Final evaluation of model predictions with calculated fold errors.
Additional Files
External_Test_ALL_Lombardo_FDA_CL_fu_V2: External test dataset for evaluating model performance against independent data sources.
Installation
Using PyPI
You can install the DILI Predictor using pip: pip install pksmart. Please use Python <3.12, >=3.9
For details see https://pypi.org/project/pksmart/
Cite
If you use PKSmart in your work, please cite:
PKSmart: An Open-Source Computational Model to Predict in vivo Pharmacokinetics of Small Molecules Srijit Seal, Maria-Anna Trapotsi, Manas Mahale, Vigneshwari Subramanian, Ola Spjuth, Nigel Greene, Andreas Bender bioRxiv 2024.02.02.578658; doi: https://doi.org/10.1101/2024.02.02.578658
Acknowledgements
Developed and maintained by Srijit Seal and contributors.
Contact
For any questions or issues, please open an issue on the GitHub repository.
Related Skills
node-connect
337.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.2kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
337.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.2kCommit, push, and open a PR
