SkillAgentSearch skills...

Caco2ml

Main code chunks used for models in the publication "Exploring the Potential of Adaptive, Local Machine Learning (ML) in Comparison ton the Prediction Performance of Global Models: A Case Study from Bayer's Caco-2 Permeability Database"

Install / Use

/learn @ffstghc/Caco2ml
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

"Exploring the Potential of Adaptive, Local Machine Learning (ML) in Comparison to the Prediction Performance of Global Models: A Case Study from Bayer's Caco-2 Permeability Database"

American Chemical Society (ACS): Journal of Chemical Information and Modeling (JCIM)

Frank Filip Steinbauer, Thorsten Lehr, Andreas Reichel

http://pubs.acs.org/doi/abs/10.1021/acs.jcim.4c01083

Repository for archiving the main code chunks used for the local and global machine learning models in the publication "Exploring the Potential of Adaptive, Local Machine Learning (ML) in Comparison ton the Prediction Performance of Global Models: A Case Study from Bayer's Caco-2 Permeability Database" published in 2024 in ACS Journal of Chemical Information and Modeling (JCIM) as 1st publication of my doctoral studies at Bayer.

The five different included files contain the main code chunks for:

  1. Data preparation (SMILES/molecule object standardization; PaDEL descriptor calculation)
  2. Global models (including other descriptor calculations and recursive feature elimination with cross-validation as well as external TDC benchmarking<sup>1</sup>)
  3. Local model (training data selection via fixed tanimoto similarity criteria)
  4. Local model (training data selection via fixed amounts of most similar structuress)
  5. Local model (training data selection via kNN<sup>2</sup> as control/proof of superiority of the chosen tanimoto similarity approach)

If you have further questions or need additional parts of the utilized code for your own studies, feel free to contact Filip.Steinbauer@bayer.com.

View on GitHub
GitHub Stars4
CategoryData
Updated9d ago
Forks2

Languages

Python

Security Score

75/100

Audited on Mar 28, 2026

No findings