Confounds

Conquering confounds and covariates: methods, library and guidance

Generate Convert Improve

Install / Use

/learn @raamana/Confounds

About this skill

Quality Score

0/100

README

Conquering confounds and covariates in machine learning

.. image:: https://github.com/raamana/confounds/blob/master/confounds_card.jpg

.. image:: https://zenodo.org/badge/197298208.svg :target: https://zenodo.org/badge/latestdoi/197298208

.. image:: https://img.shields.io/pypi/v/confounds.svg :target: https://pypi.python.org/pypi/confounds

.. image:: https://img.shields.io/travis/raamana/confounds.svg :target: https://travis-ci.org/raamana/confounds

News


- **Hackathon folks**: Those coming here from the hackathon, please go here to learn some ideas for contribution: https://github.com/ohbm/hackathon2021/issues/34

- The previous slides for the OHBM Hackathon and Open Science Room are here: https://crossinvalidation.com/2020/03/04/conquering-confounds-and-covariates-in-machine-learning/

Vision / Goals
~~~~~~~~~~~~~~~

The high-level goals of this package is to develop high-quality library to conquer confounds and covariates in ML applications. By conquering, we mean methods and tools to

1. visualize and establish the presence of confounds (e.g. quantifying confound-to-target relationships),
2. offer solutions to handle them appropriately via correction or removal etc, and
3. analyze the effect of the deconfounding methods in the processed data (e.g. ability to check if they worked at all, or if they introduced new or unwanted biases etc).


Documentation
~~~~~~~~~~~~~~

https://raamana.github.io/confounds


Methods
~~~~~~~~

Available:

- Residualize (e.g. via regression)
- Augment (include confounds as predictors)
- Some utils

To be added:

- Harmonize (correct batch effects via rescaling or normalization etc)
- Stratify (sub- or re-sampling procedures to minimize confounding)
- Full set of utilities (Goals 1 and 3)
- reweight (based on propensity scores as in IPW, or based on confounds)
- estimate propensity scores

In a more schematic way:

.. image:: docs/schematic_method_impl_status.png



Resources
~~~~~~~~~
any useful resources; papers, presentations, lectures related to the problems of confounding can be found here https://github.com/raamana/confounds/blob/master/docs/references_confounds.rst



Citation
~~~~~~~~~~~~~~

If you found any parts of ``confounds`` to be useful in your research, directly or indirectly, **I'd appreciate if you could cite** the following:

- Pradeep Reddy Raamana (2020), "Conquering confounds and covariates in machine learning with the python library confounds", Version 0.1.1, Zenodo. http://doi.org/10.5281/zenodo.3701528


Contributors are most welcome.

Your contributions of all kinds will be greatly appreciated. Learn how to contribute to this repo here <CONTRIBUTING.rst>_.

All contributors making non-trivial contributions will be

publicly and clearly acknowledged on the authors page <AUTHORS.rst>_
become an author on the [software] paper to be published when it's ready soon.

Related Skills

proje

Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

400

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

raamana

View profile

View on GitHub

GitHub Stars40

CategoryEducation

Updated4mo ago

Forks14

raamana/confounds

Languages

Python

Security Score

92/100

Audited on Nov 13, 2025

No findings