12 skills found
OpenDCAI / DataFlexDataFlex is a data-centric training framework that enhances model performance by either selecting the most influential samples, optimizing their weights, or adjusting their mixing ratios.
GENIE-MC / GeneratorThe GENIE Generator is a leading simulation tool for neutrino experiments, featuring a modular framework with advanced physics, BSM channels, and tuned models from global data analysis. It supports all neutrinos, targets, and energies (MeV–PeV), with tools for flux, geometry, event generation, and reweighting.
tummfm / DifftreLearning neural network potentials from experimental data via Differentiable Trajectory Reweighting
vodp / Py KmmA Python implementation of Kernel Mean Matching data reweighting algorithm
deaneckles / Multiway BootstrapImplemention of the multiway bootstrap (including the Pigeonhole bootstrap, reweighting tensor bootstrap). Reweights observations with the product of weights for the units that observation is of (e.g., from crossed random effects). Owen, A.B., & Eckles, D. (2012). Bootstrapping data arrays of arbitrary order. Annals of Applied Statistics, 6(3), 895-927.
2003pro / ScaleBiOThis is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
bfshi / ARML Auxiliary Task ReweightingCode for our paper "Auxiliary Task Reweighting for Minimum-data Learning" (NeurIPS 2020)
ddehun / Coling2022 Reweighting StsOfficial repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"
ericmetodiev / OmniFoldUniversally unfolding collider data with machine learning-based reweighting.
sjoerdvanalten / UKBWeightsFinalAll codes necessary to reproduce the results for necessary to reproduce the results ``The costs of non-reprsenative data: reweighting the UK Biobank corrects for pervasive selection bias due to volunteering'' by Sjoerd van Alten, Ben Domingue, Titus Galama and Andries Marees.
YunzeTong / Latent Score Based Reweighting[ICML 2025] Latent Score-Based Reweighting for Robust Classification on Imbalanced Tabular Data
snudatalab / DoReMeDomain-Aware Data Selection for Speech Classification via Meta-Reweighting (Interspeech'24)