Massivedatans
Big Data vs. complex physical models - a scalable nested sampling inference algorithm for many data sets
Install / Use
/learn @JohannesBuchner/MassivedatansREADME
========================================================================= Big Data vs. complex physical models - a scalable inference algorithm
A algorithm for fitting models against many data sets, giving parameter probability distributions. The key is that model evaluations are efficiently re-used between data sets, making the algorithm scale sub-linearly.
See paper for details: https://arxiv.org/abs/1707.04476
How to run
You need to install
- python-igraph
- numpy, scipy
- h5py
- progressbar
- gcc
Then run::
$ # build
$ make
$ # simulate data set
$ python gensimple_horns.py 10000
$ # analyse
$ OMP_NUM_THREADS=4 python sample.py data_widths_10000.hdf5 100
$ # simulate no-signal data set
$ python gennothing.py 10000 # simulate no-signal data set
$ # analyse
$ OMP_NUM_THREADS=4 python sample.py data_nothing_10000.hdf5 10000
See paper draft for details.
Improving Performance
See TODO.
Implementation notes and Code organisation
- sample.py sets up everything
- Set your problem definition (parameters, model, likelihood) in sample.py
- Integrator: multi_nested_integrator.py . Calls sampler repeatedly.
- Joint Sampler: multi_nested_sampler.py . This deals with managing the graph and the queues and which live points to use for a new draw. Calls draw_constrained
- The queues (paper) are called shelves in the code.
- RadFriends: hiermetriclearn.py: Suggests new samples from live points and filters with likelihood function to return a higher point.
- clustering/: Fast C implementations for checking if a point is in the neighbourhood and computing safe distances.
Related Skills
node-connect
353.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
claude-opus-4-5-migration
111.7kMigrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5
frontend-design
111.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
model-usage
353.3kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
