Zernike3d

Compute Zernike moments (complex coefficients of Zernike expansion) and Zernike invariants (rotationally invariant) from voxelised MRC density maps (cryo-EM)

Generate Convert Improve

Install / Use

/learn @charbj/Zernike3d

About this skill

Quality Score

0/100

README

Zernike3d for cryo-EM analysis

Computes the Zernike moments (complex coefficients of the Zernike expansion) and Zernike invariants (rotationally invariant norms) from voxel-based MRC density maps (cryo-EM).

example

Requirements

Hardware

The fastest and most luxurious option would be to have a GPU with a lot of VRAM, e.g. an NVIDIA RTX A6000 with 48 Gb of VRAM, however this is an extreme. Most people will have 12-24 Gb of VRAM. Alternatively, a lot of system RAM is also a good option. Many HPC nodes have 500 Gb - 1 Tb of RAM nowadays. Computing everything on the GPU and in GPU memory is the fastest. Unfortunately due to the large number of arrays and limited bandwidth, computing on the GPU and passing these to the CPU memory is quite slow and in this case it's usually just faster to compute directly with the CPU. The optimal approach is somewhat system dependent...

Zernike3D should check your system and determine the fastest computation strategy depending on your system hardware. It prioritises:

GPU VRAM > GPU/CPU VRAM/RAM > CPU RAM > CPU SSD/HD I/O.

Memory

Computing moments to order n=50 represents an expansion of ~12,051 Zernike polynomials. Zernike3D will minimise this by exploiting redundancy and recurrence. However, this still requires 675 radial and 1326 spherical harmonics. Consequently, memory usage scales poorly with volume size. It is strongly recommended to crop the volume to remove as much unnecessary solvent as possible without cropping the signal of the macromolecule. Alternatively, a soft mask can be applied and the masked subvolume can be decomposed (once again, after cropping this region).

memory_usage

Resolution

To further minimise memory requirements, consider whether the full pixel size sampling is necessary (e.g. by binning). A factor of 2, e.g. 200 -> 100 px, requires 90 Gb less memory! Here is an example of resolutions obtained for a 64-pixel volume with 1.4 Å pix<sup>-1</sup>. Currently, this relationship depends on the pixel size and the box size (aiming to remove/resolve this).

resolution_vs_order )

Dependencies

python==3.10
numpy
numba
cupy
mrcfile
joblib
scipy
tqdm
zarr
ray
nvidia-ml-py3

Cupy install can be tricky, check your CUDA version and follow the documentation. I recommend the `pip install cupy-cuda12x` approach. Note `nvidia-smi` doesn't always report the correct cuda version that is on your system path. You should check this.

Install

mamba create -n zernike3d python==3.10
mamba activate zernike3d
mamba install -c conda-forge numpy numba mrcfile joblib scipy tqdm zarr cupy
python -m pip install ray nvidia-ml-py3 cupy-cuda12x

Examples

Decompose a single volume in MRC format into the Zernike moments (saved as a .npz file).

python zernike_master.py decomposition \
--mode spharm --input meprin_64pix.mrc --order 50 --compute_mode 3 --output test.npz --gpu_id 0 --verbose

Reconstruct a single volume from a set of Zernike moments

python zernike_master.py reconstruct \
--mode spharm --moments test.npz --order 50 --compute_mode 3 --output reconstructed.mrc --gpu_id 0 --verbose --dimensions 64

Decompose a 3DVA job from cryoSPARC in WIGGLE (.wgl/.npz) format into Zernike moments (saved as a .npz file).

In order to handle the 3DVA base volume and component maps, these must be bundled together along with the scaling factors from cryoSPARC. WIGGLE provides a utility to take the `map.mrc`, `component_0.mrc`, `component_1.mrc`, etc, and `particles.cs` files and output a single `bundled_3dva.npz` file. These files are pretty simple, so you could probably generate the file yourself with a few lines of python:

>>> import numpy as np
>>> f = np.load('test_3dva.npz', allow_pickle=True)
>>> f.keys
<bound method Mapping.keys of NpzFile 'test_3dva.npz' with keys: latents, dim, apix, arr_0, arr_1...>
>>> f['latents'][0:10]           #The scalar weights of the 3DVA latent space
array([[  7.4729023 ,   6.832714  , -33.30121   ,   5.044608  ,
         11.376951  ],
       [ 19.616875  , -25.294249  , -36.11458   , -18.165325  ,
         48.48419   ],
       [-36.886692  ,  14.499297  , -15.675574  , -10.1494    ,
         50.46876   ],
       [  9.554419  ,  38.54905   ,   0.38004974, -14.412842  ,
         -3.0512257 ],
       [ 31.750261  ,  -3.0132551 ,  16.870565  ,  45.401466  ,
          6.4584966 ],
       [ -2.8442464 ,  28.176533  ,  17.966751  , -59.378757  ,
          7.5261383 ],
       [ 34.960415  ,  22.40595   , -15.593164  , -41.62918   ,
         -2.1461964 ],
       [ 43.29809   ,  32.435684  , -22.64612   ,  -7.7810197 ,
         -8.789153  ],
       [ 10.623635  , -49.933887  , -48.178192  ,  33.493988  ,
         31.926722  ],
       [-82.7782    ,  -4.7471595 ,  -1.859404  , -23.770464  ,
        -13.955286  ]], dtype=float32)
>>> f['apix']
array(1)
>>> f['arr_0'][0:10]            #The moments of the first component
array([-5.5713095e-05+0.0000000e+00j,  1.3842816e-05+0.0000000e+00j,
       -5.5967530e-06-2.8170200e-05j, -3.6291796e-05+0.0000000e+00j,
       -3.9011447e-06+0.0000000e+00j, -4.5352658e-06-2.6491825e-06j,
        2.0415033e-07-2.2599554e-05j, -4.5965538e-05+0.0000000e+00j,
        3.1379077e-05+1.0098607e-04j, -1.1113875e-06+0.0000000e+00j],
      dtype=complex64)

Providing a file of this type is handled automatically and the moments are calculated for the base map, and each of the variability components

python zernike_master.py decomposition \
--mode spharm --input bundle.npz --order 50 --compute_mode 3 --output test_3dva.npz --gpu_id 0 --verbose

Reconstruct a single volume from Zernike moments for a given latent coordinate in the 3DVA space

Providing `--z_ind` will fetch the scale factors stored in the `Moments` class and pre-scale the moments accordingly

python zernike_master.py reconstruct \
--mode spharm --moments test.npz --z_ind 3450 --order 50 --compute_mode 3 --output reconstructed.mrc --gpu_id 0 --verbose --dimensions 64

The user can also directly provide the scalar weights by `--scales` as a list e.g. `--scales 0 3 4 1`. These must match the dimensions of the latent space.

python zernike_master.py reconstruct \
--mode spharm --moments test.npz --scales -34 0 0 0 --order 50 --compute_mode 3 --output reconstructed.mrc --gpu_id 0 --verbose --dimensions 64

Providing no `--scales` or `-z_ind` will default to reconstructing the base map and the component maps independently.

python zernike_master.py reconstruct \
--mode spharm --moments test.npz --order 50 --compute_mode 3 --output reconstructed.mrc --gpu_id 0 --verbose --dimensions 64

Reducing memory requirements

The `Zernike` class can perform decomposition and reconstruction in a lower precision 8-bit mode `--bit8` (with some computational cost incurred) to reduce memory requirements. This is basically a lossly compression of the np.complex64 array to a np.uint8 format. The compression has very little effect on the volume quality.

python zernike_master.py decomposition \
--mode spharm --input bundle.npz --order 50 --compute_mode 3 --output test_3dva.npz --gpu_id 0 --verbose --bit8

The `Zernike` class preallocates memory to the GPU via cupy's default memory management. This often uses larger chunks of memory than strictly necessary. Disable pre-allocation with `--preallocate_memory_off` - this is slower, but will use less memory.

python zernike_master.py decomposition \
--mode spharm --input bundle.npz --order 50 --compute_mode 3 --output test_3dva.npz --gpu_id 0 --verbose --preallocate_memory_off

The `Zernike` class can handle cropping and binning if you provide a UCSF Chimera .cmm file with the center position

python zernike_master.py decomposition \
--mode spharm --input bundle.npz --order 50 --compute_mode 3 --output test_3dva.npz --gpu_id 0 --verbose --com center.cmm --bin 2

(zernike3d) charles $ head center.cmm 
<marker_set name="J21_map.mrc center">
<marker id="1" x="118.09" y="119.05" z="111.43" r="0.7" g="0.7" b="0.7" radius="0.82"/>
</marker_set>

Note, while it is possible to run `--bit8` and `--preallocate_memory_off` simultaneously, it might simply be faster to perform the computation on CPU within system RAM instead of on the GPU. You should test these on your system...

The `Moments` class has utilities for loading, handling, and viewing these moments.

>>> from zernike_master import Moments

>>> m = Moments()

>>> m.load('test.npz')
Loading test.npz into Moments object
Detected full precision moments (single).

>>> m.
m.add_dimension()         m.display_invariants()    m.get_order()             m.load(                   m.order                   m.stats()
m.apix                    m.display_moments()       m.get_size()              m.moments                 m.save(                   m.voxdim
m.calculate_invariants()  m.get_moment(             m.invariants              m.ndinvariants            m.set_latents(            
m.dimensions              m.get_moment_size()       m.latents                 m.ndmoments               m.set_moment(

>>> m.stats()
+-------------------------------------------------------+
|Moment statistics                                      |
|   Dimensions: 2                                       |
|   Moments: 12051                                      |
|   Order: 50                                           |
|   nbytes: 2918032                                     |
|   type: complex64                                     |
|                                                       |
|   Invar

Related Skills

node-connect

351.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

110.6k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

351.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

351.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。

charbj

View profile

View on GitHub

GitHub Stars6

CategoryDevelopment

Updated3mo ago

Forks0

charbj/zernike3d

Languages

Python

Security Score

82/100

Audited on Dec 28, 2025

No findings

Zernike3d

Install / Use

README

Zernike3d for cryo-EM analysis

Requirements

Hardware

Memory

Resolution

Dependencies

Cupy install can be tricky, check your CUDA version and follow the documentation. I recommend the pip install cupy-cuda12x approach. Note nvidia-smi doesn't always report the correct cuda version that is on your system path. You should check this.

Install

Examples

Decompose a single volume in MRC format into the Zernike moments (saved as a .npz file).

Reconstruct a single volume from a set of Zernike moments

Decompose a 3DVA job from cryoSPARC in WIGGLE (.wgl/.npz) format into Zernike moments (saved as a .npz file).

Providing a file of this type is handled automatically and the moments are calculated for the base map, and each of the variability components

Reconstruct a single volume from Zernike moments for a given latent coordinate in the 3DVA space

Providing --z_ind will fetch the scale factors stored in the Moments class and pre-scale the moments accordingly

The user can also directly provide the scalar weights by --scales as a list e.g. --scales 0 3 4 1. These must match the dimensions of the latent space.

Providing no --scales or -z_ind will default to reconstructing the base map and the component maps independently.

Reducing memory requirements

The Zernike class preallocates memory to the GPU via cupy's default memory management. This often uses larger chunks of memory than strictly necessary. Disable pre-allocation with --preallocate_memory_off - this is slower, but will use less memory.

The Zernike class can handle cropping and binning if you provide a UCSF Chimera .cmm file with the center position

Note, while it is possible to run --bit8 and --preallocate_memory_off simultaneously, it might simply be faster to perform the computation on CPU within system RAM instead of on the GPU. You should test these on your system...

The Moments class has utilities for loading, handling, and viewing these moments.

Related Skills

Cupy install can be tricky, check your CUDA version and follow the documentation. I recommend the `pip install cupy-cuda12x` approach. Note `nvidia-smi` doesn't always report the correct cuda version that is on your system path. You should check this.

Providing `--z_ind` will fetch the scale factors stored in the `Moments` class and pre-scale the moments accordingly

The user can also directly provide the scalar weights by `--scales` as a list e.g. `--scales 0 3 4 1`. These must match the dimensions of the latent space.

Providing no `--scales` or `-z_ind` will default to reconstructing the base map and the component maps independently.

The `Zernike` class preallocates memory to the GPU via cupy's default memory management. This often uses larger chunks of memory than strictly necessary. Disable pre-allocation with `--preallocate_memory_off` - this is slower, but will use less memory.

The `Zernike` class can handle cropping and binning if you provide a UCSF Chimera .cmm file with the center position

Note, while it is possible to run `--bit8` and `--preallocate_memory_off` simultaneously, it might simply be faster to perform the computation on CPU within system RAM instead of on the GPU. You should test these on your system...

The `Moments` class has utilities for loading, handling, and viewing these moments.