Swscnn
Demo source code for the paper "Spin-Weighted Spherical CNNs".
Install / Use
/learn @daniilidis-group/SwscnnREADME
#+TITLE: Spin-Weighted Spherical CNNs
- Abstract Learning equivariant representations is a promising way to reduce sample and model complexity and improve the generalization performance of deep neural networks. The spherical CNNs are successful examples, producing SO(3)-equivariant representations of spherical inputs. There are two main types of spherical CNNs. The first type lifts the inputs to functions on the rotation group SO(3) and applies convolutions on the group, which are computationally expensive since SO(3) has one extra dimension. The second type applies convolutions directly on the sphere, which are limited to zonal (isotropic) filters, and thus have limited expressivity. In this paper, we present a new type of spherical CNN that allows anisotropic filters in an efficient way, without ever leaving the spherical domain. The key idea is to consider spin-weighted spherical functions, which were introduced in physics in the study of gravitational waves. These are complex-valued functions on the sphere whose phases change upon rotation. We define a convolution between spin-weighted functions and build a CNN based on it. Experiments show that our method outperforms the isotropic spherical CNNs while still being much more efficient than using SO(3) convolutions. The spin-weighted functions can also be interpreted as spherical vector fields, allowing applications to tasks where the inputs or outputs are vector fields.
** Spherical image from vector field [[file:animations/dense_scalar.gif]]
** Spherical vector field from image [[file:animations/dense_vector.gif]]
-
Datasets The data used for the spherical MNIST experiments is available [[https://drive.google.com/file/d/1h7JwdjWalXZFoXCU8Ez1rLscWih8PcZ7][here]]. It was created using [[https://github.com/jonkhler/s2cnn/blob/f79f09bcd11704ca10d65f2020b9830c4d254bd4/examples/mnist/gendata.py][this file]] from [[https://github.com/jonkhler/s2cnn][jonkhler/s2cnn]].
-
Demo
A JAX implementation that reproduces the results of this paper can be found at [[https://github.com/google-research/google-research/tree/master/spin_spherical_cnns][this Google Research repo]].
- Reference Carlos Esteves, Ameesh Makadia, Kostas Daniilidis. "Spin-Weighted Spherical CNNs", arXiv'20. http://arxiv.org/abs/2006.10731v1
#+BEGIN_SRC bibtex @article{esteves20_spin_weigh_spher_cnns, author = {Esteves, Carlos and Makadia, Ameesh and Daniilidis, Kostas}, title = {Spin-Weighted Spherical Cnns}, journal = {CoRR}, year = 2020, url = {http://arxiv.org/abs/2006.10731v1}, archivePrefix ={arXiv}, eprint = {2006.10731}, primaryClass = {cs.CV}, } #+END_SRC
- Authors [[http://machc.github.io][Carlos Esteves]] [1], [[http://www.ameeshmakadia.com][Ameesh Makadia]] [2], [[http://www.cis.upenn.edu/~kostas/][Kostas Daniilidis]] [1]
[1] [[http://grasp.upenn.edu][GRASP Laboratory]], [[http://www.upenn.edu][University of Pennsylvania]]
[2] [[http://research.google.com][Google Research]]
Related Skills
node-connect
345.4kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
104.6kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
345.4kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
345.4kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
Security Score
Audited on Jul 29, 2024
