OAFuser

No description available

Generate Convert Improve

Install / Use

/learn @FeiT-FeiTeng/OAFuser

About this skill

Quality Score

0/100

README

<div align="center"> <p align="center">OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation <p align="center">IEEE Transactions on Artificial Intelligence, 2024 <br> <div align="center"> Fei Teng</a> <b>·</b> <a href="https://www.researchgate.net/profile/Jiaming-Zhang-10" target="_blank">Jiaming Zhang</a> <b>·</b> <a href="https://www.researchgate.net/profile/Kunyu-Peng" target="_blank">Kunyu Peng</a> <b>·</b> <a href="https://www.researchgate.net/profile/Yaonan-Wang" target="_blank">Yaonan Wang</a> <b>·</b> <a href="https://www.researchgate.net/profile/Rainer-Stiefelhagen" target="_blank">Rainer Stiefelhagen</a> <b>·</b> <a href="https://www.researchgate.net/profile/Kailun-Yang" target="_blank">Kailun Yang</a> <br>

<a href="https://arxiv.org/abs/2307.15588" target="_blank">Paper</a>

</div> <p align="center">:hammer_and_wrench: :construction_worker: :rocket:</p> <p align="center">:fire: This repository is an integration of OAFuser and LFTrancy. :fire:</p> </div> <div align=center><img src="assets/Figone.jpg" width="820" height="400" /></div>

Update

2024.08.04 This repository for OAFuser is released.
2023.09.25 Codestuff is on processing.
2023.07.29 Init repository.
2023.07.31 Release the arXiv version.

TODO List

[ ] Release the arXiv version.
[ ] The code for OAFuser has been released.
[ ] The integration of OAFuser and LFTracy will be released.
[ ] Train and Eval strategy will be released.
[ ] Checkpoints will be released.

Abstract

Light field cameras can provide rich angular and spatial information to enhance image semantic segmentation for scene understanding in the field of autonomous driving. However, the extensive angular information of light field cameras contains a large amount of redundant data, which is overwhelming for the limited hardware resource of intelligent vehicles. Besides, inappropriate compression leads to information corruption and data loss. To excavate representative information, we propose an Omni-Aperture Fusion model (OAFuser), which leverages dense context from the central view and discovers the angular information from sub-aperture images to generate a semantically-consistent result. To avoid feature loss during network propagation and simultaneously streamline the redundant information from the light field camera, we present a simple yet very effective Sub-Aperture Fusion Module (SAFM) to embed sub-aperture images into angular features without any additional memory cost. Furthermore, to address the mismatched spatial information across viewpoints, we present Center Angular Rectification Module (CARM) realized feature resorting and prevent feature occlusion caused by asymmetric information.

Method

<p align="center"> (Overview) </p> <p align="center"> <div align=center><img src="assets/Figtwo.jpg" width="850" height="330" /></div> <br><br>

🤝 Publication:

Please consider referencing this paper if you use the code or data from our work. Thanks a lot :)

@article{teng2024oafuser,
  title={OAFuser: Towards omni-aperture fusion for light field semantic segmentation of road scenes},
  author={Teng, Fei and Zhang, Jiaming and Peng, Kunyu and Wang, Yaonan and Stiefelhagen, Rainer and Yang, Kailun},
  journal={IEEE Transactions on Artificial Intelligence},
  year={2024}
}

Related Skills

node-connect

349.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

109.5k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

349.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

349.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。