EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Install / Use
/learn @xetdata/EfficientSAMREADME
EfficientSAM
XetHub hosted fork of EfficientSAM using the GitHub plugin.
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
News
[Dec.5 2023] We release the torchscript version of EfficientSAM and share the Google colab notebook.
Online Demo & Examples
Online demo and examples can be found in the project page.
EfficientSAM Instance Segmentation Examples
<img width="848" alt="Screenshot 2023-12-06 at 1 16 13 PM" src="https://github.com/xetdata/EfficientSAM/assets/801507/6a4586e6-94cc-4eea-94e7-e78a6f57dcfe">Using the models
The model files live in this repo itself:
- EfficientSAM-S:models/efficientsam_s_gpu.jit
- EfficientSAM-Ti: models/efficientsam_ti_gpu.jit
Instructions
-
Install the git-xet extension.
-
Clone the repo:
# Using SSH
git clone git@github.com:xetdata/EfficientSAM.git
cd EfficientSAM/
- Activate your Python virtual environment and install the requirements:
pip install -r requirements.txt
- Start Jupyter Notebook:
# jupyter lab
jupyter lab
# jupyter notebook
jupyter notebook
- You can directly use EfficientSAM:
import torch
efficientsam = torch.jit.load(models/efficientsam_s_gpu.jit)
Google Colab
You can also use this Google Colab notebook
Acknowledgement
If you're using EfficientSAM in your research or applications, please cite using this BibTeX:
@article{xiong2023efficientsam,
title={EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything},
author={Yunyang Xiong, Bala Varadarajan, Lemeng Wu, Xiaoyu Xiang, Fanyi Xiao, Chenchen Zhu, Xiaoliang Dai, Dilin Wang, Fei Sun, Forrest Iandola, Raghuraman Krishnamoorthi, Vikas Chandra},
journal={arXiv:2312.00863},
year={2023}
}
Related Skills
node-connect
347.6kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
108.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
347.6kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
347.6kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
