134 skills found · Page 1 of 5
antgroup / Echomimic[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
AIFengheshu / Plug Play Modules2025年全网最全即插即用模块,免费分享!CVPR2025,AAAI2025,ICLR2025,TNNLS2025,arXiv2025......包含人工智能全领域(机器学习、深度学习等),适用于图像分类、目标检测、实例分割、语义分割、全景分割、姿态识别、医学图像分割、视频目标分割、图像抠图、图像编辑、单目标跟踪、多目标跟踪、行人重识别、RGBT、图像去噪、去雨、去雾、去阴影、去模糊、超分辨率、去反光、去摩尔纹、图像恢复、图像修复、高光谱图像恢复、图像融合、图像上色、高动态范围成像、视频与图像压缩、3D点云、3D目标检测、3D语义分割、3D姿态识别等各类计算机视觉和图像处理任务,以及自然语言处理、大语言模型、多模态等其他各类人工智能相关任务。持续更新中......
design-edit / DesignEdit[AAAI2025] DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
DulyHao / AlphaForgeOfficial implementation for AAAI2025: AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors
filaPro / Unidet3d[AAAI2025] UniDet3D: Multi-dataset Indoor 3D Object Detection
tsinghua-fib-lab / AAAI2025 MIA Tuner[AAAI'25 Oral] "MIA-Tuner: Adapting Large Language Models as Pre-training Text Detector".
bytedance / DreamFit[AAAI2025] DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder
chenxin-dlut / SUTrack[AAAI2025] SUTrack: Towards Simple and Unified Single Object Tracking
TROUBADOUR000 / AMDPyTorch Implementation of "Adaptive Multi-Scale Decomposition Framework for Time Series Forecasting" (AAAI2025)
JHLew / MoMoImplementation of "Disentangled Motion Modeling for Video Frame Interpolation", AAAI 2025
cure-lab / MotionCraft[AAAI 2025] Official repo for paper "MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls"
yeungchenwa / HDR[AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents
tljxyys / GaussianSR[AAAI2025] This repo holds the code for work "GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution"
scu-zjz / SparseViTOfficial repository for the AAAI2025 paper (Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding Transformer)
924973292 / MambaPro【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
924973292 / DeMo【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification
ylwhxht / L4DRAAAI2025 Oral - L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection
qcf-568 / OSTF[AAAI2025] Revisiting Tampered Scene Text Detection in the Era of Generative AI
whwjdqls / DEEPTalkOfficial code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" [AAAI2025]
sunsmarterjie / ChatterBox[AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues