21 skills found
modelscope / FunClipOpen-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
GURPREETKAURJETHRA / END TO END GENERATIVE AI PROJECTSEnd to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects
danilop / Multimodal ChatA multimodal chat interface with many tools.
Ravi-Teja-konda / Surveillance Video SummarizerVLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
ksm26 / Open Source Models With Hugging Face"Open Source Models with Hugging Face" course empowers you with the skills to leverage open-source models from the Hugging Face Hub for various tasks in NLP, audio, image, and multimodal domains.
Foxify52 / RVG TtsA retrieval based voice generation text to speech
DAMO-NLP-SG / Multipurpose ChatbotA chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)
controlecidadao / Samantha IaExperimental interface environment for open source LLM, designed to democratize the use of AI. Powered by llama-cpp, llama-cpp-python and Gradio.
buildclubai / Bitesize Notebook Financial Advisor💵 AI-powered financial advisor that analyzes personal transaction data, generates insights, and provides personalized financial advice.
Dartvauder / NeuroTrainerWebUI(Windows/Linux) Local WebUI for finetuning, evaluation and generation of neural network models (LLM and StableDiffusion) on python (In Gradio interface). Translated on 3 languages
AlokTheDataGuy / Multi Agent Customer Service ChatbotThis project is a multi-agent customer service chatbot designed for an e-commerce platform. The chatbot employ specialized agents handle distinct tasks to ensure efficient and accurate interactions. The chatbot aims to enhance user experience by streamlining order processing, answering FAQs, and providing personalized recommendations.
berhanu-tarekegn / Book RecommenderSemantic Book Recommender (Python, LLM, OpenAI, LangChain, Gradio)
PRITHIVSAKTHIUR / Orpheus TTS EdgePlay with Orpheus TTS, a Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been fine-tuned to deliver human-level speech synthesis 🔥🗣️
PRITHIVSAKTHIUR / AI Art Generator SDXLAUTOMATIC1111: Software for tensor operations, saving tensor data in .safetensors format. ComfyUI: UI library, possibly managing tensor data safely with *.safetensors. InvokeAI: ML platform using *.safetensors for secure tensor storage.
ehristoforu / TensorLM WebuiSimple and modern webui for LLM models based LLaMA.
TheAwaken1 / AIraoke PinokioTransform lyric transcriptions into karaoke-style MP4 videos. Built on Python-Lyric-Transcriber, this Gradio UI uses Whisper for transcription, an LLM for lyric edits, and Demucs for vocal separation. A fun tool for karaoke fans, though outputs may vary.
biodatlab / Gradio Meeting SummariserGradio app using Gemini to transcribe and summarize audios into Thai governmental format
AstraBert / Supabase AI ChatA journalist that knows lots of news about AI!📰💻
bdim404 / Abstracts Index Dify PluginA Dify plugin for semantic search across 110 million academic publications powered by abstracts-search.一个基于 abstracts-search 的 Dify 插件,可对 1.1 亿篇学术出版物进行语义搜索。
Lahdhirim / GENAI Company Brochure HuggingfacespacesModular AI tool to convert web content into markdown brochures using OpenRouter LLMs. Fully customizable and deployable via Gradio on Hugging Face Spaces.