5,415 skills found · Page 1 of 181
CompVis / Stable DiffusionA latent text-to-image diffusion model
ShareX / ShareXShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.
openai / CLIPCLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
HumanSignal / LabelImgLabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
karakeep-app / KarakeepA self-hostable bookmark-everything app (links, notes and images) with AI-based automatic tagging and full text search
borisdayma / Dalle MiniDALL·E Mini - Generate images from a text prompt
zai-org / CogVideotext and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
openai / Shap EGenerate 3D objects conditioned on text or images
lucidrains / DALLE2 PytorchImplementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
google / SkiaSkia is a complete 2D graphic library for drawing Text, Geometries, and Images. See documentation for contribution instructions.
easydiffusion / EasydiffusionAn easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
Acly / Krita AI DiffusionStreamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
ashawkey / Stable DreamfusionText-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
lucidrains / Imagen PytorchImplementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
vietnh1009 / ASCII GeneratorASCII generator (image to text, image to image, video to video)
QwenLM / Qwen ImageQwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
microsoft / PresidioAn open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
open-mmlab / MmagicOpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
kreuzberg-dev / KreuzbergA polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
enricoros / Big AGIAI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.