Results for &quot;image-text&quot;

Updated 13m ago

ShareX / ShareX

36.0k

ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.

capturecolor-pickercsharp+17

openai / CLIP

33.0k

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

deep-learningmachine-learning

Updated 54m ago

HumanSignal / LabelImg

24.9k

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

annotationsdeep-learningdetection+6

karakeep-app / Karakeep

24.4k

A self-hostable bookmark-everything app (links, notes and images) with AI-based automatic tagging and full text search

bookmark-managerbookmarksbookmarks-manager+4

Updated 5m ago

borisdayma / Dalle Mini

14.8k

DALL·E Mini - Generate images from a text prompt

Updated 1d ago

zai-org / CogVideo

12.6k

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

cogvideoximage-to-videollm+3

openai / Shap E

12.2k

Generate 3D objects conditioned on text or images

Updated 5h ago

lucidrains / DALLE2 Pytorch

11.3k

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

artificial-intelligencedeep-learningtext-to-image

Updated 6h ago

google / Skia

10.6k

Skia is a complete 2D graphic library for drawing Text, Geometries, and Images. See documentation for contribution instructions.

Updated 19m ago

easydiffusion / Easydiffusion

10.3k

An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.

artdiffusiongenerative-art+2

Acly / Krita AI Diffusion

9.9k

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

generative-aikrita-pluginstable-diffusion

Updated 5h ago

ashawkey / Stable Dreamfusion

8.8k

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

dreamfusionguiimage-to-3d+3

Updated 4h ago

lucidrains / Imagen Pytorch

8.4k

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

artificial-intelligencedeep-learningimagination-machine+2

Updated 7h ago

vietnh1009 / ASCII Generator

8.2k

ASCII generator (image to text, image to image, video to video)

asciiascii-artascii-generator+6

QwenLM / Qwen Image

7.7k

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

microsoft / Presidio

7.5k

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

anonymizationdata-anonymizationdata-masking+17

open-mmlab / Mmagic

7.4k

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigccomputer-visiondeep-learning+16

Updated 1d ago

kreuzberg-dev / Kreuzberg

7.2k

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

claude codecursor

buncsharpdocument-intelligence+17

Updated 30m ago

enricoros / Big AGI

6.9k

AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. Includes AI personas, AGI functions, world-class Beam multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

claude codeclaude desktop+1

agiai-agentsai-suite+15