<h1 align="center" style="border-bottom: none"> <div> <a href="https://embedditor.ai"> <img src="https://embedditor.ingestai.co/images/logo.jpg" width="80" /> <br> Embedditor </a> </div> </h1> <p align="center">Embedditor is the open-source MS Word equivalent for embedding that helps you get the most out of your vector search.</p> <div align="center">

</div> <p align="center"> <a href="https://embedditor.ai"><b>Website</b></a> • <a href="https://discord.gg/7gF8dVv86E"><b>Discord</b></a> • <a href="https://twitter.com/embedditor"><b>Twitter</b></a> • <a href="https://github.com/embedditor/embedditor/wiki/Embedditor-Docs"><b>Documentation</b></a> • <a href="https://ingestai.io"><b>Try demo on IngestAI</b></a> </p>

Get the most out of your vector search

Embedditor is an open source embedding pre-reprocessing editor, that helps you edit GPT / LLM embeddings just as if it's a Microsoft Word document, so you can get the most out of your vector search, while significanty reducing costs of embedding and vector storage.

Join Our Community

Features

Rich editor Interface

⚡ Join and split one or multiple chunks with a few clicks
⚡ Edit embedding metadata and tokens
⚡ Exclude words, sentences, or even parts of chunks from embedding
⚡ Select the parts of chunk you want to be embedded
⚡ Add additional information to your mebeddings, like url links or images
⚡ Get a nice looking HTML-markup for your AI search results
⚡ Save your pre-processed embedding files in .veml or .jason formats

Pre-processing automation

⚡ Filteer our from vectorization most of the 'noise', like punctuations or stop-words
⚡ Remove from embedidng unsignificant, requently used words with TF-IDF algorithm
⚡ Normalize your embedding tokens before vectorization

Benefits

Rich Spreadsheet Interface

⚡ Optimized relevance of the content retrieved from a vector database
⚡ Improved efficiency and accuracy in your AI / LLM-related applications
⚡ Visually better looking search results with images, url links, etc
⚡ Increased cost-efficiency with up to 30% cost-reduction on embedding and vector storage
⚡ Full control over your data, effortlessly deploying Embedditor locally on your PC or dedicated envirement
⚡ Save your pre-processed or ready embeddings in .json or .veml format to use it in LangChain, Chromat or any other Vector DB

Quick try

Sign up for free and try it in IngestAI.

GUI

Access Dashboard using: http://localhost:8080/

Screenshots

Installation

Copy .env.example into .env
Set the following settings in the .env

OPENAI_API_KEY=
Setup the project

php artisan migrate
php artisan db:seed
php artisan storage:link

Embedditor

Install / Use

README