437 skills found · Page 1 of 15
Unstructured-IO / UnstructuredConvert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
purocean / YnA highly extensible Markdown editor. Version control, AI Copilot, mind map, documents encryption, code snippet running, integrated terminal, chart embedding, HTML applets, Reveal.js, plug-in, and macro replacement.
guangzhengli / ChatFilesDocument Chatbot — multiple files. Powered by GPT / Embedding.
ddangelov / Top2VecTop2Vec learns jointly embedded topic, document and word vectors.
pipwerks / PDFObjectA lightweight JavaScript utility for dynamically embedding PDFs in HTML documents.
Cysharp / MasterMemorySource Generator based Embedded Typed Readonly In-Memory Document Database for .NET and Unity.
msgi / Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc.
tjmlabs / ColiVaraColivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has state of the art retrieval performance on both text and visual documents. using vision models instead of chunking and text-processing for documents. No OCR, no text extraction, no broken tables, or missing images.
mobfarm / FastPdfKitA Static Library to be embedded on iOS applications to display pdf documents derived from Fast PDF
PoloDB / PoloDBPoloDB is an embedded document database.
nitrite / Nitrite JavaNoSQL embedded document store for Java
ruoccofabrizio / Azure Open AI Embeddings QnaA simple web application for a OpenAI-enabled document search. This repo uses Azure OpenAI Service for creating embeddings vectors from documents. For answering the question of a user, it retrieves the most relevant document and then uses GPT-3, GPT-3.5 or GPT-4 to extract the matching answer for the question.
curiosity-ai / Catalyst🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Restream / ReindexerEmbeddable, in-memory, document-oriented database with a high-level Query builder interface.
jamesmartin / Inline SvgEmbed SVG documents in your Rails views and style them with CSS
embeddedartistry / Embedded ResourcesEmbedded Artistry Templates, Documents, and Source Code
0xdeadbeefJERKY / Office DDE PayloadsCollection of scripts and templates to generate Office documents embedded with the DDE, macro-less command execution technique.
mkusner / WmdWord Mover's Distance from Matthew J Kusner's paper "From Word Embeddings to Document Distances"
coatless / Quarto WebrCommunity developed Quarto Extension to Embed webR for HTML Documents, RevealJS, Websites, Blogs, and Books.
Tixierae / Deep Learning NLPKeras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP