3 skills found
pd3f / Pd3f🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
pd3f / Dehyphen📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF
pd3f / Pd3f Core📑 Python Package to reconstruct the original continuous text from PDFs with language models