7,255 skills found · Page 2 of 242
OpenRefine / OpenRefineOpenRefine is a free, open source power tool for working with messy data and improving it
opendataloader-project / Opendataloader PdfPDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
keplergl / Kepler.glKepler.gl is a powerful open source geospatial analysis tool for large-scale data sets.
cleanlab / CleanlabCleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
simonw / DatasetteAn open source multi-tool for exploring and publishing data
r-spacex / SpaceX API:rocket: Open Source REST API for SpaceX launch, rocket, core, capsule, starlink, launchpad, and landing pad data.
rerun-io / RerunAn open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data
unopim / UnopimA free and open-source Laravel-based Product Information Management (PIM) system that helps businesses organize, manage, and enrich their product data from a single, central platform. Learn how UnoPIM scales to handle over 10 million products: https://unopim.com/scaling-unopim-for-10-million-products/
ricklamers / GridstudioGrid studio is a web-based application for data science with full integration of open source data science frameworks and languages.
calesthio / CrucixYour personal intelligence agent. Watches the world from multiple data sources and pings you when something changes.
mark3labs / MCP GoA Go implementation of the Model Context Protocol (MCP), enabling seamless integration between LLM applications and external data sources and tools.
PrivateBin / PrivateBinA minimalist, open source online pastebin where the server has zero knowledge of pasted data. Data is encrypted/decrypted in the browser using 256 bits AES.
zilliztech / Deep SearcherOpen Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
microsoft / PresidioAn open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
evidentlyai / EvidentlyEvidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
authzed / SpicedbOpen Source, Google Zanzibar-inspired database for scalably storing and querying fine-grained authorization data
ChartsCSS / Charts.cssOpen source CSS framework for data visualization.
cloudquery / CloudqueryData pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.
openblocks-dev / Openblocks🔥 🔥 🔥 The Open Source Retool Alternative
nickscamara / Open Deep ResearchAn open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl