7,254 skills found · Page 5 of 242
apache / Incubator DevlakeApache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
jaquadro / NBTExplorerA graphical NBT editor for all Minecraft NBT data sources
obi1kenobi / TrustfallA query engine for any combination of data sources. Query your files and APIs as if they were databases!
whylabs / WhylogsAn open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
PhoebusSi / Alpaca CoTWe unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
komoot / Photonan open source geocoder for openstreetmap data
stochasticai / XTuringBuild, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
Open-Web-Analytics / Open Web AnalyticsOfficial repository for Open Web Analytics which is an open source alternative to commercial tools such as Google Analytics. Stay in control of the data you collect about the use of your website or app. Please consider sponsoring this project.
Netflix-Skunkworks / ScumblrWeb framework that allows performing periodic syncs of data sources and performing analysis on the identified results
microsoft / BondBond was a cross-platform framework for working with schematized data. The open-source project ended on March 31, 2025.
griddb / GriddbGridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
toolleeo / Awesome Cli Apps In A CsvThe largest Awesome Curated list of command line programs (CLI/TUI) with source data organized into CSV files
skishore / MakemeahanziFree, open-source Chinese character data
malloydata / MalloyMalloy is a modern open source language for describing data relationships and transformations.
google-parfait / Tensorflow FederatedAn open-source framework for machine learning and other computations on decentralized data.
CannerCMS / Cannercms⚡️ Content Management Framework creates custom CMS fast and easy. Support data sources such as Firebase/Firestore, GraphQL and Restful APIs.
Hedgehog-Computing / Hedgehog LabRun, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.
ballerine-io / BallerineOpen-source infrastructure and data orchestration platform for risk decisioning
influxdata / KapacitorOpen source framework for processing, monitoring, and alerting on time series data
graphieros / Vue Data UiAn open source user-empowering data visualization Vue 3 components library for eloquent data storytelling