Results for "benchmark-dataset"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

1,299 skills found · Page 1 of 44

CLUEbenchmark / CLUE

4.2k

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

universal

albertbenchmarkbert+12

Updated 1d ago

FreedomIntelligence / Awesome AI4Med

2.6k

A curated list of medical LLMs, multimodal systems, datasets, benchmarks, and more. 🏥

universal

awesome-listscollectiondatasets+5

Updated 1d ago

github / CodeSearchNet

2.4k

Datasets, tools, and benchmarks for representation learning of code.

universal

bertcnndata+17

Updated 18h ago

beir-cellar / Beir

2.1k

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

universal

benchmarkbertcolbert+16

Updated 6h ago

snap-stanford / Ogb

2.1k

Benchmark datasets, data loaders, and evaluators for graph machine learning

universal

datasetsdeep-learninggraph-machine-learning+1

Updated 1d ago

Thinklab-SJTU / Bench2Drive

1.8k

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

universal

Updated 13h ago

ChineseGLUE / ChineseGLUE

1.8k

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

universal

albertbertchinese-corpus+5

Updated 1d ago

RoboVerseOrg / RoboVerse

1.7k

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

universal

imitation-learningreinforcement-learningrobotics+1

Updated 9h ago

doc-analysis / TableBank

1.1k

TableBank: A Benchmark Dataset for Table Detection and Recognition

universal

Updated 9d ago

RuihengZhang / IFSOD Dataset

1.0k

Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection

universal

Updated 1mo ago

YerevaNN / Mimic3 Benchmarks

879

Python suite to construct benchmark machine learning datasets from the MIMIC-III 💊 clinical database.

universal

benchmarkclinical-datadeep-learning+1

Updated 4d ago

EpistasisLab / Pmlb

860

PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.

universal

Updated 5d ago

LAMDA-Tabular / TALENT

834

A comprehensive toolkit and benchmark for tabular data learning, featuring 35+ deep methods, more than 10 classical methods, and 300 diverse tabular datasets.

universal

tabulartabular-datatabular-data-benchmark+4

Updated 19h ago

pangeo-data / WeatherBench

819

A benchmark dataset for data-driven weather forecasting

universal

benchmarkdatasetdeep-learning+1

Updated 2d ago

bcmi / Image Harmonization Dataset IHarmony4

804

[CVPR 2020] The first large-scale public benchmark dataset for image harmonization. The code used in our paper "DoveNet: Deep Image Harmonization via Domain Verification", CVPR2020. Useful for image harmonization, image composition, etc.

universal

composite-imagesdeep-image-compositiondeep-image-harmonization+5

Updated 12d ago

RobustBench / Robustbench

773

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

zed

adversarial-machine-learningadversarial-robustnessbenchmark+1

Updated 5d ago

syncora-ai / Syncora Benchmarks

733

A lightweight, plug‑and‑play benchmark kit for synthetic data. Compare Syncora against other generators (e.g., Gretel, MostlyAI) by dropping in CSVs, then auto‑compute fidelity and similarity metrics. Works with any dataset via simple file naming no heavy setup needed.

universal

Updated 7h ago

google-research / Nasbench

717

NASBench: A Neural Architecture Search Dataset and Benchmark

universal

Updated 1d ago

DataScienceUIBK / Rankify

667

🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techniques, 24+ state-of-the-art Reranking models, and multiple RAG methods.

universal

agentaichatgpt+9

Updated 1d ago

OpenDriveLab / OpenLane V2

663

[NeurIPS 2023 Track Datasets and Benchmarks] OpenLane-V2: The First Perception and Reasoning Benchmark for Road Driving

universal

3d-lane-detectiontopology-reasoningtraffic-element-recognition

Updated 5d ago