Results for "llm-attack"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

182 skills found · Page 2 of 7

ltroin / Llm Attack Defense Arena

No description available

universal

Updated 3mo ago

OSU-NLP-Group / AmpleGCG

AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM

universal

adversarial-attacksgcgnlp+1

Updated 7d ago

Junjie-Chu / CJA Comprehensive Jailbreak Assessment

This is the public code repository of paper 'Comprehensive Assessment of Jailbreak Attacks Against LLMs'

universal

Updated 7mo ago

ezztahoun / Attack Flow Detector

Find relevant incidents, logs, events, and alerts to all of your incidents. [Attack Flows, Attack Chains, & Root Cause Discovery - NO LLMs, NO Queries, Just Explainable Machine Learning] >> Use it for free here: https://app.cypienta.io

universal

correlationcyber-analyticscybersecurity+4

Updated 1mo ago

LiuYuancheng / Threats 2 MITRE AI Mapper

The objective of this program is to leverage AI-LLM technology to process of human language-based CTI documents to succinctly summarize the attack flow path outlined within such materials via mapping the attack behaviors to the MITRE-ATT&CK and matching the vulnerabilities to MITRE-CWE.

universal

Updated 1d ago

Buyun-Liang / SECA

[NeurIPS 2025] SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations

universal

adversarial-attackslarge-language-modelsllm-hallucination+1

Updated 12d ago

requie / LLMSecurityGuide

A comprehensive reference for securing Large Language Models (LLMs). Covers OWASP GenAI Top-10 risks, prompt injection, adversarial attacks, real-world incidents, and practical defenses. Includes catalogs of red-teaming tools, guardrails, and mitigation strategies to help developers, researchers, and security teams deploy AI responsibly.

universal

ai-safetyai-securityai-security-tool+10

Updated 2d ago

Beijing-AISI / Panda Guard

Panda Guard is designed for researching jailbreak attacks, defenses, and evaluation algorithms for large language models (LLMs).

universal

Updated 19d ago

xirui-li / DrAttack

Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers

universal

adversarial-attacksadversarial-machine-learningattack+5

Updated 1mo ago

XHMY / AutoDefense

AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks

universal

Updated 3d ago

facebookresearch / Meta SecAlign

Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".

universal

Updated 1d ago

datasec-lab / CodeBreaker

[USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection

universal

Updated 5d ago

eth-sri / Llm Quantization Attack

[NeurIPS 2024 / ICML 2025] LLM Quantization Attacks

universal

Updated 15d ago

UseAI-pro / Openclaw Skills Security

Curated, security-first OpenClaw skills (Markdown-based). Security audit skills - detect prompt injection, supply chain attacks, credential leaks. Works with Codex CLI, Claude Code, any LLM.

claude codeclaude desktop+1

ai-safetyai-securityllm-security+10

Updated 2d ago

wearetyomsmnv / Awesome LLM Agent Security

All about llm-agents security,attack,vulnerabilities and how to do them for cybersecurity.

universal

Updated 1d ago

jiayingwu19 / SheepDog

Data and code for "Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks" (KDD 2024)

universal

Updated 13d ago

kk12-30 / LLMs PromptAttacks

AI大模型提示词攻击工具

universal

Updated 26d ago

AI45Lab / MAGIC

Code for paper "MAGIC: A Co-Evolving Attacker-Defender Adversarial Game for Robust LLM safety"

universal

Updated 12d ago

HKU-TASR / Imperio

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.

universal

ai-securitybackdoor-attacksllm

Updated 2mo ago

facebookresearch / Rl Injector

Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacks

universal

Updated 2d ago