SkillAgentSearch skills...

Humanizer X

4-pass AI text humanizer + voice agent humanization engine. 30 severity-ranked patterns, statistical fingerprint manipulation, SSML disfluency patterns, retail API enrichment for personalized calls, 6 voice modes. Free Claude Code skill — replaces $10-20/mo paid humanizers.

Install / Use

/learn @itsjwill/Humanizer X

README

HUMANIZER X — AI Text Humanizer That Actually Beats Detectors

Free alternative to Undetectable AI, WriteHuman, StealthWriter, Humanize AI, BypassGPT, HIX Bypass, Phrasly, and Netus AI

<div align="center">

Stars License Updated Claude Code

</div>

4-pass humanization engine that strips 30 AI writing patterns AND manipulates the statistical fingerprints (perplexity, burstiness) detectors actually measure. Runs free inside Claude Code.


Why HUMANIZER X exists

Every "AI humanizer" on the market does the same thing: swap words with synonyms and hope for the best. They charge $10-20/month for glorified find-and-replace.

HUMANIZER X works differently. We studied how AI detectors actually work — not just the vocabulary tells, but the statistical fingerprints they measure:

  • Perplexity (word predictability) — AI text scores below ~85. Humans are surprising.
  • Burstiness (sentence length variance) — AI writes uniform ~18-word sentences. Humans don't.
  • Entropy (structural predictability) — AI paragraphs follow invisible templates.

Most humanizers only address the vocabulary layer. HUMANIZER X addresses all three.


How it works: The 4-Pass Architecture

Pass 1: PATTERN REMOVAL     → Strip 30 AI tells (severity-ranked — worst first)
Pass 2: VOICE INJECTION     → Personality, opinions, cognitive artifacts, sensory anchoring
Pass 3: STATISTICAL TUNING  → Manipulate perplexity, burstiness, entropy signatures
Pass 4: VERIFICATION        → 8-point automated checklist with confidence score

Pass 1 — Pattern Removal (30 patterns, severity-ranked)

Not all AI patterns matter equally. HUMANIZER X ranks them by how aggressively detectors flag them:

| Severity | Patterns | Detector Impact | |----------|----------|-----------------| | CRITICAL | AI vocabulary ("delve", "crucial", "landscape"), uniform sentence length, copula avoidance ("serves as"), sycophantic tone | Triggers every detector | | HIGH | Em dash overuse, rule of three, significance inflation, -ing analyses, negative parallelisms, inline-header lists, generic conclusions | Caught by most detectors | | MEDIUM | Promotional language, vague attributions, synonym cycling, false ranges, filler phrases, excessive hedging, repetition at distance, perfect transitions | Sometimes caught | | LOW | Boldface overuse, title case, emojis, curly quotes, chatbot artifacts, cutoff disclaimers | Cosmetic |

CRITICAL patterns get fixed first. No point polishing curly quotes when the vocabulary screams AI.

Pass 2 — Voice Injection

Removing AI patterns is half the job. Sterile, voiceless writing is equally detectable. Modern deep learning classifiers detect the absence of human signals, not just the presence of AI signals.

HUMANIZER X injects:

  • Cognitive artifacts — Self-corrections, mid-thought pivots, callbacks to earlier points ("like I mentioned"), uncertainty markers
  • Sensory anchoring — "The dashboard lit up like a Christmas tree" vs "usage increased significantly"
  • Confidence gradient — Some claims stated firmly, others hedged, others openly uncertain
  • Self-reference — Text that refers back to its own earlier points (AI never does this)

Pass 3 — Statistical Tuning (the part nobody else does)

This is what separates HUMANIZER X from every other humanizer.

Burstiness injection:

  • Measures sentence length standard deviation across the text
  • AI text: σ < 5 (eerily uniform). Human text: σ > 8 (varied).
  • Injects fragments, questions, and long compound sentences to spike variance

Perplexity boost:

  • Finds the most "predictable" phrasing in each paragraph
  • Replaces with unexpected-but-valid alternatives
  • Calibrated to voice mode (casual slang vs academic jargon vs professional precision)

Entropy manipulation:

  • Checks sentence openers for repetitive patterns
  • Breaks structural templates between paragraphs
  • Adds parenthetical asides and rhetorical questions

Pass 4 — Verification

Every humanization ends with an automated quality check:

HUMANIZATION CONFIDENCE: HIGH

Checks: 8/8 PASS, 0 MARGINAL, 0 FAIL

4.1 Sentence length σ: 11.3 ✓
4.2 AI vocabulary: 0 remaining ✓
4.3 Sentence openers: all varied ✓
4.4 Cognitive artifacts: 4 found ✓
4.5 Burstiness range: 27 words ✓
4.6 Confidence gradient: mixed ✓
4.7 Em dashes: within limit ✓
4.8 Structural templates: all different ✓

If the score is LOW, HUMANIZER X automatically re-runs Passes 2-3 on flagged sections.


Voice Modes

HUMANIZER X adapts its output to match your context:

| Mode | Best For | Style | |------|----------|-------| | casual | Blog posts, social media, emails | Short, punchy, contractions, opinions, humor | | professional | Reports, proposals, business comms | Measured, precise, restrained personality | | academic | Papers, research, analysis | Complex clauses, field-specific terms, citations | | creative | Essays, narratives, opinion pieces | Wildly varied rhythm, vivid language, voice IS the content | | voice | Voice agent scripts, call scripts, TTS | Ultra-short, spoken cadence, filler words, verbal tics | | sdr | Cold emails, LinkedIn DMs, follow-ups | 3-5 sentences, "you"-focused, feels hand-typed |

/humanizer-x --mode casual

[paste your text]

Voice mode (for AI voice agents)

Makes scripts sound like a real person on the phone — not a chatbot reading a prompt. Adds spoken contractions ("gonna", "kinda"), natural fillers ("So," "Here's the thing"), and ultra-short sentences that work with TTS engines.

Before:

Our AI-powered content platform generates professional food photography for restaurants, improving their digital presence and increasing customer engagement across social media channels.

After (voice mode):

So basically we take your food photos and make them look incredible. Like, restaurant-magazine level. You post them on Instagram, people start saving them, sharing them... and honestly most of our clients see way more engagement within the first week. It's kinda wild.

SDR mode (for cold outreach)

3-5 sentences max. "You" > "We". Zero buzzwords. Feels hand-typed, not mass-blasted. Subject lines in lowercase.

Before:

Dear Restaurant Owner, I hope this email finds you well. My name is Jamison and I represent CraveMode AI, a cutting-edge platform that leverages artificial intelligence to transform restaurant marketing...

After (sdr mode):

saw your pad thai on instagram — looks great but the lighting's killing it

we shoot AI food photos that look like you hired a $2k photographer. takes 5 minutes, not 5 hours

worth a quick look? i can send a free sample with one of your dishes

- jamison


Voice Agent Humanization Engine (the real differentiator)

Beyond text — a complete framework for making AI voice agents indistinguishable from humans on the phone. Works with Retell AI, Vapi, Bland AI, Synthflow, and any platform with LLM + TTS.

The 3-layer stack nobody else has:

Layer 1: PLATFORM INTEGRATION   → Use Retell AI / Vapi / Bland AI native humanization features
Layer 2: SCRIPT HUMANIZATION    → SSML disfluency patterns, prosody control, anti-robotic speech
Layer 3: LIVE RESPONSE TUNING   → Real-time LLM output humanization before TTS

Layer 1 — Platform-native humanization

Every platform has built-in humanization features that most builders never configure. HUMANIZER X tells you which levers to pull:

| Platform | Key Features | Edge | |----------|-------------|------| | Retell AI | Backchannel ("mhm," "yeah"), voice cloning, custom pronunciation (IPA/CMU), spaced dashes for pauses, knowledge base, MCP nodes | Most configurable humanization | | Synthflow | Filler words toggle, voice intonation free-text field, breathing patterns, emotional nuances | Zero-config — toggle on and it works | | Vapi | Sub-600ms latency, background sound injection, custom endpointing | Fastest response time | | Bland AI | Pathway engine (visual conversation flow), dynamic data, warm transfer rules | Most structured | | Fish.audio | Sub-300ms latency, endpointing models for turn-taking | Most natural conversation feel |

Plus pre-call enrichment via Google Places, Yelp, Instagram, and CRM data to personalize every call.

Layer 2 — SSML disfluency patterns

Filler words WITHOUT proper timing sound worse than no filler words. HUMANIZER X provides platform-specific patterns:

SSML (LiveKit, ElevenLabs, Cartesia):

Yeah, um <break time="300ms"/> so <break time="300ms"/>, I can do that

Retell AI (no SSML needed):

Yeah, um --- so --- I can do that

Synthflow: Just enable the Filler Words Toggle — it handles disfluencies automatically.

Plus: anti-robotic grammar rules, false starts, self-corrections, reactive listening signals ("Oh nice", "Right right", "Totally"), and personality modeling defined as audible behaviors instead of adjectives.

Layer 3 — Live response tuning

For voice agents generating real-time responses, HUMANIZER X provides:

  • System prompt templates with speaking rules, enrichment slots, and banned phrases
  • Response length caps per call phase (opening: 25-35 words, discovery: 15-20, close: 15-20)
  • Real-time adaptation rules (prospect goes quiet → don't fill silence immediately)
  • The 8-second rule: if the agent talks for 8+ seconds without the prospect responding, cut it
  • Platform-specific latency optimization (Retell ~800ms, Vapi <600ms, Fish.audio <300ms)

Before vs after

Without HUMANIZER X:

Hi, I'm calling from CraveMode. We help restaurants with their food photography. Would you be int

View on GitHub
GitHub Stars2
CategoryDevelopment
Updated12h ago
Forks0

Security Score

90/100

Audited on Mar 31, 2026

No findings