Charamel

🌏 Truly Universal Encoding Detection in Python 🌎

Generate Convert Improve

Install / Use

/learn @chomechome/Charamel

About this skill

Quality Score

0/100

README

Truly Universal Encoding Detection in Python

Charamel is a pure Python universal character encoding library that supports all of Python character encodings. The library is based on machine learning and trained to handle more than 60 languages. All that with no external dependencies. Ain't it sweet? 🍭

Installation

$ pip install charamel

Features

🌈 Powered by machine learning
📦 No dependencies
⚡ Faster than other pure Python libraries
🐍 Supports all 98 Python encodings
🌍 Works on 60+ languages
🔎 97% accuracy

Usage

API is centered around Detector class, with detect method being responsible for basic encoding detection:

>>> from charamel import Detector
>>> detector = Detector()
>>> content = b'El espa\xf1ol o castellano del lat\xedn hablado'
>>> detector.detect(content)
<Encoding.ISO_8859_14: 'iso8859_14'>

This returns the most likely encoding that can decode the byte string. Let's try it out:

>>> from charamel import Encoding
>>> content.decode(Encoding.ISO_8859_14)
'El español o castellano del latín hablado'

To get multiple likely encodings along with confidences in range [0, 1], use probe method:

>>> detector.probe(content, top=3)
[(<Encoding.ISO_8859_14: 'iso8859_14'>, 0.9964286725192874),
 (<Encoding.CP_1258: 'cp1258'>, 0.9919203166700203),
 (<Encoding.ISO_8859_3: 'iso8859_3'>, 0.9915028923264849)]

Detector can be configured to use a subset of encodings. Less possible encodings lead to faster detection:

>>> detector = Detector(encodings=[Encoding.UTF_8, Encoding.BIG_5])

Another useful Detector parameter is min_confidence. Basically, this parameter regulates how conservative the Detector will be. Confidence for encodings that are returned by detect and probe methods must be greater that min_confidence:

>>> detector = Detector(min_confidence=0.5)

If no encoding confidences exceed min_confidence, detect will return None and probe will return an empty list.

Benchmark

Below is the comparison between Charamel and other available Python encoding detection libraries:

| Detector | Supported Encodings | Sec / File (Mean) | Sec / File (99%) | Sec / File (Max) | KB / Sec | Accuracy | Accuracy on Supported | |---------------------------------------------------------------------------|-----------------------|---------------------|--------------------|--------------------|------------|------------|-------------------------| | Chardet v3.0.4 | 26 | 0.029259 | 0.416156 | 3.115 | 220 | 61% | 97% | | Cchardet v2.1.6 | 40 | 0.000383 | 0.003913 | 0.062855 | 16811 | 67% | 79% | | Charset-Normalizer v1.3.4 | 89 | 0.126674 | 0.502882 | 1.41848 | 51 | 77% | 78% | | Charamel v1.0.0 | 98 | 0.009053 | 0.04277 | 0.120667 | 712 | 97% | 97% |

How to run this benchmark (requires Python 3.6+):

$ git clone git@github.com:chomechome/charamel.git
$ cd charamel
$ pip install poetry>=1.0.5
$ make benchmark

It also produces a detailed breakdown for all represented encodings:

* - not officially support for detector

| Encoding |-----------------| | ascii | big5 | big5hkscs | cp037 | cp1006 | cp1026 | cp1125 | cp1140 | cp1250 | cp1251 | cp1252 | cp1253 | cp1254 | cp1255 | cp1256 | cp1257 | cp1258 | cp273 | cp424 | cp437 | cp500 | cp720 | cp737 | cp775 | cp850 | cp852 | cp855 | cp856 | cp857 | cp858 | cp860 | cp861 | cp862 | cp863 | cp864 | cp865 | cp866 | cp869 | cp874 | cp875 | cp932 | cp949 | cp950 | Total | Chardet v3.0.4 | Cchardet v2.1.6 | Charset-Normalizer v1.3.4 | Charamel v1.0.0 | ---------|------------------|-------------------|-----------------------------|-------------------| | 8 | 7 (88%) | 8 (100%) | 7 (88%) | 8 (100%) | | 33 | 33 (100%) | 33 (100%) | 32 (97%) | 31 (94%) | | 9 | 6 (67%) * | 6 (67%) * | 8 (89%) | 9 (100%) | | 14 | 0 (0%) * | 0 (0%) * | 12 (86%) | 14 (100%) | | 4 | 4 (100%) * | 4 (100%) * | 4 (100%) * | 4 (100%) | | 14 | 0 (0%) * | 0 (0%) * | 10 (71%) | 14 (100%) | | 5 | 4 (80%) * | 4 (80%) * | 5 (100%) | 5 (100%) | | 14 | 0 (0%) * | 0 (0%) * | 12 (86%) | 14 (100%) | | 23 | 7 (30%) * | 22 (96%) | 11 (48%) | 23 (100%) | | 45 | 44 (98%) | 45 (100%) | 45 (100%) | 45 (100%) | | 36 | 36 (100%) | 30 (83%) | 18 (50%) | 36 (100%) | | 6 | 4 (67%) | 6 (100%) | 6 (100%) | 6 (100%) | | 16 | 15 (94%) * | 13 (81%) * | 12 (75%) | 16 (100%) | | 29 | 29 (100%) | 29 (100%) | 29 (100%) | 29 (100%) | | 8 | 6 (75%) * | 7 (88%) | 8 (100%) | 8 (100%) | | 13 | 7 (54%) * | 10 (77%) | 6 (46%) | 13 (100%) | | 15 | 14 (93%) * | 12 (80%) * | 12 (80%) | 15 (100%) | | 14 | 0 (0%) * | 0 (0%) * | 7 (50%) | 14 (100%) | | 4 | 0 (0%) * | 0 (0%) * | 4 (100%) | 4 (100%) | | 11 | 4 (36%) * | 4 (36%) * | 9 (82%) | 11 (100%) | | 14 | 0 (0%) * | 0 (0%) * | 7 (50%) | 14 (100%) | | 6 | 4 (67%) * | 4 (67%) * | 6 (100%) * | 6 (100%) | | 4 | 4 (100%) * | 4 (100%) * | 4 (100%) * | 4 (100%) | | 11 | 4 (36%) * | 4 (36%) * | 8 (73%) | 11 (100%) | | 14 | 4 (29%) * | 4 (29%) * | 11 (79%) | 14 (100%) | | 14 | 4 (29%) * | 12 (86%) | 6 (43%) | 14 (100%) | | 26 | 26 (100%) | 26 (100%) | 26 (100%) | 26 (100%) | | 4 | 4 (100%) * | 4 (100%) * | 4 (100%) * | 4 (100%) | | 14 | 4 (29%) * | 4 (29%) * | 11 (79%) | 14 (100%) | | 14 | 4 (29%) * | 4 (29%) * | 11 (79%) | 14 (100%) | | 7 | 4 (57%) * | 4 (57%) * | 6 (86%) | 7 (100%) | | 9 | 4 (44%) * | 4 (44%) * | 8 (89%) | 9 (100%) | | 4 | 4 (100%) * | 4 (100%) * | 4 (100%) | 4 (100%) | | 7 | 4 (57%) * | 4 (57%) * | 6 (86%) | 7 (100%) | | 4 | 4 (100%) * | 4 (100%) * | 4 (100%) | 4 (100%) | | 12 | 4 (33%) * | 4 (33%) * | 10 (83%) | 12 (100%) | | 23 | 23 (100%) | 23 (100%) | 23 (100%) | 23 (100%) | | 4 | 4 (100%) * | 4 (100%) * | 4 (100%) | 4 (100%) | | 8 | 6 (75%) * | 7 (88%) * | 8 (100%) * | 8 (100%) | | 4 | 0 (0%) * | 0 (0%) * | 3 (75%) * | 4 (100%) | | 11 | 11 (100%) | 8 (73%) * | 11 (100%) | 9 (82%) | | 6 | 6 (100%) * | 6 (100%) | 6 (100%) | 6 (100%) | | 6 | 6 (100%) * | 6 (100%) * | 6 (

Related Skills

node-connect

343.1k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

claude-opus-4-5-migration

90.0k

Migrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5

frontend-design

90.0k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

model-usage

343.1k

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.

chomechome

View profile

View on GitHub

GitHub Stars57

CategoryDevelopment

Updated2mo ago

Forks3

chomechome/charamel

Languages

Python

Security Score

100/100

Audited on Jan 9, 2026

No findings