Unilib
Embeddable C++17 Unicode library offering UTF encodings, general category info, simple and full casing, normalization forms, and combining marks stripping.
Install / Use
/learn @ufal/UnilibREADME
UniLib: C++17 Embeddable Unicode Library
UniLib is an embeddable C++17 Unicode library under MPL 2.0 license. It is versioned using Semantic Versioning.
If currently offers:
- utf-8 and utf-16 encodings
- Unicode General Category info
- simple and full lower/title/uppercasing
- Unicode normalization forms
- efficient stripping of combining marks
It is tested on the following compilers and platforms:
- Linux using gcc and clang
- Windows using Visual C++ 2019 and newer
- macOS using clang
Copyright 2014-2024 Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic.
UniLib repository http://github.com/ufal/unilib is hosted on GitHub.
