Enwik8
An attempt to compress the enwik8 file
Install / Use
/learn @aashrafh/Enwik8README
About
An attempt to compress the first 100 MB of Wikipedia which is called enwik8 using LZW(Lempel–Ziv–Welch) and BZip2-Like algorithms with variable length encoding.
Results
- <strong>LZW</strong>:
- Compression ratio: <strong>2.905</strong>
- Compressed file size: <strong>32 MB</strong>
- <strong>BZip2-Like</strong>:
- Compression ratio: <strong>3.855</strong>
- Compressed file size: <strong>24 MB</strong>
How to run
- <strong>Compression</strong>
- Open a terminal on the directory containing the code
- Generate the binary file using command:
g++ -o encoder.exe encoder.cpp - Run the binary file:
./encoder.exe
- <strong>Decompression</strong>
- Open a terminal on the directory containing the code
- Generate the binary file using command:
g++ -o decoder.exe decoder.cpp - Run the binary file:
./decoder.exe
To Do
- [ ] A Decoder for the BZip2-Like algorithm
