Intelexia

Intellexia is a simple python application to run Long Language Models inference on the NPU for Intel Core Ultra processors as Meteor Lake and Lunar Lake. Enjoy the low power consumption and privacy to use your own language model locally without restrictions.

Generate Convert Improve

Install / Use

/learn @JoseMariaZ/Intelexia

About this skill

Quality Score

0/100

README

Intelexia is a simple python application to run Long Language Models inference on the NPU for Intel Core Ultra processors as Meteor Lake and Lunar Lake with Openvino. Enjoy the low power consumption and privacy to use your own language model locally without restrictions.

Tested only on Ubuntu 24.04 but should work on other distributions and also on Windows <ul> <li class="has-line-data" data-line-start="5" data-line-end="7">Installation:</li> </ul> GPU Support: sudo apt update

sudo apt install -y gpg-agent wget

wget -qO - <a href="https://repositories.intel.com/gpu/intel-graphics.key">https://repositories.intel.com/gpu/intel-graphics.key</a> | sudo gpg --yes --dearmor --output /usr/share/keyrings/intel-graphics.gpg

echo “deb [arch=amd64,i386 signed-by=/usr/share/keyrings/intel-graphics.gpg] <a href="https://repositories.intel.com/gpu/ubuntu">https://repositories.intel.com/gpu/ubuntu</a> noble client” | sudo tee /etc/apt/sources.list.d/intel-gpu- noble.list

sudo apt install -y intel-opencl-icd intel-level-zero-gpu level-zero intel-media-va-driver-non-free libmfx1 libmfxgen1 libvpl2 libegl-mesa0 libegl1-mesa-dev libgbm1 libgl1-mesa-dev libgl1-mesa-dri libglapi-mesa libgles2-mesa-dev libglx-mesa0 libigdgmm12 libxatracker2 mesa-va-drivers mesa-vdpau-drivers mesa-vulkan-drivers va-driver-all vainfo hwinfo clinfo

sudo reboot

NPU Support: sudo apt install libtbb12

wget <a href="https://github.com/intel/linux-npu-driver/releases/download/v1.13.0/intel-driver-compiler-npu_1.13.0.20250131-13074932693_ubuntu24.04_amd64.deb">https://github.com/intel/linux-npu-driver/releases/download/v1.13.0/intel-driver-compiler-npu_1.13.0.20250131-13074932693_ubuntu24.04_amd64.deb</a> wget <a href="https://github.com/intel/linux-npu-driver/releases/download/v1.13.0/intel-fw-npu_1.13.0.20250131-13074932693_ubuntu24.04_amd64.deb">https://github.com/intel/linux-npu-driver/releases/download/v1.13.0/intel-fw-npu_1.13.0.20250131-13074932693_ubuntu24.04_amd64.deb</a> wget <a href="https://github.com/intel/linux-npu-driver/releases/download/v1.13.0/intel-level-zero-npu_1.13.0.20250131-13074932693_ubuntu24.04_amd64.deb">https://github.com/intel/linux-npu-driver/releases/download/v1.13.0/intel-level-zero-npu_1.13.0.20250131-13074932693_ubuntu24.04_amd64.deb</a> wget <a href="https://github.com/oneapi-src/level-zero/releases/download/v1.18.5/level-zero_1.18.5+u24.04_amd64.deb">https://github.com/oneapi-src/level-zero/releases/download/v1.18.5/level-zero_1.18.5+u24.04_amd64.deb</a>

sudo dpkg -i *.deb

sudo bash -c "echo 'SUBSYSTEM==“accel”, KERNEL=="accel“, GROUP=“render”, MODE=“0660”’ > /etc/udev/rules.d/10-intel-vpu.rules”

sudo usermod -a -G render $USER

sudo reboot

<ul> <li class="has-line-data" data-line-start="31" data-line-end="34"> Clone the repository:

git clone <a href="https://github.com/JoseMariaZ/Intelexia.git">https://github.com/JoseMariaZ/Intelexia.git</a>

</li> <li class="has-line-data" data-line-start="34" data-line-end="38"> Install Dependencies:

pip install nncf2.12 onnx1.16.1 optimum-intel==1.19.0

pip install --pre openvino openvino-tokenizers openvino-genai --extra-index-url <a href="https://storage.openvinotoolkit.org/simple/wheels/nightly">https://storage.openvinotoolkit.org/simple/wheels/nightly</a>

</li> <li class="has-line-data" data-line-start="38" data-line-end="43"> Download and install the Models:

cd Intelexia/Models

optimum-cli export openvino -m meta-llama/Llama-3.1-8B-Instruct --weight-format int4 --sym --ratio 1.0 --group-size -1 Llama-3.1-8B-Instruct

optimum-cli export openvino --model dreamlike-art/dreamlike-anime-1.0 --task stable-diffusion --weight-format fp16 dreamlike_anime_1_0_ov/FP16

</li> </ul> Run:

python3 <a href="http://Intellexia-Free.py">Intellexia-Free.py</a>

By default Llama-3.1-8B-Instruct with function calling will run on the NPU and dreamlike-anime-1.0 will use on GPU.

You can modify the settings on the config.json

4: References

<a href="https://github.com/openvinotoolkit/openvino/blob/master/docs/articles_en/learn-openvino/llm_inference_guide/genai-guide-npu.rst">https://github.com/openvinotoolkit/openvino/blob/master/docs/articles_en/learn- openvino/llm_inference_guide/genai-guide-npu.rst</a>

For NPU monitoring on Linux:

<a href="https://github.com/DMontgomery40/intel-npu-top">https://github.com/DMontgomery40/intel-npu-top</a>

Related Skills

node-connect

346.8k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

107.6k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

346.8k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

346.8k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。