Model details

Tiny LLM (~0.5B) (Experimental)

Small chat model — CPU-only on embedded boards (experimental).

CPUINT4KV260KR260

I/O and task profile

Tasks: Chat, Q&A

Input/output: Text prompt → text output

Acceleration path: CPU

Precision: INT4

Shown for experimentation. Expect low tokens/sec on embedded CPUs.

Compatible targets

Supported FPGA boards

KV260

KR260