Model details

Tiny LLM (~0.5B) (Experimental)

Small chat model — CPU-only on embedded boards (experimental).

CPUINT4KV260KR260
I/O and task profile
Tasks: Chat, Q&A
Input/output: Text prompt → text output
Acceleration path: CPU
Precision: INT4
Shown for experimentation. Expect low tokens/sec on embedded CPUs.
Compatible targets
Supported FPGA boards
KV260
KR260