Model details
Tiny LLM (~0.5B) (Experimental)
Small chat model — CPU-only on embedded boards (experimental).
CPUINT4KV260KR260
I/O and task profile
Tasks: Chat, Q&A
Input/output: Text prompt → text output
Acceleration path: CPU
Precision: INT4
Shown for experimentation. Expect low tokens/sec on embedded CPUs.