Spacemit

Benchmarking llama.cpp on SpacemiT K3: RISC-V AI Cores vs Standard RVV (Part 4)

Benchmarking llama.cpp on SpacemiT K3: RISC-V AI Cores vs Standard RVV (Part 4)

TL;DR SpacemiT's K3 has two core types: X100 (general-purpose, vlen 256) and A100 ("AI cores", vlen 1024). Standard llama.cpp runs 2.3x …

Building a Python Wheel Factory for RISC-V

Building a Python Wheel Factory for RISC-V

Photo by Homa Appliances on Unsplash You know that feeling when you type pip install tokenizers and it finishes in three seconds? That satisfying …

First Words: LLM Inference on RISC-V

First Words: LLM Inference on RISC-V

Photo by Pixabay on Pexels This is part three of the RISC-V wheel factory series. Part one: link:{% post_url …

The Dependency Rabbit Hole: Why 25 RISC-V Python Wheels Weren't Enough

The Dependency Rabbit Hole: Why 25 RISC-V Python Wheels Weren't Enough

You know that feeling when you’ve just shipped something, everything’s green, and you lean back in your chair thinking "nailed it"? …