homehome Home chatchat Notifications


This Engineer Put an AI Language Model on a USB Stick and It Actually Works

A Raspberry Pi-powered USB stick runs a lightweight AI model, making LLMs portable and plug-and-play.

Tibi Puiu
February 17, 2025 @ 9:45 pm

share Share

In a world where artificial intelligence often seems to demand the computing power of a small city, one engineer has managed to shrink it down to the size of a USB stick. Meet the pocket-sized language model, a feat of ingenuity that proves big ideas don’t always need big hardware.

Large language models (LLMs) like GPT and LLaMA have become the rock stars of the AI world, capable of generating human-like text, answering questions, and even writing code. But these models typically rely on billions of parameters and require massive data centers to function. Enter YouTuber Binh, a tinkerer who decided to challenge the status quo by cramming an LLM onto a USB stick.

This isn’t your average flash drive. Inside its custom 3D-printed case lies a Raspberry Pi Zero W, a tiny computer no bigger than a stick of gum. Running on this modest hardware is llama.cpp, a lightweight version of the LLaMA model from Meta. But getting the software to work on the Pi wasn’t easy. The latest version of llama.cpp is designed for ARMv8 processors, while the Raspberry Pi Zero W runs on the older ARMv6 architecture. So he had to painstakingly remove the ARMv8 optimizations.

His persistence paid off, and he successfully adapted the model to run on the older hardware. The result is a portable AI that fits in your pocket — no cloud computing required.

Plug-and-Play AI

The real magic of this project lies in its simplicity. Binh designed the USB stick to be a composite device, meaning it can interact with any computer without requiring special drivers. To use the LLM, all you need to do is plug in the USB stick, create an empty text file, and give it a name. The model automatically generates text and saves it to the file.

While it’s not as fast as its cloud-based counterparts, the USB-based LLM is a groundbreaking proof of concept, as first seen on Hackaday. “I believe this is the first plug-and-play USB-based LLM,” Binh said. And he’s probably right.

This project isn’t just a clever hack; it’s a glimpse into the future of AI accessibility. By making language models portable and easy to use, Binh has opened the door to new possibilities. Imagine students in remote areas using USB-based LLMs for homework help, or journalists in the field generating drafts without an internet connection.

It also raises questions about the environmental impact of AI. Large models require vast amounts of energy, contributing to carbon emissions. Smaller, more efficient models like this one could help reduce that footprint.

Of course, there are limitations. The Raspberry Pi Zero W has only 512MB of RAM, which restricts the size and complexity of the model it can run. But as hardware improves, so too will the capabilities of these pocket-sized AIs.

For now, Binh’s USB stick is a reminder that innovation doesn’t always mean building bigger and faster. Sometimes, it’s about thinking smaller. And in this case, small is mighty.

share Share

Could This Saliva Test Catch Deadly Prostate Cancer Early?

Researchers say new genetic test detects aggressive cancers that PSA and MRIs often miss

This Tree Survives Lightning Strikes—and Uses Them to Kill Its Rivals

This rainforest giant thrives when its rivals burn

Engineers Made a Hologram You Can Actually Touch and It Feels Unreal

Users can grasp and manipulate 3D graphics in mid-air.

Musk's DOGE Fires Federal Office That Regulates Tesla's Self-Driving Cars

Mass firings hit regulators overseeing self-driving cars. How convenient.

A Rare 'Micromoon' Is Rising This Weekend and Most People Won’t Notice

Watch out for this weekend's full moon that's a little dimmer, a little smaller — and steeped in seasonal lore.

Climate Change Could Slash Personal Wealth by 40%, New Research Warns

Global warming’s economic toll may be nearly four times worse than once believed

Kawasaki Unveils a Rideable Robot Horse That Runs on Hydrogen and Moves Like an Animal

Four-legged robot rides into the hydrogen-powered future, one gallop at a time.

Evolution just keeps creating the same deep-ocean mutation

Creatures at the bottom of the ocean evolve the same mutation — and carry the scars of human pollution

Scientists Found a 380-Million-Year-Old Trick in Velvet Worm Slime That Could Lead To Recyclable Bioplastic

Velvet worm slime could offer a solution to our plastic waste problem.

Beetles Conquered Earth by Evolving a Tiny Chemical Factory

There are around 66,000 species of rove beetles and one researcher proposes it's because of one special gland.