Raspberry Pi AI HAT+ 2:為 Raspberry Pi 5 帶來生成式 AI

Raspberry Pi AI HAT+ 2:為 Raspberry Pi 5 帶來生成式 AI

Hacker News·

Raspberry Pi 推出了 AI HAT+ 2,這是一款搭載 Hailo-10H 加速器的 Raspberry Pi 5 擴充板。這款新的 HAT 能夠在設備上運行生成式 AI 工作負載,提供 40 TOPS 的推論效能和專用的 8GB RAM,實現無需雲端、低延遲的 AI 處理。

We use some essential cookies to make our website work.

We use optional cookies, as detailed in our cookie policy, to remember your
settings and understand how you use our website.

News

Introducing the Raspberry Pi AI HAT+ 2: Generative AI on Raspberry Pi 5

Image

A little over a year ago, we introduced the Raspberry Pi AI HAT+, an add-on board for Raspberry Pi 5 featuring the Hailo-8 (26-TOPS variant) and Hailo-8L (13-TOPS variant) neural network accelerators. With all AI processing happening directly on the device, the AI HAT+ delivered true edge AI capabilities to our users, giving them data privacy and security while eliminating the need to subscribe to expensive cloud-based AI services.

While the AI HAT+ provides best-in-class acceleration for vision-based neural network models, including object detection, pose estimation, and scene segmentation (see it in action here), it lacks the capability to run the increasingly popular generative AI (GenAI) models. Today, we are excited to announce the Raspberry Pi AI HAT+ 2, our first AI product designed to fill the generative AI gap.

Image

Unlock generative AI on your Raspberry Pi 5

Featuring the new Hailo-10H neural network accelerator, the Raspberry Pi AI HAT+ 2 delivers 40 TOPS (INT4) of inferencing performance, ensuring generative AI workloads run smoothly on Raspberry Pi 5. Performing all AI processing locally and without a network connection, the AI HAT+ 2 operates reliably and with low latency, maintaining the privacy, security, and cost-efficiency of cloud-free AI computing that we introduced with the original AI HAT+.

Unlike its predecessor, the AI HAT+ 2 features 8GB of dedicated on-board RAM, enabling the accelerator to efficiently handle much larger models than previously possible. This, along with an updated hardware architecture, allows the Hailo-10H chip to accelerate large language models (LLMs), vision-language models (VLMs), and other generative AI applications.

For vision-based models — such as Yolo-based object recognition, pose estimation, and scene segmentation — the AI HAT+ 2’s computer vision performance is broadly equivalent to that of its 26-TOPS predecessor, thanks to the on-board RAM. It also benefits from the same tight integration with our camera software stack (libcamera, rpicam-apps, and Picamera2) as the original AI HAT+. For users already working with the AI HAT+ software, transitioning to the AI HAT+ 2 is mostly seamless and transparent.

Image

Some example applications

The following LLMs will be available to install at launch:

More (and larger) models are being readied for updates, and should be available to install soon after launch.

Let’s take a quick look at some of these models in action. The following examples use the hailo-ollama LLM backend (available in Hailo’s Developer Zone) and the Open WebUI frontend, providing a familiar chat interface via a browser. All of these examples are running entirely locally on a Raspberry Pi AI HAT+ 2 connected to a Raspberry Pi 5.

The first example uses the Qwen2 model to answer a few simple questions:

The next example uses the Qwen2.5-Coder model to perform a coding task:

This example does some simple French-to-English translation using Qwen2:

The final example shows a VLM describing the scene from a camera stream:

Fine-tune your AI models

By far the most popular examples of generative AI models are LLMs like ChatGPT and Claude, text-to-image/video models like Stable Diffusion and DALL-E, and, more recently, VLMs that combine the capabilities of vision models and LLMs. Although the examples above showcase the capabilities of the available AI models, one must keep their limitations in mind: cloud-based LLMs from OpenAI, Meta, and Anthropic range from 500 billion to 2 trillion parameters; the edge-based LLMs running on the Raspberry Pi AI HAT+ 2, which are sized to fit into the available on-board RAM, typically run at 1–7 billion parameters. Smaller LLMs like these are not designed to match the knowledge set available to the larger models, but rather to operate within a constrained dataset.

This limitation can be overcome by fine-tuning the AI models for your specific use case. On the original Raspberry Pi AI HAT+, visual models (such as Yolo) can be retrained using image datasets suited to the HAT’s intended application — this is also the case for the Raspberry Pi AI HAT+ 2, and can be done using the Hailo Dataflow Compiler.

Similarly, the AI HAT+ 2 supports Low-Rank Adaptation (LoRA)–based fine-tuning of the language models, enabling efficient, task-specific customisation of pre-trained LLMs while keeping most of the base model parameters frozen. Users can compile adapters for their particular tasks using the Hailo Dataflow Compiler and run the adapted models on the Raspberry Pi AI HAT+ 2.

Available to buy now

The Raspberry Pi AI HAT+ 2 is available now at $130. For help setting yours up, check out our AI HAT guide.

Hailo’s GitHub repo provides plenty of examples, demos, and frameworks for vision- and GenAI-based applications, such as VLMs, voice assistants, and speech recognition. You can also find documentation, tutorials, and downloads for the Dataflow Compiler and the hailo-ollama server in Hailo’s Developer Zone.

Image

Share this post

Related posts

Image

Introducing the Raspberry Pi AI HAT+ with up to 26 TOPS

Image

Raspberry Pi AI Camera on sale now at $70

Previous Post

Image

Flokk herd management with Raspberry Pi Zero W

Share this post

No comments

                Jump to the comment form

Leave a CommentCancel reply?

Δdocument.getElementById( "ak_js_1" ).setAttribute( "value", ( new Date() ).getTime() );

Follow us

About Raspberry Pi

For home

For industry

Hardware

Software

Documentation

Community

Raspberry Pi Store

Raspberry Pi Press

Hacker News

相關文章

  1. Raspberry Pi 新款 AI HAT+ 搭載 8GB RAM,支援本地 LLM 推論

    4 個月前

  2. 在搭載 26 TOPS M.2 Hailo AI Hat+ 的 Raspberry Pi 5 上運行 YOLOv8 圖像識別

    7 個月前

  3. NVIDIA Cosmos Reason 2 為實體 AI 帶來進階推理能力

    Huggingface · 4 個月前

  4. Apple Intelligence 與 AI 極大主義

    Benedict Evans · 將近 2 年前

  5. 使用 Hailuo2 簡化 AI 影片製作流程

    4 個月前