Skip to main content

Deploy Dia on reComputer Jetson

Introduction

The rapid advancement of AI-powered speech synthesis has enabled high-quality, real-time text-to-speech (TTS) applications across various domains. Among these, Dia stands out as an efficient and expressive neural speech generation model capable of producing natural-sounding audio with minimal computational overhead. This makes it particularly suitable for deployment on edge devices, such as the NVIDIA Jetson series, which are widely used in embedded AI applications due to their balance of performance and power efficiency.
In this article, we explore the process of deploying the Dia TTS model on a Jetson device and demonstrate its performance during inference.

Prerequisites

  • Jetson device with more than 8GB of memory.
  • The jetson device needs to be pre-flashed with the jetpack 6.1 operating system or later.
note

In this wiki, we will accomplish the following tasks using the reComputer J4012 - Edge AI Computer with NVIDIA® Jetson™ Orin™ NX 16GB, but you can also try using other Jetson devices.

Getting Started

Hardware Connection

  • Connect the Jetson device to the network, mouse, keyboard, and monitor.
note

Of course, you can also remotely access the Jetson device via SSH over the local network.

Install Dependencies

  1. Please download and unzip the appropriate dependencies for your Jetson device from here.
  1. On your Jetson device, execute the following command in the terminal to install:
pip install torch-2.7.0-cp310-cp310-linux_aarch64.whl
pip install torchaudio-2.7.0-cp310-cp310-linux_aarch64.whl
pip install triton-3.3.0-cp310-cp310-linux_aarch64.whl

Download and Install Dia

  1. Clone Dia's source code on your Jetson device using this terminal command:
git clone https://github.com/nari-labs/dia.git
  1. Edit the installation file.
Comment out the torch, torchaudio, and triton-related settings. Using Vim, open pyproject.toml and disable lines 19–22.
cd dia
vim pyproject.toml
note

Remember to save the changes before exiting.

  1. install the running env for dia.
pip install -e .
pip install numpy==1.26.4
  1. launch Dia
export GRADIO_SERVER_NAME=0.0.0.0
python app.py
info

To facilitate remote access to the Gradio WebUI, I reconfigured the GRADIO_SERVER_NAME environment variable.

Demonstration

In the demonstration video, I used DeepSeek to generate a dialogue introducing Seeed Studio, and then directly input the text to generate audio with DIA. Even though my prompt didn’t employ any special techniques, the quality of the generated audio was still incredibly impressive.

[S1] Hey, have you heard of Seeed Studio?
[S2] Of course! It's a company focused on open-source hardware right?
[S1] Exactly! They offer a wide range of development boards, sensor modules, and edge computing devices, perfect for makers, engineers, and developers to quickly bring their ideas to life.
[S2] Yeah, and their Grove ecosystem is really famous—its modular design makes hardware connections super easy, no messy soldering or wiring needed.
[S1] True! They also run Seeed Fusion, providing small-batch PCB manufacturing and assembly services, which is great for startups and hardware entrepreneurs.
[S2] Plus, their community and documentation are well-developed, and many of their projects are open-source, making them beginner-friendly!
[S1] In short, if you're into DIY smart hardware or IoT projects, Seeed Studio is an awesome choice!
[S2] Couldn’t agree more!

References

Tech Support & Product Discussion

Thank you for choosing our products! We are here to provide you with different support to ensure that your experience with our products is as smooth as possible. We offer several communication channels to cater to different preferences and needs.

Loading Comments...