AI Application Introduction

AI Fusion X

AI Fusion X is a demo application that integrates multiple AI capabilities including

Computer vision
Large language models (LLMs)
Real-time video processing
Speech recognition

AI Fusion X

Computer Vision

Built on onnxruntime, this module supports various image processing algorithms. The app provides sample images, and users can also upload their own images for processing.

Computer Vision

Large Language Models (LLM)

The application comes with two built-in LLMs: Qwen and DeepSeek. Users can interact with these models via a chat interface to perform natural language understanding and generation.

Large Language Models

Real-time Video Processing

Three modes are supported for real-time video processing as follows:

Object detection
Pose estimation
Face recognition

Users can freely switch between these modes to explore different scenarios.

Real-time Video Processing

Speech Recognition

Includes a voice-to-text feature. After recording, the system automatically transcribes the spoken content into text.

Speech Recognition

Ollama

The system comes with the Ollama command-line tool pre-installed which is used to manage and run local large language models. Run the ollama command to view usage details and supported models.

Ollama CLI Tool

AI Application Introduction

AI Fusion X​

Computer Vision​

Large Language Models (LLM)​

Real-time Video Processing​

Speech Recognition​

Ollama​

AI Fusion X

Computer Vision

Large Language Models (LLM)

Real-time Video Processing

Speech Recognition

Ollama