AI Application Introduction
1. AI Fusion X
AI Fusion X is a demo application that integrates multiple AI capabilities, including computer vision, large language models, real-time video processing, and speech recognition.
1.1 Computer Vision
Built on onnxruntime, this module supports various image processing algorithms. The app provides sample images, and users can also upload their own images for processing.
1.2 Large Language Models (LLM)
The application comes with two built-in LLMs: Qwen and DeepSeek. Users can interact with these models via a chat interface to perform natural language understanding and generation.
1.3 Real-time Video Processing
Three modes are supported for real-time video processing: object detection, pose estimation, and face recognition. Users can freely switch between these modes to explore different scenarios.
1.4 Speech Recognition
Includes a voice-to-text feature. After recording, the system automatically transcribes the spoken content into text.
2. Ollama
The system comes pre-installed with the Ollama
command-line tool, used to manage and run local large language models. Run the ollama
command to view usage details and supported models.