Installation Guide

This guide will help you install and set up VoiceTransor on your system.

中文安装指南 (Chinese Installation Guide)

Download
System Requirements
Install VoiceTransor
Install FFmpeg (Required)
Install Ollama (Optional)
Verify Installation
Troubleshooting

Download

Download the latest release for your platform:

Windows:

VoiceTransor-v0.9.0-Windows-x64.zip (~4GB) - Universal build for all systems
OR VoiceTransor-v0.9.0-Windows-x64-Setup.exe (~450MB) - Installer version

💡 Important: Single Universal Build

VoiceTransor uses a single build that works for everyone:

✅ Have NVIDIA GPU? Automatically uses CUDA acceleration
✅ No GPU? Automatically uses CPU (works perfectly, just slower)
✅ No need to choose between CPU/GPU versions
✅ Same installer works on all Windows 10+ systems

This is similar to how Ollama works - one download, automatic hardware detection.

System Requirements

Minimum Requirements

Operating System:
- Windows 10 or later (64-bit)
- macOS 10.15 (Catalina) or later
- Linux (Ubuntu 20.04+, Debian 11+, or equivalent)
Hardware:
- 8GB RAM
- 5GB free disk space (for application and models)
- Dual-core processor

Recommended Requirements

Hardware:
- 16GB RAM
- 10GB free disk space
- Quad-core processor or better

For GPU Acceleration (Optional)

NVIDIA GPU:
- GTX 900 series or newer (GTX 1050, RTX 20/30/40 series, etc.)
- 4GB+ VRAM
- Driver version >= 525.60.13
Apple Silicon:
- M1, M2, or M3 chip
- Automatically detected and used

Note: GPU acceleration is optional. The application works perfectly fine on CPU, just slower.

Install VoiceTransor

Windows

Extract the archive:
- Right-click VoiceTransor-Windows.zip
- Select "Extract All..."
- Choose a destination folder (e.g., C:\Program Files\VoiceTransor)
Launch the application:
- Navigate to the extracted folder
- Double-click VoiceTransor.exe
Windows Security Warning:
- If you see "Windows protected your PC", click "More info"
- Then click "Run anyway"
- This is normal for unsigned applications

macOS

Extract the archive:
- Double-click VoiceTransor-macOS.zip
- Move VoiceTransor.app to Applications folder
First launch:
- Right-click VoiceTransor.app and select "Open"
- Click "Open" in the security dialog
Grant permissions:
- Allow access to files when prompted

Linux

Extract the archive:

unzip VoiceTransor-Linux.zip -d ~/Applications/VoiceTransor
cd ~/Applications/VoiceTransor

Make executable:
```
chmod +x VoiceTransor
```
Launch:
```
./VoiceTransor
```

Install FFmpeg (Required)

VoiceTransor requires FFmpeg to process audio files. You must install it separately.

Windows

Option 1: Automatic (Recommended)

Download FFmpeg from: https://www.gyan.dev/ffmpeg/builds/
Download "ffmpeg-release-essentials.zip"
Extract to C:\ffmpeg
Add to PATH:
- Press Win + R, type sysdm.cpl, press Enter
- Go to "Advanced" tab
- Click "Environment Variables"
- Under "System variables", find "Path" and click "Edit"
- Click "New"
- Add: C:\ffmpeg\bin
- Click "OK" on all windows
Restart your computer (or at least log out and back in)

Option 2: Using Package Manager

If you have Chocolatey:

choco install ffmpeg

If you have Scoop:

scoop install ffmpeg

Verify installation:

ffmpeg -version

macOS

Using Homebrew (Recommended):

brew install ffmpeg

Verify installation:

ffmpeg -version

Linux

Ubuntu/Debian:

sudo apt update
sudo apt install ffmpeg

Fedora:

sudo dnf install ffmpeg

Arch Linux:

sudo pacman -S ffmpeg

Verify installation:

ffmpeg -version

Install Ollama (Optional)

Ollama enables AI-powered text processing (summarize, translate, extract key points, etc.). This is optional but recommended.

What is Ollama?

Local AI models (no cloud, your data stays private)
Works on both CPU and GPU
Required for text processing features in VoiceTransor

Installation

Windows:

Download installer from: https://ollama.com/download
Run the installer
Open Command Prompt and verify:
```
ollama --version
```

macOS:

Download from: https://ollama.com/download
Install the .dmg file
Verify in Terminal:
```
ollama --version
```

Linux:

curl -fsSL https://ollama.com/install.sh | sh

Download a Model

After installing Ollama:

Start Ollama service (if not auto-started):
```
ollama serve
```

Pull a model (in a new terminal):

# For English
ollama pull llama3.1:8b

# For Chinese/English
ollama pull qwen2.5:7b

Model sizes:
- llama3.1:8b - ~4.7GB
- qwen2.5:7b - ~4.4GB

Note: Models are downloaded to:

Windows: %USERPROFILE%\.ollama\models
macOS/Linux: ~/.ollama/models

Verify Installation

Test VoiceTransor

Launch VoiceTransor
Try importing a small audio file
If FFmpeg is working, you should see audio information

Test GPU Detection (if you have NVIDIA GPU)

Go to transcription settings
Select Device: "auto" or "cuda"
Start a transcription
Check the logs - should mention using CUDA

If GPU is not detected, the app will automatically use CPU.

Test Ollama (if installed)

In VoiceTransor, try "Run Text Operation"
Select a preset (e.g., "Summarize")
If Ollama is running and has a model, it should work

Troubleshooting

"ffprobe not found" Error

Cause: FFmpeg is not installed or not in PATH.

Solution:

Verify FFmpeg is installed: ffmpeg -version
If not found, reinstall FFmpeg
Make sure FFmpeg is in your PATH
Restart VoiceTransor (or your computer)

"Ollama is not running" Error

Cause: Ollama service is not started.

Solution:

Open a terminal
Run: ollama serve
Keep this terminal open
Try again in VoiceTransor

Auto-start Ollama (Optional):

Windows: Create a scheduled task
macOS: Add to Login Items
Linux: Enable systemd service

Application Won't Start

Windows:

Try running as Administrator
Check Windows Defender hasn't blocked it

macOS:

Go to System Preferences → Security & Privacy
Click "Open Anyway"

Linux:

Check file permissions: chmod +x VoiceTransor
Install required libraries: sudo apt install libxcb-cursor0

GPU Not Detected

Check your GPU:

# NVIDIA
nvidia-smi

Update drivers:

NVIDIA: Download from https://www.nvidia.com/drivers
Minimum version: 525.60.13 for CUDA 12.1

Don't worry if GPU doesn't work:

The app will automatically use CPU
Everything will still work, just slower

Next Steps

Once installation is complete:

Read the User Guide for detailed usage instructions
Try your first transcription
Explore AI text processing with Ollama

Getting Help

If you encounter issues not covered here:

Check USER_GUIDE.md - Troubleshooting
Email: voicetransor@gmail.com

Happy transcribing! 🎉

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Installation Guide

Table of Contents

Download

💡 Important: Single Universal Build

System Requirements

Minimum Requirements

Recommended Requirements

For GPU Acceleration (Optional)

Install VoiceTransor

Windows

macOS

Linux

Install FFmpeg (Required)

Windows

macOS

Linux

Install Ollama (Optional)

What is Ollama?

Installation

Download a Model

Verify Installation

Test VoiceTransor

Test GPU Detection (if you have NVIDIA GPU)

Test Ollama (if installed)

Troubleshooting

"ffprobe not found" Error

"Ollama is not running" Error

Application Won't Start

GPU Not Detected

Next Steps

Getting Help

FilesExpand file tree

INSTALLATION.md

Latest commit

History

INSTALLATION.md

File metadata and controls

Installation Guide

Table of Contents

Download

💡 Important: Single Universal Build

System Requirements

Minimum Requirements

Recommended Requirements

For GPU Acceleration (Optional)

Install VoiceTransor

Windows

macOS

Linux

Install FFmpeg (Required)

Windows

macOS

Linux

Install Ollama (Optional)

What is Ollama?

Installation

Download a Model

Verify Installation

Test VoiceTransor

Test GPU Detection (if you have NVIDIA GPU)

Test Ollama (if installed)

Troubleshooting

"ffprobe not found" Error

"Ollama is not running" Error

Application Won't Start

GPU Not Detected

Next Steps

Getting Help