NOVA is a sophisticated voice-activated AI assistant designed to enhance your desktop experience through intelligent automation and natural interaction. Leveraging advanced speech recognition and natural language processing, NOVA seamlessly manages your computer tasks, schedule, and daily activities through intuitive voice commands.
-
Voice Activation: Activate NOVA with "Hey Nova"
-
Intelligent Commands:
- System Management (volume, brightness, power controls)
- Application Control (launch/close programs)
- Media Playback
- Integrated Web Search
- Screen Capture & Recording
- Weather Information
- Calendar & Event Management
- Notes & Voice Memos
- Reminders & Timers
-
Smart Capabilities:
- Natural Language Understanding
- Robust Error Handling
- Customizable Settings
- Comprehensive Logging
- Voice Feedback System
- Python 3.10+
- Vosk for Speech Recognition
- TTS (Text-to-Speech)
- Natural Language Processing
- PyAudio
- System Control Libraries
- Python 3.10 or higher
- System Requirements:
- PortAudio
- CUDA (optional for enhanced performance)
- Clone the repository:
git clone https://github.com/yourusername/nova-ai-assistant.git
cd nova-ai-assistant- Set up virtual environment:
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate- Install dependencies:
pip install -r requirements.txt- Download required models:
python download_model.py- Configure environment:
cp .env.example .env
# Add your API keys and preferences to .env- Launch NOVA:
python main.py-
Wait for the "Listening for wake word 'Nova'" prompt
-
Start with "Hey Nova" followed by your command
- "Hey Nova, how's the weather today?"
- "Hey Nova, set a timer for 5 minutes"
- "Hey Nova, capture screen"
- "Hey Nova, launch Chrome"
- "Hey Nova, set volume to 70%"
Customize NOVA through config.yaml in the config directory:
- Audio parameters
- Wake word sensitivity
- Voice settings
- API configurations
- System paths
- Feature enablement
nova/
├── audio/ - Audio processing
├── chatbot/ - Conversation management
├── commands/ - Command implementations
├── config/ - Configuration handling
├── text_to_speech/ - Voice synthesis
├── transcription/ - Speech recognition
├── utils/ - Helper functions
└── voice_activation/ - Wake word detection
We welcome contributions! Please feel free to submit pull requests.
This project is licensed under the MIT License - see the LICENSE file for details.
- Vosk for speech recognition
- TTS for voice synthesis
- All contributing open-source projects
Common solutions:
-
Audio Input Issues:
- Verify microphone connection
- Check system permissions
- Update audio drivers
-
Wake Word Detection:
- Check microphone volume
- Reduce background noise
- Adjust sensitivity settings
-
Command Recognition:
- Speak clearly
- Monitor input levels
- Review command format
For support or questions, please create an issue in the GitHub repository.