Interview Coach

AI-powered interview coaching platform. Users select an interviewer, choose feedback timing, listen to a spoken question, record their answer, and receive structured feedback on delivery, tone, and answer quality.

How it works

User selects an interviewer voice and feedback timing, then presses Start Interview
Interviewer introduces themselves via TTS (POST /speech/tts)
Interviewer reads the question aloud; recording starts automatically
User answers and presses I'm Done Answering
Audio → Groq Whisper → timestamped transcript segments (POST /speech/transcribe)
Each segment → local wav2vec2 model → arousal / dominance / valence scores
Scores + transcript displayed per segment on the interview screen
Transcript + scores + question → Groq LLM → structured feedback (POST /feedback/generate)
Scorecard renders per-question feedback (strengths, improvements, delivery notes, model answer, transcript scores) or full-session summary at end of interview (POST /feedback/generate-session)

Prerequisites

Install these once at the OS level before running setup:

Python 3.9+
Node.js 18+ — download from https://nodejs.org (LTS release)
pnpm — npm install -g pnpm
ffmpeg — required to decode audio files:

Platform	Command
Windows	`choco install ffmpeg` (requires admin terminal)
macOS	`brew install ffmpeg`
Linux	`sudo apt install ffmpeg`

Verify with ffmpeg -version after installing. On Windows, restart the terminal first.

Setup

Run once from the project root after cloning.

1. Python environment

python -m venv venv
source venv/Scripts/activate   # Windows (Git Bash)
# source venv/bin/activate     # macOS / Linux
pip install -r backend/requirements.txt

(venv) appears in the prompt when active. Re-activate in every new terminal — packages stay installed.

2. Environment variables

cp backend/.env.example backend/.env

Open backend/.env and fill in your keys:

GROQ_API_KEY=your_key_here       # https://console.groq.com  (TTS + speech-to-text + LLM feedback)
ANTHROPIC_API_KEY=your_key_here  # https://console.anthropic.com  (reserved)
OPENAI_API_KEY=your_key_here     # Reserved

backend/.env is gitignored — never commit it.

3. Accept Groq Orpheus TTS terms ⚠️ Required

The text-to-speech feature uses Groq's Orpheus model. Each Groq account must accept the model terms once before the TTS endpoint will work. Without this step the backend starts fine but every call to POST /speech/tts returns a 400 error and the interviewer will be silent.

Log in to console.groq.com

Visit this URL directly:

https://console.groq.com/playground?model=canopylabs%2Forpheus-v1-english

Click Accept on the terms banner
Done — acceptance is permanent for your account, no need to repeat it

If you skip this step and try to run the project, the interviewer will not speak and the frontend will show "Could not load audio — check the backend is running." The backend itself will still start correctly.

4. Install frontend dependencies

cd frontend
pnpm install

Installs dependencies and git hooks (lefthook). Pre-commit auto-formats staged files; pre-push runs ESLint, Prettier, and TypeScript checks.

Running the project

Open two terminals from the project root.

Terminal 1 — Backend:

source venv/Scripts/activate   # Windows; venv/bin/activate on macOS/Linux
cd backend
uvicorn app:app --reload

Starts at http://localhost:8000. API docs at http://localhost:8000/docs. First run downloads ~1 GB of emotion model weights — this is normal.

Terminal 2 — Frontend:

cd frontend
pnpm dev

App at http://localhost:3000.

Repo structure

backend/
  app.py                        — FastAPI entry point; registers routers, loads emotion model
  requirements.txt              — Python dependencies
  services/
    tone_delivery_analyzer/     — local wav2vec2 emotion model (arousal/dominance/valence)
    speech_to_text/             — Groq Whisper transcription + Orpheus TTS
    llm/                        — Groq LLM feedback generation (/feedback/generate, /feedback/generate-session)
    text_analysis/              — transcript scoring (reserved)

frontend/
  app/
    page.tsx                    — interview UI (home page)
    InterviewClient.tsx         — full interview state machine (recording, VAD, TTS, feedback)
    layout.tsx / globals.css    — root layout and styles
    components/
      MicWaveform/              — live mic waveform visualizer
    scorecard/
      ScorecardPanel.tsx        — per-question scorecard (calls /feedback/generate, renders result)
      SessionScorecardPanel.tsx — full-session scorecard (calls /feedback/generate-session)
      components/
        Scorecard/              — top-level scorecard layout
        DeliveryScores/         — arousal/dominance/valence score display
        TranscriptFeedbackScores/ — clarity/structure/relevance/conciseness display
        QualitativeFeedback/    — strengths, improvements, delivery notes
        ModelAnswer/            — example answer display
    dev/
      flow/                     — pipeline visualization dev page
      transcribe/               — transcription dev/debug page
  lib/
    prompts/
      questions.ts              — question bank (20 questions + intro)
      interviewers.ts           — interviewer voices and personas
    interview-coach/
      types.ts                  — shared TypeScript types
      mocks.ts                  — mock data for UI development
      pipelineStages.ts         — pipeline stage definitions
      sessionAdapter.ts         — builds ReviewContextPayload from question + segments
      feedbackSpeech.ts         — converts feedback responses to TTS scripts
      aggregateReviewPayload.ts — aggregates per-question answers into session payload
    speech/
      tts.ts                    — frontend helper for POST /speech/tts

docs/
  architecture.md               — system architecture and data flow
  agent-workflows/              — AI agent workflow guides

Reference

Architecture and data flow: docs/architecture.md
Backend endpoints: backend/README.md
Agent conventions: AGENTS.md
Tone/delivery model: backend/services/tone_delivery_analyzer/README.md

Stack

Layer	Technology
Frontend	Next.js 16, React 19, TypeScript, Tailwind CSS 4
Component docs	Storybook 10
Backend	FastAPI (Python)
Text-to-speech	Groq Orpheus (`canopylabs/orpheus-v1-english`)
Speech-to-text	Groq Whisper (`whisper-large-v3-turbo`)
Tone/delivery	`audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim` (local)
LLM feedback	Groq (`openai/gpt-oss-20b`)
Package manager	pnpm
Frontend deploy	TBD
Backend deploy	TBD

Backend endpoints

Method	Path	Status	Purpose
GET	`/health`	done	Liveness check
GET	`/emotion/health`	done	Confirms emotion model is loaded
POST	`/speech/tts`	done	Text → WAV audio (Groq Orpheus TTS)
POST	`/speech/transcribe`	done	Audio → per-segment transcript + emotion scores
POST	`/emotion/analyze`	done	Audio → single arousal / dominance / valence score
POST	`/feedback/generate`	done	Transcript + scores + question → per-answer LLM feedback
POST	`/feedback/generate-session`	done	Multiple Q&A pairs → holistic session feedback + per-question reviews
POST	`/analysis/*`	reserved	Transcript text analysis (reserved)

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
backend		backend
docs		docs
frontend		frontend
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
lefthook.yml		lefthook.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interview Coach

How it works

Prerequisites

Setup

1. Python environment

2. Environment variables

3. Accept Groq Orpheus TTS terms ⚠️ Required

4. Install frontend dependencies

Running the project

Repo structure

Reference

Stack

Backend endpoints

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Interview Coach

How it works

Prerequisites

Setup

1. Python environment

2. Environment variables

3. Accept Groq Orpheus TTS terms ⚠️ Required

4. Install frontend dependencies

Running the project

Repo structure

Reference

Stack

Backend endpoints

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages