feat(anyhunt/video-transcript): implement dual-mode transcript pipeline by dvlin-dev · Pull Request #80 · dvlin-dev/moryflow

dvlin-dev · 2026-02-09T18:59:54Z

add LOCAL+CLOUD_FALLBACK workers, queue orchestration, runtime switch, and budget guard

add console/admin pages, prisma migration, local deployment script, and deployment checklist docs

add LOCAL+CLOUD_FALLBACK workers, queue orchestration, runtime switch, and budget guard add console/admin pages, prisma migration, local deployment script, and deployment checklist docs

dvlin-dev · 2026-02-10T02:51:40Z

@codex

ensure activeTasks is decremented and task is finalized when createWorkspace fails

use domain-boundary hostname checks to prevent suffix-matching bypass

cloud fallback completion uses terminal-state guard and conditional update

keep default 5m timeout config for existing queues; use a dedicated Bull configKey for long-running video transcript jobs

dvlin-dev · 2026-02-10T05:02:17Z

已按最佳实践补齐并修复 review threads（已全部 resolve）：

VPS2/Mac local worker 使用独立 worker app（apps/anyhunt/server/src/video-transcript/worker.ts + apps/anyhunt/server/src/video-transcript/video-transcript-worker-app.module.ts），不加载全量 AppModule，避免误消费其他队列/定时任务；worker 进程改为 createApplicationContext（不提供 HTTP）。
Docker 入口新增 ANYHUNT_RUN_MODE=api|video-transcript-worker 与 ANYHUNT_RUN_MIGRATIONS=true|false，worker 建议 ANYHUNT_RUN_MIGRATIONS=false。
URL 入参强制 http(s) 协议校验（DTO + normalize 双层兜底）；VIDEO_TRANSCRIPT_ENABLE_LOCAL_WORKER / VIDEO_TRANSCRIPT_ENABLE_CLOUD_FALLBACK_WORKER 默认值调整为 false，避免误启。
Admin/Console 文案与时间展示对齐：Queues 页用户可见文案统一英文；时间展示统一 formatRelativeTime。
部署流程 + 上线前 checklist 已写入：docs/architecture/anyhunt-video-transcript-pipeline.md（第 16/17 节）。

dvlin-dev · 2026-02-10T06:32:26Z

已按 PR review 修复并推送到分支 video2text：

cancelTask 改为 updateMany + terminal status guard，并且仅在取消写入成功后才写 preempt signal，避免并发完成时把 COMPLETED 覆盖成 CANCELLED。
Budget Lua EVAL 入参显式 String() 化，避免浮点/number 作为 ARGV 的隐式转换边界。
补齐回归单测覆盖上述竞态分支。

对应变更：

apps/anyhunt/server/src/video-transcript/video-transcript.service.ts
apps/anyhunt/server/src/video-transcript/video-transcript-budget.service.ts
apps/anyhunt/server/src/video-transcript/__tests__/video-transcript.service.spec.ts

devin-ai-integration

Devin Review found 1 new potential issue.

View 28 additional findings in Devin Review.

devin-ai-integration · 2026-03-06T10:05:11Z

apps/anyhunt/server/src/video-transcript/video-transcript-cloud-fallback.processor.ts

+      if (reason === 'timeout' && !preemptSignaled) {
+        cloudOwnershipAcquired = await this.acquireCloudOwnership(
+          taskId,
+          'DOWNLOADING',
+          startedAt,
+        );
+        if (!cloudOwnershipAcquired) {
+          return;
+        }


🔴 Budget reservation leak when non-probe ownership acquisition fails

In the cloud fallback processor's handleCloudRun, when probedDurationSec === 0 (probe failed) and reason === 'timeout', budget is reserved using actual audio duration at video-transcript-cloud-fallback.processor.ts:196-197. Then at lines 208-216, if acquireCloudOwnership fails (e.g., local already completed or task was cancelled), the code returns without releasing the reserved budget. This is a budget leak that inflates the daily usage counter in Redis.

The PR explicitly fixed the analogous leak on the probe path (lines 163-167 correctly call releaseCloudBudgetReservation before returning), but missed the non-probe fallback path. The CLAUDE.md constraint at apps/anyhunt/server/src/video-transcript/CLAUDE.md states: "timeout 路径若在 probe 预算后丢失 CLOUD_FALLBACK 执行权，必须回滚这次 budget reservation" — this same principle applies to the non-probe reservation path.

Suggested change

if (reason === 'timeout' && !preemptSignaled) {

cloudOwnershipAcquired = await this.acquireCloudOwnership(

taskId,

'DOWNLOADING',

startedAt,

);

if (!cloudOwnershipAcquired) {

return;

}

if (reason === 'timeout' && !preemptSignaled) {

cloudOwnershipAcquired = await this.acquireCloudOwnership(

taskId,

'DOWNLOADING',

startedAt,

);

if (!cloudOwnershipAcquired) {

if (budgetReservation) {

await this.budgetService.releaseCloudBudgetReservation(

budgetReservation,

);

}

return;

}

Was this helpful? React with 👍 or 👎 to provide feedback.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 28d91c109d

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-06T10:06:07Z

apps/anyhunt/server/src/video-transcript/video-transcript-executor.service.ts

+    const dir = path.join(os.tmpdir(), 'anyhunt-video-transcript', taskId);
+    await mkdir(dir, { recursive: true });


Isolate workspace directory per executor

createWorkspace derives the path only from taskId, and both LOCAL and CLOUD_FALLBACK processors call it with the same ID. In timeout fallback races (where cloud starts while local may still be running), both workers read/write video.*, audio.wav, and transcript files in the same directory, and either worker's cleanup can delete files the other still needs, causing spurious failures or corrupted outputs.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-06T10:06:07Z

apps/anyhunt/server/src/video-transcript/video-transcript-cloud-fallback.processor.ts

+          startedAt,
+        );
+        if (!cloudOwnershipAcquired) {
+          return;


Release reserved budget when timeout takeover loses race

In the probeVideoDurationSeconds() === 0 path, budget is reserved later from measured audio duration, but if cloud ownership acquisition then fails, execution returns immediately without rolling back that reservation. This leaks daily cloud budget for tasks that never actually entered cloud execution (e.g., local completed/cancelled during the pre-takeover work), which can incorrectly exhaust the budget gate for subsequent tasks.

Useful? React with 👍 / 👎.

dvlindev and others added 2 commits February 10, 2026 02:58

feat(anyhunt/video-transcript): implement dual-mode transcript pipeline

0933ae7

add LOCAL+CLOUD_FALLBACK workers, queue orchestration, runtime switch, and budget guard add console/admin pages, prisma migration, local deployment script, and deployment checklist docs

Merge branch 'main' into video2text

db8545f

This comment was marked as resolved.

Sign in to view

dvlindev added 9 commits February 10, 2026 11:28

fix(anyhunt/video-transcript): guard local workspace init

3734efe

ensure activeTasks is decremented and task is finalized when createWorkspace fails

fix(anyhunt/video-transcript): harden platform allowlist

7f1f48e

use domain-boundary hostname checks to prevent suffix-matching bypass

fix(anyhunt/video-transcript): avoid overwriting terminal state

a1e6b7a

cloud fallback completion uses terminal-state guard and conditional update

fix(queue): isolate video transcript queues

2ca0231

keep default 5m timeout config for existing queues; use a dedicated Bull configKey for long-running video transcript jobs

Merge remote-tracking branch 'origin/video2text' into video2text

8296e60

fix(anyhunt/video-transcript): guard status transitions

5d4143e

feat(anyhunt/video-transcript): add dedicated worker mode

9246062

chore(anyhunt/video-transcript): align admin/console UI copy

0efdedb

docs(video-transcript): update deployment docs

5b7dd50

This comment was marked as resolved.

Sign in to view

dvlindev added 2 commits February 10, 2026 14:26

fix(anyhunt/video-transcript): harden cancel race and budget eval args

796afe2

docs(architecture): sync video transcript pipeline progress

dc8bff0

This comment was marked as resolved.

Sign in to view

Merge origin/main into video2text

3d25c5c

This comment was marked as resolved.

Sign in to view

dvlin-dev added 2 commits March 6, 2026 16:37

fix(anyhunt/video-transcript): finalize cloud preflight failures

065541d

fix(anyhunt/console): align video transcript page ui imports

6ee2dd8

This comment was marked as resolved.

Sign in to view

fix(anyhunt/video-transcript): close queue and ownership races

2dc6653

This comment was marked as resolved.

Sign in to view

fix(anyhunt/video-transcript): preserve retries and toggle audit state

aa6aa70

This comment was marked as resolved.

Sign in to view

fix(anyhunt/video-transcript): rollback timeout budget reservations

28d91c1

devin-ai-integration bot reviewed Mar 6, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Mar 6, 2026

View reviewed changes

		const dir = path.join(os.tmpdir(), 'anyhunt-video-transcript', taskId);
		await mkdir(dir, { recursive: true });

Conversation

dvlin-dev commented Feb 9, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dvlin-dev commented Feb 10, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

dvlin-dev commented Feb 10, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

dvlin-dev commented Feb 10, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dvlin-dev commented Feb 9, 2026 •

edited by devin-ai-integration bot

Loading