Fix server model roles, RAG, image tasks, and GPU options by sheikhti1205 · Pull Request #126 · Siddhesh2377/ToolNeuron

sheikhti1205 · 2026-06-14T13:24:21Z

Hi, this PR includes the ToolNeuron fixes/improvements I mentioned in discussion #125.

What changed

Server model selection

Fixed remote server chat model selection so it no longer silently auto-picks the wrong model.
Added stricter model handling for server requests.
Added Android Server screen chat model picker.
Updated the bundled Web UI settings so it uses a real chat model dropdown from /v1/models.
Prevented embedding/upscaler models from being selected as chat models.

Manual model categories

Added manual model category assignment for installed models.
Categories include Chat, Embedding, Image Generation, Image Upscaler, TTS, and STT.
These categories are used across Store, Model Manager, Server, and Image Task screens.

Image task improvements

Added better handling for image generation, inpaint, and upscale tasks.
Added progress/metrics UI.
Added output options like keeping result in session, replacing input image, saving to Photos, and Save As.
Added GPU/OpenCL toggle for image tasks.

RAG improvements

Allowed selecting all file types for RAG.
Added fallback text extraction for more document-like formats.
Improved document summary behavior so summaries use broader document excerpts instead of only small top-k retrieval chunks.
Added better default RAG/embedding model repos.

GPU option for chat/server

Added GGUF GPU offload option in model loading settings.
Applied the same model config to normal chat and server/VLM loading paths.

Web UI

Redesigned the bundled server Web UI to visually match the Android app more closely.

Tested

./gradlew :app:compileDebugKotlin --console=plain
./gradlew :app:assembleDebug --console=plain

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: bec32eeacc

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

sheikhti1205 · 2026-06-14T14:02:59Z

Should I convert this to draft? And do the fixes it recommends?
I noticed another issue on the "Server model roles"

@Siddhesh2377

Siddhesh2377 · 2026-06-15T08:34:20Z

Ya Bro Fix what ever you can, I am currently out so can't code much
cc: @sheikhti1205

sheikhti1205 · 2026-06-15T15:53:08Z

For the time being, I think the app is in a good state. I tested it as much as I could, and so far I haven’t noticed any major issues—only some thoughts for future improvements.

Since you mentioned some new things in your last release note, I think you can start working on those whenever you get time.

Also, about this PR, would you prefer to merge the current changes first and continue the remaining improvements in follow-up PRs, or should I keep this PR open and add more fixes here? I’m okay with both approaches.
@Siddhesh2377

Siddhesh2377 · 2026-06-17T03:16:25Z

it's ok you can add more stuff in this pr
@sheikhti1205

sheikhti1205 · 2026-06-21T14:08:45Z

@Siddhesh2377

Note: my exams are starting soon, so I may not be able to work on this for a month or more. If you have suggestions or want changes in a specific direction, please let me know here and I’ll try to address them when I’m available.

Recent update summary:

Improved model selection behavior:
- chat now accepts manually assigned chat-capable models, not only VLM-style models
- image input still requires a vision-capable model
- clearer warnings when no model is installed, selected, still downloading, or incompatible with the current input
Improved model store organization:
- added shared model taxonomy/grouping
- changed the store into a clearer family -> task -> model structure
- kept DeepSeek separated so DeepSeek/Qwen-style model names do not crowd the Qwen section
- added a default filter that hides models larger than 2GB, with an option to show them manually
- added LFM 1.2B instruct/thinking and LFM2.5-VL 1.6B entries
- added small Gemma, SmolLM, Qwen, and DeepSeek reasoning entries
- reduced the crowded filter/category area
Improved backup/import:
- added setup restore entry point
- added export/import progress with ETA
- added import preview with per-model selection
- added overwrite/conflict handling
- added checksum verification
- added support for exporting content-URI models
- added notifications for backup completion/failure
Improved RAG behavior:
- better document-summary prompt
- avoids meta answers like “the question is asking...”
- uses full extracted document text when it fits in context
- increased usable RAG context budget where the model allows it
- changed “Possible sources” to “Sources” when document chunks are attached
Improved web search behavior:
- better handling for direct links, especially Play Store / Google Play requests
- better query targeting for app download links
- carries context for short follow-up requests like “exact link”
Added app-side notification when an AI response completes while the app is not foregrounded.

Important note:

I attempted to move the bundled server Web UI toward an Open WebUI-style structure, but the current result is not good enough. The web interface got messy and should probably be treated as needing a full rewrite rather than small patching.

Other notes:

Web search is improved, but still not perfect.
RAG is also improved, but I think more effective changes will need better real-world examples and articles/cases to test against. Until I stumble across better references for what works well here, this is a reasonable stopping point.

Tested with:

./gradlew :app:compileDebugKotlin --console=plain
./gradlew :app:assembleDebug --console=plain

Siddhesh2377 · 2026-06-22T03:50:57Z

Hey @sheikhti1205
You can Focus on your exams bro, will look into this after a month, best of luck man !

sheikhti1205 · 2026-06-22T16:42:24Z

@Siddhesh2377 I don't know but since I'm putting effort here. I really hope this turns into a great project! I'm adding a bit more stuffs to make things right + extra bits 🌟🌟
Question: Should I open a new PR? or keep this one?
edit:
I should add that the initial setup ui needs more fixing like loopholes. I'm not fixing those this time. Sorry. I'll leave that to you.

sheikhti1205 · 2026-06-23T01:46:55Z

Major changes so far:
Added server model roles and multi-engine remote server catalog handling.
Improved Remote Server WebUI with responsive layout, separate CSS asset, settings, history, markdown export, read aloud, attachments, and mobile behavior.
Added /webui.css native server route and public auth allowance.
Improved server chat routing, VLM routing, setup state, and model role fallback behavior.
Added model backup/import/export support. (needs fixing/improvement)
Added Downloads screen, download history, labels, active-download tracking, and now retry/clear handling. (untested)
Added automatic download retry for transient network/HTTP failures with preserved partial .hxd_tmp resume. (untested)
Removed visible Tool/Search model UX; legacy TOOL_SEARCH installs migrate to normal GGUF.
Simplified Vision Store browsing to provider/family sections instead of nested VLM group cards.
Kept VLM base + projector auto-download behavior internally.
Added adaptive image upscale workflow: 2x/4x/8x/custom. (this needs more fixing)
Removed temporary ONNX image-operation UI/runtime direction from active scope.
Added Storage maintenance: Quick clean, Detailed check, Deep model test, report summaries. (untested)
Restructured Settings into grouped areas: Models, Storage, Downloads, Remote Server, Web Search, Privacy & Security, Appearance, Advanced, and About.
Improved web search workflow, query modes, page fetching/extraction, search cards, and result state handling. (partially tested before more improvements were added so, it's untested 😕)
Improved RAG/search-related flows and model setup packs.
Added system TTS/STT fallback plumbing and voice handling polish.

I know I said I would stop but wanted to do a few fixes. But there are still defects, flaws and others. And neeeds testing too - which I didn't :-)
For example: Server webui is designed to work for desktop but not mobile (basically you can view it if you use desktop mode for now.

Fix server model roles, RAG, image tasks, and GPU options

bec32ee

chatgpt-codex-connector Bot reviewed Jun 14, 2026

View reviewed changes

Comment thread app/src/main/assets/server_webui.html Outdated

Comment thread app/src/main/java/com/dark/tool_neuron/viewmodel/ImageTaskViewModel.kt Outdated

Comment thread app/src/main/java/com/dark/tool_neuron/viewmodel/ModelStoreViewModel.kt

sheikhti1205 added 2 commits June 14, 2026 20:41

Refine model identity settings and web UI refresh

399aa68

Improve server web chat usability

178fd95

sheikhti1205 added 4 commits June 21, 2026 14:28

Add model roles, idle unload, and backup support

a7b09e2

Polish model store, backups, and web UI

6ca2ebe

Refine setup restore, RAG search, and web UI

d530912

Refine model store tree, RAG, and web setup

1ba26f2

Fix server chat routing and setup state

4730ac5

feat: refine store downloads and image workflows

6086e82

sheikhti1205 added 2 commits June 23, 2026 17:01

fix: stabilize settings section rendering

c199f80

fix: make remote web ui fit mobile

ee515d6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix server model roles, RAG, image tasks, and GPU options#126

Fix server model roles, RAG, image tasks, and GPU options#126
sheikhti1205 wants to merge 11 commits into
Siddhesh2377:re-writefrom
sheikhti1205:codex/server-model-roles-rag-gpu

sheikhti1205 commented Jun 14, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sheikhti1205 commented Jun 14, 2026

Uh oh!

Siddhesh2377 commented Jun 15, 2026

Uh oh!

sheikhti1205 commented Jun 15, 2026 •

edited

Loading

Uh oh!

Siddhesh2377 commented Jun 17, 2026

Uh oh!

sheikhti1205 commented Jun 21, 2026

Uh oh!

Siddhesh2377 commented Jun 22, 2026

Uh oh!

sheikhti1205 commented Jun 22, 2026 •

edited

Loading

Uh oh!

sheikhti1205 commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sheikhti1205 commented Jun 14, 2026

What changed

Server model selection

Manual model categories

Image task improvements

RAG improvements

GPU option for chat/server

Web UI

Tested

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sheikhti1205 commented Jun 14, 2026

Uh oh!

Siddhesh2377 commented Jun 15, 2026

Uh oh!

sheikhti1205 commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Siddhesh2377 commented Jun 17, 2026

Uh oh!

sheikhti1205 commented Jun 21, 2026

Uh oh!

Siddhesh2377 commented Jun 22, 2026

Uh oh!

sheikhti1205 commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sheikhti1205 commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sheikhti1205 commented Jun 15, 2026 •

edited

Loading

sheikhti1205 commented Jun 22, 2026 •

edited

Loading