You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A lightweight, all-local pipeline that scrapes PubMed, fine-tunes a compact LLM, and layers Retrieval-Augmented Generation (RAG) to answer microbiome ↔ brain questions.
1 · Data Acquisition & Curation
Decision point
Selected choice & rationale
Access method
NCBI E-utilities (ESearch → ESummary/Efetch) — official API, structured, ToS-compliant.
Search syntax
(microbiome OR "gut microbiota") AND ("gut-brain axis" OR "brain-gut axis") — high recall, good precision.
Date / citation filters
Keep every paper; then require ≥ 5 citations if > 3 years old.
Batching & rate-limit
retmax=10000; set ENTREZ_API_KEY + email for 10 req/s (fallback 0.35 s sleep).