🍡 feedmeAI
← All topics
RAG 15 items

Everything RAG

📑 arXiv 2d ago

MARCH: Multi-Agent Radiology Clinical Hierarchy for CT Report Generation

MARCH emulates the professional hierarchy of radiology departments using a multi-agent framework with specialized roles: a Resident Agent for initial drafting, Fellow Agents for retrieval-augmented revision, and an Attending Agent orchestrating iterative consensus. The approach addresses clinical hallucinations and lack of verification in automated 3D CT report generation by mimicking collaborative clinical workflows.

📑 arXiv 2d ago

RAGognizer: Hallucination-Aware Fine-Tuning via Detection Head Integration

RAGognizer uses token-level hallucination annotations from real RAG outputs as a direct training signal, integrating a detection head during fine-tuning rather than treating hallucination detection as post-hoc. The approach trains models to recognize when generated content is unsupported by retrieved context, addressing closed-domain hallucinations in retrieval-augmented generation.

📑 arXiv 3d ago

IG-Search: Step-Level Information Gain Rewards for Search-Augmented Reasoning

IG-Search introduces step-level information gain rewards for search-augmented reasoning, measuring how retrieved documents improve model confidence in answers relative to random baselines. This addresses the gradient collapse problem in trajectory-level RL when all sampled trajectories fail and enables distinguishing precise queries from vague ones within rollout groups.

📑 arXiv 3d ago

Blinded Multi-Rater Comparative Evaluation of a Large Language Model and Clinician-Authored Responses in CGM-Informed Diabetes Counseling

Blinded multi-rater study with 6 senior diabetes clinicians evaluated retrieval-grounded LLM conversational agent for CGM data interpretation and patient counseling support across 12 cases. System generated plain-language explanations while avoiding individualized therapeutic advice, addressing time-intensive nature of CGM pattern explanation. Evidence development for RAG-based clinical decision support in diabetes care.

🤗 Hugging Face 4d ago

Don't Retrieve, Navigate: Distilling Enterprise Knowledge into Navigable Agent Skills for QA and RAG

Corpus2Skill distills document corpora into hierarchical skill directories that LLM agents navigate rather than passively retrieve, addressing RAG's limitation of treating models as passive consumers. The system clusters documents offline into a navigable tree with LLM-written summaries at each level, giving agents a bird's-eye corpus view for better evidence synthesis.

🤗 Hugging Face 4d ago

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

DR³-Eval provides a reproducible benchmark for deep research agents using static research sandbox corpora paired with authentic user tasks, measuring multimodal report generation across dimensions including information recall, factual accuracy, and citation coverage. It addresses the challenge of evaluating long-horizon research tasks by simulating open-web complexity while remaining fully verifiable.