🍡 feedmeAI
← All topics
Data-attribution 1 item

Everything Data-attribution

📑 arXiv 2d ago

Sketching the Readout of Large Language Models for Scalable Data Attribution and Valuation

RISE (Readout Influence Sketching Estimator) achieves scalable data attribution for LLMs by focusing on influence hotspots at the output layer rather than computing gradients across the entire model. Uses CountSketch projections on dual-channel representation (lexical residual + semantic projected-error) to make gradient-based attribution tractable for large models.