Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval
by Alexander Most, Joseph Winjum, Ayan Biswas, Shawn M. Jones, Nishath Rajiv Ranasinghe, Dan O'Malley, Manish Bhattarai
Retrieval-Augmented Generation (RAG) has become a popular technique for enhancing the reliability and utility of Large Language Models (LLMs) by grounding responses in external documents. Traditional RAG systems rely on Optical Character Recognition (OCR) to first process scan...