Process complex documents (PDFs, Word documents) that contain: - Running text paragraphs. - Tables (extract to structured format). - Figures and charts (extract images, generate captions). - Headers and structural elements. - For each extracted element, maintain: - The content (text or image). - The