Build a web-based user interface using Streamlit, Gradio, or React. - The UI must support: - Text input (chat-style). - Image upload (drag-and-drop or file picker). - Document upload (PDF, Word). - Display of multimodal responses: text with inline images, tables, and citations. - Conversation histor