Publications

MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents

EMNLP 2025 venue badge

Aug 2025 — EMNLP 2025 (Main)

Top Conferences

Authors

Joongmin Shin*, Chanjun Park, Jeongbae Park, Jaehyung Seo, Heuiseok Lim

Abstract

MultiDocFusion combines vision parsing, OCR, and hierarchy reconstruction to preserve document structure during chunking. It improves retrieval quality for long and noisy inputs and provides stronger evidence composition for downstream QA.

Key Contribution

A hierarchical multimodal chunking pipeline that preserves layout and improves evidence composition in industrial RAG.

Architecture

MultiDocFusion architecture
MultiDocFusion architecture

Links