Multimodal Document Parsing
Structure recovery from noisy, heterogeneous PDFs using LVLMs and dependency-based chunking.
Senior Researcher, Human-Inspired AI Research — Korea University AI Institute Founder & Lead, KUDoc — Korea University Document AI Research Group
From unstructured inputs to structured evidence,
and from document structure to dynamic world structure.
I research how AI systems recover, organize, and audit evidence from heterogeneous unstructured observations, with a current focus on structure-aware evidence systems for documents and long-context reasoning.
I completed my B.S. in Computer Science and Engineering (advised by Prof. Jeongu Kim) and M.S. in Artificial Intelligence (advised by Prof. Hyuk-Chul Kwon) at Pusan National University, then joined the Human-Inspired AI Research Group at Korea University, co-advised by Prof. Jaehyung Seo and Prof. Heuiseok Lim.
There, I founded KUDoc — the Document AI research group at Korea University — on my own initiative, and this leadership earned me a promotion to Senior Researcher. I now lead and advise KUDoc, and together we have produced 12 top-tier conference papers — including 4 as first author at ACL, CVPR, and EMNLP — along with 5 patents, 4 major industry projects with 3 technology transfers, and 7 awards.
Looking ahead, I aim to extend this foundation toward structured memory, world models, and planning for future agents.
I am currently preparing for a Ph.D. program and am open to new opportunities. If my research directions resonate with yours, please feel free to reach out via email or LinkedIn — I would be glad to connect.
HiKEY was accepted at ACL 2026 Main, Oral (First Author).
M3DocDep was accepted at CVPR 2026 Main (First Author).
MultiDocFusion was accepted at EMNLP 2025 Main (First Author).
Appointed to the National Representative K-AI Research Team (with NC AI).
StyleDFS was accepted at EMNLP 2024 Industry (Co-First Author).
Structure recovery from noisy, heterogeneous PDFs using LVLMs and dependency-based chunking.
Structure-aware evidence retrieval and assembly for reliable long-context document QA.
Audit mechanisms for AI reasoning processes and error propagation diagnostics in document pipelines.
Across ACL, CVPR, and EMNLP(2).
Translated research ideas into protected and deployable AI methods.
Led major collaborations and converted research into real-world outcomes.
Recognized across research, industry collaboration, and applied AI competitions.
Structure-aware retrieval, multimodal QA, and production-facing workflows.
Korean foundation-model adaptation, data pipelines, post-training, and evaluation.
Sentence-level parsing, table/text QA, and speech-oriented language modeling.