Publications

My publications focus on transforming heterogeneous unstructured inputs into structured evidence, reliable long-context reasoning, and robustness diagnostics. Accepted and published work appears first; anonymous manuscripts under review and future-facing research directions are separated below.

Selected Publications

Journals

Domestic Conferences & Theses

Manuscripts Under Review

9 first-author and 4 co-author manuscripts currently under peer review at top-tier venues (anonymized titles per double-blind submission policy).

First-Author Manuscripts

Unified Evaluation Framework for RAG Chunking

Under Review

EMNLP 2026First AuthorUnder Review

Anonymous (Under review)

Joint measurement of retrieval relevance, evidence breadth, faithfulness, latency, memory, and cost for fair chunker comparison.

RAG evaluation chunking benchmark

Delivered-Pack Sensitivity Diagnostics for Document-Agent Memory Commits

Under Review

EMNLP 2026First AuthorUnder Review

Anonymous (Under review)

Demonstrates that final-answer correctness alone is not evidence of grounding; proposes diagnostics on what an agent commits to memory.

agent memory grounding diagnostics document agents

Memory-Write as a New Safety Surface for Agentic AI

Under Review

EMNLP 2026First AuthorUnder Review

Anonymous (Under review)

Quantitative framing, boundary measurement, and practical mitigation tools for agentic AI memory-write vulnerabilities.

agent safety memory-write mitigation

Timestamp-Grounded Evidence Consumption Auditing for Long-Video QA/RAG

Under Review

EMNLP 2026First AuthorUnder Review

Anonymous (Under review)

Shifts video QA/RAG evaluation beyond final-answer accuracy by auditing which timestamped segments were actually consumed.

long-video QA evidence auditing temporal grounding

Evidence-State Control for Repairing, Recalibrating, and Materializing Retrieved Candidates

Under Review

ML ConferenceFirst AuthorUnder Review

Anonymous (Under review)

Controllable repair, recalibration, and materialization of retrieved candidates before reader-context packing.

evidence control RAG candidate repair

Answer-Side Attribution Analysis of OCR, Evidence Placement, Answer Policy, and Reader Family

Under Review

ML ConferenceFirst AuthorUnder Review

Anonymous (Under review)

Attribution analysis of how OCR quality, evidence placement, answer policy, and reader family interact to affect answer-quality gains.

attribution analysis OCR reader analysis

Co-Author Manuscripts

Multimodal, Multi-Document, Page-Annotated Benchmark Dataset for Open-Domain Document QA

Under Review

EMNLP 2026Co-AuthorUnder Review

Anonymous (Under review)

A multimodal multi-document benchmark with page-level annotations for open-domain document QA.

Comprehensive Survey of Visual Question Answering: Methods, Benchmarks, and Evaluation Paradigms

Under Review

EMNLP 2026Co-AuthorUnder Review

Anonymous (Under review)

Comprehensive survey of VQA methods, benchmarks, and evaluation paradigms.

Survey and Audit Framework for Reliability and Safety of Multimodal Agent Systems

Under Review

EMNLP 2026Co-AuthorUnder Review

Anonymous (Under review)

Survey and unified audit framework for the reliability and safety of multimodal agent systems.

Guiding Retrieval and Reasoning for Reasoning-Efficient Agentic RAG Systems

Under Review

EMNLP 2026Co-AuthorUnder Review

Anonymous (Under review)

Retrieval- and reasoning-guidance methods for reasoning-efficient agentic RAG systems.

Patents

Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering Related to HiKEY · ACL 2026
Multi-modal, Multi-page, Multi-document Dependency Chunking with Large Vision-Language Models Related to M3DocDep · CVPR 2026
Device and Method for Document Chunking, and Question-Answering Device and Method Using the Same DP-2025-0093 · EMNLP 2024
Rule Filtering Techniques and Methods for Knowledge Inference Systems Based on Deep Learning P2022-0340-KR00 · Filed 2023
An AI-Based System and Method for Recommending Problems Tailored to the Learner's Level 10-2022-0068075 · Registered 2022

Current Research Directions

Current research directions include actionable ontologies for multimodal agent memory, video-structured memory for long-horizon reasoning, and planning over structured world representations.