Extract thesis metadata from title page images
Align extracted person names with IdRef identifiers
Streamlit template space
Nanonets / olmOCR / Qwen3-VL / LightOnOCR-2-1B / NuExtract
Search documents using BM25, dense retrieval, and fusion