What is wrong with scene text recognition model comparisons? dataset and model analysis J Baek, G Kim, J Lee, S Park, D Han, S Yun, SJ Oh, H Lee Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 606 | 2019 |
OCR-Free Document Understanding Transformer G Kim, T Hong, M Yim, JY Nam, J Park, J Yim, W Hwang, S Yun, D Han, ... Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 230* | 2022 |
Cost-effective End-to-end Information Extraction for Semi-structured Document Images W Hwang, H Lee, J Yim, G Kim, M Seo Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 26 | 2021 |
Graph embedding with shifted inner product similarity and its improved approximation capability A Okuno, G Kim, H Shimodaira The 22nd International Conference on Artificial Intelligence and Statistics …, 2019 | 10 | 2019 |
Representation learning with weighted inner product for universal approximation of general similarities G Kim, A Okuno, K Fukui, H Shimodaira Proceedings of the Twenty-Eighth International Joint Conference on …, 2019 | 9 | 2019 |
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models G Kim, H Lee, D Kim, H Jung, S Park, Y Kim, S Yun, T Kil, B Lee, S Park Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | 6* | 2023 |
Word-like character n-gram embedding G Kim, K Fukui, H Shimodaira Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User …, 2018 | 6 | 2018 |
On text localization in end-to-end OCR-Free document understanding transformer without text localization supervision G Kim, S Yokoo, S Seo, A Osanai, Y Okamoto, Y Baek International Conference on Document Analysis and Recognition, 215-232, 2023 | 4 | 2023 |
Revisiting Additive Compositionality: AND, OR and NOT Operations with Word Embeddings M Naito, S Yokoi, G Kim, H Shimodaira Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021 | 4* | 2021 |
Segmentation-free Compositional -gram Embedding G Kim, K Fukui, H Shimodaira Proceedings of the 2019 Conference of the North American Chapter of the …, 2019 | 4 | 2019 |
Prometheusvision: Vision-language model as a judge for fine-grained evaluation S Lee, S Kim, SH Park, G Kim, M Seo arXiv preprint arXiv:2401.06591, 2024 | 2 | 2024 |
SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap D Kim, Y Kim, DH Kim, Y Lim, G Kim, T Kil 2023 IEEE/CVF International Conference on Computer Vision (ICCV), 2023 | 1 | 2023 |
On Web-based Visual Corpus Construction for Visual Document Understanding D Kim, T Hong, M Yim, Y Kim, G Kim Proceedings of the International Conference on Document Analysis and …, 2023 | 1 | 2023 |
Scale down transformer by grouping features for a lightweight character-level language model S Park, G Kim, J Lee, J Cha, JH Kim, H Lee Proceedings of the 28th International Conference on Computational …, 2020 | 1 | 2020 |
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning G Kim, M Seo arXiv preprint arXiv:2406.11823, 2024 | | 2024 |
CREPE: Coordinate-Aware End-to-End Document Parser Y Okamoto, Y Baek, G Kim, R Nakao, DH Kim, MB Yim, S Park, B Lee arXiv preprint arXiv:2405.00260, 2024 | | 2024 |
HyperCLOVA X Technical Report HyperCLOVA arXiv preprint arXiv:2404.01954, 2024 | | 2024 |
Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching G Kim, W Hwang, M Seo, S Park Proceedings of the AAAI-22 Workshop on Knowledge Discovery from Unstructured …, 2022 | | 2022 |
Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization M Mizutani, A Okuno, G Kim, H Shimodaira arXiv preprint arXiv:2005.00670, 2020 | | 2020 |