Distilling step-by-step! outperforming larger language models with less training data and smaller model sizes CY Hsieh, CL Li, CK Yeh, H Nakhost, Y Fujii, A Ratner, R Krishna, CY Lee, ... arXiv preprint arXiv:2305.02301, 2023 | 242 | 2023 |
Towards unconstrained end-to-end text spotting S Qin, A Bissacco, M Raptis, Y Fujii, Y Xiao Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 136 | 2019 |
A scalable handwritten text recognition system RR Ingle, Y Fujii, T Deselaers, J Baccash, AC Popat 2019 International conference on document analysis and recognition (ICDAR …, 2019 | 116 | 2019 |
Towards end-to-end unified scene text detection and layout analysis S Long, S Qin, D Panteleev, A Bissacco, Y Fujii, M Raptis Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 75 | 2022 |
Formnet: Structural encoding beyond sequential modeling in form document information extraction CY Lee, CL Li, T Dozat, V Perot, G Su, N Hua, J Ainslie, R Wang, Y Fujii, ... arXiv preprint arXiv:2203.08411, 2022 | 71 | 2022 |
Sequence-to-label script identification for multilingual ocr Y Fujii, K Driesen, J Baccash, A Hurst, AC Popat 2017 14th IAPR international conference on document analysis and recognition …, 2017 | 43 | 2017 |
A web-based ocr service for documents J Walker, Y Fujii, AC Popat Proceedings of the 13th IAPR international workshop on document analysis …, 2018 | 40 | 2018 |
Rethinking text line recognition models DH Diaz, S Qin, R Ingle, Y Fujii, A Bissacco arXiv preprint arXiv:2104.07787, 2021 | 39 | 2021 |
Tool documentation enables zero-shot tool-usage with large language models CY Hsieh, SA Chen, CL Li, Y Fujii, A Ratner, CY Lee, R Krishna, T Pfister arXiv preprint arXiv:2308.00675, 2023 | 33 | 2023 |
Publication date estimation for printed historical documents using convolutional neural networks Y Li, D Genzel, Y Fujii, AC Popat Proceedings of the 3rd international workshop on historical document imaging …, 2015 | 32 | 2015 |
A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric S Nakagawa, K Iwami, Y Fujii, K Yamamoto Speech Communication 55 (3), 470-485, 2013 | 32 | 2013 |
Large vocabulary speech recognition system: SPOJUS++ Y Fujii, K Yamamoto, S Nakagawa Proc. International Conference MUSP, 110-118, 2011 | 31 | 2011 |
Class lecture summarization taking into account consecutiveness of important sentences. Y Fujii, K Yamamoto, N Kitaoka, S Nakagawa INTERSPEECH, 2438-2441, 2008 | 29 | 2008 |
Post-ocr paragraph recognition by graph convolutional networks R Wang, Y Fujii, AC Popat Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022 | 26 | 2022 |
Out-of-vocabulary term detection by n-gram array with distance from continuous syllable recognition results K Iwami, Y Fujii, K Yamamoto, S Nakagawa 2010 IEEE Spoken Language Technology Workshop, 212-217, 2010 | 26 | 2010 |
Rope: reading order equivariant positional encoding for graph-based document information extraction CY Lee, CL Li, C Wang, R Wang, Y Fujii, S Qin, A Popat, T Pfister arXiv preprint arXiv:2106.10786, 2021 | 24 | 2021 |
Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization. Y Fujii, N Kitaoka, S Nakagawa INTERSPEECH, 2801-2804, 2007 | 21 | 2007 |
Automatic speech recognition using hidden conditional neural fields Y Fujii, K Yamamoto, S Nakagawa 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 17 | 2011 |
Efficient out-of-vocabulary term detection by n-gram array indices with distance from a syllable lattice K Iwami, Y Fujii, K Yamamoto, S Nakagawa 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 15 | 2011 |
HMM-based Script Identification for OCR D Genzel, AC Popat, R Teunen, Y Fujii Proceedings of the 4th International Workshop on Multilingual OCR, 1-5, 2013 | 14 | 2013 |