TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content A Anand, R Jaiswal, P Bhuyan, M Gupta, S Bangar, MM Imam, RR Shah, ... Proceedings of the 1st International Workshop on Deep Multimodal Learning …, 2023 | 5 | 2023 |
Geovqa: A comprehensive multimodal geometry dataset for secondary education A Anand, R Jaiswal, A Dharmadhikari, A Marathe, H Popat, H Mital, ... 2024 IEEE 7th International Conference on Multimedia Information Processing …, 2024 | 4 | 2024 |
Unveiling Learner Dynamics: The ECLIPSE Dataset and NeuralGaze Framework for Prolonged Engagement Assessment in Online Learning A Ananda, A Mittalb, L Dhawanc, M Ramesha, J Krishnamurthyd, N Lala, ... collections 25, 34, 2024 | 1 | 2024 |
Improving Multimodal LLMs Ability In Geometry Problem Solving, Reasoning, And Multistep Scoring A Anand, R Jaiswal, A Dharmadhikari, A Marathe, HP Popat, H Mital, ... arXiv preprint arXiv:2412.00846, 2024 | | 2024 |
Improving Physics Reasoning in Large Language Models Using Mixture of Refinement Agents R Jaiswal, D Jain, HP Popat, A Anand, A Dharmadhikari, A Marathe, ... arXiv preprint arXiv:2412.00821, 2024 | | 2024 |
RanLayNet: A Dataset for Document Layout Detection used for Domain Adaptation and Generalization A Anand, R Jaiswal, M Gupta, SS Bangar, P Bhuyan, N Lal, R Singh, ... Proceedings of the 5th ACM International Conference on Multimedia in Asia, 1-6, 2023 | | 2023 |