Dreamstruct: Understanding slides and user interfaces via synthetic data generation

YH Peng, F Huq, Y Jiang, J Wu, XY Li… - … on Computer Vision, 2025 - Springer
Enabling machines to understand structured visuals like slides and user interfaces is
essential for making them accessible to people with disabilities. However, achieving such …

Surch: Enabling structural search and comparison for surgical videos

J Kim, D Choi, N Lee, M Beane, J Kim - … of the 2023 CHI Conference on …, 2023 - dl.acm.org
Video is an effective medium for learning procedural knowledge, such as surgical
techniques. However, learning procedural knowledge through videos remains difficult due to …

Semantic navigation of powerpoint-based lecture video for autonote generation

C Xu, W Jia, R Wang, X He, B Zhao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the increasing popularity of open educational resources in the past few decades, more
and more users watch online videos to gain knowledge. However, most educational videos …

Document image layout analysis via explicit edge embedding network

X Wu, Y Zheng, T Ma, H Ye, L He - Information Sciences, 2021 - Elsevier
Layout analysis from a document image plays an important role in document content
understanding and information extraction systems. While many existing methods focus on …

Line graphics digitization: A step towards full automation

O Moured, J Zhang, A Roitberg, T Schwarz… - … on Document Analysis …, 2023 - Springer
The digitization of documents allows for wider accessibility and reproducibility. While
automatic digitization of document layout and text content has been a long-standing focus of …

FitVid: Responsive and flexible video content adaptation

J Kim, Y Choi, M Kahng, J Kim - … of the 2022 CHI Conference on Human …, 2022 - dl.acm.org
Mobile video-based learning attracts many learners with its mobility and ease of access.
However, most lectures are designed for desktops. Our formative study reveals mobile …

Semantic Labels-Aware Transformer Model for Searching over a Large Collection of Lecture-Slides

KV Jobin, A Mishra… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Massive Open Online Courses (MOOCs) enable easy access to many educational
materials, particularly lecture slides, on the web. Searching through them based on user …

Wise—slide segmentation in the wild

M Haurilet, A Roitberg, M Martinez… - 2019 International …, 2019 - ieeexplore.ieee.org
We address the task of segmenting presentation slides, where the examined page was
captured as a live photo during lectures. Slides are important document types used as visual …

An online presentation slide assessment system using visual and semantic segmentation features

S Yi, J Matsugami, H Yumoto, T Yamasaki - Proceedings of the AAAI …, 2023 - ojs.aaai.org
In this study, we present a new presentation slide assessment system that can extract the
structural features from any slide file formats. Our previous work used a neural network to …

Enhancing Speaking and Slide Design Skills with Deep Learning: An Online Presentation Assessment System

S Yi, J Matsugami, T Yamamoto… - Proceedings of the 32nd …, 2024 - dl.acm.org
Presentation skills, which involve the effective use of verbal and nonverbacl cues, enable
audiences to better understand the content being presented. We develope a deep learning …