Mad: A scalable dataset for language grounding in videos from movie audio descriptions

M Soldan, A Pardo, JL Alcázar… - Proceedings of the …, 2022 - openaccess.thecvf.com
The recent and increasing interest in video-language research has driven the development
of large-scale datasets that enable data-intensive machine learning techniques. In …

Long-range multimodal pretraining for movie understanding

DM Argaw, JY Lee, M Woodson… - Proceedings of the …, 2023 - openaccess.thecvf.com
Learning computer vision models from (and for) movies has a long-standing history. While
great progress has been attained, there is still a need for a pretrained multimodal model that …

The anatomy of video editing: A dataset and benchmark suite for ai-assisted video editing

DM Argaw, FC Heilbron, JY Lee, M Woodson… - … on Computer Vision, 2022 - Springer
Abstract Machine learning is transforming the video editing industry. Recent advances in
computer vision have leveled-up video editing tasks such as intelligent reframing …

Moviecuts: A new dataset and benchmark for cut type recognition

A Pardo, FC Heilbron, JL Alcázar, A Thabet… - … on Computer Vision, 2022 - Springer
Understanding movies and their structural patterns is a crucial task in decoding the craft of
video editing. While previous works have developed tools for general analysis, such as …

Autotransition: Learning to recommend video transition effects

Y Shen, L Zhang, K Xu, X Jin - European Conference on Computer Vision, 2022 - Springer
Video transition effects are widely used in video editing to connect shots for creating
cohesive and visually appealing videos. However, it is challenging for non-professionals to …

Dynamic storyboard generation in an engine-based virtual environment for video production

A Rao, X Jiang, Y Guo, L Xu, L Yang, L Jin… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
Born in 1930, storyboarding technique helps video creators design each shot, figure out
potential problems, and communicate ideas to save time and resources raised in practical …

Match cutting: Finding cuts with smooth visual transitions

B Chen, A Ziai, RS Tucker… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
A match cut is a transition between a pair of shots that uses similar framing, composition, or
action to fluidly bring the viewer from one scene to the next. Match cuts are frequently used …

Cinematographic Camera Diffusion Model

H Jiang, X Wang, M Christie, L Liu… - Computer Graphics …, 2024 - Wiley Online Library
Designing effective camera trajectories in virtual 3D environments is a challenging task even
for experienced animators. Despite an elaborate film grammar, forged through years of …

Segtad: Precise temporal action detection via semantic segmentation

C Zhao, M Ramazanova, M Xu, B Ghanem - European Conference on …, 2022 - Springer
Temporal action detection (TAD) is an important yet challenging task in video analysis. Most
existing works draw inspiration from image object detection and tend to reformulate it as a …

Temporal and contextual transformer for multi-camera editing of TV shows

A Rao, X Jiang, S Wang, Y Guo, Z Liu, B Dai… - arXiv preprint arXiv …, 2022 - arxiv.org
The ability to choose an appropriate camera view among multiple cameras plays a vital role
in TV shows delivery. But it is hard to figure out the statistical pattern and apply intelligent …