Deep learning for video game genre classification

Y Jiang, L Zheng - Multimedia Tools and Applications, 2023 - Springer
In this paper, we propose a new multi-modal deep learning framework with a visual modality
and a textual modality for video game genre classification. The proposed framework consists …

Document image analysis using deep multi-modular features

KV Jobin, A Mondal, CV Jawahar - SN Computer Science, 2022 - Springer
Texture or repeating patterns, discriminative patches, and shapes are the salient features for
various document image analysis problems. This article proposes a deep network …

Cover-based multiple book genre recognition using an improved multimodal network

A Rasheed, AI Umar, SH Shirazi, Z Khan… - International Journal on …, 2023 - Springer
Despite the idiom not to prejudge something by its outward appearance, we consider deep
learning to learn whether we can judge a book by its cover or, more precisely, by its text and …

What Text Design Characterizes Book Genres?

D Haraguchi, BK Iwana, S Uchida - International Workshop on Document …, 2024 - Springer
This study analyzes the relationship between non-verbal information (eg, genres) and text
design (eg, font style, character color, etc.) through the classification of book genres using …

Towards book cover design via layout graphs

W Zhang, Y Zheng, T Miyazono, S Uchida… - Document Analysis and …, 2021 - Springer
Book covers are intentionally designed and provide an introduction to a book. However, they
typically require professional skills to design and produce the cover images. Thus, we …

Self-augmented multi-modal feature embedding

S Matsuo, S Uchida, BK Iwana - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Oftentimes, patterns can be represented through different modalities. For example, leaf data
can be in the form of images or contours. Handwritten characters can also be either online or …

First Describe, Then Depict: Generating Covers for Music and Books via Extracting Keywords: This paper presents two methods to generate high resolution …

V Efimova, V Shalamov, A Filchenkov - Proceedings of the 2022 5th …, 2022 - dl.acm.org
In this paper, we consider the two algorithms of generating artwork covers based on texts or
audio file features. The resulting image is combined from existing images labelled with …

Assessing and Efficiently Leveraging the Generalisation Abilities of Multimodal Models

R Bielawski - 2022 - theses.hal.science
As larger multimodal datasets are becoming available on the web, the possibility for better,
more human-like multimodal models grows. My research goal is to evaluate what …

Klasifikasi Judul Dan Tataletak Buku Dengan Algoritma Cnn Di Perpustakaan Unida Gontor

AR Adawiyah, D Muriyatmoko… - Prosiding …, 2024 - publikasi-adpiindonesia.id
Penelitian ini bertujuan untuk mengembangkan sebuah model klasifikasi menggunakan
arsitektur ResNet34 dengan algoritma Convolutional Neural Network (CNN) untuk …

What Text Design Characterizes Book

D Haraguchi, BK Iwana - … : 16th IAPR International Workshop, DAS 2024 … - books.google.com
This study analyzes the relationship between non-verbal information (eg, genres) and text
design (eg, font style, character color, etc.) through the classification of book genres using …