Benchmarking deep learning models for classification of book covers

Y Jiang, L Zheng - Multimedia Tools and Applications, 2023 - Springer

In this paper, we propose a new multi-modal deep learning framework with a visual modality
and a textual modality for video game genre classification. The proposed framework consists …

被引用次数：12 相关文章所有 7 个版本

[PDF] iiit.ac.in

Document image analysis using deep multi-modular features

KV Jobin, A Mondal, CV Jawahar - SN Computer Science, 2022 - Springer

Texture or repeating patterns, discriminative patches, and shapes are the salient features for
various document image analysis problems. This article proposes a deep network …

被引用次数：5 相关文章所有 5 个版本

[PDF] researchgate.net

Cover-based multiple book genre recognition using an improved multimodal network

A Rasheed, AI Umar, SH Shirazi, Z Khan… - International Journal on …, 2023 - Springer

Despite the idiom not to prejudge something by its outward appearance, we consider deep
learning to learn whether we can judge a book by its cover or, more precisely, by its text and …

被引用次数：3 相关文章所有 4 个版本

[PDF] arxiv.org

What Text Design Characterizes Book Genres?

D Haraguchi, BK Iwana, S Uchida - International Workshop on Document …, 2024 - Springer

This study analyzes the relationship between non-verbal information (eg, genres) and text
design (eg, font style, character color, etc.) through the classification of book genres using …

Towards book cover design via layout graphs

W Zhang, Y Zheng, T Miyazono, S Uchida… - Document Analysis and …, 2021 - Springer

Book covers are intentionally designed and provide an introduction to a book. However, they
typically require professional skills to design and produce the cover images. Thus, we …

被引用次数：3 相关文章所有 6 个版本

[PDF] arxiv.org

Self-augmented multi-modal feature embedding

S Matsuo, S Uchida, BK Iwana - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Oftentimes, patterns can be represented through different modalities. For example, leaf data
can be in the form of images or contours. Handwritten characters can also be either online or …

被引用次数：3 相关文章所有 6 个版本

First Describe, Then Depict: Generating Covers for Music and Books via Extracting Keywords: This paper presents two methods to generate high resolution …

V Efimova, V Shalamov, A Filchenkov - Proceedings of the 2022 5th …, 2022 - dl.acm.org

In this paper, we consider the two algorithms of generating artwork covers based on texts or
audio file features. The resulting image is combined from existing images labelled with …

[PDF] hal.science

Assessing and Efficiently Leveraging the Generalisation Abilities of Multimodal Models

R Bielawski - 2022 - theses.hal.science

As larger multimodal datasets are becoming available on the web, the possibility for better,
more human-like multimodal models grows. My research goal is to evaluate what …

[PDF] publikasi-adpiindonesia.id

Klasifikasi Judul Dan Tataletak Buku Dengan Algoritma Cnn Di Perpustakaan Unida Gontor

AR Adawiyah, D Muriyatmoko… - Prosiding …, 2024 - publikasi-adpiindonesia.id

Penelitian ini bertujuan untuk mengembangkan sebuah model klasifikasi menggunakan
arsitektur ResNet34 dengan algoritma Convolutional Neural Network (CNN) untuk …

What Text Design Characterizes Book

D Haraguchi, BK Iwana - … : 16th IAPR International Workshop, DAS 2024 … - books.google.com

This study analyzes the relationship between non-verbal information (eg, genres) and text
design (eg, font style, character color, etc.) through the classification of book genres using …

高级搜索

QQ 群