Large-scale text-to-image generation models for visual artists' creative works

HK Ko, G Park, H Jeon, J Jo, J Kim, J Seo - Proceedings of the 28th …, 2023 - dl.acm.org
Large-scale Text-to-image Generation Models (LTGMs)(eg, DALL-E), self-supervised deep
learning models trained on a huge dataset, have demonstrated the capacity for generating …

Where are you heading? dynamic trajectory prediction with expert goal examples

H Zhao, RP Wildes - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Goal-conditioned approaches recently have been found very useful to human trajectory
prediction, when adequate goal estimates are provided. Yet, goal inference is difficult in …

FORTE: Few samples for recognizing hand gestures with a smartphone-attached radar

S Chioccarello, A Sluÿters, A Testolin… - Proceedings of the …, 2023 - dl.acm.org
Radar sensing technologies offer several advantages over other gesture input modalities,
such as the ability to reliably sense human movements, a reasonable deployment cost …

Wordgesture-GAN: modeling word-gesture movement with generative adversarial network

J Chu, D An, Y Ma, W Cui, S Zhai, XD Gu… - Proceedings of the 2023 …, 2023 - dl.acm.org
Word-gesture production models that can synthesize word-gestures are critical to the
training and evaluation of word-gesture keyboard decoders. We propose WordGesture …

Neural latent aligner: cross-trial alignment for learning representations of complex, naturalistic neural data

CJ Cho, E Chang… - … Conference on Machine …, 2023 - proceedings.mlr.press
Understanding the neural implementation of complex human behaviors is one of the major
goals in neuroscience. To this end, it is crucial to find a true representation of the neural …

EPIC: emotion perception by spatio-temporal interaction context of gait

H Lu, S Xu, S Zhao, X Hu, R Ma… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
Recently, psychophysiological computing has received considerable attention. Due to easy
acquisition at a distance and less conscious initiation, gait-based emotion recognition is …

Soft dynamic time warping for multi-pitch estimation and beyond

M Krause, C Weiß, M Müller - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Many tasks in music information retrieval (MIR) involve weakly aligned data, where exact
temporal correspondences are unknown. The connectionist temporal classification (CTC) …

Differentiable simulation of a liquid argon time projection chamber

S Gasiorowski, Y Chen, Y Nashed… - Machine Learning …, 2024 - iopscience.iop.org
Liquid argon time projection chambers (LArTPCs) are widely used in particle detection for
their tracking and calorimetric capabilities. The particle physics community actively builds …

hvEEGNet: a novel deep learning model for high-fidelity EEG reconstruction

G Cisotto, A Zancanaro, IF Zoppis… - Frontiers in …, 2024 - frontiersin.org
Introduction Modeling multi-channel electroencephalographic (EEG) time-series is a
challenging tasks, even for the most recent deep learning approaches. Particularly, in this …

Effective 2D Stroke-based Gesture Augmentation for RNNs

M Maslych, EM Taranta, M Aldilati… - Proceedings of the 2023 …, 2023 - dl.acm.org
Recurrent neural networks (RNN) require large training datasets from which they learn new
class models. This limitation prohibits their use in custom gesture applications where only …