読みたいリスト

MT3: Multi-Task Multitrack Music Transcription

MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling

Masked Autoencoders that Listen

Film: Visual reasoning with a general conditioning layer

Attentional feature fusion

Arbitrary style transfer in real-time with adaptive instance normalization

Guided Image-to-Image Translation with Bi-Directional Feature Transformation

Hierarchical question-image co-attention for visual question answering

Robust Bayesian pitch tracking based on the harmonic model

Phoneme-to-audio alignment with recurrent neural networks for speaking and singing voice

The “Overdrive” mode in the “Complete Vocal Technique”: a preliminary study

Production Strategies of Vocal Attitudes

Investigating style evolution of Western classical music: A computational approach

P4KxSpotify: A Dataset of Pitchfork Music Reviews and Spotify Musical Features

EVOLUTION OF THE INFORMATIONAL COMPLEXITY OF CONTEMPORARY WESTERN MUSIC

Score-informed analysis of tuning, intonation, pitch modulation, and dynamics in jazz solos

Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

Interactive Multi-Class Tiny-Object Detection

W-CTC: A CONNECTIONIST TEMPORAL CLASSIFICATION LOSS WITH WILD CARDS

Weighted Training for Cross-Task Learning

Deep Hough-Transform Line Priors

S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation

Analysis of Acoustic Features Affecting “Singing-ness” and Its Application to Singing-Voice Synthesis from Speaking-Voice

The Singing Tutor: Expression Categorization and Segmentation of the Singing Voice

Automatic Characterization of Dynamics and Articulation of Expressive Monophonic Recordings

Relationships Between Lyrics and Melody in Popular Music