読みたいリスト
MT3: Multi-Task Multitrack Music Transcription
MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling
Masked Autoencoders that Listen
Film: Visual reasoning with a general conditioning layer
Attentional feature fusion
Arbitrary style transfer in real-time with adaptive instance normalization
Guided Image-to-Image Translation with Bi-Directional Feature Transformation
Hierarchical question-image co-attention for visual question answering
Robust Bayesian pitch tracking based on the harmonic model
Phoneme-to-audio alignment with recurrent neural networks for speaking and singing voice
The “Overdrive” mode in the “Complete Vocal Technique”: a preliminary study
Production Strategies of Vocal Attitudes
Investigating style evolution of Western classical music: A computational approach
P4KxSpotify: A Dataset of Pitchfork Music Reviews and Spotify Musical Features
EVOLUTION OF THE INFORMATIONAL COMPLEXITY OF CONTEMPORARY WESTERN MUSIC
Score-informed analysis of tuning, intonation, pitch modulation, and dynamics in jazz solos
Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss
Interactive Multi-Class Tiny-Object Detection
W-CTC: A CONNECTIONIST TEMPORAL CLASSIFICATION LOSS WITH WILD CARDS
Weighted Training for Cross-Task Learning
Deep Hough-Transform Line Priors
S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation
Analysis of Acoustic Features Affecting “Singing-ness” and Its Application to Singing-Voice Synthesis from Speaking-Voice
The Singing Tutor: Expression Categorization and Segmentation of the Singing Voice
Automatic Characterization of Dynamics and Articulation of Expressive Monophonic Recordings
Relationships Between Lyrics and Melody in Popular Music