ICASSP 2026
2026 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2026) is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. It offers a comprehensive technical program presenting all the latest development in research and technology in the industry that attracts thousands of professionals annually.
Sony is proud to be Platinum sponsor of ICASSP 2026. We look forward to this year's exciting exhibition opportunities, featuring a variety of ways to connect with participants in person. Sony will exhibit and participate.
- Contents
- Sony's Technical Programs at ICASSP 2026
- Exhibition Booth
- Follow us on LinkedIn!
- Sony Women in Technology Award with Nature
- Career Information
Sony's Technical Programs at ICASSP 2026
Oral
Automatic Music Mixing Using a Generative Model of Effect Embeddings
- Date/Time: May 7th (Thu) 16:30 - 16:50 (CEST)
- Place: Room 127+128
< Link >
- [Oral] https://www.cmsworkshops.com/ICASSP2026/PaperNum=16357
- [Paper] https://arxiv.org/abs/2511.08040
Automatic Music Sample Identification with Multi-Track Contrastive Learning
- Date/Time: May 7th (Thu) 16:50 - 17:10 (CEST)
- Place: Room 127+128
< Link >
- [Oral] https://www.cmsworkshops.com/ICASSP2026/PaperNum=14887
- [Paper] https://arxiv.org/abs/2510.11507
FoleyBench: A Benchmark For Video-to-Audio Models
- Date/Time: May 6th (Wed), 17:30 - 17:50 (CEST)
- Place: Room 127+128
< Link >
- [Oral] https://www.cmsworkshops.com/ICASSP2026/PaperNum=17851
- [Paper] https://arxiv.org/abs/2511.13219
Noise-to-Notes: Diffusion-Based Generation and Refinement for Automatic Drum Transcription
- Date/Time: May 8th (Fri), 09:40 - 10:00 (CEST)
- Place: Room 127+128
< Link >
- [Oral] https://www.cmsworkshops.com/ICASSP2026/PaperNum=10215
- [Paper] https://arxiv.org/abs/2509.21739
S-PRESSO: Ultra Low Bitrate Sound Effect Compression With Diffusion Autoencoders And Offline Quantization
- Date/Time: May 6th (Wed), 09:00 - 09:20
- Place: Room 127+128
< Link >
- [Oral] https://www.cmsworkshops.com/ICASSP2026/PaperNum=150808
- [Paper] https://arxiv.org/abs/2602.15082
Posters
Can Hierarchical Cross-Modal Fusion Predict Human Perception of AI Dubbed Content?
- Date/Time: May 7th (Thu), 09:00 - 11:00 (CEST)
- Place: Poster Area 25
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=16934
- [Paper] https://arxiv.org/abs/2603.28717
FlashFoley: Fast Interactive Sketch2Audio Generation
- Date/Time: May 7th (Thu), 14:00 - 16:00 (CEST)
- Place: Poster Area 25
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=14368/a>
- [Paper] https://openreview.net/pdf?id=dxwsVO0W47
Leveraging Whisper Embeddings for Audio-based Lyrics Matching
- Date/Time: May 7th (Thu), 16:30 - 18:30 (CEST)
- Place: Poster Area 25
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=13868
- [Paper] https://arxiv.org/abs/2510.08176
SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation
- Date/Time: May 5th (Tue), 14:00 - 16:00 (CEST)
- Place: Poster Area 21
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=10607
- [Paper] https://arxiv.org/abs/2412.13462
Summary of The Inaugural Music Source Restoration Challenge
- Date/Time: May 7th (Thu), 16:30 - 18:30 (CEST)
- Place: Poster Area 43
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=19113
- [Paper] https://arxiv.org/abs/2601.04343
Towards Blind Data Cleaning: A Case Study in Music Source Separation,
- Date/Time: May 5th (Tue), 14:00 - 16:00 (CEST)
- Place: Poster Area 27
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=11809
- [Paper] https://arxiv.org/abs/2510.15409
Windowed SummaryMixing: An Efficient Fine-Tuning of Self-Supervised Learning Models for Low-Resource Speech Recognition
- Date/Time: May 7th (Thu), 09:00 - 11:00 (CEST)
- Place: Poster Area 27
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=12306
- [Paper] https://arxiv.org/abs/2602.09043
Break-the-Beat! Controllable MIDI-to-Drum Audio Synthesis
- Date/Time: May 6th (Wed), 14:00 - 16:00
- Place: Poster Area 25
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=17036
- [Paper] https://ai.sony/blog/Advancing-AI-Highlights-from-March-2026
Do Foundational Audio Encoders Understand Music Structure?
- Date/Time: May 6th (Wed), 16:30 - 18:30 (CEST)
- Place: Poster Area 26
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=10100
- [Paper] https://arxiv.org/abs/2512.17209
MMAudioSep: Taming Video-to-Audio Generative Model Towards Video/Text-Queried Sound Separation
- Date/Time: May 7th (Thu), 16:30 - 18:30 (CEST)
- Place: Poster Area 24
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=12520
- [Paper] https://arxiv.org/abs/2510.09065
Phonological Tokenizer: Prosody-Aware Phonetic Token via Multi-Objective Fine-Tuning with Differentiable K-Means
- Date/Time: May 6th (Wed), 14:00 - 16:00
- Place: Poster Area 31
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=16033
- [Paper] https://arxiv.org/abs/2601.19781
Diffusion Timbre Transfer Via Mutual Information Guided Inpainting
- Date/Time: May 5th (Tue), 16:30 - 18:30 (CEST)
- Place: Poster Area 26
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=14117
- [Paper] https://arxiv.org/abs/2601.01294
VioPTT: Violin Technique-Aware Transcription from Synthetic Data Augmentation
- Date/Time: May 7th (Thu), 16:30 - 18:30 (CEST)
- Place: Poster Area 25
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=14491
- [Paper] https://arxiv.org/abs/2509.23759
Deployment Strategy for Indoor Distributed MIMO System
- Date/Time: May 5th, 14:00 - 16:00 (CEST)
- Place: Poster Area 3
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=19174
- [Paper] https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11355794
AHAI: Adaptive Hybrid-Attention Inference for Diffusion-Based Arbitrary Style Transfer
- Date/Time: May 8th (Fri), 14:00 - 16:00 (CEST)
- Place: Poster Area 16
< Link >
- [Poster] https://www.cmsworkshops.com/ICASSP2026/PaperNum=12581
- [Paper] https://repository.lboro.ac.uk/articles/
Workshop
WS-P5: Speech, Music and Mind 2026 (SMM26)
- Place: Room 121
- Speaker: Yuki Mitsufuji (Sony AI/Sony Group Corporation)
Exhibition Booth
Open Hours
Visit our booth at the exhibition to explore our latest technology firsthand and engage with our team.
Location: P1
Sony Booth Open Hours:
- May 5th (Tue) 09:00 – 17:00 (CEST)
- May 6th (Wed) 09:00 – 17:00 (CEST)
- May 7th (Thu) 09:00 – 17:00 (CEST)
- May 8th (Fri) 09:00 – 17:00 (CEST)
Follow us on LinkedIn!
Sony Women in Technology Award with Nature
The Sony Women in Technology Award with Nature recognizes exceptional early to mid-career researchers who are advancing technology. We invite eligible ICASSP attendees working in academia, for a research institution, or a university spinout to apply. Three winners will receive:
- US $250,000 to advance their research endeavors
- Global visibility through Nature Portfolio’s platforms
- Networking opportunities with distinguished technologists and visionary researchers
The application deadline is rapidly approaching!
Learn more and apply by Friday, June 5th, 2026: https://womenintechnology.sony.com/
Career Information
We look forward to working with highly motivated individuals to fill the world with emotion and to pioneer future innovation through dreams and curiosity. If interested, please access our career site and/or consider visiting the Sony booth (Booth#: P1) at the ICASSP exhibition area to know more about Sony Group.
Career Site Link: https://www.sony.com/en/SonyInfo/Careers/
Sony Group Technology Portal
You can explore our technology by clicking HERE.

