Hybrid Time and Frequency Domain Audio Pitch Shifting

This is the companion page to the paper "A Hybrid Time and Frequency Domain Audio Pitch Shifting Algorithm" to be presented at the 125th Audio Engineering Society Convention. It contains various musical excerpts illustrating the discussed algorithms, including those used in the evaluation (audio metric and listening test).

Frequency domain phase vocoder versus hybrid (time and frequency domain) phase vocoder
These music excerpts have been used for the evaluation. Time stretching (using resampling and pitch shifting) was used for the audio metric, and pitch shifting for the listening test.
For a comparison including the "legacy" phase vocoder and the "rollers" implementation, look at this page.
Original music Pitch shifted (200%), improved phase vocoder (phase-locked) Pitch shifted (200%), proposed new hybrid implementation Time stretched (200%), improved phase vocoder (phase-locked) Time stretched (200%), proposed new hybrid implementation
Listen Listen Listen Listen Listen
Listen Listen Listen Listen Listen
Listen Listen Listen Listen Listen
Listen Listen Listen Listen Listen

Sources
The excerpts are taken from the following musics:
Music 1: The Chariots of Fire, by Vangelis, in Odyssey: The Definitive Collection
Music 2: La Danse Macabre, by Saint Saens, in Synthétiseurs, les plus grand thèmes classiques, vol. 5, adaptation: Ed. Starink
Music 3: Oxygene part IV, by Jean-Michel Jarre in Synthétiseurs, les plus grands thèmes, vol. 1, adaptation: Ed. Starink
Music 4: The Caves of Skops (Track 41), by Ubisoft, in Rayman 1

Further improvements
Illustrated with pitch shifting by one octave
Original Music
New Hybrid Implementation Multiresolution Version A previously proposed transient handling scheme (Transient are just not pitch shifted) Proposed transient handling, beta (Transients are pitch shifted, but with higher time resolution)
Listen Listen Listen Listen Listen
Listen Listen Listen Listen Listen

Sources
The excerpts are taken from the following musics:
Music 1: Cleanin' Out My Closet, by Eminem, in The Eminem Show
Music 2: Quiero Contarte, by Los Calchakis, in Prestige de la musique Latino-Américaine

Poster
Official poster presented at the convention: Poster.pdf.

Did you say "Audio Metrics"?
Unofficial leaflet presented at the convention: AudioMetrics.pdf.
Corresponding music excerpts:
Original music Algorithm 1 (Time-domain WSOLA) Algorithm 2 (Hybrid, multiresolution phase vocoder)
Listen Listen Listen

(Non) Related work
To clear up some confusion, I want to mention a paper from other authors that looks similar to this work:
[1] Audio Time-Scale Modification Using a Hybrid Time-Frequency Domain Approach
However, the two papers are unrelated. They present two different improvements of the phase vocoder.