OFAI-TR-2000-25

Extraction of Musical Performance Parameters from Audio Data

Simon Dixon

We present a system for the automatic extraction of musical content from audio signals containing polyphonic music. The system works off-line, taking data from audio files and producing MIDI output, representing the pitch, timing and volume of the musical notes. The initial signal processing stage is based on a STFT enhanced by a tracking phase vocoder, which interprets stable frequency components as partials of musical notes. Heuristic methods combine these partials, using a generic instrument model, to produce note estimates. The system is tested on a large corpus of professionally performed music from the standard classical piano repertoire.

Keywords: Automatic transcription, Audio content analysis

Citation: Proceedings of the First IEEE Pacific-Rim Conference on Multimedia (PCM 2000), Sydney, Australia, December 2000