Speech and Noise Corpora for Pitch Estimation of Human Speech

Speech and Noise Corpora for Pitch Estimation of Human Speech | Zenodo

Skip to main

You are using an outdated browser. Please upgrade your browser to improve your experience.

There is a newer version of the record available.

Published June 29, 2020

| Version 1.0.0

Dataset

Open

Speech and Noise Corpora for Pitch Estimation of Human Speech

Authors/Creators

Bastian Bechtold1

Show affiliations

Jade Hochschule

Description

This dataset contains common speech and noise corpora for evaluating fundamental frequency estimation algorithms as convenient JBOF dataframes. Each corpus is available freely on its own, and allows redistribution:

CMU-ARCTIC (BSD license) [1]

FDA (free to download) [2]

KEELE (free for noncommercial use) [3]

MOCHA-TIMIT (free for noncommercial use) [4]

PTDB-TUG (ODBL license) [5]

NOISEX (free to download) [7]

QUT-NOISE (CC-BY-SA license) [8]

These files are published as part of my dissertation, "Pitch of Voiced Speech in the Short-Time Fourier Transform: Algorithms, Ground Truths, and Evaluation Methods", and in support of the Replication Dataset for Fundamental Frequency Estimation.

References:

John Kominek and Alan W Black. CMU ARCTIC database for speech synthesis, 2003.

Paul C Bagshaw, Steven Hiller, and Mervyn A Jack. Enhanced Pitch Tracking and the Processing of F0 Contours for Computer Aided Intonation Teaching. In EUROSPEECH, 1993.

F Plante, Georg F Meyer, and William A Ainsworth. A Pitch Extraction Reference Database. In Fourth European Conference on Speech Communication and Technology, pages 837–840, Madrid, Spain, 1995.

Alan Wrench. MOCHA MultiCHannel Articulatory database: English, November 1999.

Gregor Pirker, Michael Wohlmayr, Stefan Petrik, and Franz Pernkopf. A Pitch Tracking Corpus with Evaluation on Multipitch Tracking Scenario. page 4, 2011.

John S. Garofolo, Lori F. Lamel, William M. Fisher, Jonathan G. Fiscus, David S. Pallett, Nancy L. Dahlgren, and Victor Zue. TIMIT Acoustic-Phonetic Continuous Speech Corpus, 1993.

Andrew Varga and Herman J.M. Steeneken. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recog- nition systems. Speech Communication, 12(3):247–251, July 1993.

David B. Dean, Sridha Sridharan, Robert J. Vogt, and Michael W. Mason. The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms. Proceedings of Interspeech 2010, 2010.

Files

CMU_Arctic.zip

Files (15.5 GB)

Name Size

CMU_Arctic.zip

md5:9d394e2d698b4e010f91baf7e72bf527

1.4 GB

Preview

Download

KEELE.zip

md5:f5a87014bad14744660b90187de7d43f

22.8 MB

Preview

Download

KEELE_mod.zip

md5:a84f097187a7051dacef2eb2bfbf8462

23.1 MB

Preview

Download

MOCHA_TIMIT.zip

md5:854df9067051f5906fd6c761969e6415

1.4 GB

Preview

Download

NOISEX92.zip

md5:b5b707b8ac7217713e3123360c21043c

125.0 MB

Preview

Download

PTDB_TUG.zip

md5:9bf8f0bc5b1f928fcefdc9c85d7ec74d

4.4 GB

Preview

Download

QUT_NOISE.zip

md5:3b963721d8ad7b8d231170b06e4ffb1e

8.3 GB

Preview

Download

Views

Downloads

Show more details

All versions This version

Views

Total views

2,692

1,397

Downloads

Total downloads

1,226

685

Data volume

Total data volume

3.3 TB 1.6 TB

More info on how stats are collected....

Versions

External resources

Indexed in

OpenAIRE

Communities

Keywords and subjects

Keywords

speech

noise

Details

DOI

DOI Badge

DOI

10.5281/zenodo.3920591

Markdown

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.3920591.svg)](https://doi.org/10.5281/zenodo.3920591)

reStructuredText

.. image:: https://zenodo.org/badge/DOI/10.5281/zenodo.3920591.svg :target: https://doi.org/10.5281/zenodo.3920591

HTML

Image URL

https://zenodo.org/badge/DOI/10.5281/zenodo.3920591.svg

Target URL

https://doi.org/10.5281/zenodo.3920591

Resource type Dataset

Publisher Zenodo

Languages

English

Rights

License

Other (Non-Commercial)

No further description.

Citation

Export

Technical metadata

Created

June 29, 2020

Modified

June 30, 2020

Jump up

This site uses cookies. Find out more on how we use cookies

Accept all cookies Accept only essential cookies

Speech and Noise Corpora for Pitch Estimation of Human Speech

Related Articles

(no title)

Scientists reverse brain aging, with a nasal spray

AI has torched the market for junior programmers

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org