Speech and Noise Corpora for Pitch Estimation of Human Speech | Zenodo
Skip to main
You are using an outdated browser. Please upgrade your browser to improve your experience.
There is a newer version of the record<br>available.
Published June 29, 2020
| Version 1.0.0
Dataset
Open
Speech and Noise Corpora for Pitch Estimation of Human Speech
Authors/Creators
Bastian Bechtold1
Show affiliations
1.
Jade Hochschule
Description
This dataset contains common speech and noise corpora for evaluating fundamental frequency estimation algorithms as convenient JBOF dataframes. Each corpus is available freely on its own, and allows redistribution:
CMU-ARCTIC (BSD license) [1]
FDA (free to download) [2]
KEELE (free for noncommercial use) [3]
MOCHA-TIMIT (free for noncommercial use) [4]
PTDB-TUG (ODBL license) [5]
NOISEX (free to download) [7]
QUT-NOISE (CC-BY-SA license) [8]
These files are published as part of my dissertation, "Pitch of Voiced Speech in the Short-Time Fourier Transform: Algorithms, Ground Truths, and Evaluation Methods", and in support of the Replication Dataset for Fundamental Frequency Estimation.
References:
John Kominek and Alan W Black. CMU ARCTIC database for speech synthesis, 2003.
Paul C Bagshaw, Steven Hiller, and Mervyn A Jack. Enhanced Pitch Tracking and the Processing of F0 Contours for Computer Aided Intonation Teaching. In EUROSPEECH, 1993.
F Plante, Georg F Meyer, and William A Ainsworth. A Pitch Extraction Reference Database. In Fourth European Conference on Speech Communication and Technology, pages 837–840, Madrid, Spain, 1995.
Alan Wrench. MOCHA MultiCHannel Articulatory database: English, November 1999.
Gregor Pirker, Michael Wohlmayr, Stefan Petrik, and Franz Pernkopf. A Pitch Tracking Corpus with Evaluation on Multipitch Tracking Scenario. page 4, 2011.
John S. Garofolo, Lori F. Lamel, William M. Fisher, Jonathan G. Fiscus, David S. Pallett, Nancy L. Dahlgren, and Victor Zue. TIMIT Acoustic-Phonetic Continuous Speech Corpus, 1993.
Andrew Varga and Herman J.M. Steeneken. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recog- nition systems. Speech Communication, 12(3):247–251, July 1993.
David B. Dean, Sridha Sridharan, Robert J. Vogt, and Michael W. Mason. The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms. Proceedings of Interspeech 2010, 2010.
Files
CMU_Arctic.zip
Files<br>(15.5 GB)
Name<br>Size
CMU_Arctic.zip
md5:9d394e2d698b4e010f91baf7e72bf527
1.4 GB
Preview
Download
KEELE.zip
md5:f5a87014bad14744660b90187de7d43f
22.8 MB
Preview
Download
KEELE_mod.zip
md5:a84f097187a7051dacef2eb2bfbf8462
23.1 MB
Preview
Download
MOCHA_TIMIT.zip
md5:854df9067051f5906fd6c761969e6415
1.4 GB
Preview
Download
NOISEX92.zip
md5:b5b707b8ac7217713e3123360c21043c
125.0 MB
Preview
Download
PTDB_TUG.zip
md5:9bf8f0bc5b1f928fcefdc9c85d7ec74d
4.4 GB
Preview
Download
QUT_NOISE.zip
md5:3b963721d8ad7b8d231170b06e4ffb1e
8.3 GB
Preview
Download
3K
Views
1K
Downloads
Show more details
All versions<br>This version
Views
Total views
2,692
1,397
Downloads
Total downloads
1,226
685
Data volume
Total data volume
3.3 TB<br>1.6 TB
More info on how stats are collected....
Versions
External resources
Indexed in
OpenAIRE
Communities
Keywords and subjects
Keywords
speech
noise
Details
DOI
DOI Badge
DOI
10.5281/zenodo.3920591
Markdown
[](https://doi.org/10.5281/zenodo.3920591)
reStructuredText
.. image:: https://zenodo.org/badge/DOI/10.5281/zenodo.3920591.svg<br>:target: https://doi.org/10.5281/zenodo.3920591
HTML
Image URL
https://zenodo.org/badge/DOI/10.5281/zenodo.3920591.svg
Target URL
https://doi.org/10.5281/zenodo.3920591
Resource type<br>Dataset
Publisher<br>Zenodo
Languages
English
Rights
License
Other (Non-Commercial)
No further description.
Citation
Export
Technical metadata
Created
June 29, 2020
Modified
June 30, 2020
Jump up
This site uses cookies. Find out more on how we use cookies
Accept all cookies<br>Accept only essential cookies