|
Greg
Kochanski |
|
New Tools and Methods for Very-Large-Scale Phonetics
Research
These papers were presented at the "New
Tools and Methods for Very-Large-Scale Phonetics Research"
workshop at the University of Pennsylvania, January 28-31, 2011.
Archival copies may be found (via searching) at http://ora.ouls.ox.ac.uk .
- This
paper, by Greg P. Kochanski, Chilin Shih, Ryan Shosted,
"Should Corpora be Big, Rich, or Dense?", discusses how one can
compute whether or not an ASR system has done a good job of
aligning your text to your audio.
- This
paper, by Ladan Baghai-Ravary, Sergio Grau, and Greg
Kochanski, "Detecting gross alignment errors in the Spoken
British National Corpus", discusses how one can compute whether
or not an ASR system has done a good job of aligning your text
to your audio. It is also available at http://arxiv.org . It was presented at
the workshop as a
poster .
- This
paper, by John Coleman, Mark Liberman, Greg Kochanski, Lou
Burnard and Jaahong Yuan, "Mining a Year of Speech", describes
the Spoken British National Corpus, its uses, techniques used,
and lessons learned in a project to align a large speech corpus
with its transcription.
Our papers at this workshop were funded by: