CORP-ORAL Spontaneous Speech Corpus

CLARIN to IMDI, DATE: 2009-08-13 CORP-ORAL Spontaneous Speech Corpus CORP-ORAL Spontaneous Speech Corpus The aim of the CORP-ORAL project is to build a corpus of spontaneous European Portuguese speech available for the training of speech synthesis and recognition systems as well as phonetic, phonological, lexical, morphological and syntactic studies. The corpus contains the recording of 60 hours of conversations between two European Portuguese speakers per conversation (at a time). The entire corpus will be completed with orthographic transcription and the prosodic marking of speech breaks/boundaries as well as phonetic transcription of a selection of chunks. CORP-ORAL is built from scratch with the explicit goal of becoming entirely available on the internet to the scientific community and the public in general. ISO639-3:por Portuguese Unknown Unknown Unknown Portugal Spoken Corpus audio/x-wav, text/x-eaf+xml <Id/> <Contact/> </Project> <Publisher>Instituto de Linguística Teórica e Computacional</Publisher> <Author/> <Size/> <DistributionForm/> <Access> <Availability>available on the internet</Availability> <Date/> <Owner/> <Publisher/> <Contact/> </Access> <Pricing/> <ContactPerson>Fabíola Santos</ContactPerson> <ReferenceLink>http://corpus1.mpi.nl/ds/imdi_browser/?openpath=MPI556279%23</ReferenceLink> <MetadataLink>http://corpus1.mpi.nl/ds/imdi_browser/?openpath=MPI556279%23</MetadataLink> <Keys> <Key Name="NodeId">838</Key> <Key Name="VersionId">1719</Key> </Keys> </Catalogue> </METATRANSCRIPT>