Phone recognition for spoken web search
dc.contributor.author | Barnard, Etienne | |
dc.contributor.author | van Heerden, Charl | |
dc.contributor.author | Kleynhans, Neil | |
dc.contributor.author | Bali, Kalika | |
dc.contributor.author | Davel, Marelie H. | |
dc.date.accessioned | 2018-03-07T07:28:55Z | |
dc.date.available | 2018-03-07T07:28:55Z | |
dc.date.issued | 2011 | |
dc.description.abstract | Aiming at both speaker independence and robustness with respect to recognition errors in the spoken queries, we have implemented a two-pass system for spoken web search. In the first pass, unconstrained phone recognition of both the query terms and the content audio is employed to represent these recordings as phone strings. A dynamic-programming approach then finds regions in the content phone strings that correspond closely to one or more query strings. In the sec- ond pass, each of these regions is again processed with a phone recognizer, but now a lattice is extracted; this lattice is com- pared against similar lattices extracted for each of the queries. We find our approach to be somewhat successful in identify- ing the query terms in both the development and evaluation sets, but not to generalize well between these sets. | en_US |
dc.description.sponsorship | Multilingual Speech Technologies, North-West University, Vanderbijlpark, South Africa HLT Research Group,CSIR Meraka Institute, Pretoria, South Africa Microsoft Research Lab India, Bangalore, India | en_US |
dc.identifier.citation | Etienne Barnard, Marelie Davel, Charl van Heerden, Neil Kleynhans and Kalika Bali, “Phone recognition for spoken web search”, in MediaEval Workshop, Pisa, Italy, 2011. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications] | en_US |
dc.identifier.uri | http://ceur-ws.org/Vol-807/Barnard_MUST_SWS_me11wn.pdf | |
dc.identifier.uri | http://hdl.handle.net/10394/26538 | |
dc.language.iso | en | en_US |
dc.publisher | MediaEval Workshop, Pisa, Italy | en_US |
dc.subject | Phone recognition | en_US |
dc.subject | Spoken Web Search | en_US |
dc.subject | Speaker independence | en_US |
dc.subject | Unconstrained phone recognition | en_US |
dc.subject | Natural Language Processing— Speech recognition and synthesis | en_US |
dc.subject | Spoken term detection, | en_US |
dc.subject | Under-resourced languages | en_US |
dc.subject | Confidence measures | en_US |
dc.title | Phone recognition for spoken web search | en_US |
dc.type | Presentation | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- barnard-2011-phone-recognition.pdf
- Size:
- 58.66 KB
- Format:
- Adobe Portable Document Format
- Description:
- barnard-2011-phone-recognition
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.61 KB
- Format:
- Item-specific license agreed upon to submission
- Description: