25-10-2012 дата публикации
Номер: US20120271617A1
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for initializing language models for automatic speech recognition. In one aspect, a method includes receiving logged speech recognition results from an existing corpus that is specific to a given language and a target context, generating a target corpus by machine-translating the logged speech recognition results from the given language to a different, target language, and estimating a language model that is specific to the different, target language and the same, target context, using the target corpus. 1. A computer-implemented method performed by at least one processor , the method comprising:receiving logged speech recognition results from an existing corpus that is specific to a given language and a target context; machine-translating the logged speech recognition results from the given language to a different, target language; and', 'augmenting an existing, partial target corpus specific for the different, target language and the target context with the machine-translated logged speech recognition results; and, 'generating a target corpus byestimating a language model that is specific to the different, target language and the same, target context, using the target corpus.2. The method of claim 1 , wherein estimating the language model comprises counting each occurrence of each distinctive word or phrase in the target corpus.3. The method of claim 2 , wherein estimating the language model comprises determining a relative frequency of occurrence of each distinctive word or phrase in the target corpus claim 2 , from among all distinctive words or phrases in the target corpus.4. The method of claim 1 , wherein the target context is associated with a particular application or application state claim 1 , operating system claim 1 , geographic location or region claim 1 , or environmental or ambient characteristic.5. The method of claim 1 , wherein the target context is ...
Подробнее