PnG Speech Recognition
The Pick Up & Go (PnG) recognizer is enabled when the Asr_Engine_EngineName parameter is set to PnG.
- This is speaker-independent speech recognition.
- Pick Up & Go recognition requires no user training, so users can start right away.
If a user encounters problems using Pick Up & Go, they should retrain the words or consider additional ways to Improve Speech Recognition.
PnG FEATURE LICENSING
An Enterprise Voice or VoiceConsole license is required to use Pick Up & Go in languages other than American English. If it is not properly licensed, VoiceCatalyst uses the BlueStreak recognizer. Contact your customer service representative.
- If the A700x has Internet network access and a Cloud Licensing Service configuration file, the Cloud Licensing Server is used to license the device for Pick Up & Go.
- If the A700x has Internet network access but does not have a Cloud Licensing Service configuration file, VoiceConsole must be used to license the device for Pick Up & Go.
- If the A700x device does not have Internet network access, VoiceConsole must be used to license the device for Pick Up & Go.
The license indicates what features are being licensed, such as All Languages, Honeywell Voice for Manhattan Active, Pick Up & Go, etc.
Availability
The Pick Up & Go recognizer is available only on the A700x and requires VoiceCatalyst 4.2 or greater for American English and 4.3 or greater for other supported languages.
Languages Supported for Pick Up & Go
Language | PnG <languageTag> |
---|---|
American English | en_US |
Arabic (World Wide) | ar_WW |
Arabic (Gulf) | ar_AE |
Australian English | en_AU |
Belgium Dutch | nl_BE |
Brazilian Portuguese | pt_BR |
British English | en_GB |
Bulgarian | bg_BG |
Canadian French | fr_CA |
Chinese - Cantonese (Hong Kong) | zh_HK |
Chinese - Mandarin (China) | zh_CN |
Chinese - Mandarin (Taiwan) | zh_TW |
Chinese - Sichuanese (China) | zh_CN |
Czech | cs_CZ |
Danish | da_DK |
Dutch | nl_BE |
Finnish | fi_FI |
French | fr_FR |
German | de_DE |
Greek | el_GR |
Hebrew | he_IL |
Hungarian | hu_HU |
Indian English | en_IN |
Indonesian | id_ID |
Italian | it_IT |
Japanese | ja_JP |
Korean | ko_KR |
Latin American Spanish | es_MX |
Malay | ms_MY |
Netherlands Dutch | nl_NL |
Norwegian | no_NO |
Persian (Iran) | fa_IR |
Polish | pl_PL |
Portuguese | pt_PT |
Russian | ru_RU |
Slovak | sk_SK |
Spanish | es_ES |
Swedish | sv_SE |
Thai | th_TH |
Turkish | tr_TR |
Language Selection
PnG uses the same language as the text-to-speech voice that is loaded onto the device. It also uses the task or VAD’s phonetic substitutions for the loaded language.
Speaker-Independent Pronunciations
PnG expects the user to pronounce text the way a native speaker of the loaded language would pronounce it. For example, it would expect a different pronunciation for “male” when English is loaded than when Spanish is loaded.
When specified, the phonetic substitution display string is used to generate the pronunciation that the recognizer is looking for instead of the actual vocab word. This is for language translations and apps like VIO that map generic command tokens to real words. (The display string is used for both display and speech-in/recognition while the “substitution” or pronunciation string is used for speech-out/text-to-speech.)
The recognizer generates pronunciations automatically. If a pronunciation does not work well enough for one or more users, it can be replaced or supplemented with additional pronunciations using the PnG_Recog_PronunciationRespellings_<languageTag>_<word> parameter.
Trainability/Speaker-dependent Pronunciations
VoiceCatalyst 4.5 or greater enables a user to train speaker-dependent templates to help resolve poor recognition. PnG does not require training, however, configuration parameter settings can force enrollment training. VoiceCatalyst 4.5 or greater also enables the user to initiate retraining of a word via the button menu as was always available with BlueStreak).