PnG Speech Recognition

The Pick Up & Go (PnG) recognizer is enabled when the Asr_Engine_EngineName parameter is set to PnG.

  • This is speaker-independent speech recognition.
  • Pick Up & Go recognition requires no user training, so users can start right away.

If a user encounters problems using Pick Up & Go, they should retrain the words or consider additional ways to Improve Speech Recognition.

PnG FEATURE LICENSING

An Enterprise Voice or VoiceConsole license is required to use Pick Up & Go in languages other than American English. If it is not properly licensed, VoiceCatalyst uses the BlueStreak recognizer. Contact your customer service representative.

  • If the A700x has Internet network access and a Cloud Licensing Service configuration file, the Cloud Licensing Server is used to license the device for Pick Up & Go.
  • If the A700x has Internet network access but does not have a Cloud Licensing Service configuration file, VoiceConsole must be used to license the device for Pick Up & Go.
  • If the A700x device does not have Internet network access, VoiceConsole must be used to license the device for Pick Up & Go.

The license indicates what features are being licensed, such as All Languages, Honeywell Voice for Manhattan Active, Pick Up & Go, etc.

Availability

The Pick Up & Go recognizer is available only on the A700x and requires VoiceCatalyst 4.2 or greater for American English and 4.3 or greater for other supported languages.

Languages Supported for Pick Up & Go

Language PnG <languageTag>
American English en_US
Arabic (World Wide) ar_WW
Arabic (Gulf) ar_AE
Australian English en_AU
Belgium Dutch nl_BE
Brazilian Portuguese pt_BR
British English en_GB
Bulgarian bg_BG
Canadian French fr_CA
Chinese - Cantonese (Hong Kong) zh_HK
Chinese - Mandarin (China) zh_CN
Chinese - Mandarin (Taiwan) zh_TW
Chinese - Sichuanese (China) zh_CN
Czech cs_CZ
Danish da_DK
Dutch nl_BE
Finnish fi_FI
French fr_FR
German de_DE
Greek el_GR
Hebrew he_IL
Hungarian hu_HU
Indian English en_IN
Indonesian id_ID
Italian it_IT
Japanese ja_JP
Korean ko_KR
Latin American Spanish es_MX
Malay ms_MY
Netherlands Dutch nl_NL
Norwegian no_NO
Persian (Iran) fa_IR
Polish pl_PL
Portuguese pt_PT
Russian ru_RU
Slovak sk_SK
Spanish es_ES
Swedish sv_SE
Thai th_TH
Turkish tr_TR

Language Selection

PnG uses the same language as the text-to-speech voice that is loaded onto the device. It also uses the task or VAD’s phonetic substitutions for the loaded language.

Speaker-Independent Pronunciations

PnG expects the user to pronounce text the way a native speaker of the loaded language would pronounce it. For example, it would expect a different pronunciation for “male” when English is loaded than when Spanish is loaded.

When specified, the phonetic substitution display string is used to generate the pronunciation that the recognizer is looking for instead of the actual vocab word. This is for language translations and apps like VIO that map generic command tokens to real words. (The display string is used for both display and speech-in/recognition while the “substitution” or pronunciation string is used for speech-out/text-to-speech.)

The recognizer generates pronunciations automatically. If a pronunciation does not work well enough for one or more users, it can be replaced or supplemented with additional pronunciations using the PnG_Recog_PronunciationRespellings_<languageTag>_<word> parameter.

Trainability/Speaker-dependent Pronunciations

VoiceCatalyst 4.5 or greater enables a user to train speaker-dependent templates to help resolve poor recognition. PnG does not require training, however, configuration parameter settings can force enrollment training. VoiceCatalyst 4.5 or greater also enables the user to initiate retraining of a word via the button menu as was always available with BlueStreak).