Spoken Language Support
This section documents Dragonfly’s support for spoken languages.
Languages with speech recognition engine support
Speech recognition engines supported by Dragonfly have a set spoken
language. This language can be checked via the engine.language
property, which returns an ISO 639-1 code (e.g. “en”):
from dragonfly import get_engine
engine = get_engine()
# Print the engine language.
print("Engine language: {}".format(engine.language))
Each speech recognition engine supported by Dragonfly supports many languages. These are listed below with citations.
It is worth noting that Dragonfly’s use of ISO 639-1 language codes means
that no distinction is made between variants of languages. For example,
U.S. English and U.K. English will both yield "en"
and be treated as
the same language, even though there are some differences.
Languages supported by Dragon
The following languages are supported by Dragon Professional Individual version 15 [1]:
English (multiple variants)
Dutch
French
German
Italian
Spanish
Please check the linked Nuance knowledgebase page for the languages supported by other versions and editions of Dragon.
Languages supported by Windows Speech Recognition
The following languages are supported by Windows Speech Recognition (WSR) as of 2016 [2]:
English (U.S.) (*)
English (U.K.)
Chinese (Simplified) (*)
Chinese (Traditional)
French (France)
German (Germany)
Japanese
Spanish (Spain)
* Successfully tested.
Microsoft does not appear to be documenting the languages available for WSR any more, which is why the provided citation for this section is an archive.org link. Currently, the best way to find out if your language is supported is to look for available speech models in the Windows language settings: Settings > Time & Language > Language.
Languages supported by CMU Pocket Sphinx
The CMU Pocket Sphinx engine documentation page has a section on spoken language support. There are CMU Pocket Sphinx models and dictionaries available from Source Forge for the following languages [3]:
English (U.S.) (*)
English (Indian)
Catalan
Chinese (Mandarin) (*)
Dutch
French
German
Greek
Hindi
Italian
Kazakh
Portuguese
Russian (*)
Spanish
* Successfully tested.
English (U.S.) is the default language used by the CMU Pocket Sphinx engine.
Languages supported by Kaldi
The following languages are supported by the Kaldi engine back-end:
English (U.S.)
It is possible for Kaldi to support other languages in the future. This requires finding decent models for other languages and making minor modifications to enable their use by the Kaldi Active Grammar library.
You can request to have your language supported by opening a new issue or by contacting David Zurow (@daanzu) directly.
Languages with built-in grammar support
Dragonfly’s Integer
, IntegerRef
and Digits
classes have support for multiple spoken languages. Each supported language
has a sub-package under dragonfly.language
. The current engine
language will be used to load the language-specific content classes in these
sub-packages.
This functionality is optional. Languages other than those listed below can still be used if the speech recognition supports them.
The following languages are supported:
Arabic - “ar”
Dutch - “nl”
English - “en”
German - “de”
Indonesian - “id”
Malaysian - “ms”
English has additional time, date and character related classes.
Language classes reference
ShortIntegerRef
ShortIntegerRef
is a modified version of IntegerRef
which
allows for greater flexibility in the way that numbers may be pronounced,
allowing for words like “hundred” to be dropped. This may be particularly
useful when navigating files by line or page number.
Some examples of allowed pronunciations:
Pronunciation |
Result |
---|---|
one |
1 |
ten |
10 |
twenty three |
23 |
two three |
23 |
seventy |
70 |
seven zero |
70 |
hundred |
100 |
one oh three |
103 |
hundred three |
103 |
one twenty seven |
127 |
one two seven |
127 |
one hundred twenty seven |
127 |
seven hundred |
700 |
thousand |
1000 |
seventeen hundred |
1700 |
seventeen hundred fifty three |
1753 |
seventeen fifty three |
1753 |
one seven five three |
1753 |
seventeen five three |
1753 |
four thousand |
4000 |
The class works in the same way as IntegerRef
, by adding the
following as an extra:
ShortIntegerRef("name", 0, 1000),
References