Languages

The Teneo Platform currently offer Natural Language Processing (NLP) Capabilities and development of conversational AI solutions in 86 languages. This means that the Platform works with these languages, understands their alphabets and characters, understands what a word and a sentence are in each of those languages, and is able to normalize and pre-process text (e.g. tolerate misspellings and abbreviations).

More specifically it also means that the Teneo Platform supports creating Flows, Language Objects, Entities, and Classes; using the Teneo Teneo Linguistic Modelling Language for condition writing, creating Integrations and other global elements, and also the use of revisitability, flow stacks, dialogue history, and so on.

In the Teneo Platform, the NLP Capabilities are provided through the use of Input Processors (IPs) which pre-process inputs for the Teneo Engine to be able to perform different processes on them, such as normalization, tokenization, and spelling correction, or even number recognition and language detection.

Each language supported by the Teneo Platform has a chain of Input Processors that know how to process that particular language, most of the available languages share the Standard Input Processors Chain with different configuration schemes, while separate Input Processors Chains are available for Chinese, Japanese, Korean, and Turkish.

On top of the NLP Capabilities, further Natural Language (NL) Analyzers are available for a number of the languages. The NL Analyzers group together additional resources available in the Teneo Platform: these resources provide additional information about evaluated inputs, but they are not required for a language to function in the Teneo Platform. The Teneo NL Analyzers, among others, include support for understanding and interpretation of date and time expressions, Part-of-Speech (POS) and Morphological annotation, or recognition of Named Entities.

Last but not least, The Teneo Platform also offers Pre-built Knowledge in the formats of Teneo NLU Ontology and Semantic Networks together with Conversational Modules in a number of the supported languages; Pre-built Knowledge is considered relevant for any project domain and any type of solution because it enables developers to get up and running with a new solution quicker as it allows the project team to focus only on project or domain specific needs.

The below table provides an overview of the supported languages in the Teneo Platform.

Language NLP Capabilities NL Analyzers Pre-built Knowledge
Afrikaans
Albanian
Amharic
Armenian
Azerbaijani
Basque
Belarusian
Bengali (Bangla)
Bosnian
Bulgarian
Catalan
Chinese (Mandarin) POS and Morphology ✔
Date&Time ✔
Sentiment ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Croatian
Czech Ontology and Semantic Network ✔
Danish POS and Morphology ✔
Date&Time ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Dutch POS and Morphology ✔
Date&Time ✔
Sentiment ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
English POS and Morphology ✔
NERs ✔
Date&Time ✔
Sentiment ✔
Offensive ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Esperanto
Estonian
Ewe
Faroese
Finnish
French POS and Morphology ✔
NERs ✔
Date&Time ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Frisian
Galician
Georgian
German POS and Morphology ✔
NERs ✔
Date&Time ✔
Sentiment ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Greek
Gujarati
Hindi
Hungarian
Icelandic
Igbo
Indonesian Ontology and Semantic Network ✔
Conversational Modules ✔
Irish
Italian POS and Morphology ✔
NERs ✔
Date&Time ✔
Sentiment ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Japanese POS and Morphology ✔
NERs ✔
Date&Time ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Kazakh
Kinyarwanda
Kirundi (Rundi)
Korean POS and Morphology ✔
Kyrgyz
Latvian
Lithuanian
Luxembourgish
Macedonian
Malagasy
Malay Ontology and Semantic Network ✔
Conversational Modules ✔
Maltese
Marathi
Mongolian
Nepali
Norwegian Date&Time ✔ Ontology and Semantic Network ✔
Conversational Modules ✔
Odia
Oromo
Papiamento
Polish
Portuguese (European/Brazilian) Date&Time ✔ Ontology and Semantic Network ✔
Conversational Modules ✔
Quechuan (Quechua)
Romanian
Romansh
Russian Ontology and Semantic Network ✔
Conversational Modules ✔
Sámi
Sango
Scottish Gaelic
Serbian
Shona
Sinhala
Slovak
Slovene
Somali
Spanish POS and Morphology ✔
NERs ✔
Date&Time ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Swahili (Kiswahili)
Swazi
Swedish POS and Morphology ✔
NERs ✔
Date&Time ✔
Sentiment ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Tigrinya
Tsonga
Tswana (Setswana)
Turkish POS and Morphology ✔
NERs ✔
Ontology and Semantic Network ✔
Conversational Modules ✔
Turkmen
Ukrainian
Uzbek
Vietnamese
Welsh
Yoruba
Zulu (isiZulu)


Pre-built Knowledge

The Teneo Platform offers Pre-built Knowledge in the formats of Teneo NLU Ontology and Semantic Networks together with Conversational Modules in a number of the supported languages; the Pre-built Knowledge is considered relevant for any project domain and any type of solution because it enables developers to get up and running with a new solution quicker as it allows the project team to focus only on project or domain specific needs.

The Teneo NLU Ontology and Semantic Networks are Lexical Resources containing off-the-shelf building blocks to be used for building Natural Language Interaction solutions in Teneo Studio and are structured and designed to facilitate the understanding of a specific language. For some of the languages the Lexical Resources also contain Additional Resources in the form of objects which support the functionalities of the NL Analyzers, this may be, for example, Morphological ANNOT Language Objects, Named Entity Objects or even objects to support the understanding and interpretation of date and time expressions.

The Conversational Modules are Dialogue Resources or template solutions which provide more than simple smalltalk or chitchatting Flows; they provide the generic dialogue capabilities needed for the conversational AI application to behave human-like, show social skills and personalized behavior, and even for it to simulate intelligence and in this way, engage with the end-users such that they trust the capabilities of the application.

The Teneo NLU Ontology and Semantic Networks are distributed as Lexical Resources which can be assigned to solutions in Teneo Studio, while the Conversational Modules are distributed as template solutions available upon creation of new solutions in Studio.

The below table lists languages currently available with Pre-built Knowledge, and - where applicable - any Additional Resources available (providing support to the NL Analyzers).

Language Teneo NLU Ontology and Semantic Network Conversational Module Additional Resources
Included as objects in the Lexical Resource
Chinese (Mandarin) Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Date&Time Objects ✔
Sentiment Analysis Objects ✔
Czech Lexical Resource ✔
Danish Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Date&Time Objects ✔
Dutch Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Date&Time Objects ✔
Sentiment Analysis Objects ✔
English Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Named Entity ANNOT objects ✔
Date&Time Objects ✔
Sentiment Analysis Objects ✔
Offensive Language Detector Objects ✔
French Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Named Entity ANNOT objects ✔
Date&Time Objects ✔
German Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Date&Time Objects ✔
Sentiment Analysis Objects ✔
Indonesian Lexical Resource ✔ Dialogue Resource ✔
Italian Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Named Entity ANNOT objects ✔
Date&Time Objects ✔
Sentiment Analysis Objects ✔
Japanese Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Named Entity ANNOT objects ✔
Date&Time Objects ✔
Malay Lexical Resource ✔ Dialogue Resource ✔
Norwegian Lexical Resource ✔ Dialogue Resource ✔ Date&Time Objects ✔
Portuguese (European and Brazilian) Lexical Resource ✔ Dialogue Resource ✔ Date&Time Objects ✔
Russian Lexical Resource ✔ Dialogue Resource ✔
Spanish Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Named Entity ANNOT objects ✔
Date&Time Objects ✔
Swedish Lexical Resource ✔ Dialogue Resource ✔ Morphology ANNOT objects ✔
Named Entity ANNOT objects ✔
Date&Time Objects ✔
Sentiment Analysis Objects ✔
Turkish Lexical Resource ✔ Dialogue Resource✔ Morphology ANNOT objects ✔
Named Entity ANNOT objects ✔

Was this page helpful?