ArangoSearch Analyzers

To simplify query syntax ArangoSearch provides a concept of named analyzerswhich are merely aliases for type+configuration of IResearch analyzers. In thefuture, users will be able to specify their own named analyzers. For now,ArangoDB comes with the following analyzers:

  • identitytreat the value as an atom

  • text_detokenize the value into case-insensitive word stems as per the German locale,do not discard any stopwords

  • text_entokenize the value into case-insensitive word stems as per the English locale,do not discard any stopwords

  • text_estokenize the value into case-insensitive word stems as per the Spanish locale,do not discard any stopwords

  • text_fitokenize the value into case-insensitive word stems as per the Finnish locale,do not discard any stopwords

  • text_frtokenize the value into case-insensitive word stems as per the French locale,do not discard any stopwords

  • text_ittokenize the value into case-insensitive word stems as per the Italian locale,do not discard any stopwords

  • text_nltokenize the value into case-insensitive word stems as per the Dutch locale,do not discard any stopwords

  • text_notokenize the value into case-insensitive word stems as per the Norwegianlocale, do not discard any stopwords

  • text_pttokenize the value into case-insensitive word stems as per the Portugueselocale, do not discard any stopwords

  • text_rutokenize the value into case-insensitive word stems as per the Russian locale,do not discard any stopwords

  • text_svtokenize the value into case-insensitive word stems as per the Swedish locale,do not discard any stopwords

  • text_zhtokenize the value into word stems as per the Chinese locale