Robuta

Sponsor of the Day: Jerkmate
https://www.elastic.co/docs/reference/text-analysis/analysis-truncate-tokenfilter Truncate token filter | Elasticsearch Reference Truncates tokens that exceed a specified character limit. This limit defaults to 10 but can be customized using the length parameter. For example, you... token filter elasticsearchtruncatereference https://www.elastic.co/docs/reference/text-analysis/analysis-ngram-tokenfilter N-gram token filter | Elasticsearch Reference Forms n-grams of specified lengths from a token. For example, you can use the ngram token filter to change fox to [ f, fo, o, ox, x ]. This filter uses... token filter elasticsearchgramreference https://www.elastic.co/docs/reference/elasticsearch/plugins/analysis-icu-folding ICU folding token filter | Elasticsearch Reference Case folding of Unicode characters based on UTR#30, like the ASCII-folding token filter on steroids. It registers itself as the icu_folding token filter... token filter elasticsearchicufoldingreference https://www.elastic.co/docs/reference/elasticsearch/plugins/analysis-phonetic-token-filter phonetic token filter plugin | Elasticsearch Reference The phonetic token filter takes the following settings: It is important to note that token filterplugin elasticsearchphoneticreference https://www.elastic.co/docs/reference/text-analysis/analysis-kstem-tokenfilter KStem token filter | Elasticsearch Reference Provides KStem-based stemming for the English language. The kstem filter combines algorithmic stemming with a built-in dictionary. The kstem filter tends... token filter elasticsearchreference https://www.elastic.co/docs/reference/elasticsearch/plugins/analysis-icu-normalization ICU normalization token filter | Elasticsearch Reference Normalizes characters as explained here. It registers itself as the icu_normalizer token filter, which is available to all indices without any further... token filter elasticsearchicunormalizationreference https://www.elastic.co/docs/reference/text-analysis/analysis-phonetic-tokenfilter Phonetic token filter for text analysis | Elasticsearch Reference The phonetic token filter is provided as the analysis-phonetic plugin. token filtertext analysiselasticsearch referencephonetic https://www.elastic.co/docs/reference/elasticsearch/plugins/analysis-icu-collation ICU collation token filter | Elasticsearch Reference token filter elasticsearchicucollationreference https://www.elastic.co/docs/reference/text-analysis/analysis-keep-words-tokenfilter Keep words token filter | Elasticsearch Reference Keeps only tokens contained in a specified word list. This filter uses Lucene’s KeepWordFilter. The following analyze API request uses the keep filter... token filter elasticsearchkeepwordsreference https://www.elastic.co/docs/reference/elasticsearch/plugins/analysis-nori-readingform nori_readingform token filter | Elasticsearch Reference The nori_readingform token filter rewrites tokens written in Hanja to their Hangul form. Which responds with: token filter elasticsearchnorireference https://www.elastic.co/docs/reference/text-analysis/analysis-lowercase-tokenfilter Lowercase token filter | Elasticsearch Reference Changes token text to lowercase. For example, you can use the lowercase filter to change THE Lazy DoG to the lazy dog. In addition to a default filter,... token filter elasticsearchlowercasereference https://www.elastic.co/docs/reference/elasticsearch/plugins/analysis-nori-speech nori_part_of_speech token filter | Elasticsearch Reference The nori_part_of_speech token filter removes tokens that match a set of part-of-speech tags. The list of supported tags and their meanings can be found... token filter elasticsearchnori partspeechreference https://www.elastic.co/docs/reference/text-analysis/analysis-stop-tokenfilter Stop token filter | Elasticsearch Reference Removes stop words from a token stream. When not customized, the filter removes the following English stop words by default: a, an, and, are, as, at,... token filter elasticsearchstopreference https://www.elastic.co/docs/reference/text-analysis/analysis-fingerprint-tokenfilter Fingerprint token filter | Elasticsearch Reference Sorts and removes duplicate tokens from a token stream, then concatenates the stream into a single output token. For example, this filter changes the... token filter elasticsearchfingerprintreference https://www.elastic.co/docs/reference/text-analysis/analysis-snowball-tokenfilter Snowball token filter | Elasticsearch Reference A filter that stems words using a Snowball-generated stemmer. The language parameter controls the stemmer with the following available values: Arabic,... token filter elasticsearchsnowballreference https://www.elastic.co/docs/reference/elasticsearch/plugins/analysis-polish-stop polish_stop token filter | Elasticsearch Reference The polish_stop token filter filters out Polish stopwords (_polish_), and any other custom stopwords specified by the user. This filter only supports... token filter elasticsearchpolish stopreference https://www.elastic.co/docs/reference/text-analysis/analysis-uppercase-tokenfilter Uppercase token filter | Elasticsearch Reference Changes token text to uppercase. For example, you can use the uppercase filter to change the Lazy DoG to THE LAZY DOG. This filter uses Lucene’s... token filter elasticsearchuppercasereference https://www.elastic.co/docs/reference/text-analysis/analysis-delimited-payload-tokenfilter Delimited payload token filter | Elasticsearch Reference Separates a token stream into tokens and payloads based on a specified delimiter. For example, you can use the delimited_payload filter with a | delimiter... token filter elasticsearchdelimitedpayloadreference https://www.elastic.co/docs/reference/text-analysis/analysis-keyword-marker-tokenfilter Keyword marker token filter | Elasticsearch Reference Marks specified tokens as keywords, which are not stemmed. The keyword_marker filter assigns specified tokens a keyword attribute of true. Stemmer token... token filter elasticsearchkeywordmarkerreference https://www.elastic.co/docs/reference/text-analysis/analysis-stemmer-tokenfilter Stemmer token filter | Elasticsearch Reference Provides algorithmic stemming for several languages, some with additional variants. For a list of supported languages, see the language parameter. When... token filter elasticsearchstemmerreference