Skip to content

Latest commit

 

History

History
20 lines (16 loc) · 654 Bytes

10_Standard_analyzer.asciidoc

File metadata and controls

20 lines (16 loc) · 654 Bytes

standard analyzer

The standard analyzer is used by default for any full-text analyzed string field. If we were to `reimplement'' the `standard analyzer as a custom analyzer, it would be defined as follows:

{
    "type":      "custom",
    "tokenizer": "standard",
    "filter":  [ "lowercase", "stop" ]
}

In [token-normalization] and [stopwords] we will talk about the lowercase, and stop token filters, but for the moment, let’s focus on the standard tokenizer.