Skip to content

Commit

Permalink
fix typos in README.md
Browse files Browse the repository at this point in the history
Closes #23
  • Loading branch information
johtani authored and dadoonet committed Mar 3, 2014
1 parent 207420c commit 15cfa67
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ This analyzer is the following tokenizer and tokenfilter combination.
* `kuromoji_part_of_speech` : Kuromoji Part of Speech Stop Filter (TokenFilter)
* `cjk_width` : CJK Width Filter (TokenFilter)
* `stop` : Stop Filter (TokenFilter)
* `kuromoji_stemmer` : Kuromiji Katakana Stemmer Filter(TokenFilter)
* `kuromoji_stemmer` : Kuromoji Katakana Stemmer Filter(TokenFilter)
* `lowercase` : LowerCase Filter (TokenFilter)

## CharFilter : kuromoji_iteration_mark
Expand Down Expand Up @@ -79,7 +79,7 @@ The mode is three types.

* `normal` : Ordinary segmentation: no decomposition for compounds

* `search` : Segmentation geared towards search: this includes a decompounding process for long nouns, also includeing the full compound token as a synonym.
* `search` : Segmentation geared towards search: this includes a decompounding process for long nouns, also including the full compound token as a synonym.

* `extended` : Extended mode outputs unigrams for unknown words.

Expand Down Expand Up @@ -159,7 +159,7 @@ curl -XPOST 'http://localhost:9200/kuromoji_sample/_analyze?analyzer=my_analyzer

## TokenFilter : kuromoji_baseform

A token filter of type `kuromoji_baseform` that replcaes term text with BaseFormAttribute.
A token filter of type `kuromoji_baseform` that replaces term text with BaseFormAttribute.
This acts as a lemmatizer for verbs and adjectives.

### example
Expand Down Expand Up @@ -202,7 +202,7 @@ The following are settings that can be set for a stop token filter type:
|:------------|:-----------------------------------------------------|
| stoptags | A list of part-of-speech tags that should be removed |

Note that default setting is stoptags.txt include lucene-analyzer-kuromji.jar.
Note that default setting is stoptags.txt include lucene-analyzer-kuromoji.jar.

### example

Expand Down Expand Up @@ -260,7 +260,7 @@ The following are settings that can be set for a `kuromoji_readingform` token fi
|:------------|:----------------------------------------------------------|:------------------|
| use_romaji | `true` if romaji reading form output instead of katakana. | `false` |

Note that elasticsearch-analysis-kuromoji built-in `kuromoji_readingform` set default `ture` to `use_romaji` attribute.
Note that elasticsearch-analysis-kuromoji built-in `kuromoji_readingform` set default `true` to `use_romaji` attribute.

### example

Expand Down

0 comments on commit 15cfa67

Please sign in to comment.