EnglishSubReader (Lucene-jam 0.9.3 API)

概要

パッケージ

クラス

使用

階層ツリー

非推奨 API

索引

ヘルプ

前のクラス 次のクラス

フレームあり フレームなし

概要: 入れ子 | フィールド | コンストラクタ | メソッド

詳細: フィールド | コンストラクタ | メソッド

org.aminds.lucene.analysis
クラス EnglishSubReader

java.lang.Object
  java.io.Reader
      org.apache.lucene.analysis.CharStream
          org.apache.lucene.analysis.CharFilter
              org.aminds.lucene.analysis.SubReader
                  org.aminds.lucene.analysis.CodePointBasedSubReader
                      org.aminds.lucene.analysis.EnglishSubReader

すべての実装されたインタフェース:: Closeable, Readable, ReusableCharFilter

public class EnglishSubReader
extends CodePointBasedSubReader
extends CodePointBasedSubReader

SubReader that ignores whitespaces between hyphen and alphabet character. This behavior realizes fine tokenization of multiline/multipage English text with hyphenation. Current implementation finds '-' + pattern, replacing it with '-' . Hyphen character is not removed. To remove a hyphen appropriately, we need a dictionary.

作成者:: Masashi Nakanishi

フィールドの概要

クラス org.apache.lucene.analysis.CharFilter から継承されたフィールド
`input`

クラス java.io.Reader から継承されたフィールド
`lock`

インタフェース org.aminds.lucene.analysis.ReusableCharFilter から継承されたフィールド
`EMPTY_CHAR_STREAM`

コンストラクタの概要
`EnglishSubReader()`

メソッドの概要
`protected boolean`	`accept(int codePoint)`
`static boolean`	`isPrintable(int codePoint)`

クラス org.aminds.lucene.analysis.CodePointBasedSubReader から継承されたメソッド
`canStart, isWhitespace, read`

クラス org.aminds.lucene.analysis.SubReader から継承されたメソッド
`close, currentOffset, end, mark, markSupported, reset, reset`

クラス org.apache.lucene.analysis.CharFilter から継承されたメソッド
`correct, correctOffset`

クラス java.io.Reader から継承されたメソッド
`read, read, read, ready, skip`

クラス java.lang.Object から継承されたメソッド
`clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait`

コンストラクタの詳細

EnglishSubReader

public EnglishSubReader()

メソッドの詳細

accept

protected boolean accept(int codePoint)

定義:: クラス CodePointBasedSubReader 内の accept

isPrintable

public static boolean isPrintable(int codePoint)

概要