: Class StandardTokenizer

Overview

Package

Class

Tree

Deprecated

Index

Help

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: INNER | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

com.lucene.analysis.standard
Class StandardTokenizer

java.lang.Object
  |
  +--com.lucene.analysis.TokenStream
        |
        +--com.lucene.analysis.Tokenizer
              |
              +--com.lucene.analysis.standard.StandardTokenizer

All Implemented Interfaces:: StandardTokenizerConstants

public class StandardTokenizer
extends Tokenizer
implements StandardTokenizerConstants

A grammar-based tokenizer constructed with JavaCC.

This should be a good tokenizer for most European-language documents.

Many applications have specific tokenizer needs. If this tokenizer does not suit your application, please consider copying this source code directory to your project and maintaining your own grammar-based tokenizer.

Field Summary

Token jj_nt


Token token


StandardTokenizerTokenManager token_source


Fields inherited from class com.lucene.analysis.Tokenizer

input

Fields inherited from interface com.lucene.analysis.standard.StandardTokenizerConstants

ACRONYM, ALPHA, ALPHANUM, APOSTROPHE, COMPANY, DEFAULT, DIGIT, EMAIL, EOF, HAS_DIGIT, HOST, LETTER, NOISE, NUM, P, tokenImage

Constructor Summary

StandardTokenizer(CharStream stream)


StandardTokenizer(Reader reader)
          Constructs a tokenizer for this Reader.

StandardTokenizer(StandardTokenizerTokenManager tm)


Method Summary

void disable_tracing()


void enable_tracing()


ParseException generateParseException()


Token getNextToken()


Token getToken(int index)


Token next()
          Returns the next token in the stream, or null at EOS.

void ReInit(CharStream stream)


void ReInit(StandardTokenizerTokenManager tm)


Methods inherited from class com.lucene.analysis.Tokenizer

close

Methods inherited from class java.lang.Object

clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Detail

token_source

public StandardTokenizerTokenManager token_source

token

public Token token

jj_nt

public Token jj_nt

Constructor Detail

StandardTokenizer

public StandardTokenizer(Reader reader)

Constructs a tokenizer for this Reader.

StandardTokenizer

public StandardTokenizer(CharStream stream)

StandardTokenizer

public StandardTokenizer(StandardTokenizerTokenManager tm)

Method Detail

public final Token next()
                 throws ParseException,
                        IOException

Returns the next token in the stream, or null at EOS.

The returned token's type is set to an element of StandardTokenizerConstants.tokenImage.

Overrides:: next in class TokenStream

ReInit

public void ReInit(CharStream stream)

ReInit

public void ReInit(StandardTokenizerTokenManager tm)

getNextToken

public final Token getNextToken()

getToken

public final Token getToken(int index)

generateParseException

public final ParseException generateParseException()

enable_tracing

public final void enable_tracing()

disable_tracing

public final void disable_tracing()

Overview

Package

Class

Tree

Deprecated

Index

Help

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: INNER | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

Field Summary
`Token`	`jj_nt`
`Token`	`token`
`StandardTokenizerTokenManager`	`token_source`

Constructor Summary
`StandardTokenizer(CharStream stream)`
`StandardTokenizer(Reader reader)` Constructs a tokenizer for this Reader.
`StandardTokenizer(StandardTokenizerTokenManager tm)`

Method Summary
`void`	`disable_tracing()`
`void`	`enable_tracing()`
`ParseException`	`generateParseException()`
`Token`	`getNextToken()`
`Token`	`getToken(int index)`
`Token`	`next()` Returns the next token in the stream, or null at EOS.
`void`	`ReInit(CharStream stream)`
`void`	`ReInit(StandardTokenizerTokenManager tm)`

com.lucene.analysis.standard Class StandardTokenizer

token_source

token

jj_nt

StandardTokenizer

StandardTokenizer

StandardTokenizer

next

ReInit

ReInit

getNextToken

getToken

generateParseException

enable_tracing

disable_tracing

com.lucene.analysis.standard
Class StandardTokenizer