|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.apache.stanbol.commons.opennlp.KeywordTokenizer
public class KeywordTokenizer
Performs tokenization using the character class whitespace. Will create seperate tokens for punctation at the end of words. Intended to be used to extract alphanumeric keywords from texts
| Field Summary | |
|---|---|
static KeywordTokenizer |
INSTANCE
|
| Method Summary | |
|---|---|
java.lang.String[] |
tokenize(java.lang.String s)
|
opennlp.tools.util.Span[] |
tokenizePos(java.lang.String s)
|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final KeywordTokenizer INSTANCE
| Method Detail |
|---|
public java.lang.String[] tokenize(java.lang.String s)
tokenize in interface opennlp.tools.tokenize.Tokenizerpublic opennlp.tools.util.Span[] tokenizePos(java.lang.String s)
tokenizePos in interface opennlp.tools.tokenize.Tokenizer
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||