Class BreakIteratorWrapper
- java.lang.Object
-
- org.apache.lucene.analysis.icu.segmentation.BreakIteratorWrapper
-
final class BreakIteratorWrapper extends java.lang.Object
Wraps RuleBasedBreakIterator, making object reuse convenient and emitting a rule status for emoji sequences.
-
-
Field Summary
Fields Modifier and Type Field Description (package private) static com.ibm.icu.text.UnicodeSet
EMOJI
(package private) static com.ibm.icu.text.UnicodeSet
EMOJI_RK
private com.ibm.icu.text.RuleBasedBreakIterator
rbbi
private int
start
private int
status
private char[]
text
private CharArrayIterator
textIterator
-
Constructor Summary
Constructors Constructor Description BreakIteratorWrapper(com.ibm.icu.text.RuleBasedBreakIterator rbbi)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description private int
calcStatus(int current, int next)
Returns current rule status for the text between breaks.(package private) int
current()
(package private) int
getRuleStatus()
private boolean
isEmoji(int current, int next)
Returns true if the current text represents emoji character or sequence(package private) int
next()
(package private) void
setText(char[] text, int start, int length)
-
-
-
Field Detail
-
textIterator
private final CharArrayIterator textIterator
-
rbbi
private final com.ibm.icu.text.RuleBasedBreakIterator rbbi
-
text
private char[] text
-
start
private int start
-
status
private int status
-
EMOJI_RK
static final com.ibm.icu.text.UnicodeSet EMOJI_RK
-
EMOJI
static final com.ibm.icu.text.UnicodeSet EMOJI
-
-
Method Detail
-
current
int current()
-
getRuleStatus
int getRuleStatus()
-
next
int next()
-
calcStatus
private int calcStatus(int current, int next)
Returns current rule status for the text between breaks. (determines token type)
-
isEmoji
private boolean isEmoji(int current, int next)
Returns true if the current text represents emoji character or sequence
-
setText
void setText(char[] text, int start, int length)
-
-