FIELD: physics, computer technology.
SUBSTANCE: invention concerns methods and systems of text segmentation. Method involves addressing symbol line (204), long lexeme determination (206), recording adjoining symbols in long lexeme (208), determination of lexemes from symbol line by holding together the adjoining symbols, and determination of multiple lexeme combinations (210), with number of lexeme combinations reduced by means of recorded adjoining symbols.
EFFECT: enhanced speed of text fragmentation.
22 cl, 3 dwg
Authors
Dates
2009-02-27—Published
2003-12-30—Filed