This collator ensures all characters, supplementary and non-supplementary, have the same binary collating sequence as UTF-8.
这个排序器确保所有字符(补充字符和非补充字符)采用与 UTF-8 一样的二进制排序次序。
2
While this is somewhat simpler than identifying a "word", the completely naive approach of looking at every (overlapping) sequence of three bytes is non-optimal.
The logic path is unpredictable and requires activity to be detected and correlated by time and sequence across multiple applications (non-linear processing).