U-Tokenizer is a project created by Uniwits.com. U-Tokenizer converts texts into tokens that can be used for search and other analyses. On CJK texts, U-Tokenizer does not simply turn texts into contiguous character pairs with overlapping characters. On the contrary, U-Tokenizer tries to cut texts at word boundaries that are meanful to human readings.
In the following form, you can paste a piece of texts and have a try.