Thanks for all. The problem is done.
The solution is to use ultraedit convert function(convert ascii to utf-8(unicode edition)). Originally, i use the notepad++ to convert utf-8 without BOM, and it seems not work as expected.
Now, the split function can work correctly without any force_encoding, but the input from html should be converted using force_encoding("UTF-8").
Thanks again for TIG.