jieba is a Python library for segmenting Chinese text - from the GitHub Page https://github.com/fxsjy/jieba#jieba-1 : Jieba (Chinese for to stutter) Chinese text segmentation: built to be the best Python Chinese word segmentation module.