本次的主要更新:
1) 改进了对标点符号的处理,之前的版本会过滤掉所有的标点符号;
2) 允许用户在自定义词典中添加词性;
3) 改进了关键词提取的功能jieba.analyse.extract_tags;
4) 修复了一个在pypy解释器下运行的bug.
在线演示:http://jiebademo.ap01.aws.af.cm/
>>> import jiebaTraceback (most recent call last): File "<stdin>", line 1, in <module> File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/__init__.py", line 5, in <module> import finalseg File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/finalseg/__init__.py", line 13, in <module> prob_start = load_model("prob_start.py") File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/finalseg/__init__.py", line 10, in load_model tab = eval(open(prob_p_path,"rb").read()) File "<string>", line 1 {'B': -0.26268660809250016, ^SyntaxError: invalid syntax报错
评论删除后,数据将无法恢复
引用来自“fanlu”的评论
>>> import jieba
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/__init__.py", line 5, in <module>
import finalseg
File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/finalseg/__init__.py", line 13, in <module>
prob_start = load_model("prob_start.py")
File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/finalseg/__init__.py", line 10, in load_model
tab = eval(open(prob_p_path,"rb").read())
File "<string>", line 1
{'B': -0.26268660809250016,
^
SyntaxError: invalid syntax
报错
引用来自“fanlu”的评论
>>> import jieba
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/__init__.py", line 5, in <module>
import finalseg
File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/finalseg/__init__.py", line 13, in <module>
prob_start = load_model("prob_start.py")
File "/usr/local/lib/python2.6/site-packages/jieba-0.26-py2.6.egg/jieba/finalseg/__init__.py", line 10, in load_model
tab = eval(open(prob_p_path,"rb").read())
File "<string>", line 1
{'B': -0.26268660809250016,
^
SyntaxError: invalid syntax
报错