Python 绘制词云,,文本内容:data(


文本内容:data(包含很多条文本)

1、分词:

import jiebadata_cut = data.apply(jieba.lcut)

2、去除停用词:

stoplist.txt:链接:https://pan.baidu.com/s/1lN1J8aUFOwqXpYMzuqVA7w提取码:nk7z

with open(r‘D:\数据文件\stoplist.txt‘, encoding=‘utf-8‘) as f:    txt = f.read()stop = txt.split()stop = stop + [‘ ‘]   #把空格加进去data_after = data_cut.apply(lambda x : [i for i in x if i not in stop])

3、

from tkinter import _flattentmp = pd.Series(_flatten(list(data_after)))   #把二维变成一维num = tmp.value_counts()

4、绘制

from wordcloud import WordCloudimport matplotlib.pyplot as plt  pic = plt.imread(r‘D:\数据文件\aixin.jpg‘)wc = WordCloud(    background_color=‘white‘,    mask=pic,    font_path=r‘C:/Windows/Fonts/simsun.ttc‘)wc2 = wc.fit_words(num)plt.imshow(wc2)plt.axis(‘off‘)plt.show()

Python 绘制词云

评论关闭