英文文本预处理

作者: _龙雀 | 来源:发表于2019-06-12 18:55 被阅读0次

英文文本预处理
php是什么意思?
php视频基础教程（下）（网盘直接查看）
php视频基础教程（上）（网盘直接查看）
php - 计算机编程语言（Hypertext Preproc
什么是PHP?
文本处理中的并行处理
分词处理
2019-05-29 文本预处理
动手学深度学习(八) NLP 文本预处理

import nltk
nltk.download('stopwords')

def text_to_list(text):
    text = str(text)
    text = text.lower()

    # Clean the text
    text = re.sub(r"[^A-Za-z0-9^,!.\/'+-=]", " ", text)
    text = re.sub(r"what's", "what is ", text)
    text = re.sub(r"\'s", " ", text)
    text = re.sub(r"\'ve", " have ", text)
    text = re.sub(r"can't", "cannot ", text)
    text = re.sub(r"n't", " not ", text)
    text = re.sub(r"i'm", "i am ", text)
    text = re.sub(r"\'re", " are ", text)
    text = re.sub(r"\'d", " would ", text)
    text = re.sub(r"\'ll", " will ", text)
    text = re.sub(r",", " ", text)
    text = re.sub(r"!", " ! ", text)
    text = re.sub(r":", " : ", text)
    text = re.sub(r"e - mail", "email", text)
    text = text.split()
    
    #去停用词
    from nltk.corpus import stopwords  
    stops = list(stopwords.words('english'))
    clean_text = []
    for  i  in text:
          if i in stops:
            continue  
          clean_text.append(i)

    return clean_text

网友评论

本文标题：英文文本预处理

本文链接：https://www.haomeiwen.com/subject/ferrfctx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

英文文本预处理

相关文章

英文文本预处理

php是什么意思?

php视频基础教程（下）（网盘直接查看）

php视频基础教程（上）（网盘直接查看）

php - 计算机编程语言（Hypertext Preproc

什么是PHP?

文本处理中的并行处理

分词处理

2019-05-29 文本预处理

动手学深度学习(八) NLP 文本预处理

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读