0
点赞
收藏
分享

微信扫一扫

weka使用教程2--对文档进行分类


1 使用的数据集

根据语句内容,判断该语句是乐观还是悲观。

weka使用教程2--对文档进行分类_weka零基础使用

2 使用NativeBayesMultinomialText,这个算法支持文本分类

weka使用教程2--对文档进行分类_weka零基础使用_02

3  把文档转为词向量,对文档进行分类

(1)选择StringToWordVector过滤器

weka使用教程2--对文档进行分类_weka零基础使用_03

 

 

 

G

M

T

 

 



Detect languageAfrikaansAlbanianAmharicArabicArmenianAzerbaijaniBasqueBelarusianBengaliBosnianBulgarianCatalanCebuanoChichewaChinese (Simplified)Chinese (Traditional)CorsicanCroatianCzechDanishDutchEnglishEsperantoEstonianFilipinoFinnishFrenchFrisianGalicianGeorgianGermanGreekGujaratiHaitian CreoleHausaHawaiianHebrewHindiHmongHungarianIcelandicIgboIndonesianIrishItalianJapaneseJavaneseKannadaKazakhKhmerKoreanKurdishKyrgyzLaoLatinLatvianLithuanianLuxembourgishMacedonianMalagasyMalayMalayalamMalteseMaoriMarathiMongolianMyanmar (Burmese)NepaliNorwegianPashtoPersianPolishPortuguesePunjabiRomanianRussianSamoanScots GaelicSerbianSesothoShonaSindhiSinhalaSlovakSlovenianSomaliSpanishSundaneseSwahiliSwedishTajikTamilTeluguThaiTurkishUkrainianUrduUzbekVietnameseWelshXhosaYiddishYorubaZulu

 

AfrikaansAlbanianAmharicArabicArmenianAzerbaijaniBasqueBelarusianBengaliBosnianBulgarianCatalanCebuanoChichewaChinese (Simplified)Chinese (Traditional)CorsicanCroatianCzechDanishDutchEnglishEsperantoEstonianFilipinoFinnishFrenchFrisianGalicianGeorgianGermanGreekGujaratiHaitian CreoleHausaHawaiianHebrewHindiHmongHungarianIcelandicIgboIndonesianIrishItalianJapaneseJavaneseKannadaKazakhKhmerKoreanKurdishKyrgyzLaoLatinLatvianLithuanianLuxembourgishMacedonianMalagasyMalayMalayalamMalteseMaoriMarathiMongolianMyanmar (Burmese)NepaliNorwegianPashtoPersianPolishPortuguesePunjabiRomanianRussianSamoanScots GaelicSerbianSesothoShonaSindhiSinhalaSlovakSlovenianSomaliSpanishSundaneseSwahiliSwedishTajikTamilTeluguThaiTurkishUkrainianUrduUzbekVietnameseWelshXhosaYiddishYorubaZulu

 

 

 



 

 

 

 


Text-to-speech function is limited to 200 characters



 

Options : History : Feedback : Donate

Close


(2)去掉停用词,以及一些无意义的词

使用pattern去掉数字

weka使用教程2--对文档进行分类_weka的简单使用_04

删除无用词

weka使用教程2--对文档进行分类_jar_05

 (2)编辑生成词向量的文件,把第一列设置为分类

weka使用教程2--对文档进行分类_jar_06

 (3)选择NativeBayes进行分类测试

weka使用教程2--对文档进行分类_weka的简单使用_07

举报

相关推荐

0 条评论