Skip to content

zhangcong-zc/Text_Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

5a98d02 · Jan 12, 2021

History

5 Commits
Jul 15, 2020
Jul 15, 2020
Jul 15, 2020
Jul 15, 2020
Jul 15, 2020
Jul 15, 2020
Jul 15, 2020
Jul 15, 2020
Jan 12, 2021
Jul 15, 2020

Repository files navigation

Text Classification Based on Chinese SogouNews

模型

  • TextCNN
  • TextRNN
  • LEAM
  • HAN
  • FastText
  • Char-CNN
  • Bi-LSTM-Attention
  • Transformer

数据集:

SougouNews (http://www.sogou.com/labs/resource/cs.php) 中选出10个类别的新闻,每个类别5000个样本,组成总量为50000的数据集:

it、women、business、sports、yule、learning、travel、auto、health、house

数据形式:

label \t content

依赖环境

Python 3.6
TensorFlow 1.15

模型对比:

avatar

  • 以上结果只是对模型做了基本测试与验证、参数也不一定是最优,如果需用在具体项目中,可自行调试。