Chinese-Text-Classification-Pytorch

中文文本分类，TextCNN，TextRNN，FastText，TextRCNN，BiLSTM_Attention, DPCNN, Transformer, 基于pytorch，开箱即用。

先看原项目：https://github.com/649453932/Bert-Chinese-Text-Classification-Pytorch.git

本项目增加了预测类 my_classifier.py

介绍

数据以字为单位输入模型，预训练词向量使用搜狗新闻 Word+Character 300d，点这里下载

在 utils.py 文件中可以提取预训练词向量

环境

python 3.12
cuda 12.1

pip install -r requirements.txt 安装依赖，若安装的 Pytorch 不支持 CUDA，先卸载 pip uninstall torch，后安装 pip install torch==2.3.1+cu121 -f https://download.pytorch.org/whl/cu121/torch_stable.html

更换自己的数据集

如果用字，按照我数据集的格式来格式化你的数据。
如果用词，提前分好词，词之间用空格隔开，python run.py --model TextCNN --word True
使用预训练词向量：utils.py的main函数可以提取词表对应的预训练词向量。

使用说明

# 训练并测试：
# TextCNN 89个品目平均82%准确率
python run.py --model TextCNN --embedding random

# TextRNN 89个品目平均83%准确率
python run.py --model TextRNN

# TextRNN_Att 89个品目平均84%准确率
python run.py --model TextRNN_Att --embedding random

# TextRCNN 89个品目平均82%准确率
python run.py --model TextRCNN --embedding random

# FastText 89个品目86准确率
python run.py --model FastText --embedding random

# DPCNN
python run.py --model DPCNN --embedding random

# Transformer
python run.py --model Transformer --embedding random

参数

模型都在models目录下，超参定义和模型定义在同一文件中。

模型使用

python my_classifier.py

更轻量的server项目

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
goods/data		goods/data
models		models
LICENSE		LICENSE
README.md		README.md
my_classifier.py		my_classifier.py
requirements.txt		requirements.txt
run.py		run.py
train_eval.py		train_eval.py
utils.py		utils.py
utils_fasttext.py		utils_fasttext.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chinese-Text-Classification-Pytorch

介绍

环境

更换自己的数据集

使用说明

参数

模型使用

About

Releases

Packages

Languages

License

AriesYB/Chinese-Text-Classification-Pytorch

Folders and files

Latest commit

History

Repository files navigation

Chinese-Text-Classification-Pytorch

介绍

环境

更换自己的数据集

使用说明

参数

模型使用

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages