caver

Multi-label Text Classification Toolkit

These details have not been verified by PyPI

Project links

Homepage

Project description

# Caver: a toolkit for multilabel text classification.

[中文版](./README_zh.md)

---

Rising a torch in the cave to see the words on the wall. This is the **Caver**.

Tag short text in 3 lines

```
from caver import CaverModel
model = CaverModel("./checkpoint_path", device="cpu")

sentence = ["看英语学美剧靠谱吗", "科比携手姚明出任 2019 篮球世界杯全球大使"]

model.predict(sentence[0], top_k=3)
>>> ['英语学习', '英语', '美剧']

model.predict(sentence[1], top_k=10)
>>> ['篮球', 'NBA', '体育', 'NBA 球员', '运动']
```

[Documents](https://guokr.github.io/Caver)

## Requirements

* PyTorch
* tqdm
* torchtext
* numpy
* Python3

## Get it

```
$ pip install caver --user
```

## Did you guys have some pre-trained models

Yes, we have released two pre-trained models on Zhihu NLPCC2018 open dataset.

```
$ wget -O - https://github.com/guokr/Caver/releases/download/0.1/checkpoints_char_cnn.tar.gz | tar zxvf -
$ wget -O - https://github.com/guokr/Caver/releases/download/0.1/checkpoints_char_lstm.tar.gz | tar zxvf -
```

## How to train on your own dataset

```
$ python3 train.py --input_data_dir {path to your origin dataset}
--output_data_dir {path to store the preprocessed dataset}
--train_filename train.tsv
--valid_filename valid.tsv
--checkpoint_dir {path to save the checkpoints}
--model {fastText/CNN/LSTM}
--batch_size {16, you can modify this for you own}
--epoch {10}

```

## How to setup the models for inference
Basically just setup the model and target labels, you can check [examples](./examples).

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.2

Jan 10, 2019

0.1

Dec 25, 2018

This version

0.0.7

Dec 25, 2018

0.0.6

Dec 23, 2018

0.0.4

Dec 16, 2018

0.0.1

Jul 25, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

caver-0.0.7.tar.gz (12.9 kB view hashes)

Uploaded Dec 25, 2018 Source

Hashes for caver-0.0.7.tar.gz

Hashes for caver-0.0.7.tar.gz
Algorithm	Hash digest
SHA256	`c7ed1f3426f935e71f0ef842d65599df4767b9793b53d4eb749863b350e9a80e`
MD5	`ef141e220124f76f2bbfc0eb58706f4a`
BLAKE2b-256	`69bd6f51df775866bdcc8b116d25f6b926b22315f193014868734f710389082d`