Few-Shot Named Entity Recognition using Span Markers
Project description
SpanMarker for Named Entity Recognition
SpanMarker is a framework for training powerful Named Entity Recognition models using familiar encoders such as BERT, RoBERTa and DeBERTa. Tightly implemented on top of the 🤗 Transformers library, SpanMarker can take advantage of its valuable functionality.
Based on the PL-Marker paper, SpanMarker breaks the mold through its accessibility and ease of use. Crucially, SpanMarker works out of the box with many common encoders such as bert-base-cased
and roberta-large
, and automatically works with datasets using the IOB
, IOB2
, BIOES
, BILOU
or no label annotation scheme.
Documentation
Feel free to have a look at the documentation.
Installation
You may install the span_marker
Python module via pip
like so:
pip install span_marker
Quick Start
Please have a look at our Getting Started notebook for details on how SpanMarker is commonly used. It explains the following snippet in more detail.
Colab | Kaggle | Gradient | Studio Lab |
---|---|---|---|
from datasets import load_dataset
from span_marker import SpanMarkerModel, Trainer
from transformers import TrainingArguments
def main():
dataset = load_dataset("DFKI-SLT/few-nerd", "supervised")
labels = dataset["train"].features["ner_tags"].feature.names
model_name = "bert-base-cased"
model = SpanMarkerModel.from_pretrained(model_name, labels=labels)
args = TrainingArguments(
output_dir="my_span_marker_model",
learning_rate=5e-5,
gradient_accumulation_steps=2,
per_device_train_batch_size=4,
per_device_eval_batch_size=4,
num_train_epochs=1,
save_strategy="steps",
eval_steps=200,
logging_steps=50,
fp16=True,
warmup_ratio=0.1,
dataloader_num_workers=2,
)
trainer = Trainer(
model=model,
args=args,
train_dataset=dataset["train"].select(range(8000)),
eval_dataset=dataset["validation"].select(range(2000)),
)
trainer.train()
trainer.save_model("my_span_marker_model/checkpoint-final")
metrics = trainer.evaluate()
print(metrics)
if __name__ == "__main__":
main()
Because this work is based on PL-Marker, you may expect similar results to its Papers with Code Leaderboard results. Tests, documentation and further information on expected performance will come soon.
Pretrained Models
tomaarsen/span-marker-bert-base-fewnerd-fine-super
is a model that I have trained in just 4 hours on the finegrained, supervised Few-NERD dataset. It reached a 0.7020 Test F1, competitive in the all-time Few-NERD leaderboard. My training script resembles the one that you can see above.- See this Weights and Biases report for training details.
- Try the model out online using this 🤗 Space.
Changelog
See CHANGELOG.md for news on all SpanMarker versions.
License
See [LICENSE] for the current license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file span_marker-1.0.0.tar.gz
.
File metadata
- Download URL: span_marker-1.0.0.tar.gz
- Upload date:
- Size: 33.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bd96d9ec8b3093311c4462ff6e4f7cf3744d2d8e8096e02c3ff9fcbfac389618 |
|
MD5 | f176d3df677f276c4d5ed0c2d94a7b42 |
|
BLAKE2b-256 | 52bee544721ebf3c67f49ea5d2eeba550852e219138c47e0d74c2d6676bcd5e4 |
File details
Details for the file span_marker-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: span_marker-1.0.0-py3-none-any.whl
- Upload date:
- Size: 32.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 12645f92121c9e036d417e04c5e550859e1de335ab99254d8e3da5dfb2d56ba2 |
|
MD5 | 9ad9e1b7a72efd5c2c0066e428a79a61 |
|
BLAKE2b-256 | 9d779c6ebb368565f4b97c9b4cb8468c6d8637c4ac46e27bcc98fa12b9171e64 |