13 projects
synthetic-dataset-generator
Build datasets with natural language
observers
🤗 Observers: A lightweight library for (generative) AI observability, enabling insights into model interactions and everything that comes with it.
dataset-viber
Dataset Viber is your chill repo for data collection, annotation and vibe checks.
fast-sentence-transformers
This repository contains code to run faster sentence-transformers. Simply, faster, sentence-transformers.
classy-classification
Have you every struggled with needing a Spacy TextCategorizer but didn't have the time to train one from scratch? Classy Classification is the way to go!
spacy-setfit
crosslingual-coreference
A multi-lingual approach to AllenNLP CoReference Resolution, along with a wrapper for spaCy.
concise-concepts
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity confidence scores!
adept-augmentations
A Python library aimed at adeptly, augmenting NLP training data.
peal
A package dedicated to using PEFT for active-learning, hence PEAL.
argilla-plugins
🔌 Open-source plugins for with practical features for Argilla using listeners.
mutate-nlp
Text data synthesize and pseudo labelling using LLMs
piontologyextractor
A package that can deal with extracting useful info from our Graph Ontology.