qary

"An open framework and dataset for building a distributed-agent chatbot based on _Natural Language Processing in Action_."

These details have been verified by PyPI

Maintainers

cetinca hobs JohnMay thompsgj

These details have not been verified by PyPI

Project links

Homepage

Project description

Use NLP in Action to build a virtual assistant that actually assists! Most bots manipulate you to make money for their corporate masters. Your bot can help protect you and amplify your abilities and prosocial instincts.

This hybrid chatbot combines 4 techniques explained in NLP in Action:

1. search: [chatterbot](https://github.com/gunthercox/ChatterBot), [will](https://github.com/skoczen/will)
2. pattern matching and response templates: Alexa, [AIML](https://github.com/keiffster/program-y)
3. generative deep learning: [robot-bernie](https://github.com/nlpia/robot-bernie), [movie-bot](https://github.com/totalgood/nlpia/blob/master/src/nlpia/book/examples/ch10_movie_dialog_chatbot.py)
4. grounding: [snips](https://github.com/snipsco/snips-nlu)

The presentations for San Diego Python User Group are in docs/

Install

You’ll want to install and use the conda package manager within Anaconda3, especially if your development environment is not a open standard operating system like Linux.

git clone git@github.com:nlpia/qary
cd qary
conda env create -n nlpia -f environment.yml  # or environment-windoze.yml
conda activate nlpia
pip install --editable .

Usage

$ bot --help
usage: bot [-h] [--version] [--name STR] [-p] [-b STR] [-v] [-vv]
           [words [words ...]]

Command line bot application, e.g. bot how do you work?

positional arguments:
  words                Words to pass to bot as an utterance or conversational
                       statement requiring a bot reply or action.

optional arguments:
  -h, --help           show this help message and exit
  --version            show program's version number and exit
  --name STR           IRC nick or CLI command name for the bot
  -p, --persist        Don't exit. Retain language model in memory and
                       maintain dialog until user says 'exit', 'quit' or 'bye'
                       (this is the default behavior if you do not provide a statement)
  -b STR, --bots STR   comma-separated list of bot personalities to load
                       default: pattern,parul,search_fuzzy,time,eliza
  -v, --verbose        set loglevel to INFO
  -vv, --very-verbose  set loglevel to DEBUG

Examples

You can run bot just like any other command line app, giving it your statement/query as an argument.

$ bot hello
# 2019-11-21 12:42:13,620 WARNING:nlpia.constants:107:            <module> Starting logger in nlpia.constants...
# 100%|█████████████████████████████████████████████████████████████████████████████████████████████| 64350/64350 [00:00<00:00, 540679.58it/s]
BOT: Hello

And if you want quicker turnaround on your bot you can just run it in peristent mode (without any positional arguments for your statement or words).

Travis’s probabilistic reply selector is working nicely to chose a reply from multiple sources (default settings):

pattern_bots.py: regex patterns and greeting templates
fuzzy_search_bots.py: movie dialog fuzzy matching
parul_bots.py: Wikipedia searches using conventional TFIDF like a search engine
eliza_bots.py: A python port of the ELIZA therapist bot
time_bots.py: A time and productivity tracker that parses your git logs and bash history

$ bot
(nlpia) hobs@Hobsons-MacBook-Air:~/code/chatbot/qary$ bot
# 2019-11-21 12:59:05,854 WARNING:nlpia.constants:107:            <module> Starting logger in nlpia.constants...
# 100%|█████████████████████████████████████████████████████████████████████████████████████████████| 64350/64350 [00:00<00:00, 495935.48it/s]
YOU: Hi
BOT: hey there. tired of breathing?
YOU: Hello
BOT: How do you do. Please state your problem.
YOU: What is an ELIZA chatbot?
BOT: joey never told you we went out, did he?
YOU: ^[[A
BOT: I am sorry! I dont understand you
YOU: What is an ELIZA chatbot?
BOT: Does that question interest you ?
YOU: What is a chatbot?
BOT: hello barbie is an internet-connected version of the doll that uses a chatbot provided by the company toytalk, which previously used the chatbot for a range of smartphone-based characters for children.
YOU: Hello
BOT: hello.
YOU: Hello
BOT: How do you do. Please state your problem.
YOU: bye
$

Work in Progress

Travis (@travis-harper): model management, context filtering, and the addition of more conversational agents
Nima (@hulkgeek): question answering bot based on his state of the art question classifier
Xavier (@spirovanni): employment counselor for workforce.org and the city of San Diego
Hobson (@hobson): infrastructure (CI, webapp) and framework features (nltk->spacy, USE vectors)
Erturgrul: Turkish wikipedia QA bot (parul bot)
You: What big chatbot idea would you like to make a reality?

Ideas

Please submit your feature ideas github issues. Here are a few ideas to get you started.

movie dialog in django database to hold the statement->response pairs
1. graph schema compatible with MxGraph (draw.io) and other js libraries for editing graphs/flow charts.
2. ubuntu dialog corpus in db
3. mindfulness faq corpus in db
4. famous quotes as responses to the statement “tell me something inspiring”
5. jokes for “tell me a joke”
6. data science faq
7. nlpia faq
8. psychology/self-help faq
html django template so there is a web interface to the app rather than just the command line command bot
use Django Rest Framework to create a basic API that returns json containing a reply to any request sent to the local host url, like http://localhost:8000/api?statement='Hello world' might return {‘reply’: ‘Hello human!’}
have the command line app use the REST API from #3 rather than the slow reloading of the csv file every time you talk to the bot
use database full text search to find appropriate statements in the database that we have a response for
use semantic search instead of text similarity (full text search or fuzzywyzzy text matches)
1. add embedding vectors (300D document vectors from spacy) to each statement and response in the db
2. create a semantic index of the document vectors using annoy so “approximate nearest neighbors” (semantic matches) can be found quickly
3. load the annoy index of the document vectors every time the server is started and use it to find the best reply in the database.
4. use universal sentence encodings instead of docvecs from spacy.
create a UX for dialog graph creation/design:
1. install mxgraph in the django app
2. create a basic page based on this mxgraph example so the user can build and save dialog to the db as a graph: tutorial, example app
3. convert the dialog graph into a set of records/rows in the qary db so it acts
tag different dialog graphs in the db so the user can turn them on/off for their bot
1. allow the user to prioritize some dialogs/models over others
2. allow the user to create their own weighting function to prioritize individual statements produced by the api
train a character-based generative model
1. decoder half of autoencoder to generate text based on docvecs from spacy
2. decoder part of autoencoder to generate text based on universal sentence encodings
3. train model to generate reply embeddings (doc vecs and/or use vecs) using statement embeddings (dialog engine encoder-decoder using docvecs or use vecs for the encoder half
add a therapy/mindfulness-coach feature to respond with mindfulness ideas to some queries/statements
add the “translate ‘this text’ to spanish” feature
1. train character-based LSTM models on english-spanish, english-french, english-german, english<->whatever
2. add module for this to the django app/api
AIML engine fallback

Inspiration

A lot of the patterns and ideas were gleaned from other awesome prosocial chatbots and modular open source frameworks.

Mental Health Coaches

WYSA from London is free
- https://www.techinasia.com/ai-chatbot-wysa-touchkin-penguin
- open source (touchkin)?
- ionic?
- passive sensing of sleep patterns (accelerometers?)
- guided meditation
- exercise suggestions
- free text dialog with buttons to suggest replies
- based on open source touchkin/mindlogger ?
- list of alternative apps
Replika from US is paywalled
- personality profile test
- pay to unlock “skills” training
Youper (thank you Maria and tangibleai.com)

Open Source Frameworks

will
- lang: python
- web: zeromq
- db: redis, couchbase, flat file, user-defined
- integrations: hipchat, rocketchat, shell, slack
ai-chatbot-framework
- lang: python
- web: flask
- orm: flask?
- db: mongodb
- nice general json syntax for specifying intent/goals for conversation manager (agent)
rasa
- lang: python
- web: sanic (async)
- orm: sqlalchemy
- db: sqlite
- rich, complex, mature framework
botpress
- javascript (typescript)
- meta-framework allowing your to write your own modules in javascript
Program-Y
- python
- web: flask (rest), sanic (async)
- db: aiml flat files (XML)
- integrations: facebook messenger, google search, kik, line, alexa, webchat, viber

Project details

These details have been verified by PyPI

Maintainers

cetinca hobs JohnMay thompsgj

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.7.27

Dec 6, 2022

0.7.26

Nov 21, 2022

0.7.25

Nov 19, 2022

0.7.24

Nov 19, 2022

0.7.23

Oct 7, 2022

0.7.21

Jun 29, 2022

0.7.19

Jun 13, 2022

0.7.18

Jun 2, 2022

0.7.17

Jun 1, 2022

0.7.15

Jun 1, 2022

0.7.14

May 16, 2022

0.7.13

May 14, 2022

0.7.12

May 14, 2022

0.7.11

May 13, 2022

0.7.9

May 8, 2022

0.7.8

May 5, 2022

0.7.5

Apr 26, 2022

0.7.4

Apr 26, 2022

0.7.3

Mar 26, 2022

0.7.2

Oct 21, 2021

0.7.1

Sep 8, 2021

0.6.29

Jun 12, 2021

0.6.28

Apr 11, 2021

0.6.27

Apr 8, 2021

0.6.26

Apr 8, 2021

0.6.25

Apr 8, 2021

0.6.24

Apr 8, 2021

0.6.23

Apr 7, 2021

0.6.22

Feb 9, 2021

0.6.21

Feb 3, 2021

0.6.20

Feb 3, 2021

0.6.19

Feb 3, 2021

0.6.18

Feb 3, 2021

0.6.17

Feb 3, 2021

0.6.16

Feb 3, 2021

0.6.15

Jan 31, 2021

0.6.14

Jan 27, 2021

0.6.13

Jan 26, 2021

0.6.12

Jan 25, 2021

0.6.11

Jan 25, 2021

0.6.10

Jan 25, 2021

0.6.9

Jan 23, 2021

0.6.8

Jan 22, 2021

0.6.7

Jan 22, 2021

0.6.5

Jan 22, 2021

0.6.4

Jan 22, 2021

0.6.3

Jan 22, 2021

0.6.2

Jan 22, 2021

0.6.1

Oct 9, 2020

0.6.0

Oct 9, 2020

0.5.21

Oct 9, 2020

0.5.20

Sep 30, 2020

0.5.19

Sep 30, 2020

0.5.18

Aug 13, 2020

0.5.17

Jul 22, 2020

0.5.16

Jul 17, 2020

0.5.15

Jun 30, 2020

0.5.14

Jun 30, 2020

0.5.13

Jun 26, 2020

0.5.12

Jun 26, 2020

0.5.11

Jun 22, 2020

0.5.10

Jun 22, 2020

0.5.9

Jun 13, 2020

0.5.8

Jun 13, 2020

0.5.7

Jun 13, 2020

0.5.6

Jun 13, 2020

0.5.5

Jun 12, 2020

0.5.4

Jun 11, 2020

0.5.0

Jun 1, 2020

0.4.17

May 28, 2020

0.4.16

May 28, 2020

0.4.15

May 28, 2020

0.4.11

May 28, 2020

0.4.10

May 24, 2020

0.4.9

May 13, 2020

0.4.8

May 13, 2020

0.4.7

May 9, 2020

0.4.6

May 9, 2020

0.4.5

May 9, 2020

0.4.4

Apr 25, 2020

0.4.3

Apr 23, 2020

0.4.2

Apr 23, 2020

0.4.1

Apr 23, 2020

0.4.0

Apr 23, 2020

0.3.18

Apr 23, 2020

0.3.16

Apr 22, 2020

0.3.15

Apr 22, 2020

0.3.14

Apr 21, 2020

0.3.13

Apr 21, 2020

0.3.12

Apr 21, 2020

0.3.11

Apr 21, 2020

0.3.10

Apr 16, 2020

0.3.9

Apr 16, 2020

0.3.8

Apr 15, 2020

0.3.7

Apr 2, 2020

0.3.6

Apr 1, 2020

0.3.5

Mar 30, 2020

0.3.4

Mar 26, 2020

This version

0.3.2

Mar 26, 2020

0.3.1

Mar 19, 2020

0.3.0

Mar 19, 2020

0.0.1

Apr 7, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

qary-0.3.2-py2.py3-none-any.whl (2.0 MB view details)

Uploaded Mar 26, 2020 Python 2 Python 3

File details

Details for the file qary-0.3.2-py2.py3-none-any.whl.

File metadata

Download URL: qary-0.3.2-py2.py3-none-any.whl
Upload date: Mar 26, 2020
Size: 2.0 MB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: Python-urllib/3.7

File hashes

Hashes for qary-0.3.2-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`7b98e72c0ffad0710f22f2cdc0642f59409aff81ae54265d6ba61d4eecbc503c`
MD5	`1a7cabd840a02b2626d684aa3705e6bd`
BLAKE2b-256	`e952dd0f43e16572034051f49ba5aa279c530bf30cce87e91291359663707c80`