NotDiamond Python library for easy OpenAI integration.

These details have not been verified by PyPI

Project links

Project description

Getting started with Not Diamond

Not Diamond helps you leverage multiple LLMs dynamically, sending queries to the best-suited model in real-time.

Key features

Not a proxy: When we receive a query, our backend service returns a recommendation of the model you should call and the actual call goes out client-side. So we never see your response outputs.
Privacy preserving: To protect your privacy, we use fuzzy hashing to conceal your query strings from us, and our model recommendation engine works entirely using the fuzzy hashed queries. This means we also never see the raw query strings you send to an LLM.
Maximize performance: By dynamically routing to the best-suited LLM for each query, we improve your overall LLM output quality and enhance product quality. Our recommendations are based on millions of data points from rigorous evaluation benchmarks and real-world data.
Reduce cost and latency: Most of the time, a small and specialized model can optimally handle a given query. We can help you determine this in real-time, cutting your LLM inference costs while reducing latency.

👍 Free to use!

Not Diamond is free up to 100K monthly requests. Beyond this pay just $10 per 10K requests.

Installation

Requires Python 3.9+

pip install notdiamond

API keys

Sign up and get a Not Diamond API key.

Create a .env file with your Not Diamond API key, and the API keys of the models you want to route between.

OPENAI_API_KEY="YOUR_OPENAI_API_KEY"
GOOGLE_API_KEY="YOUR_GOOGLE_API_KEY"
ANTHROPIC_API_KEY="YOUR_ANTHROPIC_API_KEY"
NOTDIAMOND_API_KEY="YOUR_NOTDIAMOND_API_KEY"

Alternatively, you can also set API keys programmatically as described further below.

📘 API keys

The notdiamond library uses your API keys client-side to call the LLM we recommend. We never pass your keys to our servers.

🚧 Model access

Since notdiamond calls the LLMs client-side using your keys, we will only call models you have access to. You can also use our router to determine the best model to call regardless of whether you have access or not (see example). Our router supports most of the popular open and proprietary models (see full list).

Drop me a line if you have a specific model requirement and we're happy to work with you to support it.

Example

If you already have existing projects in either OpenAI SDK or LangChain, check out our OpenAI and Langchain integration guides. Otherwise, continue reading.

Create a main.py file in the same folder as the .env file you created earlier, or try it in Colab.

from notdiamond.llms.llm import NDLLM
from notdiamond.prompts.prompt import NDPrompt, NDContext, NDQuery, NDPromptTemplate


# Define your prompt and query
prompt = NDPrompt("You are a world class software developer.") # The system prompt, defines the LLM's role
query = NDQuery("Write a merge sort in Python.") # The specific query written by an end-user

# Define the prompt template to combine prompt and query into a single string
prompt_template = NDPromptTemplate("About you: {prompt}\n{query}",
                                   partial_variables={"prompt": prompt, "query": query})

# Define the available LLMs you'd like to route between
llm_providers = ['openai/gpt-3.5-turbo', 'openai/gpt-4','openai/gpt-4-1106-preview', 'openai/gpt-4-turbo-preview',
                 'anthropic/claude-2.1', 'anthropic/claude-3-sonnet-20240229', 'anthropic/claude-3-opus-20240229',
                 'google/gemini-pro']

# Create the NDLLM object -> like a 'meta-LLM' combining all of the specified models
nd_llm = NDLLM(llm_providers=llm_providers)

# After fuzzy hashing the inputs, the best LLM is determined by the ND API and the LLM is called client-side
result, session_id, provider = nd_llm.invoke(prompt_template=prompt_template)


print("ND session ID: ", session_id)  # A unique ID of the invoke. Important for future references back to ND API
print("LLM called: ", provider.model)  # The LLM routed to
print("LLM output: ", result.content)  # The LLM response

👍 Run it!

python main.py

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.4

Apr 2, 2024

0.1.3

Apr 2, 2024

0.1.2

Mar 30, 2024

This version

0.1.1

Mar 29, 2024

0.1.0

Mar 29, 2024

0.1.0a0 pre-release

Mar 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

notdiamond_openai-0.1.1.tar.gz (144.9 kB view details)

Uploaded Mar 29, 2024 Source

Built Distribution

notdiamond_openai-0.1.1-py3-none-any.whl (260.2 kB view details)

Uploaded Mar 29, 2024 Python 3

File details

Details for the file notdiamond_openai-0.1.1.tar.gz.

File metadata

Download URL: notdiamond_openai-0.1.1.tar.gz
Upload date: Mar 29, 2024
Size: 144.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.12.2

File hashes

Hashes for notdiamond_openai-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`f0455d658d5a4d23ca111c32ab80fdca052959860c2c739781223809464c2cae`
MD5	`5cf327371dc1588dcdbb256c3ee289bb`
BLAKE2b-256	`c92f23f7cb3599030ee32bc49d8d5aef479754165c599247fb98271207802190`

See more details on using hashes here.

File details

Details for the file notdiamond_openai-0.1.1-py3-none-any.whl.

File metadata

Download URL: notdiamond_openai-0.1.1-py3-none-any.whl
Upload date: Mar 29, 2024
Size: 260.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.12.2

File hashes

Hashes for notdiamond_openai-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`03c534d352b60f844ab08b2d740210b4ec95e301b01e0e4286ebd8437dcb88e9`
MD5	`c8d9168eb822599f63e311ca9ad7fd28`
BLAKE2b-256	`acdf2dd061fee5011172ed80541c071264ca70de4f4dde80cd7e71d5ab9aee86`