No project description provided

These details have not been verified by PyPI

Project links

Project description

LangServe 🦜️🏓

Overview

LangServe helps developers deploy LangChain runnables and chains as a REST API.

This library is integrated with FastAPI and uses pydantic for data validation.

In addition, it provides a client that can be used to call into runnables deployed on a server. A javascript client is available in LangChainJS.

Features

Input and Output schemas automatically inferred from your LangChain object, and enforced on every API call, with rich error messages
API docs page with JSONSchema and Swagger (insert example link)
Efficient /invoke, /batch and /stream endpoints with support for many concurrent requests on a single server
/stream_log endpoint for streaming all (or some) intermediate steps from your chain/agent
Playground page at /playground with streaming output and intermediate steps
Built-in (optional) tracing to LangSmith, just add your API key (see Instructions])
All built with battle-tested open-source Python libraries like FastAPI, Pydantic, uvloop and asyncio.
Use the client SDK to call a LangServe server as if it was a Runnable running locally (or call the HTTP API directly)

Limitations

Client callbacks are not yet supported for events that originate on the server
Does not work with pydantic v2 yet

Security

Vulnerability in Versions 0.0.13 - 0.0.15 -- playground endpoint allows accessing arbitrary files on server. Resolved in 0.0.16.

LangChain CLI 🛠️

Use the LangChain CLI to bootstrap a LangServe project quickly.

To use the langchain CLI make sure that you have a recent version of langchain installed and also typer. (pip install langchain typer or pip install "langchain[cli]")

langchain ../path/to/directory

And follow the instructions...

Examples

For more examples, see the examples directory.

Server

Here's a server that deploys an OpenAI chat model, an Anthropic chat model, and a chain that uses the Anthropic model to tell a joke about a topic.

#!/usr/bin/env python
from fastapi import FastAPI
from langchain.prompts import ChatPromptTemplate
from langchain.chat_models import ChatAnthropic, ChatOpenAI
from langserve import add_routes


app = FastAPI(
  title="LangChain Server",
  version="1.0",
  description="A simple api server using Langchain's Runnable interfaces",
)

add_routes(
    app,
    ChatOpenAI(),
    path="/openai",
)

add_routes(
    app,
    ChatAnthropic(),
    path="/anthropic",
)

model = ChatAnthropic()
prompt = ChatPromptTemplate.from_template("tell me a joke about {topic}")
add_routes(
    app,
    prompt | model,
    path="/chain",
)

if __name__ == "__main__":
    import uvicorn

    uvicorn.run(app, host="localhost", port=8000)

Docs

If you've deployed the server above, you can view the generated OpenAPI docs using:

curl localhost:8000/docs

make sure to add the /docs suffix.

Below will return a 404 until you define a @app.get("/")

localhost:8000

Client

Python SDK

from langchain.schema import SystemMessage, HumanMessage
from langchain.prompts import ChatPromptTemplate
from langchain.schema.runnable import RunnableMap
from langserve import RemoteRunnable

openai = RemoteRunnable("http://localhost:8000/openai/")
anthropic = RemoteRunnable("http://localhost:8000/anthropic/")
joke_chain = RemoteRunnable("http://localhost:8000/chain/")

joke_chain.invoke({"topic": "parrots"})

# or async
await joke_chain.ainvoke({"topic": "parrots"})

prompt = [
    SystemMessage(content='Act like either a cat or a parrot.'),
    HumanMessage(content='Hello!')
]

# Supports astream
async for msg in anthropic.astream(prompt):
    print(msg, end="", flush=True)

prompt = ChatPromptTemplate.from_messages(
    [("system", "Tell me a long story about {topic}")]
)

# Can define custom chains
chain = prompt | RunnableMap({
    "openai": openai,
    "anthropic": anthropic,
})

chain.batch([{ "topic": "parrots" }, { "topic": "cats" }])

In TypeScript (requires LangChain.js version 0.0.166 or later):

import { RemoteRunnable } from "langchain/runnables/remote";

const chain = new RemoteRunnable({
  url: `http://localhost:8000/chain/invoke/`,
});
const result = await chain.invoke({
  topic: "cats",
});

Python using requests:

import requests
response = requests.post(
    "http://localhost:8000/chain/invoke/",
    json={'input': {'topic': 'cats'}}
)
response.json()

You can also use curl:

curl --location --request POST 'http://localhost:8000/chain/invoke/' \
    --header 'Content-Type: application/json' \
    --data-raw '{
        "input": {
            "topic": "cats"
        }
    }'

Endpoints

The following code:

...
add_routes(
  app,
  runnable,
  path="/my_runnable",
)

adds of these endpoints to the server:

POST /my_runnable/invoke - invoke the runnable on a single input
POST /my_runnable/batch - invoke the runnable on a batch of inputs
POST /my_runnable/stream - invoke on a single input and stream the output
POST /my_runnable/stream_log - invoke on a single input and stream the output, including output of intermediate steps as it's generated
GET /my_runnable/input_schema - json schema for input to the runnable
GET /my_runnable/output_schema - json schema for output of the runnable
GET /my_runnable/config_schema - json schema for config of the runnable

Playground

You can find a playground page for your runnable at /my_runnable/playground. This exposes a simple UI to configure and invoke your runnable with streaming output and intermediate steps.

Installation

For both client and server:

pip install "langserve[all]"

or pip install "langserve[client]" for client code, and pip install "langserve[server]" for server code.

Legacy Chains

LangServe works with both Runnables (constructed via LangChain Expression Language) and legacy chains (inheriting from Chain). However, some of the input schemas for legacy chains may be incomplete/incorrect, leading to errors. This can be fixed by updating the input_schema property of those chains in LangChain. If you encounter any errors, please open an issue on THIS repo, and we will work to address it.

Handling Authentication

If you need to add authentication to your server, please reference FastAPI's security documentation and middleware documentation.

Deployment

Deploy to GCP

You can deploy to GCP Cloud Run using the following command:

gcloud run deploy [your-service-name] --source . --port 8001 --allow-unauthenticated --region us-central1 --set-env-vars=OPENAI_API_KEY=your_key

Advanced

Files

LLM applications often deal with files. There are different architectures that can be made to implement file processing; at a high level:

The file may be uploaded to the server via a dedicated endpoint and processed using a separate endpoint
The file may be uploaded by either value (bytes of file) or reference (e.g., s3 url to file content)
The processing endpoint may be blocking or non-blocking
If significant processing is required, the processing may be offloaded to a dedicated process pool

You should determine what is the appropriate architecture for your application.

Currently, to upload files by value to a runnable, use base64 encoding for the file (multipart/form-data is not supported yet).

Here's an example that shows how to use base64 encoding to send a file to a remote runnable.

Remember, you can always upload files by reference (e.g., s3 url) or upload them as multipart/form-data to a dedicated endpoint.

Custom Input and Output Types

Input and Output types are defined on all runnables.

You can access them via the input_schema and output_schema properties.

LangServe uses these types for validation and documentation.

If you want to override the default inferred types, you can use the with_types method.

Here's a toy example to illustrate the idea:

from typing import Any

from fastapi import FastAPI
from langchain.schema.runnable import RunnableLambda

app = FastAPI()


def func(x: Any) -> int:
    """Mistyped function that should accept an int but accepts anything."""
    return x + 1


runnable = RunnableLambda(func).with_types(
    input_schema=int,
)

add_routes(app, runnable)

Custom User Types

Inherit from CustomUserType if you want the data to de-serialize into a pydantic model rather than the equivalent dict representation.

At the moment, this type only works server side and is used to specify desired decoding behavior. If inheriting from this type the server will keep the decoded type as a pydantic model instead of converting it into a dict.

from fastapi import FastAPI
from langchain.schema.runnable import RunnableLambda

from langserve import add_routes
from langserve.schema import CustomUserType

app = FastAPI()


class Foo(CustomUserType):
    bar: int


def func(foo: Foo) -> int:
    """Sample function that expects a Foo type which is a pydantic model"""
    assert isinstance(foo, Foo)
    return foo.bar

# Note that the input and output type are automatically inferred!
# You do not need to specify them.
# runnable = RunnableLambda(func).with_types( # <-- Not needed in this case
#     input_schema=Foo,
#     output_schema=int,
# 
add_routes(app, RunnableLambda(func), path="/foo")

Playground Widgets

The playground allows you to define custom widgets for your runnable from the backend.

A widget is specified at the field level and shipped as part of the JSON schema of the input type
A widget must contain a key called type with the value being one of a well known list of widgets
Other widget keys will be associated with values that describe paths in a JSON object

General schema:

type JsonPath = number | string | (number | string)[];
type NameSpacedPath = { title: string; path: JsonPath }; // Using title to mimick json schema, but can use namespace
type OneOfPath = { oneOf: JsonPath[] };

type Widget = {
    type: string // Some well known type (e.g., base64file, chat etc.)
    [key: string]: JsonPath | NameSpacedPath | OneOfPath;
};

File Upload Widget

Allows creation of a file upload input in the UI playground for files that are uploaded as base64 encoded strings. Here's the full example.

Snippet:

from pydantic import Field

from langserve import CustomUserType


# ATTENTION: Inherit from CustomUserType instead of BaseModel otherwise
#            the server will decode it into a dict instead of a pydantic model.
class FileProcessingRequest(CustomUserType):
    """Request including a base64 encoded file."""

    # The extra field is used to specify a widget for the playground UI.
    file: str = Field(..., extra={"widget": {"type": "base64file"}})
    num_chars: int = 100

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.0

Sep 14, 2024

0.3.0.dev2 pre-release

Sep 12, 2024

0.3.0.dev1 pre-release

Sep 9, 2024

0.2.3

Sep 2, 2024

0.2.2

Jun 10, 2024

0.2.1

May 24, 2024

0.2.0

May 17, 2024

0.2.0rc1 pre-release

May 15, 2024

0.1.1

Apr 25, 2024

0.1.0

Apr 15, 2024

0.1.0rc2 pre-release

Apr 3, 2024

0.1.0rc1 pre-release

Apr 1, 2024

0.0.51

Mar 12, 2024

0.0.50

Mar 12, 2024

0.0.49

Mar 11, 2024

0.0.48

Mar 11, 2024

0.0.47

Mar 9, 2024

0.0.46

Mar 1, 2024

0.0.45

Feb 28, 2024

0.0.44

Feb 28, 2024

0.0.43

Feb 26, 2024

0.0.42

Feb 22, 2024

0.0.41

Jan 27, 2024

0.0.40

Jan 26, 2024

0.0.39

Jan 10, 2024

0.0.38

Jan 5, 2024

0.0.37

Dec 21, 2023

0.0.36

Dec 13, 2023

0.0.35

Dec 9, 2023

0.0.34

Dec 6, 2023

0.0.33

Dec 5, 2023

0.0.32

Nov 28, 2023

0.0.31

Nov 27, 2023

0.0.30

Nov 20, 2023

0.0.29

Nov 17, 2023

0.0.28

Nov 16, 2023

0.0.27

Nov 13, 2023

0.0.26

Nov 11, 2023

0.0.25

Nov 10, 2023

0.0.24

Nov 8, 2023

0.0.23

Nov 6, 2023

0.0.22

Nov 3, 2023

0.0.21

Oct 31, 2023

0.0.20

Oct 31, 2023

This version

0.0.19

Oct 27, 2023

0.0.18

Oct 27, 2023

0.0.17

Oct 26, 2023

0.0.16

Oct 25, 2023

0.0.15 yanked

Oct 19, 2023

Reason this release was yanked:

CVE

0.0.14 yanked

Oct 19, 2023

Reason this release was yanked:

CVE

0.0.13 yanked

Oct 19, 2023

Reason this release was yanked:

CVE

0.0.12

Oct 19, 2023

0.0.11

Oct 19, 2023

0.0.10

Oct 17, 2023

0.0.9

Oct 17, 2023

0.0.8

Oct 12, 2023

0.0.7

Oct 6, 2023

0.0.6

Oct 4, 2023

0.0.5

Oct 3, 2023

0.0.4

Oct 3, 2023

0.0.3

Oct 2, 2023

0.0.2

Oct 2, 2023

0.0.1

Sep 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langserve-0.0.19.tar.gz (482.9 kB view details)

Uploaded Oct 27, 2023 Source

Built Distribution

langserve-0.0.19-py3-none-any.whl (482.1 kB view details)

Uploaded Oct 27, 2023 Python 3

File details

Details for the file langserve-0.0.19.tar.gz.

File metadata

Download URL: langserve-0.0.19.tar.gz
Upload date: Oct 27, 2023
Size: 482.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for langserve-0.0.19.tar.gz
Algorithm	Hash digest
SHA256	`930f6a6a090deec2ef3d6375212d06c95dd547ffb7cafe969c286df94fe6823f`
MD5	`c8214a81cf2a119bc1a558da2fb2bc47`
BLAKE2b-256	`0d5de3cd1fa5df551e90278cc1db6c709a69c62d08c836429a5e3d02ded66b7a`

See more details on using hashes here.

File details

Details for the file langserve-0.0.19-py3-none-any.whl.

File metadata

Download URL: langserve-0.0.19-py3-none-any.whl
Upload date: Oct 27, 2023
Size: 482.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for langserve-0.0.19-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f644aed61dda42ad1c70770a41285e4069e0233435b7452b4af5b026188c00b0`
MD5	`d79db64f606c039a43b46913c2df0c44`
BLAKE2b-256	`e346a4c8571d3cb32261173ccfffe48ee303a321923df9b3f9cd2efe7753348e`

See more details on using hashes here.

langserve 0.0.19

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LangServe 🦜️🏓

Overview

Features

Limitations

Security

LangChain CLI 🛠️

Examples

Server

Docs

Client

Endpoints

Playground

Installation

Legacy Chains

Handling Authentication

Deployment

Deploy to GCP

Advanced

Files

Custom Input and Output Types

Custom User Types

Playground Widgets

File Upload Widget

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes