Python Asynchronous Task for AsyncIO
Project description
PATIO
PATIO is an acronym for Python Asynchronous Tasks for AsyncIO.
Motivation
I wanted to create an easily extensible library, for distributed task execution,
like celery
, only targeting asyncio as the main
design approach.
By design, the library should be suitable for small projects and the really large distributed projects. The general idea is that the user simply splits the projects code base into functions of two roles – background tasks and triggers of these background tasks. Also, this should help your project to scale horizontally. It allows to make workers or callers available across the network using embedded TCP, or using plugins to communicate through the existing messaging infrastructure.
Quickstart
The minimal example, which executes tasks in a thread pool:
import asyncio
from functools import reduce
from patio import Registry
from patio.broker import MemoryBroker
from patio.executor import ThreadPoolExecutor
rpc = Registry()
@rpc("mul")
def multiply(*args: int) -> int:
return reduce(lambda x, y: x * y, args)
async def main():
async with ThreadPoolExecutor(rpc, max_workers=4) as executor:
async with MemoryBroker(executor) as broker:
print(
await asyncio.gather(
*[broker.call("mul", 1, 2, 3) for _ in range(100)]
)
)
if __name__ == '__main__':
asyncio.run(main())
The ThreadPoolExecutor
in this example is the entity that will execute the
tasks. If the tasks in your project are asynchronous, you can select
AsyncExecutor
, and then the code will look like this:
import asyncio
from functools import reduce
from patio import Registry
from patio.broker import MemoryBroker
from patio.executor import AsyncExecutor
rpc = Registry()
@rpc("mul")
async def multiply(*args: int) -> int:
# do something asynchronously
await asyncio.sleep(0)
return reduce(lambda x, y: x * y, args)
async def main():
async with AsyncExecutor(rpc, max_workers=4) as executor:
async with MemoryBroker(executor) as broker:
print(
await asyncio.gather(
*[broker.call("mul", 1, 2, 3) for _ in range(100)]
)
)
if __name__ == '__main__':
asyncio.run(main())
These examples may seem complicated, but don't worry, the next section details the general concepts and hopefully a lot will become clear to you.
The main concepts
The main idea in developing this library was to create a maximally modular and extensible system that can be expanded with third-party integrations or directly in the user's code.
The basic elements from which everything is built are:
Registry
- Key-Value like store for the functionsExecutor
- The thing which execute functions from the registryBroker
- The actor who distributes tasks in your distributed (or local) system.
Registry
This is a container of functions for their subsequent execution. You can register a function by specific name or without it, in which case the function is assigned a unique name that depends on the source code of the function.
This registry does not necessarily have to match on the calling and called sides, but for functions that you register without a name it is must be, and then you should not need to pass the function name but the function itself when you will call it.
An instance of the registry must be transferred to the broker, the first broker in the process of setting up will block the registry to write, that is, registering new functions will be impossible.
An optional project
parameter, this is essentially like a namespace
that will help avoid clash functions in different projects with the same
name. It is recommended to specify it and the broker should also use this
parameter, so it should be the same value within the same project.
You can either manually register elements or use a registry instance as a decorator:
from patio import Registry
rpc = Registry(project="example")
# Will be registered with auto generated name
@rpc
def mul(a, b):
return a * b
@rpc('div')
def div(a, b):
return a / b
def pow(a, b):
return a ** b
def sub(a, b):
return a - b
# Register with auto generated name
rpc.register(pow)
rpc.register(sub, "sub")
Alternatively using register
method:
from patio import Registry
rpc = Registry(project="example")
def pow(a, b):
return a ** b
def sub(a, b):
return a - b
# Register with auto generated name
rpc.register(pow)
rpc.register(sub, "sub")
Finally, you can register functions explicitly, as if it were just a dictionary:
from patio import Registry
rpc = Registry(project="example")
def mul(a, b):
return a * b
rpc['mul'] = mul
Executor
An Executor is an entity that executes local functions from registry. The following executors are implemented in the package:
AsyncExecutor
- Implements pool of asynchronous tasksThreadPoolExecutor
- Implements pool of threadsProcessPoolExecutor
- Implements pool of processesNullExecutor
- Implements nothing and exists just for forbid execute anything explicitly.
Its role is to reliably execute jobs without taking too much so as not to cause a denial of service, or excessive memory consumption.
The executor instance is passing to the broker, it's usually applies it to the whole registry. Therefore, you should understand what functions the registry must contain to choose kind of an executor.
Broker
The basic approach for distributing tasks is to shift the responsibility for the distribution to the user implementation. In this way, task distribution can be implemented through third-party brokers, databases, or something else.
This package is implemented by the following brokers:
MemoryBroker
- To distribute tasks within a single process. A very simple implementation, in case you don't know yet how your application will develop and just want to leave the decision of which broker to use for later, while laying the foundation for switching to another broker.TCPBroker
- Simple implementation of the broker just using TCP, both Server and Client mode is supported for both the task executor and the task provider.
MemoryBroker
It's a good start if you don't need to assign tasks in a distributed fashion right now.
In fact, it's a simple way to run tasks in the executor from the other places in your project.
TCPBroker
It allows you to make your tasks distributed without resorting to external message brokers.
The basic idea of TCP broker implementation is that in terms of performing tasks, there is no difference between them, it is just a way to establish a connection, both the server and the client can be the one who performs tasks and the one who sets them, and it is also possible in a mixed mode.
In other words, deciding who will be the server and who will be the client in your system is just a way to connect and find each other in your distributed system.
Here are the ways of organizing communication between the server and the clients.
Server centric scheme example
This diagram describes a simple example, if there is one server and one client exchanging messages via TCP.
One client multiple servers example
This is an example of how a client establishes connections to a set server.
Full mesh example
Full mesh scheme, all clients are connected to all servers.
Authorization
Authorization takes place at the start of the connection,
for this the parameter key=
(b''
is by default) must contain the same keys
for client and server.
It is important to understand that this is not 100% protection against attacks like MITM etc.
This approach should only be used if the client and server are on a trusted
network. In order to secure traffic as it traverses the Internet, the
ssl_context=
parameter should be prepended to both the server and the client
(see example bellow).
Examples
The examples below will hopefully help you figure this out.
Server executing tasks
from functools import reduce
import asyncio
from patio import Registry
from patio.broker.tcp import TCPServerBroker
from patio.executor import ThreadPoolExecutor
rpc = Registry(project="test", auto_naming=False)
def mul(*args):
return reduce(lambda x, y: x * y, args)
async def main():
rpc.register(mul, "mul")
async with ThreadPoolExecutor(rpc) as executor:
async with TCPServerBroker(executor) as broker:
# Start IPv4 server
await broker.listen(address='127.0.0.1')
# Start IPv6 server
await broker.listen(address='::1', port=12345)
await broker.join()
if __name__ == "__main__":
asyncio.run(main())
Client calling tasks remotely
import asyncio
from patio import Registry
from patio.broker.tcp import TCPClientBroker
from patio.executor import NullExecutor
rpc = Registry(project="test", auto_naming=False)
async def main():
async with NullExecutor(rpc) as executor:
async with TCPClientBroker(executor) as broker:
# Connect to the IPv4 address
await broker.connect(address='127.0.0.1')
# Connect to the IPv6 address (optional)
await broker.connect(address='::1', port=12345)
print(
await asyncio.gather(*[
broker.call('mul', i, i) for i in range(10)
]),
)
if __name__ == "__main__":
asyncio.run(main())
Examples with SSL
The task comes down to passing the ssl context to the server and to the client.
Below you will see an example of how to make a couple of self-signed certificates, and an authorization CA. Original post here.
This is just an example, and if you want to use your own certificates, just create an ssl context as required by your security policy.
Certificate authority creation
Attention: this is the key used to sign the certificate requests, anyone holding this can sign certificates on your behalf. So keep it in a safe place!
openssl req -x509 \
-sha256 -days 3650 \
-nodes \
-newkey rsa:2048 \
-subj "/CN=Patio Example CA/C=CC/L=West Island" \
-keyout CA.key -out CA.pem
Server certificate creation
First create server private key:
openssl genrsa -out server.key 2048
Then create the certificate request signing this key:
openssl req \
-new -sha256 \
-key server.key \
-subj "/CN=server.example.net/C=CC/L=West Island" \
-out server.csr
Sign this request by CA:
openssl x509 -req \
-days 365 -sha256 \
-in server.csr \
-CA CA.pem \
-CAkey CA.key \
-CAcreateserial \
-out server.pem
This should be enough to encrypt the traffic.
Server with SSL executing tasks
from functools import reduce
import asyncio
import ssl
from patio import Registry
from patio.broker.tcp import TCPServerBroker
from patio.executor import ThreadPoolExecutor
rpc = Registry(project="test", auto_naming=False)
def mul(*args):
return reduce(lambda x, y: x * y, args)
async def main():
rpc.register(mul, "mul")
ssl_context = ssl.SSLContext()
ssl_context.load_verify_locations("path/to/CA.pem")
ssl_context.load_cert_chain("path/to/server.pem", "path/to/server.key")
async with ThreadPoolExecutor(rpc) as executor:
async with TCPServerBroker(executor, ssl_context=ssl_context) as broker:
# Start IPv4 server
await broker.listen(address='127.0.0.1')
# Start IPv6 server
await broker.listen(address='::1', port=12345)
await broker.join()
if __name__ == "__main__":
asyncio.run(main())
Client calling tasks remotely
import asyncio
import ssl
from patio import Registry
from patio.broker.tcp import TCPClientBroker
from patio.executor import NullExecutor
rpc = Registry(project="test", auto_naming=False)
async def main():
ssl_context = ssl.create_default_context(cafile="path/to/CA.pem")
async with NullExecutor(rpc) as executor:
async with TCPClientBroker(executor, ssl_context=ssl_context) as broker:
# Connect to the IPv4 address
await broker.connect(address='127.0.0.1')
# Connect to the IPv6 address (optional)
await broker.connect(address='::1', port=12345)
print(
await asyncio.gather(*[
broker.call('mul', i, i) for i in range(10)
]),
)
if __name__ == "__main__":
asyncio.run(main())
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file patio-0.1.1.tar.gz
.
File metadata
- Download URL: patio-0.1.1.tar.gz
- Upload date:
- Size: 18.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.0 CPython/3.10.2 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 67d67d14da0a366ad999be60f614a51a9935f8c048d8e2cdda35319ea0461691 |
|
MD5 | 38e246427e1798066c4ccfa748fff916 |
|
BLAKE2b-256 | 456ea258f3645de18dc7c4f6ec71525e719849b31748df945bcc0a5331e2fbf1 |
File details
Details for the file patio-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: patio-0.1.1-py3-none-any.whl
- Upload date:
- Size: 20.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.0 CPython/3.10.2 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c92f67a907e20d4d34664bc41616f1621f28aa47687534962f45907a59789992 |
|
MD5 | 4794e0aa20f0f18c2a5e0e8a6724d9e5 |
|
BLAKE2b-256 | d238bcddd279dfdbecee743e099a0093b696b6d54bf88f164621f58f3fda5132 |