Visiongraph is a high level computer vision framework.

Project description

Visiongraph

Visiongraph is a high level computer vision framework that includes predefined modules to quickly create and run algorithms on images. It is based on opencv and includes other computer vision frameworks like Intel openVINO and Google MediaPipe.

Here an example on how to start a webcam capture and display the image:

import visiongraph as vg
vg.create_graph(vg.VideoCaptureInput()).then(vg.ImagePreview()).open()

The main goal is to implement a platform independent and high performance framework for day-to-day computer vision tasks.

Installation

Visiongraph supports python 3.9 and 3.10. Other versions might work as well but are not officially supported.

To install visiongraph with all dependencies call pip like this:

pip install "visiongraph[all]"

🚨 Please note that visiongraph is in an early alpha phase and the API will still undergo changes.

It is also possible to only install certain packages depending on your needs:

pip install "visiongraph[realsense, openvino, mediapipe, onnx, media, azure, numba, opencv-contrib]"

Development

To develop visiograph itself it is recommended to clone this repository and install the dependencies like this:

# in the visiongraph directory
pip install -e ".[all]"

Build

To build a new wheel package of visiongraph run the following command in the root directory.

python setup.py bdist_wheel

Examples

To demonstrate the possibilities of visiongraph there are already implemented examples ready for you to try out. Here is a list of the current examples:

SimpleVisionGraph - SSD object detection & tracking of live webcam input with 5 lines of code.
VisionGraphExample - A face detection and tracking example with custom events.
InputExample - A basic input example that determines the center if possible.
RealSenseDepthExample - Display the RealSense or Azure Kinect depth map.
FaceDetectionExample - A face detection pipeline example.
FindFaceExample - A face recognition example to find a target face.
CascadeFaceDetectionExample - A face detection pipeline that also predicts other feature points of the face.
HandDetectionExample - A hand detection pipeline example.
PoseEstimationExample - A pose estimation pipeline which annotates the generic pose keypoints.
ProjectedPoseExample - Project the pose estimation into 3d space with the RealSense camera.
ObjectDetectionExample - An object detection & tracking example.
InstanceSegmentationExample - Intance Segmentation based on COCO80 dataset.
InpaintExample - GAN based inpainting example.
MidasDepthExample - Realtime depth prediction with the midas-small network.
RGBDSmoother - Smooth RGB-D depth map videos with a one-euro filter per pixel.

There are even more examples where visiongraph is currently in use:

Spout/Syphon RGB-D Example - Share RGB-D images over spout or syphon.
WebRTC Input - WebRTC input example for visiongraph

Documentation

This documentation is intended to provide an overview of the framework. A full documentation will be available later.

Graph

The core component of visiongraph is the BaseGraph class. It contains and handles all the nodes of the graph. A BaseGraph can run on the same thread as called or a new thread or process. The nodes in the graph are just a list, the graph itself is created by nesting nodes into each other.

Graph Node

A GraphNode is a single step in the graph. It has a input and output type and processes the data within the process() method.

Graph Builder

The graph builder helps to create new graphs on a single line in python. It creates a VisionGraph object which is a child of the BaseGraph. The following code snippet is an example of the graph builder which creates a smooth pose estimation graph.

import visiongraph as vg

graph = vg.create_graph(name="Smooth Pose Estimation",
                            input_node=vg.VideoCaptureInput(0),
                            handle_signals=True) \
        .apply(ssd=vg.sequence(vg.OpenPoseEstimator.create(), vg.MotpyTracker(), vg.LandmarkSmoothFilter()),
               image=vg.passthrough()) \
        .then(vg.ResultAnnotator(image="image"), vg.ImagePreview()) \
        .open()

Input

Supported are image, video, webcam, RealSense and Azure Kinect input types.

Estimator

Usually an estimator is a graph node which takes an image as an input and estimates an information about the content. This could be a pose estimation or a face detection. It is also possible to have a transformation of the image, for example de-blurring it or estimate the depth map.

Object Detection Tracker

Object detection trackers allow a detected object to be assigned an id that remains the same across successive frames.

DSP (Digital Signal Processing)

To filter noisy estimations or inputs, the DSP package provides different filters which can be applied directly into a graph.

Recorder

To record incoming frames or annotated results, multiple frame recorders are provided.

Assets

Most estimators use big model and weight descriptions for their neural networks. To keep visiongraph small and easy to install, these assets are hosted externally on github. Visiongraph provides a system to directly download and cache these files.

Argparse

To support rapid prototyping many graph and estimator options are already provided to add to the argparse parser.

Logging

To enable logging for visiongraph imports please set the following environment variable:

# zsh / bash
export VISIONGRAPH_LOGLEVEL=INFO

# cmd
set VISIONGRAPH_LOGLEVEL=INFO

# powershell
$env:VISIONGRAPH_LOGLEVEL="INFO"

Roadmap

Next roadmap points:

Async input and network model (run when ready)

About

Included Libraries

Parts of these libraries are directly included and adapted to work with visiongraph.

motpy - simple multi object tracking library (MIT License)
motrackers - Multi-object trackers in Python (MIT License)
OneEuroFilter-Numpy - (MIT License)

For more information about the dependencies have a look at the requirements.txt.

Project details

Release history Release notifications | RSS feed

1.0.0b2 pre-release

Sep 25, 2024

1.0.0b1 pre-release

Sep 10, 2024

0.1.60.1

Aug 28, 2024

0.1.60

Aug 28, 2024

0.1.59.1

Aug 23, 2024

0.1.59

Aug 23, 2024

0.1.58.2

Aug 16, 2024

0.1.58.1

Jun 7, 2024

0.1.58

Jun 7, 2024

0.1.57.4

May 27, 2024

0.1.57.3

May 27, 2024

0.1.57.2

Apr 23, 2024

0.1.57.1

Apr 23, 2024

0.1.57

Apr 17, 2024

0.1.56

Apr 3, 2024

0.1.55.3

Feb 28, 2024

0.1.55.2

Feb 22, 2024

0.1.55.1

Feb 22, 2024

0.1.54.1

Feb 19, 2024

0.1.53.1

Jan 17, 2024

0.1.53

Jan 17, 2024

0.1.52

Nov 24, 2023

0.1.51.3

Nov 2, 2023

0.1.51.2

Nov 2, 2023

0.1.51.1

Oct 24, 2023

0.1.50.3

Oct 13, 2023

0.1.50.2

Aug 10, 2023

0.1.50.1

Aug 9, 2023

0.1.50

Aug 9, 2023

0.1.49

Aug 4, 2023

0.1.48.2

Jul 27, 2023

0.1.48.1

Jul 27, 2023

This version

0.1.48

Jul 3, 2023

0.1.47.5

Jun 29, 2023

0.1.47.4

Jun 28, 2023

0.1.47.3

Jun 26, 2023

0.1.47.2

Jun 26, 2023

0.1.47.1

Jun 26, 2023

0.1.46.1

Jun 21, 2023

0.1.45.1

May 22, 2023

0.1.45

May 19, 2023

0.1.44.5

May 3, 2023

0.1.44.4

May 2, 2023

0.1.44.3

Apr 27, 2023

0.1.44.2

Apr 19, 2023

0.1.44.1

Apr 14, 2023

0.1.44

Apr 14, 2023

0.1.43

Mar 30, 2023

0.1.42

Mar 27, 2023

0.1.41

Mar 27, 2023

0.1.40

Mar 24, 2023

0.1.35

Mar 22, 2023

0.1.34

Jan 30, 2023

0.1.33

Jan 16, 2023

0.1.32.2

Dec 7, 2022

0.1.32

Dec 7, 2022

0.1.31.2

Oct 31, 2022

0.1.30.4

Oct 14, 2022

0.1.30.3

Oct 10, 2022

0.1.30.2

Oct 8, 2022

0.1.30.1

Oct 7, 2022

0.1.29.8

Oct 5, 2022

0.1.29.7

Oct 5, 2022

0.1.29.6

Sep 27, 2022

0.1.29.5

Sep 27, 2022

0.1.29.4

Sep 27, 2022

0.1.29.3

Sep 26, 2022

0.1.29.2

Sep 20, 2022

0.1.29.1

Sep 12, 2022

0.1.28.2

Sep 12, 2022

0.1.28.1

Aug 13, 2022

0.1.27.2

Aug 9, 2022

0.1.27

Aug 5, 2022

0.1.26

Jun 28, 2022

0.1.25

Jun 20, 2022

0.1.24

May 30, 2022

0.1.23.7

May 30, 2022

0.1.23.6

May 25, 2022

0.1.23.5

May 23, 2022

0.1.23.4

May 22, 2022

0.1.23.3

May 21, 2022

0.1.23.2

May 20, 2022

0.1.23.1

May 20, 2022

0.1.23

May 19, 2022

0.1.22.3

May 8, 2022

0.1.22.2

Apr 27, 2022

0.1.22.1

Apr 27, 2022

0.1.22

Apr 27, 2022

0.1.21

Apr 22, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

visiongraph-0.1.48-py3-none-any.whl (240.6 kB view details)

Uploaded Jul 3, 2023 Python 3

File details

Details for the file visiongraph-0.1.48-py3-none-any.whl.

File metadata

Download URL: visiongraph-0.1.48-py3-none-any.whl
Upload date: Jul 3, 2023
Size: 240.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for visiongraph-0.1.48-py3-none-any.whl
Algorithm	Hash digest
SHA256	`72cb1448b384ce7af60e1d583e78bc7af6f93ed2587e040b0934327304987440`
MD5	`53887d1ea5ff7b7b93fcda765d7f377a`
BLAKE2b-256	`48abe3673ce37af513e18730483679a72e5c0f3c005504573145099b18d25df3`