fluent-logger

A Python logging handler for Fluentd event collector

These details have been verified by PyPI

Maintainers

These details have not been verified by PyPI

Project links

Project description

Many web/mobile applications generate huge amount of event logs (c,f. login, logout, purchase, follow, etc). To analyze these event logs could be really valuable for improving the service. However, the challenge is collecting these logs easily and reliably.

Fluentd and Fluent Bit solves that problem by having: easy installation, small footprint, plugins, reliable buffering, log forwarding, etc.

fluent-logger-python is a Python library, to record the events from Python application.

Requirements

Python 3.7+
msgpack
IMPORTANT: Version 0.8.0 is the last version supporting Python 2.6, 3.2 and 3.3
IMPORTANT: Version 0.9.6 is the last version supporting Python 2.7 and 3.4
IMPORTANT: Version 0.10.0 is the last version supporting Python 3.5 and 3.6

Installation

This library is distributed as ‘fluent-logger’ python package. Please execute the following command to install it.

$ pip install fluent-logger

Configuration

Fluentd daemon must be launched with a tcp source configuration:

<source>
  type forward
  port 24224
</source>

To quickly test your setup, add a matcher that logs to the stdout:

<match app.**>
  type stdout
</match>

Usage

FluentSender Interface

sender.FluentSender is a structured event logger for Fluentd.

By default, the logger assumes fluentd daemon is launched locally. You can also specify remote logger by passing the options.

from fluent import sender

# for local fluent
logger = sender.FluentSender('app')

# for remote fluent
logger = sender.FluentSender('app', host='host', port=24224)

For sending event, call emit method with your event. Following example will send the event to fluentd, with tag ‘app.follow’ and the attributes ‘from’ and ‘to’.

# Use current time
logger.emit('follow', {'from': 'userA', 'to': 'userB'})

# Specify optional time
cur_time = int(time.time())
logger.emit_with_time('follow', cur_time, {'from': 'userA', 'to':'userB'})

To send events with nanosecond-precision timestamps (Fluent 0.14 and up), specify nanosecond_precision on FluentSender.

# Use nanosecond
logger = sender.FluentSender('app', nanosecond_precision=True)
logger.emit('follow', {'from': 'userA', 'to': 'userB'})
logger.emit_with_time('follow', time.time(), {'from': 'userA', 'to': 'userB'})

You can detect an error via return value of emit. If an error happens in emit, emit returns False and get an error object using last_error method.

if not logger.emit('follow', {'from': 'userA', 'to': 'userB'}):
    print(logger.last_error)
    logger.clear_last_error() # clear stored error after handled errors

If you want to shutdown the client, call close() method.

logger.close()

Event-Based Interface

This API is a wrapper for sender.FluentSender.

First, you need to call sender.setup() to create global sender.FluentSender logger instance. This call needs to be called only once, at the beginning of the application for example.

Initialization code of Event-Based API is below:

from fluent import sender

# for local fluent
sender.setup('app')

# for remote fluent
sender.setup('app', host='host', port=24224)

Then, please create the events like this. This will send the event to fluentd, with tag ‘app.follow’ and the attributes ‘from’ and ‘to’.

from fluent import event

# send event to fluentd, with 'app.follow' tag
event.Event('follow', {
  'from': 'userA',
  'to':   'userB'
})

event.Event has one limitation which can’t return success/failure result.

Other methods for Event-Based Interface.

sender.get_global_sender # get instance of global sender
sender.close # Call FluentSender#close

Handler for buffer overflow

You can inject your own custom proc to handle buffer overflow in the event of connection failure. This will mitigate the loss of data instead of simply throwing data away.

import msgpack
from io import BytesIO

def overflow_handler(pendings):
    unpacker = msgpack.Unpacker(BytesIO(pendings))
    for unpacked in unpacker:
        print(unpacked)

logger = sender.FluentSender('app', host='host', port=24224, buffer_overflow_handler=overflow_handler)

You should handle any exception in handler. fluent-logger ignores exceptions from buffer_overflow_handler.

This handler is also called when pending events exist during close().

Python logging.Handler interface

This client-library also has FluentHandler class for Python logging module.

import logging
from fluent import handler

custom_format = {
  'host': '%(hostname)s',
  'where': '%(module)s.%(funcName)s',
  'type': '%(levelname)s',
  'stack_trace': '%(exc_text)s'
}

logging.basicConfig(level=logging.INFO)
l = logging.getLogger('fluent.test')
h = handler.FluentHandler('app.follow', host='host', port=24224, buffer_overflow_handler=overflow_handler)
formatter = handler.FluentRecordFormatter(custom_format)
h.setFormatter(formatter)
l.addHandler(h)
l.info({
  'from': 'userA',
  'to': 'userB'
})
l.info('{"from": "userC", "to": "userD"}')
l.info("This log entry will be logged with the additional key: 'message'.")

You can also customize formatter via logging.config.dictConfig

import logging.config
import yaml

with open('logging.yaml') as fd:
    conf = yaml.load(fd)

logging.config.dictConfig(conf['logging'])

You can inject your own custom proc to handle buffer overflow in the event of connection failure. This will mitigate the loss of data instead of simply throwing data away.

import msgpack
from io import BytesIO

def overflow_handler(pendings):
    unpacker = msgpack.Unpacker(BytesIO(pendings))
    for unpacked in unpacker:
        print(unpacked)

A sample configuration logging.yaml would be:

logging:
    version: 1

    formatters:
      brief:
        format: '%(message)s'
      default:
        format: '%(asctime)s %(levelname)-8s %(name)-15s %(message)s'
        datefmt: '%Y-%m-%d %H:%M:%S'
      fluent_fmt:
        '()': fluent.handler.FluentRecordFormatter
        format:
          level: '%(levelname)s'
          hostname: '%(hostname)s'
          where: '%(module)s.%(funcName)s'

    handlers:
        console:
            class : logging.StreamHandler
            level: DEBUG
            formatter: default
            stream: ext://sys.stdout
        fluent:
            class: fluent.handler.FluentHandler
            host: localhost
            port: 24224
            tag: test.logging
            buffer_overflow_handler: overflow_handler
            formatter: fluent_fmt
            level: DEBUG
        none:
            class: logging.NullHandler

    loggers:
        amqp:
            handlers: [none]
            propagate: False
        conf:
            handlers: [none]
            propagate: False
        '': # root logger
            handlers: [console, fluent]
            level: DEBUG
            propagate: False

Asynchronous Communication

Besides the regular interfaces - the event-based one provided by sender.FluentSender and the python logging one provided by handler.FluentHandler - there are also corresponding asynchronous versions in asyncsender and asynchandler respectively. These versions use a separate thread to handle the communication with the remote fluentd server. In this way the client of the library won’t be blocked during the logging of the events, and won’t risk going into timeout if the fluentd server becomes unreachable. Also it won’t be slowed down by the network overhead.

The interfaces in asyncsender and asynchandler are exactly the same as those in sender and handler, so it’s just a matter of importing from a different module.

For instance, for the event-based interface:

from fluent import asyncsender as sender

# for local fluent
sender.setup('app')

# for remote fluent
sender.setup('app', host='host', port=24224)

# do your work
...

# IMPORTANT: before program termination, close the sender
sender.close()

or for the python logging interface:

import logging
from fluent import asynchandler as handler

custom_format = {
  'host': '%(hostname)s',
  'where': '%(module)s.%(funcName)s',
  'type': '%(levelname)s',
  'stack_trace': '%(exc_text)s'
}

logging.basicConfig(level=logging.INFO)
l = logging.getLogger('fluent.test')
h = handler.FluentHandler('app.follow', host='host', port=24224, buffer_overflow_handler=overflow_handler)
formatter = handler.FluentRecordFormatter(custom_format)
h.setFormatter(formatter)
l.addHandler(h)
l.info({
  'from': 'userA',
  'to': 'userB'
})
l.info('{"from": "userC", "to": "userD"}')
l.info("This log entry will be logged with the additional key: 'message'.")

...

# IMPORTANT: before program termination, close the handler
h.close()

NOTE: please note that it’s important to close the sender or the handler at program termination. This will make sure the communication thread terminates and it’s joined correctly. Otherwise the program won’t exit, waiting for the thread, unless forcibly killed.

Circular queue mode

In some applications it can be especially important to guarantee that the logging process won’t block under any circumstance, even when it’s logging faster than the sending thread could handle (backpressure). In this case it’s possible to enable the circular queue mode, by passing True in the queue_circular parameter of asyncsender.FluentSender or asynchandler.FluentHandler. By doing so the thread doing the logging won’t block even when the queue is full, the new event will be added to the queue by discarding the oldest one.

WARNING: setting queue_circular to True will cause loss of events if the queue fills up completely! Make sure that this doesn’t happen, or it’s acceptable for your application.

Testing

Testing can be done using pytest.

$ pytest tests

Release

$ # Download dist.zip for release from GitHub Action artifact.
$ unzip -d dist dist.zip
$ pipx twine upload dist/*

Contributors

Patches contributed by those people.

License

Apache License, Version 2.0

Project details

These details have been verified by PyPI

Maintainers

arcivanov kzk naoki repeatedly yyuu

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.11.1

Jun 6, 2024

This version

0.11.0

Feb 29, 2024

0.10.0

Mar 11, 2021

0.9.6

Mar 10, 2020

0.9.5

Feb 24, 2020

0.9.4

Oct 16, 2019

0.9.3

Apr 11, 2018

0.9.2

Feb 21, 2018

0.9.1

Jan 14, 2018

0.9.0

Dec 15, 2017

0.8.0

Dec 13, 2017

0.7.0

Dec 6, 2017

0.6.2

Dec 4, 2017

0.6.1

Dec 3, 2017

0.6.0

Nov 10, 2017

0.5.3

Aug 23, 2017

0.5.2

Aug 8, 2017

0.5.1

Aug 8, 2017

0.5.0

Apr 4, 2017

0.4.6

Jan 30, 2017

0.4.5

Jan 24, 2017

0.4.4

Sep 20, 2016

0.4.3

Jul 26, 2016

0.4.2

Jul 21, 2016

0.4.1

Apr 14, 2016

0.4.0

Jun 3, 2015

0.3.5

Jan 30, 2015

0.3.4

Jun 13, 2014

0.3.3

Aug 20, 2012

0.3.2

Aug 17, 2012

0.3.1

Jan 13, 2012

0.3.0

Jan 13, 2012

0.2.0

Oct 15, 2011

0.1.1

Oct 3, 2011

0.1.0

Oct 1, 2011

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fluent_logger-0.11.0.tar.gz (15.8 kB view details)

Uploaded Feb 29, 2024 Source

Built Distribution

fluent_logger-0.11.0-py3-none-any.whl (12.7 kB view details)

Uploaded Feb 29, 2024 Python 3

File details

Details for the file fluent_logger-0.11.0.tar.gz.

File metadata

Download URL: fluent_logger-0.11.0.tar.gz
Upload date: Feb 29, 2024
Size: 15.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.12.1

File hashes

Hashes for fluent_logger-0.11.0.tar.gz
Algorithm	Hash digest
SHA256	`813cf44b70ad204f7c161b79604892c89bc92df63fcd132f3c6734bab56e6527`
MD5	`834955cd6aeff0819d8a5f50375f1b4b`
BLAKE2b-256	`b9fe32dd1e09cd40292ab2b34759957d179b5fce0bdeb80dea42725e8f303eff`

See more details on using hashes here.

File details

Details for the file fluent_logger-0.11.0-py3-none-any.whl.

File metadata

Download URL: fluent_logger-0.11.0-py3-none-any.whl
Upload date: Feb 29, 2024
Size: 12.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.12.1

File hashes

Hashes for fluent_logger-0.11.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3485cc006294b8cb6faf2a832ba088da55b7daef3f99fd379468623108c7e2c7`
MD5	`54f461a8b93766e787e0e1d9f6c93e34`
BLAKE2b-256	`1d20bb3f05568e128e82bc9d27bbd85509f5777a1a3b15381c7553d02dffdc76`