Skip to main content

Utilities to work with Scala/Java code with py4j

Project description

Version_Status Travis Conda Forge Conda_Forge

A set of compatibility routines for making it easier to interact with Scala from Python.

Occasionally Python-focused data shops need to use JVM languages for performance reasons. Generally this necessitates throwing away whole repositories of Python code and starting over or resorting to service architectures (e.g., Apache thrift) which increase system complexity.

You don’t have to.

Using py4j and Spylon you can readily interact with Scala code for more performance critical sections of your whilst leaving the rest unmodified.

Alternatively you can use it as a bridge to allow building wrappers for a Scala/Java codebase.

Installation

Spylon can be installed either from pip or conda-forge

Usage

The simplest way to use spylon is to use it to help with writing PySpark jobs. If you want to supply your own jars to load for usage as Spark user defined functions, you’d want to supply the jar with the udf implementation to spark via spark-submit.

For an easier interactive experience you can make use of the supplied Apache Spark launcher to make it simpler to instantiate a PySpark application from inside a python Jupyter notebook.

Extensions

Spylon is designed as an easy to extend toolkit. Since Apache Spark is a major user or Py4J, some special use cases have been implemented for that and its an example of some use cases for Spylon.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spylon-0.2.4.tar.gz (91.8 kB view details)

Uploaded Source

File details

Details for the file spylon-0.2.4.tar.gz.

File metadata

  • Download URL: spylon-0.2.4.tar.gz
  • Upload date:
  • Size: 91.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for spylon-0.2.4.tar.gz
Algorithm Hash digest
SHA256 94387e3f26215b5436bb8e150e31dd026dd5681b1fc43b4b626098843cf0eecb
MD5 c941f5edeb921612c9046ba7e501daf2
BLAKE2b-256 bf1ac418a6b74e561798a0b1a3a800081a51d19240c38b0a21a642974d7eaee8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page