Skip to main content

Replicate files to another computer for remote development

Project description

File Replicator

Replicate files one-way to another computer e.g. for remote development.

A key use-case is to keep in sync a directory of development files from a computer on which the files are edited with a copy of those files in a docker container running on a remote docker host.

Dependencies are:

  • Python and some Python packages on the development machine
  • Ability to run bash on the remote machine with connected stdin

Nothing is installed remotely.

This has only been tested between two Linux machines.

How it works

The approach is to run a small bash program on the remote end which is able to add/update new files in (potentially) new directories. It receives commands over stdin, endlessly waiting for:

  • an absolute path to a filename
  • a newline
  • an integer number of bytes
  • a newline
  • that many bytes of data
  • ...repeat...

The controlling end then simply sends files over to the stdin of the receiving bash program. Establishing the connection to the remote end is outside the remit of this tool. Instead it accepts as an argument the command to make such a connection. See examples below.

It has two key operations:

  1. recursively walking a source tree of files and sending them over the wire to the destination
  2. watching for changes or new files and directories before sending them over the wire to the destination

So there is no "difference algorithm" like rsync, no attempt to compress, the connection is made entirely using standard means like ssh and docker, no ports to open, and even the bash program on the remote end is sent over every time so nothing is installed remotely.

This is sufficient for editing code on a local computer and automatically replicating to a remote server or docker container.

Usage and examples

See help:

Usage: file-replicator [OPTIONS] SRC_DIR DEST_PARENT_DIR
                       [CONNECTION_COMMAND]...

  Replicate files to another computer e.g. for remote development.

  SRC_DIR is the source directory on this machine.

  DEST_PARENT_DIR is the (absolute) destination parent directory on the
  remote machine accessed using the CONNECTION_COMMAND.

  The CONNECTION_COMMAND must result in a running instance of bash ready to
  receive commands on stdin.

  Example CONNECTION_COMMANDS include:

      ssh some.host.com bash

      docker exec -i my_container bash

      docker-compose exec -T my_container bash

  So a full use of the tool might look like:

      file-replicator code_dir /home/code -- docker exec -i a_container bash

  (the use of "--" prevents any further processing of command line arguments
  by file-replicator, leaving them all for docker)

  Initially, all files and required directories are recursively copied. Then
  waits for changes before copying each modified or new file.

  Note that empty directories are not replicated until they contain a file.

  Lastly, the only time the tool deletes files or directories is if called
  with the optional --clean-out-first switch.

Options:
  --clean-out-first  Optionally start by cleaning out the destination
                     directory.
  --help             Show this message and exit.

For example, to replicate files from local directory my_project_dir to directory /home/code/my_project_dir on remote machine called my.server.com:

file-replicator my_project_dir /home/code ssh my.server.com bash

As another example, to replicate files from local directory my_project_dir to directory /home/code/my_project_dir in a running docker container called my_container on a potentially remote host (depending upon the DOCKER* environment variables e.g. as set by docker-machine eval):

file-replicator my_project_dir /home/code -- docker exec -i my_container bash

Or to do the same but using docker-compose instead:

file-replicator my_project_dir /home/code -- docker-compose exec -T my_container bash

Lastly, as a degenerate example which doesn't actually connect to a remote machine at all but replicates into /tmp/my_project_dir:

file-replicator my_project_dir /tmp bash

Limitations

Due to limitations with inotify (race conditions around watching for changes in newly created directories), it is possible that the watching-for-changes phase becomes out of step. In which case, just restart the whole program. (the tool includes some self-restarting behaviour, but ultimately a full restart may sometimes be needed).

Information printed to stdout indicates when this happens.

Tests

============================= test session starts ==============================
platform linux -- Python 3.6.7, pytest-3.10.1, py-1.7.0, pluggy-0.8.0 -- /home/tcorbettclark/.cache/pypoetry/virtualenvs/file-replicator-py3.6/bin/python
cachedir: .pytest_cache
rootdir: /home/tcorbettclark/code/file-replicator, inifile:
collecting ... collected 8 items

tests/test_lib.py::test_empty_directories_are_not_copied PASSED            [ 12%]
tests/test_lib.py::test_copy_one_file PASSED                               [ 25%]
tests/test_lib.py::test_copy_file_with_unusual_characters_in_name PASSED   [ 37%]
tests/test_lib.py::test_make_missing_parent_directories PASSED             [ 50%]
tests/test_lib.py::test_replicate_all_files PASSED                         [ 62%]
tests/test_lib.py::test_detect_and_copy_new_file PASSED                    [ 75%]
tests/test_lib.py::test_detect_and_copy_modified_file PASSED               [ 87%]
tests/test_lib.py::test_detect_and_copy_new_file_in_new_directories PASSED [100%]

=========================== 8 passed in 3.93 seconds ===========================

Contributions

Pull-requests welcome. Please considering including tests.

The package is maintained using poetry (https://poetry.eustace.io) and pyenv (https://github.com/pyenv/pyenv).

The code is formatted using black (https://black.readthedocs.io/en/stable).

It is tested using pytest (poetry run pytest). Note that in order to run these tests the current user must be able to ssh to localhost without a password.

Commit checklist

  1. check version both in pyproject.toml and file_replicator/__init__.py
  2. check git tag
  3. isort -rc .
  4. black .
  5. pytest -v
  6. update this README.md with the latest output from the tests
  7. update this README.md with the latest output from the --help option

TODO

  • Add option to exclude certain files
  • Add docs to show an example output. Possibly a screenshot so it looks nice.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

file-replicator-0.1.2.tar.gz (7.2 kB view details)

Uploaded Source

Built Distribution

file_replicator-0.1.2-py3-none-any.whl (12.9 kB view details)

Uploaded Python 3

File details

Details for the file file-replicator-0.1.2.tar.gz.

File metadata

  • Download URL: file-replicator-0.1.2.tar.gz
  • Upload date:
  • Size: 7.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/0.12.10 CPython/3.6.7 Linux/4.4.0-139-generic

File hashes

Hashes for file-replicator-0.1.2.tar.gz
Algorithm Hash digest
SHA256 eecb64f2980bc19265b0df1a58154ba4231069ed2dfa8df36cee9266c1c08f84
MD5 3799cda22633ef58410661c815ec739e
BLAKE2b-256 16fb54261e8a65405089298d6618a06a3e9d4aadfe52726fe9e04f0a95565d4e

See more details on using hashes here.

File details

Details for the file file_replicator-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: file_replicator-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 12.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/0.12.10 CPython/3.6.7 Linux/4.4.0-139-generic

File hashes

Hashes for file_replicator-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2fdb6d4032594ce8a84f8ec321e614ed79be5bb3e6cc0cc09ea1c252d8c970b8
MD5 fabf6e20373690c1c5b93efcbdfd9051
BLAKE2b-256 b4f55c1818a3ad086b469c7d85902137ff9de666bae8c88ebba0e39840b6079c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page