Skip to main content

Cleans the LaTeX code of your paper to submit to arXiv.

Project description

arxiv_latex_cleaner

This tool allows you to easily clean the LaTeX code of your paper to submit to arXiv. From a folder containing all your code, e.g. /path/to/latex/, it creates a new folder /path/to/latex_arXiv/, that is ready to ZIP and upload to arXiv.

Example call:

python -m arxiv_latex_cleaner /path/to/latex --im_size 500 --images_whitelist='{"images/im.png":2000}'

Setup:

git clone https://github.com/google-research/arxiv-latex-cleaner
cd arxiv-latex-cleaner/

Optionally, this may be installed and used as a command-line program:

python setup.py install
arxiv_latex_cleaner --help

Main features:

Privacy-oriented

  • Removes all auxiliary files (.aux, .log, .out, etc.).
  • Removes all comments from your code (yes, those are visible on arXiv and you do not want them to be). These also include \begin{comment}\end{comment} environments.
  • Optionally removes user-defined commands entered with commands_to_delete (such as \todo{} that you at the end redefine as the empty string).

Size-oriented

There is a 10MB limit on arXiv submissions, so to make it fit:

  • Removes all unused .tex files (those that are not in the root and not included in any other .tex file).
  • Removes all unused images that take up space (those that are not actually included in any used .tex file).
  • Optionally resizes all images to im_size pixels, to reduce the size of the submission. You can whitelist some images to skip the global size using images_whitelist.
  • Optionally compresses .pdf files using ghostscript (Linux and Mac only). You can whitelist some PDFs to skip the global size using images_whitelist.

Usage:

usage: arxiv_latex_cleaner@v0.1.0 [-h] [--resize_images] [--im_size IM_SIZE]
                                  [--compress_pdf]
                                  [--pdf_im_resolution PDF_IM_RESOLUTION]
                                  [--images_whitelist IMAGES_WHITELIST]
                                  [--commands_to_delete COMMANDS_TO_DELETE [COMMANDS_TO_DELETE ...]]
                                  input_folder

Clean the LaTeX code of your paper to submit to arXiv. Check the README for
more information on the use.

positional arguments:
  input_folder          Input folder containing the LaTeX code.

optional arguments:
  -h, --help            show this help message and exit
  --resize_images       Resize images.
  --im_size IM_SIZE     Size of the output images (in pixels, longest side).
                        Fine tune this to get as close to 10MB as possible.
  --compress_pdf        Compress PDF images using ghostscript (Linux and Mac
                        only).
  --pdf_im_resolution PDF_IM_RESOLUTION
                        Resolution (in dpi) to which the tool resamples the
                        PDF images.
  --images_whitelist IMAGES_WHITELIST
                        Images (and PDFs) that won't be resized to the default
                        resolution,but the one provided here. Value is pixel
                        for images, and dpi forPDFs, as in --im_size and
                        --pdf_im_resolution, respectively. Format is a
                        dictionary as: '{"path/to/im.jpg": 1000}'
  --commands_to_delete COMMANDS_TO_DELETE [COMMANDS_TO_DELETE ...]
                        LaTeX commands that will be deleted. Useful for e.g.
                        user-defined \todo commands.

Note

This is not an officially supported Google product.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv_latex_cleaner-0.1.2.tar.gz (9.3 kB view details)

Uploaded Source

Built Distribution

arxiv_latex_cleaner-0.1.2-py3-none-any.whl (15.5 kB view details)

Uploaded Python 3

File details

Details for the file arxiv_latex_cleaner-0.1.2.tar.gz.

File metadata

  • Download URL: arxiv_latex_cleaner-0.1.2.tar.gz
  • Upload date:
  • Size: 9.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.6.8

File hashes

Hashes for arxiv_latex_cleaner-0.1.2.tar.gz
Algorithm Hash digest
SHA256 ace8162de6370e5e6ad7c46eb57b717f90c8aa79cd4508ec021524600d1b083e
MD5 3af763ebd460a38695f29d7d635eb307
BLAKE2b-256 30e457d660df4636f4769da9b23b056d0160fea3d4acac01e2ff1a0c7205d1db

See more details on using hashes here.

Provenance

File details

Details for the file arxiv_latex_cleaner-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: arxiv_latex_cleaner-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 15.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.6.8

File hashes

Hashes for arxiv_latex_cleaner-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 72b206728dc146acec93409d2c6342d3e64a1db3323b6391b2c8bcbb7c9a1c7b
MD5 86aad262a09ce2ebe7377d86e2b8c898
BLAKE2b-256 319b7675d706dae430b4cac55b089ea504e765d6b0dbcd6aeb73f2729636f4ed

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page