Pure-Python Universal Game Translator
Project description
pyugt - Python Universal Game Translator
pyugt is a pure-python universal game translator: it takes screenshots from a region you select on your screen, uses OCR (via Tesseract v5) to extract the characters, then feeds it to a machine translator (Google Translate) to then show you a translated text.
Since it works directly on images, there is no need to hack the game or anything to access the text. It is also cross-platform (support for Windows and Linux - MacOSX does not yet work because hotkeys are not supported for now).
Here is a demo:
Of course, since the translation is done by a machine, don't expect a very nice translation, but for games where no translation is available, it can be sufficient to understand the gist and be able to play.
This software was inspired by the amazing work of Seth Robinson on UGT (Universal Game Translator).
How to install & update
First, install Tesseract v5, installers are provided by UB Mannheim. Make sure to install the additional languages you want to translate from (eg, Japanese).
Also, you need to install a Python interpreter. Anaconda is a good one.
Then, install this software:
pip install --upgrade pyugt
Or maybe what can be more easy if you want to edit the config file, is to download the archive from Github, unzip anywhere, cd in the folder and type:
python setup.py develop
Note the software was tested on Windows 10 x64 with Python 3.7 (Anaconda). It should also work on other Python versions and on Linux but this was not tested (please let me know if you try on Linux!).
How to use
-
First, you need to configure the config file
config.ini
. A sample config file is provided with the software that should work fine on Windows, but on other platforms or in some cases you may need to edit it, particularly to setup the path to the Tesseract binaries. The config file also allows you to change the hotkeys and the monitor to screenshot from, and a few other things such as the source and target languages (by default, the source is japanese and target is english). -
Then, you can launch the script:
python -m pyugt
-
Then, use the hotkey to first select a region to capture from (default hotkey:
CTRL+F2
). The selected region does not need to be very precise, but need to contain the text to translate. -
Finally, use the hotkey to translate what is shown in the region (default:
CTRL+F3
). This will display a window with the original text and the translated text. Make sure to close this window before requesting another translation, else it won't work (because of the GUI waiting loop).
IMPORTANT NOTE: The software is still in alpha stage (and may forever stay in this state). It IS working, but sometimes the hotkeys glitch and they do not work anymore. If this happens, simply focus the Python console and hit CTRL+C
to force quit the app, then launch it again. The selected region is saved in the config file, so you don't have to redo this step everytime.
Shortcomings & advantages
Compared to UGT, the translated text is not overlaid over a screenshot of the original text. This could maybe be done as Tesseract provides some functions (image_to_boxes()
or image_to_osd()
or image_to_data()
). PRs are very welcome if anyone would like to give it a try!
UGT can also directly select and translate the active window. We dropped this feature because it's platform dependent, so a region selection seemed like the most reliable and cross-platform way to implement screen capture, even if it adds one additional step.
On the other hand, there are several advantages to our approach:
-
it's cross-platform (Windows & Linux currently, MacOSX could be supported if we find a python module to register global hotkeys on it),
-
we use Tesseract so that OCR is done locally (instead of via the Google Cloud Vision API) so we only send text which is a lot smaller footprint and thus less expensive (generally free),
-
Regions can be selected, so that unnecessary screen objects that may confuse the OCR can be elimited with a carefully selected region,
-
We enforce the source and target languages, so that both the OCR and translator know what to expect, instead of trying to autodetect, which may fail particularly when there are names that may be written in another language or character form (eg, not in Kanji).
License
This software is made by Stephen Larroque and is published under the MIT Public License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pyugt-0.2.2.tar.gz
.
File metadata
- Download URL: pyugt-0.2.2.tar.gz
- Upload date:
- Size: 9.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 48c692dcf2d78ac41a6e78f44182bcaff3c96d97f53b70f2725ba9b81ae9eb23 |
|
MD5 | 13fd09cd9df22c073787057cee249e4a |
|
BLAKE2b-256 | c6e0dd14e9a6d305be8f9a5b4504f54802e07410726434f5888798f51697544c |
File details
Details for the file pyugt-0.2.2-py3-none-any.whl
.
File metadata
- Download URL: pyugt-0.2.2-py3-none-any.whl
- Upload date:
- Size: 8.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | be3db4e53ac20e932bc944b04f80ef018b7d1d2ac1fc61ddb8e005263a7546a3 |
|
MD5 | bc98ccf411cee862e61ab9d114e6975b |
|
BLAKE2b-256 | c5e1795a54ea0eaf06ffec9c9f4f82d06f483ae80951d4913a58c4a056e6d050 |