MSOComprex

1) Presentation
Lingua et Machina is a language
tools provider:
- Translation Memory (CAT)
- aligner
- statistical and neural automatic translation
- automatic terminology extraction and lexico-semantic network building
- specific on-demand tools
For more information, see:
http://www.lingua-et-machina.com/
MSOComprex is provided as is, with
no warranty, at your own risk.
You can use it freely, for any purpose.
You can redistribute it, at the condition you are preserving this README,
with its full content.
Reverse ingineering, decompilation, and any kind of modification are
forbidden.
For any question, please contact us here:
support@lingua-et-machina.com
MSOComprex is a small peace of
software which will enable you to:
- remove images from a large MS Office file (DOCX, PPTX, or XLSX), to make
it smaller
- resize images from a large MS Office file (DOCX, PPTX, or XLSX), to make
it smaller
- restore original images after having change the text, typically using a
CAT tool to translate it,
with or without resizing these images,
at the condition that the images and the page layouts weren't
modified.
Typical use case: when using MS Office to convert a PDF file to a DOCX file,
MS Word is often producing very huge file size. With a 4M PDF file, you may
obtain
a 130M DOCX file. This is due to the fact that MS Word is resampling all
images
according with high DPI. It's then very hard to send such huge file size by
email,
or simply to upload it to a online web service, like a CAT tool.
2) INSTALL MSOComprex
Download the zip archive here:
http://www.lingua-et-machina.com/MSOComprex.zip
Unzip this file somewhere on your disk, for example in a folder:
c:\MSOComprex
Download and install Java, see here:
https://www.java.com/
3) REMOVE IMAGES FROM A MS OFFICE FILE (DOCX, PPTX, or XLSX)
- open the MSOComprex folder in a file browser
- drag&drop your MS Office file over the "MSOComprex.bat" file
- you should get a new file suffixed with "_CPX", for example "test.docx"
will give you "test_CPX.docx". In this file, all images should be
replaced
by a small green-cross icon.
4) RESIZE IMAGES FROM A MS OFFICE FILE (DOCX, PPTX, or XLSX)
- open the MSOComprex folder in a file browser
- drag&drop your MS Office file over the "MSOComprex_600px.bat" file
- you should get a new file suffixed with "_CPX", for example "test.docx"
will give you "test_CPX.docx". In this new file, images having a
width
larger than 600px should be reduced to a width of 600px
For different image max width, you may use the other provided BAT files,
or create your own with your own size by copying/editing one of them.
5) RESTORE IMAGES AFTER TEXT MODIFICATION
On order to work poperly, only the text may have been modified.
IMAGES AND PAGE LAYOUTS MUST NOT HAVE BEEN MODIFIED !
- open the MSOComprex folder in a file browser
- drag&drop both your MS Office file and the "_CPX" file over the
"MSOComprex.bat" file
- you should get a new file suffixed with "_CPX_RESTORED", for example
"test.docx"
and "test_CPX.docx" will give you "test_CPX_RESTORED.docx". In this
file,
all images should be replaced back by the large original images.
If you want to resize original images while replacing them, use for example
"MSOComprex_600px.bat", or one other according to the max width expected.
Best regards,
L&M team