MSOComprex

MSOComprex

1) Presentation

Lingua et Machina is a language tools provider:
- Translation Memory (CAT)
- aligner
- statistical and neural automatic translation
- automatic terminology extraction and lexico-semantic network building
- specific on-demand tools

For more information, see:
http://www.lingua-et-machina.com/

MSOComprex is provided as is, with no warranty, at your own risk.
You can use it freely, for any purpose.
You can redistribute it, at the condition you are preserving this README, with its full content.
Reverse ingineering, decompilation, and any kind of modification are forbidden.

For any question, please contact us here:
support@lingua-et-machina.com

MSOComprex is a small piece of software which will enable you to:
- remove images from a large MS Office file (DOCX, PPTX, or XLSX), to make it smaller
- resize images from a large MS Office file (DOCX, PPTX, or XLSX), to make it smaller
- restore original images after having change the text, typically using a CAT tool to translate it,
  with or without resizing these images,
  at the condition that the images and the page layouts weren't modified.

Typical use case: when using MS Office to convert a PDF file to a DOCX file,
MS Word is often producing very huge file size. With a 4M PDF file, you may obtain
a 130M DOCX file. This is due to the fact that MS Word is resampling all images
according with high DPI. It's then very hard to send such huge file size by email,
or simply to upload it to a online web service, like a CAT tool.

2) INSTALL MSOComprex


Download the zip archive here:
http://www.lingua-et-machina.com/MSOComprex.zip

Unzip this file somewhere on your disk, for example in a folder:
c:\MSOComprex

Download and install Java, see here:
https://www.java.com/

3) REMOVE IMAGES FROM A MS OFFICE FILE (DOCX, PPTX, or XLSX)


- open the MSOComprex folder in a file browser
- drag&drop your MS Office file over the "MSOComprex.bat" file
- you should get a new file suffixed with "_CPX", for example "test.docx"
  will give you "test_CPX.docx". In this file, all images should be replaced
  by a small green-cross icon.

ScreenRemove

4) RESIZE IMAGES FROM A MS OFFICE FILE (DOCX, PPTX, or XLSX)


- open the MSOComprex folder in a file browser
- drag&drop your MS Office file over the "MSOComprex_600px.bat" file
- you should get a new file suffixed with "_CPX", for example "test.docx"
  will give you "test_CPX.docx". In this new file, images having a width
  larger than 600px should be reduced to a width of 600px

For different image max width, you may use the other provided BAT files,
or create your own with your own size by copying/editing one of them.

5) RESTORE IMAGES AFTER TEXT MODIFICATION


On order to work poperly, only the text may have been modified.
IMAGES AND PAGE LAYOUTS MUST NOT HAVE BEEN MODIFIED !

- open the MSOComprex folder in a file browser
- drag&drop both your MS Office file and the "_CPX" file over the "MSOComprex.bat" file
- you should get a new file suffixed with "_CPX_RESTORED", for example "test.docx"
  and "test_CPX.docx" will give you "test_CPX_RESTORED.docx". In this file,
  all images should be replaced back by the large original images.

If you want to resize original images while replacing them, use for example
"MSOComprex_600px.bat", or one other according to the max width expected.

ScreenRestore

Best regards,

L&M team