Popular Post wibr Posted June 13, 2014 at 09:26 AM Popular Post Report Posted June 13, 2014 at 09:26 AM So, for everyone who is interested: https://github.com/jiong3/glocutch Run it on the commandline with: python cut.py glossika.pdf The script will create folders containing the 1000 png files for each language (traditional, simplified, english, pinyin ...) This should support all available Chinese pdf-files. except the TRVL folder. I will add support for this later. Edit: Just added support for TRVL folder For me this script has served it's purpose, so I don't want to provide any future support. But feel free to ask questions here if something doesn't work or use github. You need python2.x + numpy + opencv and imagemagick (the convert command for pdf -> png). If you don't have this installed, for linux use the repositories, for mac use homebrew and for windows use google. ;-) 6 Quote
crt32 Posted December 11, 2014 at 12:00 PM Report Posted December 11, 2014 at 12:00 PM Are you by any chance working on slicing the new updated layout? (If not, I will try myself probably this weekend.) Quote
wibr Posted December 12, 2014 at 05:54 AM Author Report Posted December 12, 2014 at 05:54 AM No sorry, I won't do the whole procedure again... First I would try to adapt the parameters of the script at the beginning. If you have any specific questions about the code (I hope I can still understand it myself), feel free to ask. Quote
crt32 Posted December 13, 2014 at 10:38 AM Report Posted December 13, 2014 at 10:38 AM Thanks, I used your code as reference but ended up using a different approach https://github.com/crt32/GlossikaSlice It should work for any of the PDFs with the new layout, but I can't test it. Quote
Recommended Posts
Join the conversation
You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.