Friday Posted October 19, 2011 at 10:59 AM Report Posted October 19, 2011 at 10:59 AM I have CC-CEDICT as a single file. How can I import it into a spreadsheet? In the original file, some words show an extra field with simplified characters, so when I try to import the information, some rows have more cells and don't line up. Quote
jbradfor Posted October 19, 2011 at 01:39 PM Report Posted October 19, 2011 at 01:39 PM [in the CC-CEDIT, all lines have both both simplified and traditional.] I haven't found a perfect way, because the format does not allow simple parsing based on individual characters. The best way I've found is a two-pass approach: import it once using a space as the column delimiter, and then a second time using "[]" as the column delimiter. Then you will have the characters in the first two columns of the first import, and the pinyin and definitions in the second and third columns of the second import. Quote
Friday Posted October 22, 2011 at 09:53 PM Author Report Posted October 22, 2011 at 09:53 PM Great, I will try it out. Thanks. Quote
Recommended Posts
Join the conversation
You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.