uni419 Posted February 28, 2015 at 03:15 PM Report Posted February 28, 2015 at 03:15 PM Anyone know of tools which would allow for mass generation of monolingual definitions from a .csv file? (preferably also feed back into a .csv file) Quote
edelweis Posted February 28, 2015 at 09:48 PM Report Posted February 28, 2015 at 09:48 PM Do you mean, you have a list of Chinese words in one column of a csv file, and you want to output a csv file with the words in one column, and the Chinese definitions in another column? I don't know any tool that does this. But if you have a C-C dictionary file in text format, it should not be difficult to create a small program to read the words, search the dictionary, and output the entries... For instance if you wanted entries from CEDICT, it would be a simple matter of selecting the input column with sed or awk (there are examples of csv column selection on the internet), grep-ing the word in the CEDICT file to obtain the definition line, etc. But CEDICT is not monolingual. 1 Quote
uni419 Posted February 28, 2015 at 11:47 PM Author Report Posted February 28, 2015 at 11:47 PM Yeah, no programming experience whatsoever though. I'll look around on the internet and see what I can throw together. I'm trying to use the Chinese-Chinese dictionary included with OSX. Quote
imron Posted March 1, 2015 at 02:26 AM Report Posted March 1, 2015 at 02:26 AM Out of curiosity, where can one find the C-C dictionary included with OSX? Quote
uni419 Posted March 1, 2015 at 04:19 AM Author Report Posted March 1, 2015 at 04:19 AM My bad, it's just 现代汉语规范词典, I'd imported it into OSX‘s dictionary app Quote
xiaokaka Posted March 1, 2015 at 09:21 AM Report Posted March 1, 2015 at 09:21 AM I'm pretty sure that 现代汉语规范词典 is included in the dictionary app in OSX (since I have it as well and I've never installed/imported any external dictionaries), you just have to manually enable it (and maybe download it, I don't remember). 1 Quote
uni419 Posted March 1, 2015 at 09:32 AM Author Report Posted March 1, 2015 at 09:32 AM Ah, cool. So circling back around to the original topic, lets say I wanted to do a pull request on around 3-4k of words, with split definitions, example sentences and part of speech exported to a csv. Import file is also a cvs. Dictionary used is the copy of 现代汉语规范词典 which is apparently included with OSX, does anyone know how I'd either A.) How to do that (guessing unlikely) B.) Where I can find resources to educate myself enough to hack some together myself (feeling this is probably more likely) ? Quote
tysond Posted March 1, 2015 at 10:23 PM Report Posted March 1, 2015 at 10:23 PM If you can get the dictionary as a CSV format, you could do it in Excel. =VLOOKUP(column of 3-4k words, table of dictionary, column # of definition in dictionary, TRUE) should do the trick. 1 Quote
uni419 Posted March 2, 2015 at 01:29 PM Author Report Posted March 2, 2015 at 01:29 PM So now the question becomes, does anyone where i could get my hands on a C-C dictionary in CSV format? Quote
Recommended Posts
Join the conversation
You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.