chinesemadrush Posted January 13, 2017 at 09:50 AM Report Posted January 13, 2017 at 09:50 AM Hi everyone, I am currently studying chinese using Anki and it's not very helpful to remember the words without understanding how they are used. Hence, I wanted to include sample sentences in my flashcards. Manually extracting them from websites like yellowbridge would be way too time consuming. May I know if any of you have good suggestions? Thank you! 1 Quote
fabiothebest Posted January 13, 2017 at 10:05 AM Report Posted January 13, 2017 at 10:05 AM There is a program called Chinese Text Analyser by @imron that is very good for segmenting text, keeping track of known and unknown words and create wordlists. You are especially interested in sentences though. Another thing that you could do is using sub2srs for creating flashcards based on movie subtitles. If you want to extract sentences from websites, I'm afraid you need to do it manually. There aren't programs specifically made for that I think. There are some general scrapers but for general purpose and you should know how to customize and use them. I'm also interested in suggestions from others. We could make a list of such websites with example sentences and then see if there are any programs for extracting them. If there isn't anything I might consider coding a script for extracting the sentences. I should take into account the different layout of every website, that's why making a list of websites would be useful. I don't call myself a professional programmer, but that's something I could do and there are some other programmers in this forum so it's something that could be done if it doesn't exist yet. 2 Quote
Guest realmayo Posted January 13, 2017 at 10:42 AM Report Posted January 13, 2017 at 10:42 AM Consider downloading and playing with the popular Anki deck called Chinese Sentences and audio, spoon fed https://ankiweb.net/shared/info/2003820603 Also ask yourself if it really is time-consuming to add them manually: after all, you're only really having to paste the word into a website and then paste an example sentence or two into Anki. The rest of the time is spent reading the example sentences that the website returns and picking which examples you want to select. That can be viewed as part of the process of learning and understanding more about how a given word is used. Quote
fabiothebest Posted January 13, 2017 at 11:12 AM Report Posted January 13, 2017 at 11:12 AM I was aware about the existence of an anki deck with sentences although I didn't try it yet. I don't know if it contains mistakes or not. I'll try it. Someone who tried it can give a feedback. Hmm I'm not actually sure that learning many sentences this way with Anki can be beneficial. Maybe it's better to just search the usage of the words you are studying and need to use at the moment. Otherwise there is also Glossika, that is more listen and repeat type. You just play the audio file, you don't need to switch cards. The quality of the sentences matters though, so since I haven't tried the anki deck yet I can't really judge. I think that self made materials are the best because they are personal and based on your needs. Materials made by others may be less interesting or useful for you but also contain things that you wouldn't think of because you haven't been exposed to them, so it's worth trying something like that anyway. Anyone has his own learning method. There are many ways to learn Chinese. It's important to set some goals and stick to them. Quote
imron Posted January 13, 2017 at 01:43 PM Report Posted January 13, 2017 at 01:43 PM There is a program called Chinese Text Analyser by @imron that is very good for segmenting text, keeping track of known and unknown words and create wordlists. You are especially interested in sentences though Chinese Text Analyser can extract sentences too, with optional cloze deletion of the word in question, just choose the 'Sentence' or 'Cloze Sentence' field from the export word list dialog box. So for example, you could export cloze deleted sentences for the top 20 most frequent unknown words in a given document. Even better, with the new Lua scripting support you can get CTA to process all files in a directory (and all sub-directories) and create anki-compatible cloze deleted sentences for all 'mostly known sentences' found (where mostly know means that great than a certain percentage of all words in the sentence are known). In fact, one of the example scripts provided (anki-cloze.lua) does exactly that, and is explained step by step in the Lua example documentation. 2 Quote
Yadang Posted January 13, 2017 at 09:16 PM Report Posted January 13, 2017 at 09:16 PM Another thing that you could do is using sub2srs for creating flashcards based on movie subtitles We also have a post dedicated to Anki decks that have been made using Subs2SRS to cut the movie into little fragments of text and the corresponding audio. You can then use the decks with Imron's Chinese Text Analyzer to tag all of the sentences containing words you don't know, and provide definitions and pinyin. Quote
chinesemadrush Posted January 14, 2017 at 10:11 AM Author Report Posted January 14, 2017 at 10:11 AM Thanks everyone for the inputs. Would explore the various options mentioned. @fabiothebest, I previously tried using scrapping tools on websites but they often have this anti-botting system that requires you to identify you are not a robot after a few words or so. For example, they would ask you to type out certain words on screen into a box. Any suggestions you have in mind? Thanks cone again. Quote
fabiothebest Posted January 14, 2017 at 12:37 PM Report Posted January 14, 2017 at 12:37 PM @chinesemadrush If I have time I'll try and if I come up with something usable, I'll post it here. 1 Quote
Flickserve Posted January 15, 2017 at 12:15 AM Report Posted January 15, 2017 at 12:15 AM Are you making cards with just the word on the front and then the sentences on the back? I have made a lot of Anki cards. I am afraid there is no easy way to select sentences when you create your own individual cards. Because the sentences you select are personalised to your own knowledge. For instance, I ignore long sentences or sentences which have a lot of unknown vocabulary. At my low intermediate stage, it doesn't help to include such sentences. Quote
chinesemadrush Posted January 18, 2017 at 02:30 PM Author Report Posted January 18, 2017 at 02:30 PM @fabiothebest thank you! @Flickserve Nope, I make them with words on the front and at the back I have the meaning, example sentences etc. Quote
roddy Posted January 18, 2017 at 10:01 PM Report Posted January 18, 2017 at 10:01 PM @chinesemadrush - your email address isn't working. Can you update it, or opt out of emails Quote
Recommended Posts
Join the conversation
You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.