ipsi() Posted August 5, 2007 at 05:40 AM Report Posted August 5, 2007 at 05:40 AM EDIT: Ok, by flagrantly abusing my University's decision to give me a homepage, you can now view the HTML files here: http://www.mcs.vuw.ac.nz/~thorbuandr/. Please try not to view it too much I like the Computer Science department (free internet, homepages, ability to VNC in and avoid the great firewall, etc. ). EDIT: Uploaded new files, changed encoding to UTF-8 on both files due to Big5 not supporting certain traditional characters according to EmEditor... One of the things that did annoy me about the NPCR 1 book was that there was Pinyin all the way through for the main texts (and pinyin for supplementary words in the Reading Comprehension and Paraphrasing bit), so I've gone through and typed up all the texts into characters only. I'm also going to do a Pinyin version at some point. I've also used HanConv to convert them into Traditional Characters, so I would appreciate it if someone who knows traditional characters could take a look and tell me if there are errors. There's probably grammer errors in there due to the authors wanting simplicity, so if you spot those they're probably not my fault. If you spot a character on the other hand that's just blatantly wrong, then tell me and I'll see if I've made a boo-boo. I've already checked it once, but that might not be enough... They're HTML files as I was going to put them on my Treo, and that's probably the easiest format. Sad, but there we go. As for the table borders, Blazer doesn't like displaying tables properly without borders I don't think I'm violating copyright or anything, but I can't find a way to contact the authors, so I guess I (or an admin) will just take it down if they complain. I'll also probably do book two, and maybe three, just so I can have them on my PDA (and so people can have pinyin versions of those two). Finally, apologies for putting them up as TXT files. We're not allowed to upload HTML files. Just change the extension and they'll work fine. If people would prefer, I'll chuck up doc or PDF versions or something. NPCR1Chars.txt NPCR1Trad.txt 1 Quote
skylee Posted August 5, 2007 at 09:48 AM Report Posted August 5, 2007 at 09:48 AM I see only gibberish in the file NPCR1Chars.txt . Re your file NPCR1Trad.txt, I've noticed these errors - Lesson 4 part 2 我是語言學院的學生。我姓林,叫林娜。我是英國人。你姓什么? [should be 麽] 我不是加拿大人,我是美國人,也是語言學院的學主。[should be 生] Lesson 5 parts 1 & 2 沒關系。[should be 係] Lesson 6 part 1 林娜,昨天的京劇怎么樣?[should be 麽] 太好了!什么時候去?[should be 麽; 什麽 is ok but some people, like me, prefer 甚麽] ** All the 么 in the file are wrong. Lesson 7 part 1 謝謝(看名片)啊,您是張教援。 [should be 授] 您是語言學院的教援,認識您,我們很高興。[should be 授] 張教援,您忙不忙?[should be 授] Lesson 7 part 2 那是語言學院的漢語老師女老師姓陳,男老師姓楊。[seems to me a punctuation mark before 女老師 is missing] 田小姐不是老師,她是語言學院的醫生。[a doctor of the language institute, this is new] Lesson 8 part 1 他媽媽姓丁,叫丁云,是中國人。[i suspect it should be 雲. But since it is a name it can be anything.] Lesson 9 part 2 當然喝紅葡萄酒,我們還吃壽面。[should be 麵] 吃壽面?真有意思。[should be 麵] 好,十一月十二號我們再來吃壽面。 [should be 麵] 他們在北京烤鴨店吃烤鳴和壽面,喝紅葡萄酒。 [should be 麵] 宋華說那天他們再來吃壽面和烤鴨。 [should be 麵] Lesson 10 part 1 這兒有沒有書和報?[i don't think we call newspaper like this. should be 報紙。] Lesson 11 part 1 哪里,我的漢語不太好。[should be 裡 or 裏] Lesson 11 part 2 現在去還是下去?[looks like something is missing, probably "午" between "下" and "去"] 你愿意吃中藥還是愿意吃西藥?[should be 願] 我愿意吃中藥。[should be 願] Lesson 12 part 1 我認識了一個漂亮的姑娘,她愿意做我女朋友。[should be 願] 祝賀你!這是好事啊。[probably nobody speaks like this. 恭喜你 is more appropriate.] 我租了一件很合適的房子。[should be 間] Lesson 14 part 1 祝你圣誕快樂 [should be 聖, all the 圣 are wrong] 你的臟衣服太多了。 [should be 髒] 我們也向你們。[should be 想] 1 Quote
ipsi() Posted August 5, 2007 at 04:28 PM Author Report Posted August 5, 2007 at 04:28 PM Thanks very much for that! On the topic of the NPCR1Chars.txt file, I suspect you haven't set your encoding right. Download EmEditor and force it to use GB2312 encoding for that file (if it doesn't detect it automatically). I'll fix those errors shortly and upload. In response to specific errors: Some punctuation was missing, thanks In lesson 7, it really does say "医生", believe it or not. Regarding "祝賀你“, I'm merely reprinting what's in the textbook. While nobody may actually *say* that, it's what's in the textbook. Given that I'm reproducing this to help people get the most out of the textbook, replacing it with what should be said isn't exactly the best idea in the world. The same logic applies to "報紙", that's not what's in the textbook. Could be a typo, but that's another discussion entirely. I also prefer "甚麽", and it's what the textbook says is the traditional version, so we'll go with that. I think I've fixed all the errors. Something to note, though, is that Big5 encoding doesn't support 麽, or 零(or is it 〇?) which I find very strange. So I've converted it to UTF-8 encoding. I don't think it makes much difference (I'll do the same with the simplified one). I'm also going with 丁云 being 丁雲, as the character 王小云's traditional name is 王小雲. I'll edit my original post in a second to upload the new versions. Thanks again for the help! Quote
ipsi() Posted August 5, 2007 at 05:07 PM Author Report Posted August 5, 2007 at 05:07 PM I've updated my original post with somewhere to actually view the HTML files. Yay. Quote
Recommended Posts
Join the conversation
You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.