Jump to content
Chinese-Forums
  • Sign Up

Chengyu and 4-character fixed expressions


Recommended Posts

Posted

I am looking for an electronic list of chengyu and other 4-character fixed expressions for the purpose of determining the most commonly used. I have a list of about 14,000, but I know there are over 40,000. Does anyone have a list or know where I could find one? Thanks

Posted

Thank you. I am familiar with this website and, although useful, it is not really in a format that I can easily make use of.

Posted

I'm not aware of anything available electronically. However, the owners of Chengyu.info and OneaDay.org might be aware of something.

Incidentally, if you can make that list of 14,000 idioms public, I'm sure plenty of people on here would be interested. Is fo, there's an attachment function, or you can email it to me at admin@chinese-forums.com and I'll make it available. If not, never mind.

Roddy

Posted

14,000, that is pretty impressive. Definitely, if you care to share that would be greatly appreciated as Roddy mentioned.

But I'm not sure if finding more of them will meet your stated goal: "determining the most commonly used"

For this purpose may I suggest using a lexical database or using one of many linguistic evaluation tools that can analyze mass quantities of text. From there, you can do statistical analysis to determine the frequency of usage. That way you can focus on learning those that are most useful first-- prioritizing.

I have been working on doing something similar in my own content production. I have a good friend who has two masters in linguistics. She maybe will to give you more specific advice as she has helped me. I can't guarantee though because she is terrifically busy, but a few p.m. me and give me your contact information I can pass it on to her and she is better at some of these things.

Also an introductory data mining course might be of some use to you.

Also you might want to try to identify authors of similar compilations:

One good example is here: http://kamares.ucsd.edu/~arobert/hanziData.html

And this site: A Review of Chinese Word Lists Accessible on the Internet

The main page here also has good sources: http://technology.chtsai.org/

These people of obviously use data mining techniques in the past and may be of assistance to you and you can learn a little bit more about it. Furthermore, I believe SourceForge has a tool that you can use.

Good luck, and if you are willing out of love to see your list.

OK just as I about to post, I realize that you may be using your compiled list as being input data from which to use a lexical scanner to look for matches. So maybe never mind. But maybe this helps, I don't know.

Good luck,

Join the conversation

You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Unfortunately, your content contains terms that we do not allow. Please edit your content to remove the highlighted words below.
Click here to reply. Select text to quote.

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...